Sascha Kirch

AI

Here Comes Mamba: The Selective State Space Model

šŸ Towards Mamba State Space Models for Images, Videos and Time Series Part 3 ā€” Towards Mamba State Space Models for Images, Videos and Time Series Sascha Kirch Ā· Follow Published in Towards Data Science Ā· 17 min read Ā· 7 hours ago — Image by Sascha Kirch. This is part 3 of my new multi-part series šŸ Towards Mamba State Space Models for Images, Videos and Time Series. Mamba, the model to be said

Read More Ā»
AI

Towards Mamba State Space Models for Images, Videos and Time Series

šŸ Towards Mamba State Space Models for Images, Videos and Time Series Part 1 Sascha Kirch Ā· Follow Published in Towards Data Science Ā· 16 min read Ā· 8 hours ago — Image by Sascha Kirch This is part 1 of my new multi-part series šŸ Towards Mamba State Space Models for Images, Videos and Time Series. Is Mamba all you need? Certainly, people have thought that for a long time of the Transformer architecture

Read More Ā»
AI

The Rise of Diffusion Modelsā€Šā€”ā€ŠA new Era of Generative Deep Learning

Paper Walkthrough: Denoising Diffusion Probabilistic Models Sascha Kirch Ā· Follow Published in Towards Data Science Ā· 13 min read Ā· 10 hours ago — This walkthrough is about a paper that kicked off a new era of generative deep learning in computer vision and many other fields subsequently: the era of diffusion models. Itā€™s titled ā€œDenoising Diffusion Probabilistic Modelsā€ and it introduces a new framework known as DDPM, the abbreviation of the paperā€™s title. While

Read More Ā»
AI

Depth Anything ā€”A Foundation Model for Monocular Depth Estimation

Paper Walkthrough ā€” Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data Sascha Kirch Ā· Follow Published in Towards Data Science Ā· 11 min read Ā· 12 hours ago — Monocular depth estimation, the prediction of distance in 3D space from a 2D image. The ā€œill posed and inherently ambiguous problemā€, as stated in literally every paper on depth estimation, is a fundamental problem in computer vision and robotics. At the same time foundation models

Read More Ā»