Diffusion Family: Score Matching Langevin Dynamics and More

References: Tutorial on Diffusion Models for Imaging and Vision by Stanley Chan Dr. Yang Song’s blog on Score Matching Basics of Langevin Dynamics Unlike DDPM, which models the generative model as a hidden variable model with $x_{1:T}$ as the hidden variables, score-matching models, while deeply linked to DDPM, starts from a sampling view and later concerns about the distribution we sampled from. Let’s start with an assumption that we have a distribution $p(x)$ that we can sample from, and this distribution is exactly the distribution we want (say, the image distribution of a cat)....

July 15, 2024 · Dibbla

Diffusion Family: DDPM? Implementation Matters!

Implementation really matters when it comes to training Diffusion Models. In this post, we will discuss some tircks, some problems I encountered, coding details and model architectures that avoid awkward failure of noise in noise out. Our goal is to generate satisfying samples from 2 common datasets: MNIST (easier) and CIFAR-10 (harder). References: The Annotated Diffusion Model Diffusers Lil’Log What are Diffusion Models? UNet Architecture UNet is a common architecture for image generation....

June 28, 2024 · Dibbla

Diffusion Family: DDPM

In this blog we will try to understand the member of diffusion family, DDPM. Given the common structure shared by variational autoencoders (VAEs) and diffusion models, we will first discuss some important content in VAE, and then introduce the DDPM. The blog will try to be self-contained and mathematically clear, but some prior knowledge of VAEs and generative models is assumed. References: VAE on Wikipedia Lil’Log What are Diffusion Models?...

June 13, 2024 · Dibbla