Welcome!

Hi👋, welcome to Dibbla's space!


I'm Dibbla and this is my blog.
I write about programming, technology, and other things that interest me.
I hope you find something useful here.

This site is under aggressive development. Maybe return later for more content.

Diffusion Family: Score Matching Langevin Dynamics and More

References: Tutorial on Diffusion Models for Imaging and Vision by Stanley Chan Dr. Yang Song’s blog on Score Matching Basics of Langevin Dynamics Unlike DDPM, which models the generative model as a hidden variable model with $x_{1:T}$ as the hidden variables, score-matching models, while deeply linked to DDPM, starts from a sampling view and later concerns about the distribution we sampled from. Let’s start with an assumption that we have a distribution $p(x)$ that we can sample from, and this distribution is exactly the distribution we want (say, the image distribution of a cat)....

July 15, 2024 · Dibbla

Diffusion Family: DDPM? Implementation Matters!

Implementation really matters when it comes to training Diffusion Models. In this post, we will discuss some tircks, some problems I encountered, coding details and model architectures that avoid awkward failure of noise in noise out. Our goal is to generate satisfying samples from 2 common datasets: MNIST (easier) and CIFAR-10 (harder). References: The Annotated Diffusion Model Diffusers Lil’Log What are Diffusion Models? UNet Architecture UNet is a common architecture for image generation....

June 28, 2024 · Dibbla

Sutton & Barto Reading Note: Chapter 3

In the last note, we have covered first 2 chapters of the book, and discussed about the tabular cases of RL(Bandit problems). In this note, we will discuss the Finite Markov Decision Process(MDP) and the Bellman Equation. Agent-Environment Interface, Goals and Rewards As in this series we assume readers have some ideas about “RL learns from interactions with the environment”, we will only briefly introduce the agent-environment interface here. It can be illustrated in a diagram as below:...

June 16, 2024 · Dibbla

Diffusion Family: DDPM

In this blog we will try to understand the member of diffusion family, DDPM. Given the common structure shared by variational autoencoders (VAEs) and diffusion models, we will first discuss some important content in VAE, and then introduce the DDPM. The blog will try to be self-contained and mathematically clear, but some prior knowledge of VAEs and generative models is assumed. References: VAE on Wikipedia Lil’Log What are Diffusion Models?...

June 13, 2024 · Dibbla

Sutton & Barto Reading Note: Chapter 1-2

I am reviewing the book, Reinforcement Learning: An Introduction by Sutton and Barto. This post covers the first two chapters of the book. As the very first note in this series, it is good to explain why I write these notes. First of all, it is good to review RL even in this era where LLM/AIGC is the new hype. Secondly, I am preparing for my job search and grad study....

June 13, 2024 · Dibbla