Diffusion Models for Video Generation

submited by

Style Pass

2024-04-17 03:00:09

Diffusion models have demonstrated strong results on image synthesis in past years. Now the research community has started working on a harder task—using it for video generation. The task itself is a superset of the image case, since an image is a video of 1 frame, and it is much more challenging because:

🥑 Required Pre-read: Please make sure you have read the previous blog on “What are Diffusion Models?” for image generation before continue here.

First let’s review approaches for designing and training diffusion video models from scratch, meaning that we do not rely on pre-trained image generators.

Here we use a slightly different variable definition from the previous post, but the math stays the same. Let $\mathbf{x} \sim q_\text{real}$ be a data point sampled from the real data distribution. Now we are adding Gaussian noise in small amount in time, creating a sequence of noisy variations of $\mathbf{x}$, denoted as $\{\mathbf{z}_t \mid t =1 \dots, T\}$, with increasing amount of noise as $t$ increases and the last $q(\mathbf{z}_T) \sim \mathcal{N}(\mathbf{0}, \mathbf{I})$. The noise-adding forward process is a Gaussian process. Let $\alpha_t, \sigma_t$ define a differentiable noise schedule of the Gaussian process:

Let the log signal-to-noise-ratio be $\lambda_t = \log[\alpha^2_t / \sigma^2_t]$, we can represent the DDIM (Song et al. 2020) update as:

Apple introduces iPhone 13 and iPhone 13 mini, delivering breakthrough camera innovations and a powerhouse chip with an impressive leap in battery life

Comment

Microsoft and Nvidia Partner on AI Video Training Breakthrough

Comment

Shaky Rumor Claims Redesigned MacBook Pros Will Feature 'M1X' Chip and Remove the Front Logo

Comment

marvin

Comment

Killer Of F-35 Jets: US Dominance Of 5th-Gen Fighter Aircraft To End With Supersonic Stealthy ‘Flying Machines’?

Comment

Class action lawsuit filed against Dell for misleading advertising on the upgradenability of Alienware laptop

Comment

Update on Windows 11 minimum system requirements

Comment

Accurate prediction of protein structures and interactions using a three-track neural network

Comment

NIO is putting advanced silicon carbide technology into its electric drive system

Comment

Analysis: When might the world exceed 1.5C and 2C of global warming?

Comment

Diffusion Models for Video Generation

Leave a Comment

Related Posts

Apple introduces iPhone 13 and iPhone 13 mini, delivering breakthrough camera innovations and a powerhouse chip with an impressive leap in battery life

Microsoft and Nvidia Partner on AI Video Training Breakthrough

Shaky Rumor Claims Redesigned MacBook Pros Will Feature 'M1X' Chip and Remove the Front Logo

marvin

Killer Of F-35 Jets: US Dominance Of 5th-Gen Fighter Aircraft To End With Supersonic Stealthy ‘Flying Machines’?

Class action lawsuit filed against Dell for misleading advertising on the upgradenability of Alienware laptop

Update on Windows 11 minimum system requirements

Accurate prediction of protein structures and interactions using a three-track neural network

NIO is putting advanced silicon carbide technology into its electric drive system

Analysis: When might the world exceed 1.5C and 2C of global warming?

Recent Posts

LLMs: Understanding Tokens and Embeddings

Two Clock Problem Of Rewrites

A few facts about POSIX

Report suggests Switch 2 can play all original Switch games

Computer Science > Computation and Language

Technical Report: Highly Pathogenic Avian Influenza A(H5N1) Viruses

Search code, repositories, users, issues, pull requests...

How do Touch screens work?

How to Train Diffusion for Text from Scratch

Surreal claims, lawfare and abuse: toxic politics in Spain reflect the new Europe

Judge0 Sandbox Escape

RISC-V support in Android just got a big setback

The volunteers putting forgotten villages on the map

A Peek Inside the Brains of ‘Super-Agers’

Search code, repositories, users, issues, pull requests...

Why SQLite Uses Bytecode

Implementing Your Own Garbage Collector in Java

Search code, repositories, users, issues, pull requests...

The new class war: A wealth gap between millennials

Save the Web by Being Nice