Planning with Diffusion

submited by
Style Pass
2022-05-23 03:00:19

Diffuser is a denoising diffusion probabilistic model that plans by iteratively refining randomly sampled noise. The denoising process lends itself to flexible conditioning, by either using gradients of an objective function to bias plans toward high-reward regions or conditioning the plan to reach a specified goal.

Diffuser's planning horizon is determined by the size of the random noise used to initialize the denoising process.

Diffuser acts as an unconditional prior over possible behaviors. We can plan for new test-time tasks by guiding its sampled plans with reward functions or constraints. All of the plans below are executed by a single model.

Leave a Comment