Imitating Human Behaviour with Diffusion Models

01/25/2023
by   Tim Pearce, et al.
0

Diffusion models have emerged as powerful generative models in the text-to-image domain. This paper studies their application as observation-to-action models for imitating human behaviour in sequential environments. Human behaviour is stochastic and multimodal, with structured correlations between action dimensions. Meanwhile, standard modelling choices in behaviour cloning are limited in their expressiveness and may introduce bias into the cloned policy. We begin by pointing out the limitations of these choices. We then propose that diffusion models are an excellent fit for imitating human behaviour, since they learn an expressive distribution over the joint action space. We introduce several innovations to make diffusion models suitable for sequential environments; designing suitable architectures, investigating the role of guidance, and developing reliable sampling strategies. Experimentally, diffusion models closely match human demonstrations in a simulated robotic control task and a modern 3D gaming environment.

READ FULL TEXT

page 1

page 5

page 18

page 21

research
06/07/2023

On the Design Fundamentals of Diffusion Models: A Survey

Diffusion models are generative models, which gradually add and remove n...
research
03/25/2023

Better Aligning Text-to-Image Models with Human Preference

Recent years have witnessed a rapid growth of deep generative models, wi...
research
06/12/2023

Latent Dynamical Implicit Diffusion Processes

Latent dynamical models are commonly used to learn the distribution of a...
research
02/26/2023

Diffusion Model-Augmented Behavioral Cloning

Imitation learning addresses the challenge of learning by observing an e...
research
09/29/2022

Human Motion Diffusion Model

Natural and expressive human motion generation is the holy grail of comp...
research
11/21/2022

Investigating Prompt Engineering in Diffusion Models

With the spread of the use of Text2Img diffusion models such as DALL-E 2...
research
09/04/2023

DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion

We present an innovative approach to 3D Human Pose Estimation (3D-HPE) b...

Please sign up or login with your details

Forgot password? Click here to reset