DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models

04/03/2023
by   Yukang Cao, et al.
0

We present DreamAvatar, a text-and-shape guided framework for generating high-quality 3D human avatars with controllable poses. While encouraging results have been produced by recent methods on text-guided 3D common object generation, generating high-quality human avatars remains an open challenge due to the complexity of the human body's shape, pose, and appearance. We propose DreamAvatar to tackle this challenge, which utilizes a trainable NeRF for predicting density and color features for 3D points and a pre-trained text-to-image diffusion model for providing 2D self-supervision. Specifically, we leverage SMPL models to provide rough pose and shape guidance for the generation. We introduce a dual space design that comprises a canonical space and an observation space, which are related by a learnable deformation field through the NeRF, allowing for the transfer of well-optimized texture and geometry from the canonical space to the target posed avatar. Additionally, we exploit a normal-consistency regularization to allow for more vivid generation with detailed geometry and texture. Through extensive evaluations, we demonstrate that DreamAvatar significantly outperforms existing methods, establishing a new state-of-the-art for text-and-shape guided 3D human generation.

READ FULL TEXT

page 2

page 6

page 7

page 12

page 13

page 14

page 16

page 17

research
08/01/2022

AvatarGen: a 3D Generative Model for Animatable Human Avatars

Unsupervised generation of clothed virtual humans with various appearanc...
research
09/07/2023

Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model

Recent advances in diffusion models such as ControlNet have enabled geom...
research
05/25/2023

ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image

Recent advancements in text-to-image generation have enabled significant...
research
03/30/2023

AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control

Neural implicit fields are powerful for representing 3D scenes and gener...
research
07/10/2023

Articulated 3D Head Avatar Generation using Text-to-Image Diffusion Models

The ability to generate diverse 3D articulated head avatars is vital to ...
research
11/13/2022

VGFlow: Visibility guided Flow Network for Human Reposing

The task of human reposing involves generating a realistic image of a pe...
research
08/16/2023

TeCH: Text-guided Reconstruction of Lifelike Clothed Humans

Despite recent research advancements in reconstructing clothed humans fr...

Please sign up or login with your details

Forgot password? Click here to reset