Guide3D: Create 3D Avatars from Text and Image Guidance

08/18/2023
by   Yukang Cao, et al.
0

Recently, text-to-image generation has exhibited remarkable advancements, with the ability to produce visually impressive results. In contrast, text-to-3D generation has not yet reached a comparable level of quality. Existing methods primarily rely on text-guided score distillation sampling (SDS), and they encounter difficulties in transferring 2D attributes of the generated images to 3D content. In this work, we aim to develop an effective 3D generative model capable of synthesizing high-resolution textured meshes by leveraging both textual and image information. To this end, we introduce Guide3D, a zero-shot text-and-image-guided generative model for 3D avatar generation based on diffusion models. Our model involves (1) generating sparse-view images of a text-consistent character using diffusion models, and (2) jointly optimizing multi-resolution differentiable marching tetrahedral grids with pixel-aligned image features. We further propose a similarity-aware feature fusion strategy for efficiently integrating features from different views. Moreover, we introduce two novel training objectives as an alternative to calculating SDS, significantly enhancing the optimization process. We thoroughly evaluate the performance and components of our framework, which outperforms the current state-of-the-art in producing topologically and structurally correct geometry and high-resolution textures. Guide3D enables the direct transfer of 2D-generated images to the 3D space. Our code will be made publicly available.

READ FULL TEXT

page 2

page 5

page 7

page 10

page 18

page 21

page 23

page 24

research
06/05/2023

HeadSculpt: Crafting 3D Head Avatars with Text

Recently, text-guided 3D generative methods have made remarkable advance...
research
11/29/2022

DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model

Recent 3D generative models have achieved remarkable performance in synt...
research
11/14/2022

Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures

Text-guided image generation has progressed rapidly in recent years, ins...
research
05/10/2023

Text-guided High-definition Consistency Texture Model

With the advent of depth-to-image diffusion models, text-guided generati...
research
08/21/2023

TADA! Text to Animatable Digital Avatars

We introduce TADA, a simple-yet-effective approach that takes textual de...
research
09/22/2022

Implementing and Experimenting with Diffusion Models for Text-to-Image Generation

Taking advantage of the many recent advances in deep learning, text-to-i...
research
07/04/2020

BézierSketch: A generative model for scalable vector sketches

The study of neural generative models of human sketches is a fascinating...

Please sign up or login with your details

Forgot password? Click here to reset