DreamBooth3D: Subject-Driven Text-to-3D Generation

03/23/2023
by   Amit Raj, et al.
0

We present DreamBooth3D, an approach to personalize text-to-3D generative models from as few as 3-6 casually captured images of a subject. Our approach combines recent advances in personalizing text-to-image models (DreamBooth) with text-to-3D generation (DreamFusion). We find that naively combining these methods fails to yield satisfactory subject-specific 3D assets due to personalized text-to-image models overfitting to the input viewpoints of the subject. We overcome this through a 3-stage optimization strategy where we jointly leverage the 3D consistency of neural radiance fields together with the personalization capability of text-to-image models. Our method can produce high-quality, subject-specific 3D assets with text-driven modifications such as novel poses, colors and attributes that are not seen in any of the input images of the subject.

READ FULL TEXT

page 1

page 4

page 6

page 7

page 8

page 12

page 13

research
01/14/2022

A Survey of Pretrained Language Models Based Text Generation

Text Generation aims to produce plausible and readable text in human lan...
research
05/07/2020

Learning Implicit Text Generation via Feature Matching

Generative feature matching network (GFMN) is an approach for training i...
research
09/07/2023

Chasing Consistency in Text-to-3D Generation from a Single Image

Text-to-3D generation from a single-view image is a popular but challeng...
research
04/01/2023

Subject-driven Text-to-Image Generation via Apprenticeship Learning

Recent text-to-image generation models like DreamBooth have made remarka...
research
05/18/2023

Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation

The field of text-to-image (T2I) generation has garnered significant att...
research
11/09/2020

MUSE: Illustrating Textual Attributes by Portrait Generation

We propose a novel approach, MUSE, to illustrate textual attributes visu...
research
06/12/2023

Controlling Text-to-Image Diffusion by Orthogonal Finetuning

Large text-to-image diffusion models have impressive capabilities in gen...

Please sign up or login with your details

Forgot password? Click here to reset