Free-style and Fast 3D Portrait Synthesis

06/27/2023
by   Tianxiang Ma, et al.
0

Efficiently generating a free-style 3D portrait with high quality and consistency is a promising yet challenging task. The portrait styles generated by most existing methods are usually restricted by their 3D generators, which are learned in specific facial datasets, such as FFHQ. To get a free-style 3D portrait, one can build a large-scale multi-style database to retrain the 3D generator, or use a off-the-shelf tool to do the style translation. However, the former is time-consuming due to data collection and training process, the latter may destroy the multi-view consistency. To tackle this problem, we propose a fast 3D portrait synthesis framework in this paper, which enable one to use text prompts to specify styles. Specifically, for a given portrait style, we first leverage two generative priors, a 3D-aware GAN generator (EG3D) and a text-guided image editor (Ip2p), to quickly construct a few-shot training set, where the inference process of Ip2p is optimized to make editing more stable. Then we replace original triplane generator of EG3D with a Image-to-Triplane (I2T) module for two purposes: 1) getting rid of the style constraints of pre-trained EG3D by fine-tuning I2T on the few-shot dataset; 2) improving training efficiency by fixing all parts of EG3D except I2T. Furthermore, we construct a multi-style and multi-identity 3D portrait database to demonstrate the scalability and generalization of our method. Experimental results show that our method is capable of synthesizing high-quality 3D portraits with specified styles in a few minutes, outperforming the state-of-the-art.

READ FULL TEXT

page 2

page 3

page 6

page 7

page 8

page 9

research
08/12/2021

MISS GAN: A Multi-IlluStrator Style Generative Adversarial Network for image to illustration translation

Unsupervised style transfer that supports diverse input styles using onl...
research
04/04/2019

In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data

Neural text-to-speech synthesis (NTTS) models have shown significant pro...
research
10/18/2021

StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis

We propose StyleNeRF, a 3D-aware generative model for photo-realistic hi...
research
06/01/2023

StyleDrop: Text-to-Image Generation in Any Style

Pre-trained large text-to-image models synthesize impressive images with...
research
03/20/2019

Im2Pencil: Controllable Pencil Illustration from Photographs

We propose a high-quality photo-to-pencil translation method with fine-g...
research
10/29/2019

Disentangling Timbre and Singing Style with Multi-singer Singing Synthesis System

In this study, we define the identity of the singer with two independent...
research
06/27/2017

Auto-Encoder Guided GAN for Chinese Calligraphy Synthesis

In this paper, we investigate the Chinese calligraphy synthesis problem:...

Please sign up or login with your details

Forgot password? Click here to reset