Zero-shot Text-driven Physically Interpretable Face Editing

08/11/2023
by   Yapeng Meng, et al.
0

This paper proposes a novel and physically interpretable method for face editing based on arbitrary text prompts. Different from previous GAN-inversion-based face editing methods that manipulate the latent space of GANs, or diffusion-based methods that model image manipulation as a reverse diffusion process, we regard the face editing process as imposing vector flow fields on face images, representing the offset of spatial coordinates and color for each image pixel. Under the above-proposed paradigm, we represent the vector flow field in two ways: 1) explicitly represent the flow vectors with rasterized tensors, and 2) implicitly parameterize the flow vectors as continuous, smooth, and resolution-agnostic neural fields, by leveraging the recent advances of implicit neural representations. The flow vectors are iteratively optimized under the guidance of the pre-trained Contrastive Language-Image Pretraining (CLIP) model by maximizing the correlation between the edited image and the text prompt. We also propose a learning-based one-shot face editing framework, which is fast and adaptable to any text prompt input. Our method can also be flexibly extended to real-time video face editing. Compared with state-of-the-art text-driven face editing methods, our method can generate physically interpretable face editing results with high identity consistency and image quality. Our code will be made publicly available.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

page 8

page 9

page 10

research
08/30/2023

Zero-shot Inversion Process for Image Attribute Editing with Diffusion Models

Denoising diffusion models have shown outstanding performance in image e...
research
05/24/2023

ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation

Editing real facial images is a crucial task in computer vision with sig...
research
06/14/2023

VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing

Recently, diffusion-based generative models have achieved remarkable suc...
research
11/30/2018

The GAN that Warped: Semantic Attribute Editing with Unpaired Data

Deep neural networks have recently been used to edit images with great s...
research
03/02/2023

Zero-Shot Text-to-Parameter Translation for Game Character Auto-Creation

Recent popular Role-Playing Games (RPGs) saw the great success of charac...
research
09/08/2021

FaceCook: Face Generation Based on Linear Scaling Factors

With the excellent disentanglement properties of state-of-the-art genera...
research
07/21/2023

FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance Fields

As recent advances in Neural Radiance Fields (NeRF) have enabled high-fi...

Please sign up or login with your details

Forgot password? Click here to reset