Third Time's the Charm? Image and Video Editing with StyleGAN3

01/31/2022
by   Yuval Alaluf, et al.
7

StyleGAN is arguably one of the most intriguing and well-studied generative models, demonstrating impressive performance in image generation, inversion, and manipulation. In this work, we explore the recent StyleGAN3 architecture, compare it to its predecessor, and investigate its unique advantages, as well as drawbacks. In particular, we demonstrate that while StyleGAN3 can be trained on unaligned data, one can still use aligned data for training, without hindering the ability to generate unaligned imagery. Next, our analysis of the disentanglement of the different latent spaces of StyleGAN3 indicates that the commonly used W/W+ spaces are more entangled than their StyleGAN2 counterparts, underscoring the benefits of using the StyleSpace for fine-grained editing. Considering image inversion, we observe that existing encoder-based techniques struggle when trained on unaligned data. We therefore propose an encoding scheme trained solely on aligned data, yet can still invert unaligned images. Finally, we introduce a novel video inversion and editing workflow that leverages the capabilities of a fine-tuned StyleGAN3 generator to reduce texture sticking and expand the field of view of the edited video.

READ FULL TEXT

page 17

page 19

page 22

page 23

page 24

page 26

page 27

page 28

research
01/31/2023

ReGANIE: Rectifying GAN Inversion Errors for Accurate Real Image Editing

The StyleGAN family succeed in high-fidelity image generation and allow ...
research
06/21/2022

Temporally Consistent Semantic Video Editing

Generative adversarial networks (GANs) have demonstrated impressive imag...
research
04/18/2023

UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer

Existing person image generative models can do either image generation o...
research
03/08/2023

Video-P2P: Video Editing with Cross-attention Control

This paper presents Video-P2P, a novel framework for real-world video ed...
research
03/22/2023

Make Encoder Great Again in 3D GAN Inversion through Geometry and Occlusion-Aware Encoding

3D GAN inversion aims to achieve high reconstruction fidelity and reason...
research
02/09/2023

In-N-Out: Face Video Inversion and Editing with Volumetric Decomposition

3D-aware GANs offer new capabilities for creative content editing, such ...
research
06/13/2021

Inverting Adversarially Robust Networks for Image Synthesis

Recent research in adversarially robust classifiers suggests their repre...

Please sign up or login with your details

Forgot password? Click here to reset