StylePrompter: All Styles Need Is Attention

07/30/2023
by   Chenyi Zhuang, et al.
0

GAN inversion aims at inverting given images into corresponding latent codes for Generative Adversarial Networks (GANs), especially StyleGAN where exists a disentangled latent space that allows attribute-based image manipulation at latent level. As most inversion methods build upon Convolutional Neural Networks (CNNs), we transfer a hierarchical vision Transformer backbone innovatively to predict 𝒲^+ latent codes at token level. We further apply a Style-driven Multi-scale Adaptive Refinement Transformer (SMART) in ℱ space to refine the intermediate style features of the generator. By treating style features as queries to retrieve lost identity information from the encoder's feature maps, SMART can not only produce high-quality inverted images but also surprisingly adapt to editing tasks. We then prove that StylePrompter lies in a more disentangled 𝒲^+ and show the controllability of SMART. Finally, quantitative and qualitative experiments demonstrate that StylePrompter can achieve desirable performance in balancing reconstruction quality and editability, and is "smart" enough to fit into most edits, outperforming other ℱ-involved inversion methods.

READ FULL TEXT

page 15

page 16

page 17

page 18

page 19

page 20

page 22

page 24

research
03/15/2022

Style Transformer for Image Inversion and Editing

Existing GAN inversion methods fail to provide latent codes for reliable...
research
02/04/2022

Feature-Style Encoder for Style-Based GAN Inversion

We propose a novel architecture for GAN inversion, which we call Feature...
research
08/31/2023

Robust GAN inversion

Recent advancements in real image editing have been attributed to the ex...
research
10/17/2021

AE-StyleGAN: Improved Training of Style-Based Auto-Encoders

StyleGANs have shown impressive results on data generation and manipulat...
research
08/26/2022

User-Controllable Latent Transformer for StyleGAN Image Layout Editing

Latent space exploration is a technique that discovers interpretable lat...
research
07/20/2020

Generative Hierarchical Features from Synthesizing Images

Generative Adversarial Networks (GANs) have recently advanced image synt...
research
08/16/2019

TunaGAN: Interpretable GAN for Smart Editing

In this paper, we introduce a tunable generative adversary network (Tuna...

Please sign up or login with your details

Forgot password? Click here to reset