Improving Negative-Prompt Inversion via Proximal Guidance

06/08/2023
by   Ligong Han, et al.
0

DDIM inversion has revealed the remarkable potential of real image editing within diffusion-based methods. However, the accuracy of DDIM reconstruction degrades as larger classifier-free guidance (CFG) scales being used for enhanced editing. Null-text inversion (NTI) optimizes null embeddings to align the reconstruction and inversion trajectories with larger CFG scales, enabling real image editing with cross-attention control. Negative-prompt inversion (NPI) further offers a training-free closed-form solution of NTI. However, it may introduce artifacts and is still constrained by DDIM reconstruction quality. To overcome these limitations, we propose Proximal Negative-Prompt Inversion (ProxNPI), extending the concepts of NTI and NPI. We enhance NPI with a regularization term and reconstruction guidance, which reduces artifacts while capitalizing on its training-free nature. Our method provides an efficient and straightforward approach, effectively addressing real image editing tasks with minimal computational overhead.

READ FULL TEXT

page 1

page 3

page 4

page 5

research
05/26/2023

Negative-prompt Inversion: Fast Image Inversion for Editing with Text-guided Diffusion Models

In image editing employing diffusion models, it is crucial to preserve t...
research
09/10/2023

Effective Real Image Editing with Accelerated Iterative Diffusion Inversion

Despite all recent progress, it is still challenging to edit and manipul...
research
03/28/2023

StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing

A significant research effort is focused on exploiting the amazing capac...
research
03/08/2023

Video-P2P: Video Editing with Cross-attention Control

This paper presents Video-P2P, a novel framework for real-world video ed...
research
03/15/2022

Style Transformer for Image Inversion and Editing

Existing GAN inversion methods fail to provide latent codes for reliable...
research
04/15/2021

A Simple Baseline for StyleGAN Inversion

This paper studies the problem of StyleGAN inversion, which plays an ess...

Please sign up or login with your details

Forgot password? Click here to reset