UniTune: Text-Driven Image Editing by Fine Tuning an Image Generation Model on a Single Image

10/17/2022
by   Dani Valevski, et al.
3

We present UniTune, a simple and novel method for general text-driven image editing. UniTune gets as input an arbitrary image and a textual edit description, and carries out the edit while maintaining high semantic and visual fidelity to the input image. UniTune uses text, an intuitive interface for art-direction, and does not require additional inputs, like masks or sketches. At the core of our method is the observation that with the right choice of parameters, we can fine-tune a large text-to-image diffusion model on a single image, encouraging the model to maintain fidelity to the input image while still allowing expressive manipulations. We used Imagen as our text-to-image model, but we expect UniTune to work with other large-scale models as well. We test our method in a range of different use cases, and demonstrate its wide applicability.

READ FULL TEXT

page 1

page 2

page 8

page 9

page 10

page 12

page 16

page 18

research
10/17/2022

Imagic: Text-Based Real Image Editing with Diffusion Models

Text-conditioned image editing has recently attracted considerable inter...
research
05/08/2023

Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion Models

Recently large-scale language-image models (e.g., text-guided diffusion ...
research
05/10/2023

iEdit: Localised Text-guided Image Editing with Weak Supervision

Diffusion models (DMs) can generate realistic images with text guidance ...
research
05/27/2023

Text-to-image Editing by Image Information Removal

Diffusion models have demonstrated impressive performance in text-guided...
research
04/05/2022

Text2LIVE: Text-Driven Layered Image and Video Editing

We present a method for zero-shot, text-driven appearance manipulation i...
research
05/30/2023

LayerDiffusion: Layered Controlled Image Editing with Diffusion Models

Text-guided image editing has recently experienced rapid development. Ho...
research
08/11/2020

Text as Neural Operator: Image Manipulation by Text Instruction

In this paper, we study a new task that allows users to edit an input im...

Please sign up or login with your details

Forgot password? Click here to reset