SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Editing

by   Jing Shi, et al.

Recently, large pretrained models (e.g., BERT, StyleGAN, CLIP) have shown great knowledge transfer and generalization capability on various downstream tasks within their domains. Inspired by these efforts, in this paper we propose a unified model for open-domain image editing focusing on color and tone adjustment of open-domain images while keeping their original content and structure. Our model learns a unified editing space that is more semantic, intuitive, and easy to manipulate than the operation space (e.g., contrast, brightness, color curve) used in many existing photo editing softwares. Our model belongs to the image-to-image translation framework which consists of an image encoder and decoder, and is trained on pairs of before- and after-images to produce multimodal outputs. We show that by inverting image pairs into latent codes of the learned editing space, our model can be leveraged for various downstream editing tasks such as language-guided image editing, personalized editing, editing-style clustering, retrieval, etc. We extensively study the unique properties of the editing space in experiments and demonstrate superior performance on the aforementioned tasks.



There are no comments yet.


page 1

page 14

page 15

page 16

page 17

page 18

page 19

page 20


SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing

Recent studies have shown that StyleGANs provide promising prior models ...

StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN

Recently, StyleGAN has enabled various image manipulation and editing ta...

LSC-GAN: Latent Style Code Modeling for Continuous Image-to-image Translation

Image-to-image (I2I) translation is usually carried out among discrete d...

Learning by Planning: Language-Guided Global Image Editing

Recently, language-guided global image editing draws increasing attentio...

Topologically-Guided Color Image Enhancement

Enhancement is an important step in post-processing digital images for p...

Nested Scale Editing for Conditional Image Synthesis

We propose an image synthesis approach that provides stratified navigati...

Automatically eliminating seam lines with Poisson editing in complex relative radiometric normalization mosaicking scenarios

Relative radiometric normalization (RRN) mosaicking among multiple remot...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.