Semantic Image Manipulation with Background-guided Internal Learning

03/24/2022
by   Zhongping Zhang, et al.
0

Image manipulation has attracted a lot of interest due to its wide range of applications. Prior work modifies images either from low-level manipulation, such as image inpainting or through manual edits via paintbrushes and scribbles, or from high-level manipulation, employing deep generative networks to output an image conditioned on high-level semantic input. In this study, we propose Semantic Image Manipulation with Background-guided Internal Learning (SIMBIL), which combines high-level and low-level manipulation. Specifically, users can edit an image at the semantic level by applying changes on a scene graph. Then our model manipulates the image at the pixel level according to the modified scene graph. There are two major advantages of our approach. First, high-level manipulation of scene graphs requires less manual effort from the user compared to manipulating raw image pixels. Second, our low-level internal learning approach is scalable to images of various sizes without reliance on external visual datasets for training. We outperform the state-of-the-art in a quantitative and qualitative evaluation on the CLEVR and Visual Genome datasets. Experiments show 8 points improvement on FID scores (CLEVR) and 27 improvement on user evaluation (Visual Genome), demonstrating the effectiveness of our approach.

READ FULL TEXT

page 6

page 7

page 10

page 11

page 21

page 22

page 23

page 24

research
04/07/2020

Semantic Image Manipulation Using Scene Graphs

Image manipulation can be considered a special case of image generation ...
research
03/28/2019

Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation

The state-of-the-art approaches in Generative Adversarial Networks (GANs...
research
08/04/2020

Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions

We propose a novel algorithm, named Open-Edit, which is the first attemp...
research
03/31/2021

Smart Scribbles for Image Mating

Image matting is an ill-posed problem that usually requires additional u...
research
01/05/2018

VSE-ens: Visual-Semantic Embeddings with Efficient Negative Sampling

Jointing visual-semantic embeddings (VSE) have become a research hotpot ...
research
08/22/2018

Manipulating Attributes of Natural Scenes via Hallucination

In this study, we explore building a two-stage framework for enabling us...
research
12/21/2022

TruFor: Leveraging all-round clues for trustworthy image forgery detection and localization

In this paper we present TruFor, a forensic framework that can be applie...

Please sign up or login with your details

Forgot password? Click here to reset