Language Guided Fashion Image Manipulation with Feature-wise Transformations

08/12/2018
by   Mehmet Günel, et al.
0

Developing techniques for editing an outfit image through natural sentences and accordingly generating new outfits has promising applications for art, fashion and design. However, it is considered as a certainly challenging task since image manipulation should be carried out only on the relevant parts of the image while keeping the remaining sections untouched. Moreover, this manipulation process should generate an image that is as realistic as possible. In this work, we propose FiLMedGAN, which leverages feature-wise linear modulation (FiLM) to relate and transform visual features with natural language representations without using extra spatial information. Our experiments demonstrate that this approach, when combined with skip connections and total variation regularization, produces more plausible results than the baseline work, and has a better localization capability when generating new outfits consistent with the target description.

READ FULL TEXT
research
10/05/2022

LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion Models

Research in vision-language models has seen rapid developments off-late,...
research
09/12/2016

Generative Visual Manipulation on the Natural Image Manifold

Realistic image manipulation is challenging because it requires modifyin...
research
07/27/2021

Remember What You have drawn: Semantic Image Manipulation with Memory

Image manipulation with natural language, which aims to manipulate image...
research
02/23/2018

Interactive Image Manipulation with Natural Language Instruction Commands

We propose an interactive image-manipulation system with natural languag...
research
08/02/2023

ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation

While language-guided image manipulation has made remarkable progress, t...
research
12/13/2022

Structure-Guided Image Completion with Image-level and Object-level Semantic Discriminators

Structure-guided image completion aims to inpaint a local region of an i...

Please sign up or login with your details

Forgot password? Click here to reset