TD-GEM: Text-Driven Garment Editing Mapper

05/29/2023
by   Reza Dadfar, et al.
0

Language-based fashion image editing allows users to try out variations of desired garments through provided text prompts. Inspired by research on manipulating latent representations in StyleCLIP and HairCLIP, we focus on these latent spaces for editing fashion items of full-body human datasets. Currently, there is a gap in handling fashion image editing due to the complexity of garment shapes and textures and the diversity of human poses. In this paper, we propose an editing optimizer scheme method called Text-Driven Garment Editing Mapper (TD-GEM), aiming to edit fashion items in a disentangled way. To this end, we initially obtain a latent representation of an image through generative adversarial network inversions such as Encoder for Editing (e4e) or Pivotal Tuning Inversion (PTI) for more accurate results. An optimization-based Contrasive Language-Image Pre-training (CLIP) is then utilized to guide the latent representation of a fashion image in the direction of a target attribute expressed in terms of a text prompt. Our TD-GEM manipulates the image accurately according to the target attribute, while other parts of the image are kept untouched. In the experiments, we evaluate TD-GEM on two different attributes (i.e., "color" and "sleeve length"), which effectively generates realistic images compared to the recent manipulation schemes.

READ FULL TEXT

page 6

page 7

page 8

page 9

research
07/17/2023

CLIP-Guided StyleGAN Inversion for Text-Driven Real Image Editing

Researchers have recently begun exploring the use of StyleGAN-based mode...
research
07/06/2022

Towards Counterfactual Image Manipulation via CLIP

Leveraging StyleGAN's expressivity and its disentangled latent codes, ex...
research
05/26/2023

StyleHumanCLIP: Text-guided Garment Manipulation for StyleGAN-Human

This paper tackles text-guided control of StyleGAN for editing garments ...
research
06/03/2019

Fashion Editing with Multi-scale Attention Normalization

Interactive fashion image manipulation, which enables users to edit imag...
research
07/03/2019

Semi-supervised Image Attribute Editing using Generative Adversarial Networks

Image attribute editing is a challenging problem that has been recently ...
research
04/04/2023

Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing

Fashion illustration is used by designers to communicate their vision an...
research
07/25/2023

Fashion Matrix: Editing Photos by Just Talking

The utilization of Large Language Models (LLMs) for the construction of ...

Please sign up or login with your details

Forgot password? Click here to reset