Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation

10/12/2022
by   Chaerin Kong, et al.
0

Fashion attribute editing is a task that aims to convert the semantic attributes of a given fashion image while preserving the irrelevant regions. Previous works typically employ conditional GANs where the generator explicitly learns the target attributes and directly execute the conversion. These approaches, however, are neither scalable nor generic as they operate only with few limited attributes and a separate generator is required for each dataset or attribute set. Inspired by the recent advancement of diffusion models, we explore the classifier-guided diffusion that leverages the off-the-shelf diffusion model pretrained on general visual semantics such as Imagenet. In order to achieve a generic editing pipeline, we pose this as multi-attribute image manipulation task, where the attribute ranges from item category, fabric, pattern to collar and neckline. We empirically show that conventional methods fail in our challenging setting, and study efficient adaptation scheme that involves recently introduced attention-pooling technique to obtain a multi-attribute classifier guidance. Based on this, we present a mask-free fashion attribute editing framework that leverages the classifier logits and the cross-attention map for manipulation. We empirically demonstrate that our framework achieves convincing sample quality and attribute alignments.

READ FULL TEXT

page 5

page 7

page 8

research
04/16/2019

Fashion-AttGAN: Attribute-Aware Fashion Editing with Multi-Objective GAN

In this paper, we introduce attribute-aware fashion-editing, a novel tas...
research
07/12/2023

DiffuseGAE: Controllable and High-fidelity Image Manipulation from Disentangled Representation

Diffusion probabilistic models (DPMs) have shown remarkable results on v...
research
07/11/2019

Semi-supervised Feature-Level Attribute Manipulation for Fashion Image Retrieval

With a growing demand for the search by image, many works have studied t...
research
10/02/2022

ManiCLIP: Multi-Attribute Face Manipulation from Text

In this paper we present a novel multi-attribute face manipulation metho...
research
11/03/2022

Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models

During image editing, existing deep generative models tend to re-synthes...
research
09/09/2020

MU-GAN: Facial Attribute Editing based on Multi-attention Mechanism

Facial attribute editing has mainly two objectives: 1) translating image...
research
11/09/2018

Changing the Image Memorability: From Basic Photo Editing to GANs

Memorability is considered to be an important characteristic of visual c...

Please sign up or login with your details

Forgot password? Click here to reset