Cloth Interactive Transformer for Virtual Try-On

04/12/2021
by   Bin Ren, et al.
0

2D image-based virtual try-on has attracted increased attention from the multimedia and computer vision communities. However, most of the existing image-based virtual try-on methods directly put both person and the in-shop clothing representations together, without considering the mutual correlation between them. What is more, the long-range information, which is crucial for generating globally consistent results, is also hard to be established via the regular convolution operation. To alleviate these two problems, in this paper we propose a novel two-stage Cloth Interactive Transformer (CIT) for virtual try-on. In the first stage, we design a CIT matching block, aiming to perform a learnable thin-plate spline transformation that can capture more reasonable long-range relation. As a result, the warped in-shop clothing looks more natural. In the second stage, we propose a novel CIT reasoning block for establishing the global mutual interactive dependence. Based on this mutual dependence, the significant region within the input data can be highlighted, and consequently, the try-on results can become more realistic. Extensive experiments on a public fashion dataset demonstrate that our CIT can achieve the new state-of-the-art virtual try-on performance both qualitatively and quantitatively. The source code and trained models are available at https://github.com/Amazingren/CIT.

READ FULL TEXT

page 1

page 4

page 7

page 8

page 9

research
07/20/2018

Toward Characteristic-Preserving Image-based Virtual Try-On Network

Image-based virtual try-on systems for fitting a new in-shop clothes int...
research
11/16/2021

Data Augmentation using Random Image Cropping for High-resolution Virtual Try-On (VITON-CROP)

Image-based virtual try-on provides the capacity to transfer a clothing ...
research
04/18/2023

PG-VTON: A Novel Image-Based Virtual Try-On Method via Progressive Inference Paradigm

Virtual try-on is a promising computer vision topic with a high commerci...
research
10/19/2021

Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation

It is hard to generate an image at target view well for previous cross-v...
research
11/23/2022

GhostNetV2: Enhance Cheap Operation with Long-Range Attention

Light-weight convolutional neural networks (CNNs) are specially designed...
research
05/22/2023

LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On

The rapidly evolving fields of e-commerce and metaverse continue to seek...
research
06/28/2022

High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions

Image-based virtual try-on aims to synthesize an image of a person weari...

Please sign up or login with your details

Forgot password? Click here to reset