SCAM! Transferring humans between images with Semantic Cross Attention Modulation

10/10/2022
by   Nicolas Dufour, et al.
36

A large body of recent work targets semantically conditioned image generation. Most such methods focus on the narrower task of pose transfer and ignore the more challenging task of subject transfer that consists in not only transferring the pose but also the appearance and background. In this work, we introduce SCAM (Semantic Cross Attention Modulation), a system that encodes rich and diverse information in each semantic region of the image (including foreground and background), thus achieving precise generation with emphasis on fine details. This is enabled by the Semantic Attention Transformer Encoder that extracts multiple latent vectors for each semantic region, and the corresponding generator that exploits these multiple latents by using semantic cross attention modulation. It is trained only using a reconstruction setup, while subject transfer is performed at test time. Our analysis shows that our proposed architecture is successful at encoding the diversity of appearance in each semantic region. Extensive experiments on the iDesigner and CelebAMask-HD datasets show that SCAM outperforms SEAN and SPADE; moreover, it sets the new state of the art on subject transfer.

READ FULL TEXT

page 12

page 13

page 23

page 24

page 26

page 27

page 28

page 29

research
04/14/2021

Learning Semantic Person Image Generation by Region-Adaptive Normalization

Human pose transfer has received great attention due to its wide applica...
research
03/11/2021

HumanGAN: A Generative Model of Humans Images

Generative adversarial networks achieve great performance in photorealis...
research
11/25/2018

PCGAN: Partition-Controlled Human Image Generation

Human image generation is a very challenging task since it is affected b...
research
05/27/2022

Image Harmonization with Region-wise Contrastive Learning

Image harmonization task aims at harmonizing different composite foregro...
research
02/14/2023

DiffFashion: Reference-based Fashion Design with Structure-aware Transfer by Diffusion Models

Image-based fashion design with AI techniques has attracted increasing a...
research
10/03/2018

Assessing Performance of Aerobic Routines using Background Subtraction and Intersected Image Region

It is recommended for a novice to engage a trained and experience person...
research
03/20/2023

Open-World Pose Transfer via Sequential Test-Time Adaption

Pose transfer aims to transfer a given person into a specified posture, ...

Please sign up or login with your details

Forgot password? Click here to reset