Mix and match networks: multi-domain alignment for unpaired image-to-image translation

03/08/2019
by   Yaxing Wang, et al.
8

This paper addresses the problem of inferring unseen cross-domain and cross-modal image-to-image translations between multiple domains and modalities. We assume that only some of the pairwise translations have been seen (i.e. trained) and infer the remaining unseen translations (where training pairs are not available). We propose mix and match networks, an approach where multiple encoders and decoders are aligned in such a way that the desired translation can be obtained by simply cascading the source encoder and the target decoder, even when they have not interacted during the training stage (i.e. unseen). The main challenge lies in the alignment of the latent representations at the bottlenecks of encoder-decoder pairs. We propose an architecture with several tools to encourage alignment, including autoencoders and robust side information and latent consistency losses. We show the benefits of our approach in terms of effectiveness and scalability compared with other pairwise image-to-image translation approaches. We also propose zero-pair cross-modal image translation, a challenging setting where the objective is inferring semantic segmentation from depth (and vice-versa) without explicit segmentation-depth pairs, and only from two (disjoint) segmentation-RGB and depth-segmentation training sets. We observe that certain part of the shared information between unseen domains might not be reachable, so we further propose a variant that leverages pseudo-pairs to exploit all shared information.

READ FULL TEXT

page 3

page 7

page 8

page 12

page 13

page 15

page 16

research
04/06/2018

Mix and match networks: encoder-decoder alignment for zero-pair image translation

We address the problem of image translation between domains or modalitie...
research
11/11/2020

Zero-Pair Image to Image Translation using Domain Conditional Normalization

In this paper, we propose an approach based on domain conditional normal...
research
09/17/2019

Multi-mapping Image-to-Image Translation via Learning Disentanglement

Recent advances of image-to-image translation focus on learning the one-...
research
04/15/2019

Implicit Pairs for Boosting Unpaired Image-to-Image Translation

In image-to-image translation the goal is to learn a mapping from one im...
research
01/11/2019

Image Disentanglement and Uncooperative Re-Entanglement for High-Fidelity Image-to-Image Translation

Cross-domain image-to-image translation should satisfy two requirements:...
research
09/09/2021

Leveraging Local Domains for Image-to-Image Translation

Image-to-image (i2i) networks struggle to capture local changes because ...
research
09/23/2022

Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs

A self-driving car must be able to reliably handle adverse weather condi...

Please sign up or login with your details

Forgot password? Click here to reset