Zero-shot-Learning Cross-Modality Data Translation Through Mutual Information Guided Stochastic Diffusion

01/31/2023
by   Zihao Wang, et al.
0

Cross-modality data translation has attracted great interest in image computing. Deep generative models (e.g., GANs) show performance improvement in tackling those problems. Nevertheless, as a fundamental challenge in image translation, the problem of Zero-shot-Learning Cross-Modality Data Translation with fidelity remains unanswered. This paper proposes a new unsupervised zero-shot-learning method named Mutual Information guided Diffusion cross-modality data translation Model (MIDiffusion), which learns to translate the unseen source data to the target domain. The MIDiffusion leverages a score-matching-based generative model, which learns the prior knowledge in the target domain. We propose a differentiable local-wise-MI-Layer (LMI) for conditioning the iterative denoising sampling. The LMI captures the identical cross-modality features in the statistical domain for the diffusion guidance; thus, our method does not require retraining when the source domain is changed, as it does not rely on any direct mapping between the source and target domains. This advantage is critical for applying cross-modality data translation methods in practice, as a reasonable amount of source domain dataset is not always available for supervised training. We empirically show the advanced performance of MIDiffusion in comparison with an influential group of generative models, including adversarial-based and other score-matching-based models.

READ FULL TEXT

page 2

page 7

page 13

research
04/05/2023

Zero-shot Medical Image Translation via Frequency-Guided Diffusion Models

Recently, the diffusion model has emerged as a superior generative model...
research
04/06/2023

Zero-shot Generative Model Adaptation via Image-specific Prompt Learning

Recently, CLIP-guided image synthesis has shown appealing performance on...
research
06/01/2019

ZstGAN: An Adversarial Approach for Unsupervised Zero-Shot Image-to-Image Translation

Image-to-image translation models have shown remarkable ability on trans...
research
08/14/2023

Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation

With a strong understanding of the target domain from natural language, ...
research
11/02/2021

Zero-Shot Translation using Diffusion Models

In this work, we show a novel method for neural machine translation (NMT...
research
09/15/2015

Zero-Shot Learning via Semantic Similarity Embedding

In this paper we consider a version of the zero-shot learning problem wh...
research
12/06/2021

Prototypical Model with Novel Information-theoretic Loss Function for Generalized Zero Shot Learning

Generalized zero shot learning (GZSL) is still a technical challenge of ...

Please sign up or login with your details

Forgot password? Click here to reset