Cognitively Inspired Cross-Modal Data Generation Using Diffusion Models

05/28/2023
by   Zizhao Hu, et al.
0

Most existing cross-modal generative methods based on diffusion models use guidance to provide control over the latent space to enable conditional generation across different modalities. Such methods focus on providing guidance through separately-trained models, each for one modality. As a result, these methods suffer from cross-modal information loss and are limited to unidirectional conditional generation. Inspired by how humans synchronously acquire multi-modal information and learn the correlation between modalities, we explore a multi-modal diffusion model training and sampling scheme that uses channel-wise image conditioning to learn cross-modality correlation during the training phase to better mimic the learning process in the brain. Our empirical results demonstrate that our approach can achieve data generation conditioned on all correlated modalities.

READ FULL TEXT

page 5

page 7

page 8

page 9

page 12

page 13

page 14

page 15

research
07/07/2022

A Novel Unified Conditional Score-based Generative Framework for Multi-modal Medical Image Completion

Multi-modal medical image completion has been extensively applied to all...
research
11/03/2020

Robust Latent Representations via Cross-Modal Translation and Alignment

Multi-modal learning relates information across observation modalities o...
research
06/07/2023

Multi-modal Latent Diffusion

Multi-modal data-sets are ubiquitous in modern applications, and multi-m...
research
06/15/2022

Discrete Contrastive Diffusion for Cross-Modal and Conditional Generation

Diffusion probabilistic models (DPMs) have become a popular approach to ...
research
03/12/2023

One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale

This paper proposes a unified diffusion framework (dubbed UniDiffuser) t...
research
05/19/2023

MaGIC: Multi-modality Guided Image Completion

The vanilla image completion approaches are sensitive to the large missi...
research
11/06/2019

A coupled autoencoder approach for multi-modal analysis of cell types

Recent developments in high throughput profiling of individual neurons h...

Please sign up or login with your details

Forgot password? Click here to reset