Cross-modal Variational Auto-encoder with Distributed Latent Spaces and Associators

05/30/2019
by   Dae Ung Jo, et al.
0

In this paper, we propose a novel structure for a cross-modal data association, which is inspired by the recent research on the associative learning structure of the brain. We formulate the cross-modal association in Bayesian inference framework realized by a deep neural network with multiple variational auto-encoders and variational associators. The variational associators transfer the latent spaces between auto-encoders that represent different modalities. The proposed structure successfully associates even heterogeneous modal data and easily incorporates the additional modality to the entire network via the proposed cross-modal associator. Furthermore, the proposed structure can be trained with only a small amount of paired data since auto-encoders can be trained by unsupervised manner. Through experiments, the effectiveness of the proposed structure is validated on various datasets including visual and auditory data.

READ FULL TEXT

page 7

page 8

research
12/05/2021

Variational Autoencoder with CCA for Audio-Visual Cross-Modal Retrieval

Cross-modal retrieval is to utilize one modality as a query to retrieve ...
research
07/07/2019

A methodology for multisensory product experience design using cross-modal effect: A case of SLR camera

Throughout the course of product experience, a user employs multiple sen...
research
09/16/2019

Learning Controls Using Cross-Modal Representations: Bridging Simulation and Reality for Drone Racing

Machines are a long way from robustly solving open-world perception-cont...
research
10/08/2020

A Variational Auto-Encoder Approach for Image Transmission in Wireless Channel

Recent advancements in information technology and the widespread use of ...
research
06/14/2019

Modality Conversion of Handwritten Patterns by Cross Variational Autoencoders

This research attempts to construct a network that can convert online an...
research
05/14/2019

Correlated Variational Auto-Encoders

Variational Auto-Encoders (VAEs) are capable of learning latent represen...
research
12/01/2020

Learning Disentangled Latent Factors from Paired Data in Cross-Modal Retrieval: An Implicit Identifiable VAE Approach

We deal with the problem of learning the underlying disentangled latent ...

Please sign up or login with your details

Forgot password? Click here to reset