Multimodal Representation Learning using Deep Multiset Canonical Correlation

04/03/2019
by   Krishna Somandepalli, et al.
0

We propose Deep Multiset Canonical Correlation Analysis (dMCCA) as an extension to representation learning using CCA when the underlying signal is observed across multiple (more than two) modalities. We use deep learning framework to learn non-linear transformations from different modalities to a shared subspace such that the representations maximize the ratio of between- and within-modality covariance of the observations. Unlike linear discriminant analysis, we do not need class information to learn these representations, and we show that this model can be trained for complex data using mini-batches. Using synthetic data experiments, we show that dMCCA can effectively recover the common signal across the different modalities corrupted by multiplicative and additive noise. We also analyze the sensitivity of our model to recover the correlated components with respect to mini-batch size and dimension of the embeddings. Performance evaluation on noisy handwritten datasets shows that our model outperforms other CCA-based approaches and is comparable to deep neural network models trained end-to-end on this dataset.

READ FULL TEXT
research
10/12/2020

Deep Gated Canonical Correlation Analysis

Canonical Correlation Analysis (CCA) models can extract informative corr...
research
10/31/2017

Common Representation Learning Using Step-based Correlation Multi-Modal CNN

Deep learning techniques have been successfully used in learning a commo...
research
04/27/2015

Correlational Neural Networks

Common Representation Learning (CRL), wherein different descriptions (or...
research
07/04/2019

Deep Coupled-Representation Learning for Sparse Linear Inverse Problems with Side Information

In linear inverse problems, the goal is to recover a target signal from ...
research
11/21/2018

Learning from Multiview Correlations in Open-Domain Videos

An increasing number of datasets contain multiple views, such as video, ...
research
09/19/2019

HyperLearn: A Distributed Approach for Representation Learning in Datasets With Many Modalities

Multimodal datasets contain an enormous amount of relational information...

Please sign up or login with your details

Forgot password? Click here to reset