Generalized Multi-view Embedding for Visual Recognition and Cross-modal Retrieval

05/31/2016
by   Guanqun Cao, et al.
0

In this paper, the problem of multi-view embedding from different visual cues and modalities is considered. We propose a unified solution for subspace learning methods using the Rayleigh quotient, which is extensible for multiple views, supervised learning, and non-linear embeddings. Numerous methods including Canonical Correlation Analysis, Partial Least Sqaure regression and Linear Discriminant Analysis are studied using specific intrinsic and penalty graphs within the same framework. Non-linear extensions based on kernels and (deep) neural networks are derived, achieving better performance than the linear ones. Moreover, a novel Multi-view Modular Discriminant Analysis (MvMDA) is proposed by taking the view difference into consideration. We demonstrate the effectiveness of the proposed multi-view embedding methods on visual object recognition and cross-modal image retrieval, and obtain superior results in both applications compared to related methods.

READ FULL TEXT

page 2

page 12

research
04/02/2020

Randomized Kernel Multi-view Discriminant Analysis

In many artificial intelligence and computer vision systems, the same ob...
research
02/13/2018

A probabilistic framework for multi-view feature learning with many-to-many associations via neural networks

A simple framework Probabilistic Multi-view Graph Embedding (PMvGE) is p...
research
07/09/2020

Multi-view Orthonormalized Partial Least Squares: Regularizations and Deep Extensions

We establish a family of subspace-based learning method for multi-view l...
research
10/14/2019

A unified framework of predicting binary interestingness of images based on discriminant correlation analysis and multiple kernel learning

In the modern content-based image retrieval systems, there is an increas...
research
08/23/2015

MultiView Diffusion Maps

In this study we consider learning a reduced dimensionality representati...
research
07/30/2017

Deep Multi-View Learning with Stochastic Decorrelation Loss

Multi-view learning aims to learn an embedding space where multiple view...
research
07/17/2019

Deep Multi-View Learning via Task-Optimal CCA

Canonical Correlation Analysis (CCA) is widely used for multimodal data ...

Please sign up or login with your details

Forgot password? Click here to reset