Linking Image and Text with 2-Way Nets

08/29/2016
by   Aviv Eisenschtat, et al.
0

Linking two data sources is a basic building block in numerous computer vision problems. Canonical Correlation Analysis (CCA) achieves this by utilizing a linear optimizer in order to maximize the correlation between the two views. Recent work makes use of non-linear models, including deep learning techniques, that optimize the CCA loss in some feature space. In this paper, we introduce a novel, bi-directional neural network architecture for the task of matching vectors from two data sources. Our approach employs two tied neural network channels that project the two views into a common, maximally correlated space using the Euclidean loss. We show a direct link between the correlation-based loss and Euclidean loss, enabling the use of Euclidean loss for correlation maximization. To overcome common Euclidean regression optimization problems, we modify well-known techniques to our problem, including batch normalization and dropout. We show state of the art results on a number of computer vision matching tasks including MNIST image matching and sentence-image matching on the Flickr8k, Flickr30k and COCO datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/26/2019

Task-Driven Common Representation Learning via Bridge Neural Network

This paper introduces a novel deep learning based method, named bridge n...
research
04/01/2018

Unsupervised Correlation Analysis

Linking between two data sources is a basic building block in numerous c...
research
06/01/2020

Bi-directional Exponential Angular Triplet Loss for RGB-Infrared Person Re-Identification

RGB-Infrared person re-identification (RGB-IR ReID) is a cross-modality ...
research
06/26/2018

EmbNum: Semantic labeling for numerical values with deep metric learning

Semantic labeling is a task of matching unknown data source to labeled d...
research
10/31/2017

Image Patch Matching Using Convolutional Descriptors with Euclidean Distance

In this work we propose a neural network based image descriptor suitable...
research
10/30/2020

Multiview Variational Graph Autoencoders for Canonical Correlation Analysis

We present a novel multiview canonical correlation analysis model based ...
research
05/20/2020

Adversarial Canonical Correlation Analysis

Canonical Correlation Analysis (CCA) is a statistical technique used to ...

Please sign up or login with your details

Forgot password? Click here to reset