Cross-Modal Common Representation Learning with Triplet Loss Functions

02/16/2022
by   Felix Ott, et al.
18

Common representation learning (CRL) learns a shared embedding between two or more modalities to improve in a given task over using only one of the modalities. CRL from different data types such as images and time-series data (e.g., audio or text data) requires a deep metric learning loss that minimizes the distance between the modality embeddings. In this paper, we propose to use the triplet loss, which uses positive and negative identities to create sample pairs with different labels, for CRL between image and time-series modalities. By adapting the triplet loss for CRL, higher accuracy in the main (time-series classification) task can be achieved by exploiting additional information of the auxiliary (image classification) task. Our experiments on synthetic data and handwriting recognition data from sensor-enhanced pens show an improved classification accuracy, faster convergence, and a better generalizability.

READ FULL TEXT

page 4

page 6

page 10

research
08/10/2019

Deep Triplet Neural Networks with Cluster-CCA for Audio-Visual Cross-modal Retrieval

Cross-modal retrieval aims to retrieve data in one modality by a query i...
research
11/18/2020

Vector Embeddings with Subvector Permutation Invariance using a Triplet Enhanced Autoencoder

The use of deep neural network (DNN) autoencoders (AEs) has recently exp...
research
11/07/2022

Complete Cross-triplet Loss in Label Space for Audio-visual Cross-modal Retrieval

The heterogeneity gap problem is the main challenge in cross-modal retri...
research
04/11/2017

Deep Multimodal Representation Learning from Temporal Data

In recent years, Deep Learning has been successfully applied to multimod...
research
12/20/2014

Deep metric learning using Triplet network

Deep learning has proven itself as a successful set of models for learni...
research
09/27/2021

Audio-to-Image Cross-Modal Generation

Cross-modal representation learning allows to integrate information from...
research
01/24/2022

Ordinal-Quadruplet: Retrieval of Missing Classes in Ordinal Time Series

In this paper, we propose an ordered time series classification framewor...

Please sign up or login with your details

Forgot password? Click here to reset