Cross-Modal Subspace Learning with Scheduled Adaptive Margin Constraints

09/30/2019
by   David Semedo, et al.
0

Cross-modal embeddings, between textual and visual modalities, aim to organise multimodal instances by their semantic correlations. State-of-the-art approaches use maximum-margin methods, based on the hinge-loss, to enforce a constant margin m, to separate projections of multimodal instances from different categories. In this paper, we propose a novel scheduled adaptive maximum-margin (SAM) formulation that infers triplet-specific constraints during training, therefore organising instances by adaptively enforcing inter-category and inter-modality correlations. This is supported by a scheduled adaptive margin function, that is smoothly activated, replacing a static margin by an adaptively inferred one reflecting triplet-specific semantic correlations while accounting for the incremental learning behaviour of neural networks to enforce category cluster formation and enforcement. Experiments on widely used datasets show that our model improved upon state-of-the-art approaches, by achieving a relative improvement of up to  12.5 scheduled adaptive margin formulation.

READ FULL TEXT
research
09/30/2019

Diachronic Cross-modal Embeddings

Understanding the semantic shifts of multimodal information is only poss...
research
04/04/2019

Triplet-Based Deep Hashing Network for Cross-Modal Retrieval

Given the benefits of its low storage requirements and high retrieval ef...
research
02/02/2019

Joint Cluster Unary Loss for Efficient Cross-Modal Hashing

With the rapid growth of various types of multimodal data, cross-modal d...
research
04/07/2017

CCL: Cross-modal Correlation Learning with Multi-grained Fusion by Hierarchical Network

Cross-modal retrieval has become a highlighted research topic for retrie...
research
08/10/2021

ASMR: Learning Attribute-Based Person Search with Adaptive Semantic Margin Regularizer

Attribute-based person search is the task of finding person images that ...
research
02/19/2019

Adaptive Cross-Modal Few-Shot Learning

Metric-based meta-learning techniques have successfully been applied to ...
research
10/10/2018

Temporal Cross-Media Retrieval with Soft-Smoothing

Multimedia information have strong temporal correlations that shape the ...

Please sign up or login with your details

Forgot password? Click here to reset