Temporal Cross-Media Retrieval with Soft-Smoothing

10/10/2018
by   David Semedo, et al.
0

Multimedia information have strong temporal correlations that shape the way modalities co-occur over time. In this paper we study the dynamic nature of multimedia and social-media information, where the temporal dimension emerges as a strong source of evidence for learning the temporal correlations across visual and textual modalities. So far, cross-media retrieval models, explored the correlations between different modalities (e.g. text and image) to learn a common subspace, in which semantically similar instances lie in the same neighbourhood. Building on such knowledge, we propose a novel temporal cross-media neural architecture, that departs from standard cross-media methods, by explicitly accounting for the temporal dimension through temporal subspace learning. The model is softly-constrained with temporal and inter-modality constraints that guide the new subspace learning task by favouring temporal correlations between semantically similar and temporally close instances. Experiments on three distinct datasets show that accounting for time turns out to be important for cross-media retrieval. Namely, the proposed method outperforms a set of baselines on the task of temporal cross-media retrieval, demonstrating its effectiveness for performing temporal subspace learning.

READ FULL TEXT
research
03/16/2022

Scientific and Technological Information Oriented Semantics-adversarial and Media-adversarial Cross-media Retrieval

Cross-media retrieval of scientific and technological information is one...
research
04/17/2019

Adversarial Cross-Modal Retrieval via Learning and Transferring Single-Modal Similarities

Cross-modal retrieval aims to retrieve relevant data across different mo...
research
04/14/2017

Cross-media Similarity Metric Learning with Unified Deep Networks

As a highlighting research topic in the multimedia area, cross-media ret...
research
08/25/2022

Multimedia Generative Script Learning for Task Planning

Goal-oriented generative script learning aims to generate subsequent ste...
research
03/10/2018

Deep Cross-media Knowledge Transfer

Cross-media retrieval is a research hotspot in multimedia area, which ai...
research
09/30/2019

Cross-Modal Subspace Learning with Scheduled Adaptive Margin Constraints

Cross-modal embeddings, between textual and visual modalities, aim to or...
research
06/26/2022

Semantic Role Aware Correlation Transformer for Text to Video Retrieval

With the emergence of social media, voluminous video clips are uploaded ...

Please sign up or login with your details

Forgot password? Click here to reset