Domain Adaptation in Multi-View Embedding for Cross-Modal Video Retrieval

10/25/2021
by   Jonathan Munro, et al.
0

Given a gallery of uncaptioned video sequences, this paper considers the task of retrieving videos based on their relevance to an unseen text query. To compensate for the lack of annotations, we rely instead on a related video gallery composed of video-caption pairs, termed the source gallery, albeit with a domain gap between its videos and those in the target gallery. We thus introduce the problem of Unsupervised Domain Adaptation for Cross-modal Video Retrieval, along with a new benchmark on fine-grained actions. We propose a novel iterative domain alignment method by means of pseudo-labelling target videos and cross-domain (i.e. source-target) ranking. Our approach adapts the embedding space to the target gallery, consistently outperforming source-only as well as marginal and conditional alignment methods.

READ FULL TEXT
research
09/23/2022

Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval

As an increasingly popular task in multimedia information retrieval, vid...
research
08/26/2021

Learning Cross-modal Contrastive Features for Video Domain Adaptation

Learning transferable and domain adaptive feature representations from v...
research
05/08/2022

Cross-lingual Adaptation for Recipe Retrieval with Mixup

Cross-modal recipe retrieval has attracted research attention in recent ...
research
03/18/2023

Augmenting and Aligning Snippets for Few-Shot Video Domain Adaptation

For video models to be transferred and applied seamlessly across video t...
research
08/16/2022

Subtype-Aware Dynamic Unsupervised Domain Adaptation

Unsupervised domain adaptation (UDA) has been successfully applied to tr...
research
08/17/2023

Knowledge-inspired Subdomain Adaptation for Cross-Domain Knowledge Transfer

Most state-of-the-art deep domain adaptation techniques align source and...
research
08/23/2023

Towards Privacy-Supporting Fall Detection via Deep Unsupervised RGB2Depth Adaptation

Fall detection is a vital task in health monitoring, as it allows the sy...

Please sign up or login with your details

Forgot password? Click here to reset