DORi: Discovering Object Relationship for Moment Localization of a Natural-Language Query in Video

10/13/2020
by   Cristian Rodriguez Opazo, et al.
0

This paper studies the task of temporal moment localization in a long untrimmed video using natural language query. Given a query sentence, the goal is to determine the start and end of the relevant segment within the video. Our key innovation is to learn a video feature embedding through a language-conditioned message-passing algorithm suitable for temporal moment localization which captures the relationships between humans, objects and activities in the video. These relationships are obtained by a spatial sub-graph that contextualizes the scene representation using detected objects and human features conditioned in the language query. Moreover, a temporal sub-graph captures the activities within the video through time. Our method is evaluated on three standard benchmark datasets, and we also introduce YouCookII as a new benchmark for this task. Experiments show our method outperforms state-of-the-art methods on these datasets, confirming the effectiveness of our approach.

READ FULL TEXT

page 8

page 12

page 14

page 15

page 16

page 17

page 18

page 19

research
08/20/2019

Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention

This paper studies the problem of temporal moment localization in a long...
research
06/28/2023

SpotEM: Efficient Video Search for Episodic Memory

The goal in episodic memory (EM) is to search a long egocentric video to...
research
08/11/2019

Exploiting Temporal Relationships in Video Moment Localization with Natural Language

We address the problem of video moment localization with natural languag...
research
09/01/2020

Uncovering Hidden Challenges in Query-Based Video Moment Retrieval

The query-based moment retrieval is a problem of localising a specific c...
research
08/19/2020

Generating Adjacency Matrix for Video-Query based Video Moment Retrieval

In this paper, we continue our work on Video-Query based Video Moment re...
research
07/20/2020

Graph Neural Network for Video-Query based Video Moment Retrieval

In this paper, we focus on Video Query based Video Moment Retrieval (VQ-...
research
04/01/2021

A Survey on Natural Language Video Localization

Natural language video localization (NLVL), which aims to locate a targe...

Please sign up or login with your details

Forgot password? Click here to reset