Learning Space-Time Semantic Correspondences

06/16/2023
by   Du Tran, et al.
0

We propose a new task of space-time semantic correspondence prediction in videos. Given a source video, a target video, and a set of space-time key-points in the source video, the task requires predicting a set of keypoints in the target video that are the semantic correspondences of the provided source keypoints. We believe that this task is important for fine-grain video understanding, potentially enabling applications such as activity coaching, sports analysis, robot imitation learning, and more. Our contributions in this paper are: (i) proposing a new task and providing annotations for space-time semantic correspondences on two existing benchmarks: Penn Action and Pouring; and (ii) presenting a comprehensive set of baselines and experiments to gain insights about the new problem. Our main finding is that the space-time semantic correspondence prediction problem is best approached jointly in space and time rather than in their decomposed sub-problems: time alignment and spatial correspondences.

READ FULL TEXT

page 1

page 3

page 11

research
03/15/2022

Learning Spatio-Temporal Downsampling for Effective Video Upscaling

Downsampling is one of the most basic image processing operations. Impro...
research
07/11/2016

Efficient Activity Detection in Untrimmed Video with Max-Subgraph Search

We propose an efficient approach for activity detection in video that un...
research
06/09/2021

Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

This paper presents a simple yet effective approach to modeling space-ti...
research
11/26/2018

Evolving Space-Time Neural Architectures for Videos

In this paper, we present a new method for evolving video CNN models to ...
research
08/20/2020

Causal Future Prediction in a Minkowski Space-Time

Estimating future events is a difficult task. Unlike humans, machine lea...
research
10/16/2012

Semantic Understanding of Professional Soccer Commentaries

This paper presents a novel approach to the problem of semantic parsing ...
research
04/06/2020

Deep Space-Time Video Upsampling Networks

Video super-resolution (VSR) and frame interpolation (FI) are traditiona...

Please sign up or login with your details

Forgot password? Click here to reset