Learning Video Representations from Correspondence Proposals

05/20/2019
by   Xingyu Liu, et al.
0

Correspondences between frames encode rich information about dynamic content in videos. However, it is challenging to effectively capture and learn those due to their irregular structure and complex dynamics. In this paper, we propose a novel neural network that learns video representations by aggregating information from potential correspondences. This network, named CPNet, can learn evolving 2D fields with temporal consistency. In particular, it can effectively learn representations for videos by mixing appearance and long-range motion with an RGB-only input. We provide extensive ablation experiments to validate our model. CPNet shows stronger performance than existing methods on Kinetics and achieves the state-of-the-art performance on Something-Something and Jester. We provide analysis towards the behavior of our model and show its robustness to errors in proposals.

READ FULL TEXT

page 4

page 8

page 14

page 15

page 16

research
12/16/2021

HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images

Existing state-of-the-art methods for Video Object Segmentation (VOS) le...
research
05/02/2019

Self-supervised Learning for Video Correspondence Flow

The objective of this paper is self-supervised learning of feature embed...
research
08/23/2021

Recurrent Video Deblurring with Blur-Invariant Motion Estimation and Pixel Volumes

For the success of video deblurring, it is essential to utilize informat...
research
04/11/2022

Structure-Aware Motion Transfer with Deformable Anchor Model

Given a source image and a driving video depicting the same object type,...
research
07/16/2023

FourierHandFlow: Neural 4D Hand Representation Using Fourier Query Flow

Recent 4D shape representations model continuous temporal evolution of i...
research
04/10/2020

Stacked Convolutional Deep Encoding Network for Video-Text Retrieval

Existing dominant approaches for cross-modal video-text retrieval task a...
research
09/19/2022

NeuralMarker: A Framework for Learning General Marker Correspondence

We tackle the problem of estimating correspondences from a general marke...

Please sign up or login with your details

Forgot password? Click here to reset