PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking

07/27/2023
by   Yang Zheng, et al.
0

We introduce PointOdyssey, a large-scale synthetic dataset, and data generation framework, for the training and evaluation of long-term fine-grained tracking algorithms. Our goal is to advance the state-of-the-art by placing emphasis on long videos with naturalistic motion. Toward the goal of naturalism, we animate deformable characters using real-world motion capture data, we build 3D scenes to match the motion capture environments, and we render camera viewpoints using trajectories mined via structure-from-motion on real videos. We create combinatorial diversity by randomizing character appearance, motion profiles, materials, lighting, 3D assets, and atmospheric effects. Our dataset currently includes 104 videos, averaging 2,000 frames long, with orders of magnitude more correspondence annotations than prior work. We show that existing methods can be trained from scratch in our dataset and outperform the published variants. Finally, we introduce modifications to the PIPs point tracking method, greatly widening its temporal receptive field, which improves its performance on PointOdyssey as well as on two real-world benchmarks. Our data and code are publicly available at: https://pointodyssey.com

READ FULL TEXT

page 1

page 4

page 5

page 8

page 12

page 13

page 14

page 15

research
03/10/2020

PANDA: A Gigapixel-level Human-centric Video Dataset

We present PANDA, the first gigaPixel-level humAN-centric viDeo dAtaset,...
research
02/24/2023

Decoupling Human and Camera Motion from Videos in the Wild

We propose a method to reconstruct global human trajectories from videos...
research
06/07/2022

Generating Long Videos of Dynamic Scenes

We present a video generation model that accurately reproduces object mo...
research
11/07/2022

TAP-Vid: A Benchmark for Tracking Any Point in a Video

Generic motion understanding from video involves not only tracking objec...
research
09/06/2021

Learning Fine-Grained Motion Embedding for Landscape Animation

In this paper we focus on landscape animation, which aims to generate ti...
research
10/25/2021

MoDeRNN: Towards Fine-grained Motion Details for Spatiotemporal Predictive Learning

Spatiotemporal predictive learning (ST-PL) aims at predicting the subseq...
research
04/07/2022

Swarm behavior tracking based on a deep vision algorithm

The intelligent swarm behavior of social insects (such as ants) springs ...

Please sign up or login with your details

Forgot password? Click here to reset