Tracking People with 3D Representations

11/15/2021
by   Jathushan Rajasegaran, et al.
5

We present a novel approach for tracking multiple people in video. Unlike past approaches which employ 2D representations, we focus on using 3D representations of people, located in three-dimensional space. To this end, we develop a method, Human Mesh and Appearance Recovery (HMAR) which in addition to extracting the 3D geometry of the person as a SMPL mesh, also extracts appearance as a texture map on the triangles of the mesh. This serves as a 3D representation for appearance that is robust to viewpoint and pose changes. Given a video clip, we first detect bounding boxes corresponding to people, and for each one, we extract 3D appearance, pose, and location information using HMAR. These embedding vectors are then sent to a transformer, which performs spatio-temporal aggregation of the representations over the duration of the sequence. The similarity of the resulting representations is used to solve for associations that assigns each person to a tracklet. We evaluate our approach on the Posetrack, MuPoTs and AVA datasets. We find that 3D representations are more effective than 2D representations for tracking in these settings, and we obtain state-of-the-art performance. Code and results are available at: https://brjathu.github.io/T3DP.

READ FULL TEXT

page 3

page 5

page 9

research
12/08/2021

Tracking People by Predicting 3D Appearance, Location Pose

In this paper, we present an approach for tracking people in monocular v...
research
04/03/2023

On the Benefits of 3D Pose and Tracking for Human Action Recognition

In this work we study the benefits of using tracking and 3D poses for ac...
research
04/19/2018

Part-Aligned Bilinear Representations for Person Re-identification

We propose a novel network that learns a part-aligned representation for...
research
05/22/2018

Automatic Adaptation of Person Association for Multiview Tracking in Group Activities

Reliable markerless motion tracking of multiple people participating in ...
research
05/31/2023

Humans in 4D: Reconstructing and Tracking Humans with Transformers

We present an approach to reconstruct humans and track them over time. A...
research
11/09/2021

Video Text Tracking With a Spatio-Temporal Complementary Model

Text tracking is to track multiple texts in a video,and construct a traj...

Please sign up or login with your details

Forgot password? Click here to reset