BKinD-3D: Self-Supervised 3D Keypoint Discovery from Multi-View Videos

12/14/2022
by   Jennifer J. Sun, et al.
11

Quantifying motion in 3D is important for studying the behavior of humans and other animals, but manual pose annotations are expensive and time-consuming to obtain. Self-supervised keypoint discovery is a promising strategy for estimating 3D poses without annotations. However, current keypoint discovery approaches commonly process single 2D views and do not operate in the 3D space. We propose a new method to perform self-supervised keypoint discovery in 3D from multi-view videos of behaving agents, without any keypoint or bounding box supervision in 2D or 3D. Our method uses an encoder-decoder architecture with a 3D volumetric heatmap, trained to reconstruct spatiotemporal differences across multiple views, in addition to joint length constraints on a learned 3D skeleton of the subject. In this way, we discover keypoints without requiring manual supervision in videos of humans and rats, demonstrating the potential of 3D keypoint discovery for studying behavior.

READ FULL TEXT

page 1

page 4

page 8

page 16

page 17

page 18

research
12/09/2021

Self-Supervised Keypoint Discovery in Behavioral Videos

We propose a method for learning the posture and structure of agents fro...
research
12/07/2019

Self-Supervised 3D Keypoint Learning for Ego-motion Estimation

Generating reliable illumination and viewpoint invariant keypoints is cr...
research
08/13/2020

3D Bird Reconstruction: a Dataset, Model, and Shape Recovery from a Single View

Automated capture of animal pose is transforming how we study neuroscien...
research
05/21/2022

AutoLink: Self-supervised Learning of Human Skeletons and Object Outlines by Linking Keypoints

Structured representations such as keypoints are widely used in pose tra...
research
09/13/2021

Vision-based system identification and 3D keypoint discovery using dynamics constraints

This paper introduces V-SysId, a novel method that enables simultaneous ...
research
05/15/2023

AutoRecon: Automated 3D Object Discovery and Reconstruction

A fully automated object reconstruction pipeline is crucial for digital ...
research
05/31/2018

MONET: Multiview Semi-supervised Keypoint via Epipolar Divergence

This paper presents MONET---an end-to-end semi-supervised learning frame...

Please sign up or login with your details

Forgot password? Click here to reset