A Horse with no Labels: Self-Supervised Horse Pose Estimation from Unlabelled Images and Synthetic Prior

08/07/2023
by   Jose Sosa, et al.
0

Obtaining labelled data to train deep learning methods for estimating animal pose is challenging. Recently, synthetic data has been widely used for pose estimation tasks, but most methods still rely on supervised learning paradigms utilising synthetic images and labels. Can training be fully unsupervised? Is a tiny synthetic dataset sufficient? What are the minimum assumptions that we could make for estimating animal pose? Our proposal addresses these questions through a simple yet effective self-supervised method that only assumes the availability of unlabelled images and a small set of synthetic 2D poses. We completely remove the need for any 3D or 2D pose annotations (or complex 3D animal models), and surprisingly our approach can still learn accurate 3D and 2D poses simultaneously. We train our method with unlabelled images of horses mainly collected for YouTube videos and a prior consisting of 2D synthetic poses. The latter is three times smaller than the number of images needed for training. We test our method on a challenging set of horse images and evaluate the predicted 3D and 2D poses. We demonstrate that it is possible to learn accurate animal poses even with as few assumptions as unlabelled images and a small set of 2D poses generated from synthetic data. Given the minimum requirements and the abundance of unlabelled data, our method could be easily deployed to different animals.

READ FULL TEXT

page 1

page 3

page 5

page 6

research
12/06/2020

Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos

Estimating 3D hand pose directly from RGB imagesis challenging but has g...
research
07/25/2023

Of Mice and Pose: 2D Mouse Pose Estimation from Unlabelled Data and Synthetic Prior

Numerous fields, such as ecology, biology, and neuroscience, use animal ...
research
04/08/2021

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency

Compared to 2D object bounding-box labeling, it is very difficult for hu...
research
03/09/2022

NeRF-Pose: A First-Reconstruct-Then-Regress Approach for Weakly-supervised 6D Object Pose Estimation

Pose estimation of 3D objects in monocular images is a fundamental and l...
research
06/17/2022

Self-supervised deep visual servoing for high precision peg-in-hole insertion

Many industrial assembly tasks involve peg-in-hole like insertions with ...
research
12/15/2020

Pose Error Reduction for Focus Enhancement in Thermal Synthetic Aperture Visualization

Airborne optical sectioning, an effective aerial synthetic aperture imag...
research
10/20/2022

Photo-realistic 360 Head Avatars in the Wild

Delivering immersive, 3D experiences for human communication requires a ...

Please sign up or login with your details

Forgot password? Click here to reset