MURAUER: Mapping Unlabeled Real Data for Label AUstERity

by   Georg Poier, et al.

Data labeling for learning 3D hand pose estimation models is a huge effort. Readily available, accurately labeled synthetic data has the potential to reduce the effort. However, to successfully exploit synthetic data, current state-of-the-art methods still require a large amount of labeled real data. In this work, we remove this requirement by learning to map from the features of real data to the features of synthetic data mainly using a large amount of synthetic and unlabeled real data. We exploit unlabeled data using two auxiliary objectives, which enforce that (i) the mapped representation is pose specific and (ii) at the same time, the distributions of real and synthetic data are aligned. While pose specifity is enforced by a self-supervisory signal requiring that the representation is predictive for the appearance from different views, distributions are aligned by an adversarial term. In this way, we can significantly improve the results of the baseline system, which does not use unlabeled data and outperform many recent approaches already with about 1 of the labeled real data. This presents a step towards faster deployment of learning based hand pose estimation, making it accessible for a larger range of applications.


page 1

page 14


Cross-Domain Complementary Learning with Synthetic Data for Multi-Person Part Segmentation

The success of supervised deep learning depends on the training labels. ...

SemiMultiPose: A Semi-supervised Multi-animal Pose Estimation Framework

Multi-animal pose estimation is essential for studying animals' social b...

SPAC-Net: Synthetic Pose-aware Animal ControlNet for Enhanced Pose Estimation

Animal pose estimation has become a crucial area of research, but the sc...

3D Hand Pose Estimation using Simulation and Partial-Supervision with a Shared Latent Space

Tremendous amounts of expensive annotated data are a vital ingredient fo...

CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data

We present a visual localization system that learns to estimate camera p...

What If We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels

Scene text recognition (STR) task has a common practice: All state-of-th...

Detecting Olives with Synthetic or Real Data? Olive the Above

Modern robotics has enabled the advancement in yield estimation for prec...

Please sign up or login with your details

Forgot password? Click here to reset