Chasing the Tail in Monocular 3D Human Reconstruction with Prototype Memory

12/29/2020
by   Yu Rong, et al.
2

Deep neural networks have achieved great progress in single-image 3D human reconstruction. However, existing methods still fall short in predicting rare poses. The reason is that most of the current models perform regression based on a single human prototype, which is similar to common poses while far from the rare poses. In this work, we 1) identify and analyze this learning obstacle and 2) propose a prototype memory-augmented network, PM-Net, that effectively improves performances of predicting rare poses. The core of our framework is a memory module that learns and stores a set of 3D human prototypes capturing local distributions for either common poses or rare poses. With this formulation, the regression starts from a better initialization, which is relatively easier to converge. Extensive experiments on several widely employed datasets demonstrate the proposed framework's effectiveness compared to other state-of-the-art methods. Notably, our approach significantly improves the models' performances on rare poses while generating comparable results on other samples.

READ FULL TEXT

page 1

page 3

page 5

page 7

page 8

page 9

page 11

research
07/18/2020

SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach

Human poses that are rare or unseen in a training set are challenging fo...
research
08/16/2022

PoseTrans: A Simple Yet Effective Pose Transformation Augmentation for Human Pose Estimation

Human pose estimation aims to accurately estimate a wide variety of huma...
research
07/20/2022

3D Clothed Human Reconstruction in the Wild

Although much progress has been made in 3D clothed human reconstruction,...
research
08/14/2020

Rb-PaStaNet: A Few-Shot Human-Object Interaction Detection Based on Rules and Part States

Existing Human-Object Interaction (HOI) Detection approaches have achiev...
research
06/14/2020

Cascaded deep monocular 3D human pose estimation with evolutionary training data

End-to-end deep representation learning has achieved remarkable accuracy...
research
02/17/2023

Find Beauty in the Rare: Contrastive Composition Feature Clustering for Nontrivial Cropping Box Regression

Automatic image cropping algorithms aim to recompose images like human-b...

Please sign up or login with your details

Forgot password? Click here to reset