A-NeRF: Surface-free Human 3D Pose Refinement via Neural Rendering

02/11/2021
by   Shih-Yang Su, et al.
9

While deep learning has reshaped the classical motion capture pipeline, generative, analysis-by-synthesis elements are still in use to recover fine details if a high-quality 3D model of the user is available. Unfortunately, obtaining such a model for every user a priori is challenging, time-consuming, and limits the application scenarios. We propose a novel test-time optimization approach for monocular motion capture that learns a volumetric body model of the user in a self-supervised manner. To this end, our approach combines the advantages of neural radiance fields with an articulated skeleton representation. Our proposed skeleton embedding serves as a common reference that links constraints across time, thereby reducing the number of required camera views from traditionally dozens of calibrated cameras, down to a single uncalibrated one. As a starting point, we employ the output of an off-the-shelf model that predicts the 3D skeleton pose. The volumetric body shape and appearance is then learned from scratch, while jointly refining the initial pose estimate. Our approach is self-supervised and does not require any additional ground truth labels for appearance, pose, or 3D shape. We demonstrate that our novel combination of a discriminative pose estimation technique with surface-free analysis-by-synthesis outperforms purely discriminative monocular pose estimation approaches and generalizes well to multiple views.

READ FULL TEXT

page 1

page 5

page 8

page 11

page 12

page 13

research
08/17/2021

Self-Supervised 3D Human Pose Estimation with Multiple-View Geometry

We present a self-supervised learning algorithm for 3D human pose estima...
research
06/17/2022

TAVA: Template-free Animatable Volumetric Actors

Coordinate-based volumetric representations have the potential to genera...
research
12/04/2017

Self-supervised Learning of Motion Capture

Current state-of-the-art solutions for motion capture from a single came...
research
07/28/2016

General Automatic Human Shape and Motion Capture Using Volumetric Contour Cues

Markerless motion capture algorithms require a 3D body with properly per...
research
08/23/2023

Pose Modulated Avatars from Video

It is now possible to reconstruct dynamic human motion and shape from a ...
research
09/09/2023

Mirror-Aware Neural Humans

Human motion capture either requires multi-camera systems or is unreliab...
research
03/20/2023

Open-World Pose Transfer via Sequential Test-Time Adaption

Pose transfer aims to transfer a given person into a specified posture, ...

Please sign up or login with your details

Forgot password? Click here to reset