Semantic Estimation of 3D Body Shape and Pose using Minimal Cameras

08/08/2019
by   Andrew Gilbert, et al.
1

We present an approach to accurately estimate high fidelity markerless 3D pose and volumetric reconstruction of human performance using only a small set of camera views (∼ 2). Our method utilises a dual loss in a generative adversarial network that can yield improved performance in both reconstruction and pose estimate error. We use a deep prior implicitly learnt by the network trained over a dataset of view-ablated multi-view video footage of a wide range of subjects and actions. Uniquely we use a multi-channel symmetric 3D convolutional encoder-decoder with a dual loss to enforce the learning of a latent embedding that enforces skeletal joint positions and a deep volumetric reconstruction of the performer. An extensive evaluation is performed with state of the art performance reported on three datasets; Human 3.6M, TotalCapture and TotalCaptureOutdoor. The method opens the possibility of high-end volumetric and pose performance capture in on-set and prosumer scenarios where time or cost prohibit a high witness camera count.

READ FULL TEXT

page 11

page 14

research
07/05/2018

Volumetric performance capture from minimal camera viewpoints

We present a convolutional autoencoder that enables high fidelity volume...
research
07/04/2018

Deep Autoencoder for Combined Human Pose Estimation and body Model Upscaling

We present a method for simultaneously estimating 3D human pose and body...
research
02/26/2022

Accurate Human Body Reconstruction for Volumetric Video

In this work, we enhance a professional end-to-end volumetric video prod...
research
05/10/2022

KeypointNeRF: Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints

Image-based volumetric avatars using pixel-aligned features promise gene...
research
04/05/2020

Lightweight Multi-View 3D Pose Estimation through Camera-Disentangled Representation

We present a lightweight solution to recover 3D pose from multi-view ima...
research
05/07/2022

Multi-View Video Coding with GAN Latent Learning

The introduction of multiple viewpoints inevitably increases the bitrate...
research
04/06/2020

Light3DPose: Real-time Multi-Person 3D PoseEstimation from Multiple Views

We present an approach to perform 3D pose estimation of multiple people ...

Please sign up or login with your details

Forgot password? Click here to reset