ORTexME: Occlusion-Robust Human Shape and Pose via Temporal Average Texture and Mesh Encoding

09/21/2023
by   Yu Cheng, et al.
0

In 3D human shape and pose estimation from a monocular video, models trained with limited labeled data cannot generalize well to videos with occlusion, which is common in the wild videos. The recent human neural rendering approaches focusing on novel view synthesis initialized by the off-the-shelf human shape and pose methods have the potential to correct the initial human shape. However, the existing methods have some drawbacks such as, erroneous in handling occlusion, sensitive to inaccurate human segmentation, and ineffective loss computation due to the non-regularized opacity field. To address these problems, we introduce ORTexME, an occlusion-robust temporal method that utilizes temporal information from the input video to better regularize the occluded body parts. While our ORTexME is based on NeRF, to determine the reliable regions for the NeRF ray sampling, we utilize our novel average texture learning approach to learn the average appearance of a person, and to infer a mask based on the average texture. In addition, to guide the opacity-field updates in NeRF to suppress blur and noise, we propose the use of human body mesh. The quantitative evaluation demonstrates that our method achieves significant improvement on the challenging multi-person 3DPW dataset, where our method achieves 1.8 P-MPJPE error reduction. The SOTA rendering-based methods fail and enlarge the error up to 5.6 on the same dataset.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 7

page 8

research
03/15/2023

Mesh Strikes Back: Fast and Efficient Human Reconstruction from RGB videos

Human reconstruction and synthesis from monocular RGB videos is a challe...
research
10/13/2020

Multi-Scale Networks for 3D Human Pose Estimation with Inference Stage Optimization

Estimating 3D human poses from a monocular video is still a challenging ...
research
08/01/2021

LASOR: Learning Accurate 3D Human Pose and Shape Via Synthetic Occlusion-Aware Data and Neural Mesh Rendering

A key challenge in the task of human pose and shape estimation is occlus...
research
11/23/2022

Hand Avatar: Free-Pose Hand Animation and Rendering from Monocular Video

We present HandAvatar, a novel representation for hand animation and ren...
research
05/30/2023

Scene restoration from scaffold occlusion using deep learning-based methods

The occlusion issues of computer vision (CV) applications in constructio...
research
08/07/2019

Relighting Humans: Occlusion-Aware Inverse Rendering for Full-Body Human Images

Relighting of human images has various applications in image synthesis. ...
research
01/14/2020

Neural Human Video Rendering by Learning Dynamic Textures and Rendering-to-Video Translation

Synthesizing realistic videos of humans using neural networks has been a...

Please sign up or login with your details

Forgot password? Click here to reset