Progressive Multi-view Human Mesh Recovery with Self-Supervision

12/10/2022
by   Xuan Gong, et al.
0

To date, little attention has been given to multi-view 3D human mesh estimation, despite real-life applicability (e.g., motion capture, sport analysis) and robustness to single-view ambiguities. Existing solutions typically suffer from poor generalization performance to new settings, largely due to the limited diversity of image-mesh pairs in multi-view training data. To address this shortcoming, people have explored the use of synthetic images. But besides the usual impact of visual gap between rendered and target data, synthetic-data-driven multi-view estimators also suffer from overfitting to the camera viewpoint distribution sampled during training which usually differs from real-world distributions. Tackling both challenges, we propose a novel simulation-based training pipeline for multi-view human mesh recovery, which (a) relies on intermediate 2D representations which are more robust to synthetic-to-real domain gap; (b) leverages learnable calibration and triangulation to adapt to more diversified camera setups; and (c) progressively aggregates multi-view information in a canonical 3D space to remove ambiguities in 2D representations. Through extensive benchmarking, we demonstrate the superiority of the proposed solution especially for unseen in-the-wild scenarios.

READ FULL TEXT

page 2

page 4

page 7

research
10/04/2022

Multi-view Human Body Mesh Translator

Existing methods for human mesh recovery mainly focus on single-view fra...
research
09/10/2022

Self-supervised Human Mesh Recovery with Cross-Representation Alignment

Fully supervised human mesh recovery methods are data-hungry and have po...
research
04/22/2022

Leveraging Deepfakes to Close the Domain Gap between Real and Synthetic Images in Facial Capture Pipelines

We propose an end-to-end pipeline for both building and tracking 3D faci...
research
01/01/2022

Self-attention Multi-view Representation Learning with Diversity-promoting Complementarity

Multi-view learning attempts to generate a model with a better performan...
research
01/15/2023

Delving Deep into Pixel Alignment Feature for Accurate Multi-view Human Mesh Recovery

Regression-based methods have shown high efficiency and effectiveness fo...
research
12/17/2020

Human Mesh Recovery from Multiple Shots

Videos from edited media like movies are a useful, yet under-explored so...
research
07/04/2019

Probabilistic CCA with Implicit Distributions

Canonical Correlation Analysis (CCA) is a classic technique for multi-vi...

Please sign up or login with your details

Forgot password? Click here to reset