NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action

12/28/2022
by   Kuan-Chieh Wang, et al.
0

The task of reconstructing 3D human motion has wideranging applications. The gold standard Motion capture (MoCap) systems are accurate but inaccessible to the general public due to their cost, hardware and space constraints. In contrast, monocular human mesh recovery (HMR) methods are much more accessible than MoCap as they take single-view videos as inputs. Replacing the multi-view Mo- Cap systems with a monocular HMR method would break the current barriers to collecting accurate 3D motion thus making exciting applications like motion analysis and motiondriven animation accessible to the general public. However, performance of existing HMR methods degrade when the video contains challenging and dynamic motion that is not in existing MoCap datasets used for training. This reduces its appeal as dynamic motion is frequently the target in 3D motion recovery in the aforementioned applications. Our study aims to bridge the gap between monocular HMR and multi-view MoCap systems by leveraging information shared across multiple video instances of the same action. We introduce the Neural Motion (NeMo) field. It is optimized to represent the underlying 3D motions across a set of videos of the same action. Empirically, we show that NeMo can recover 3D motion in sports using videos from the Penn Action dataset, where NeMo outperforms existing HMR methods in terms of 2D keypoint detection. To further validate NeMo using 3D metrics, we collected a small MoCap dataset mimicking actions in Penn Action,and show that NeMo achieves better 3D reconstruction compared to various baselines.

READ FULL TEXT

page 1

page 6

page 7

research
08/18/2020

Motion Capture from Internet Videos

Recent advances in image-based human pose estimation make it possible to...
research
10/06/2017

CAMREP- Concordia Action and Motion Repository

Action recognition, motion classification, gait analysis and synthesis a...
research
04/23/2021

SportsCap: Monocular 3D Human Motion Capture and Fine-grained Understanding in Challenging Sports Videos

Markerless motion capture and understanding of professional non-daily hu...
research
10/24/2022

Monocular Dynamic View Synthesis: A Reality Check

We study the recent progress on dynamic view synthesis (DVS) from monocu...
research
08/31/2023

GHuNeRF: Generalizable Human NeRF from a Monocular Video

In this paper, we tackle the challenging task of learning a generalizabl...
research
12/03/2018

A Two-Stream Variational Adversarial Network for Video Generation

Video generation is an inherently challenging task, as it requires the m...
research
01/21/2022

Multi-view Monocular Depth and Uncertainty Prediction with Deep SfM in Dynamic Environments

3D reconstruction of depth and motion from monocular video in dynamic en...

Please sign up or login with your details

Forgot password? Click here to reset