Coarse-to-fine Animal Pose and Shape Estimation

11/16/2021
by   Chen Li, et al.
3

Most existing animal pose and shape estimation approaches reconstruct animal meshes with a parametric SMAL model. This is because the low-dimensional pose and shape parameters of the SMAL model makes it easier for deep networks to learn the high-dimensional animal meshes. However, the SMAL model is learned from scans of toy animals with limited pose and shape variations, and thus may not be able to represent highly varying real animals well. This may result in poor fittings of the estimated meshes to the 2D evidences, e.g. 2D keypoints or silhouettes. To mitigate this problem, we propose a coarse-to-fine approach to reconstruct 3D animal mesh from a single image. The coarse estimation stage first estimates the pose, shape and translation parameters of the SMAL model. The estimated meshes are then used as a starting point by a graph convolutional network (GCN) to predict a per-vertex deformation in the refinement stage. This combination of SMAL-based and vertex-based representations benefits from both parametric and non-parametric representations. We design our mesh refinement GCN (MRGCN) as an encoder-decoder structure with hierarchical feature representations to overcome the limited receptive field of traditional GCNs. Moreover, we observe that the global image feature used by existing animal mesh reconstruction works is unable to capture detailed shape information for mesh refinement. We thus introduce a local feature extractor to retrieve a vertex-level feature and use it together with the global feature as the input of the MRGCN. We test our approach on the StanfordExtra dataset and achieve state-of-the-art results. Furthermore, we test the generalization capacity of our approach on the Animal Pose and BADJA datasets. Our code is available at the project website.

READ FULL TEXT

page 2

page 5

page 9

page 14

research
11/09/2021

Monocular Human Shape and Pose with Dense Mesh-borne Local Image Features

We propose to improve on graph convolution based approaches for human sh...
research
11/14/2022

Conformal marked bisection for local refinement of n-dimensional unstructured simplicial meshes

We present an n-dimensional marked bisection method for unstructured con...
research
07/22/2020

Unsupervised Shape and Pose Disentanglement for 3D Meshes

Parametric models of humans, faces, hands and animals have been widely u...
research
10/25/2022

THOR-Net: End-to-end Graformer-based Realistic Two Hands and Object Reconstruction with Self-supervision

Realistic reconstruction of two hands interacting with objects is a new ...
research
10/24/2022

Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement

Estimating 3D poses and shapes in the form of meshes from monocular RGB ...
research
02/23/2021

Deep Deformation Detail Synthesis for Thin Shell Models

In physics-based cloth animation, rich folds and detailed wrinkles are a...
research
04/21/2022

Pixel2Mesh++: 3D Mesh Generation and Refinement from Multi-View Images

We study the problem of shape generation in 3D mesh representation from ...

Please sign up or login with your details

Forgot password? Click here to reset