PCLs: Geometry-aware Neural Reconstruction of 3D Pose with Perspective Crop Layers

11/27/2020
by   Frank Yu, et al.
6

Local processing is an essential feature of CNNs and other neural network architectures - it is one of the reasons why they work so well on images where relevant information is, to a large extent, local. However, perspective effects stemming from the projection in a conventional camera vary for different global positions in the image. We introduce Perspective Crop Layers (PCLs) - a form of perspective crop of the region of interest based on the camera geometry - and show that accounting for the perspective consistently improves the accuracy of state-of-the-art 3D pose reconstruction methods. PCLs are modular neural network layers, which, when inserted into existing CNN and MLP architectures, deterministically remove the location-dependent perspective effects while leaving end-to-end training and the number of parameters of the underlying neural network unchanged. We demonstrate that PCL leads to improved 3D human pose reconstruction accuracy for CNN architectures that use cropping operations, such as spatial transformer networks (STN), and, somewhat surprisingly, MLPs used for 2D-to-3D keypoint lifting. Our conclusion is that it is important to utilize camera calibration information when available, for classical and deep-learning-based computer vision alike. PCL offers an easy way to improve the accuracy of existing 3D reconstruction networks by making them geometry-aware.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

page 7

page 11

page 12

research
09/18/2016

Learning camera viewpoint using CNN to improve 3D body pose estimation

The objective of this work is to estimate 3D human pose from a single RG...
research
05/09/2022

Single-Image 3D Face Reconstruction under Perspective Projection

In 3D face reconstruction, orthogonal projection has been widely employe...
research
12/06/2022

Perspective Fields for Single Image Camera Calibration

Geometric camera calibration is often required for applications that und...
research
03/21/2019

Weakly-Supervised Discovery of Geometry-Aware Representation for 3D Human Pose Estimation

Recent studies have shown remarkable advances in 3D human pose estimatio...
research
09/21/2023

Ego3DPose: Capturing 3D Cues from Binocular Egocentric Views

We present Ego3DPose, a highly accurate binocular egocentric 3D pose rec...
research
11/05/2020

Conflicting Bundles: Adapting Architectures Towards the Improved Training of Deep Neural Networks

Designing neural network architectures is a challenging task and knowing...
research
01/27/2020

Deep NRSfM++: Towards 3D Reconstruction in the Wild

The recovery of 3D shape and pose solely from 2D landmarks stemming from...

Please sign up or login with your details

Forgot password? Click here to reset