Numerical Coordinate Regression with Convolutional Neural Networks

01/23/2018
by   Aiden Nibali, et al.
0

We study deep learning approaches to inferring numerical coordinates for points of interest in an input image. Existing convolutional neural network-based solutions to this problem either take a heatmap matching approach or regress to coordinates with a fully connected output layer. Neither of these approaches is ideal, since the former is not entirely differentiable, and the latter lacks inherent spatial generalization. We propose our differentiable spatial to numerical transform (DSNT) to fill this gap. The DSNT layer adds no trainable parameters, is fully differentiable, and exhibits good spatial generalization. Unlike heatmap matching, DSNT works well with low heatmap resolutions, so it can be dropped in as an output layer for a wide range of existing fully convolutional architectures. Consequently, DSNT offers a better trade-off between inference speed and prediction accuracy compared to existing techniques. When used to replace the popular heatmap matching approach used in almost all state-of-the-art methods for pose estimation, DSNT gives better prediction accuracy for all model architectures tested.

READ FULL TEXT

page 3

page 4

page 5

page 6

research
11/29/2020

Improving Neural Network with Uniform Sparse Connectivity

Neural network forms the foundation of deep learning and numerous AI app...
research
05/10/2017

Efficient and Scalable View Generation from a Single Image using Fully Convolutional Networks

Single-image-based view generation (SIVG) is important for producing 3D ...
research
04/16/2020

Spatially Attentive Output Layer for Image Classification

Most convolutional neural networks (CNNs) for image classification use a...
research
05/24/2020

Deep Convolutional Neural Network-based Bernoulli Heatmap for Head Pose Estimation

Head pose estimation is a crucial problem for many tasks, such as driver...
research
04/25/2020

NullSpaceNet: Nullspace Convoluional Neural Network with Differentiable Loss Function

We propose NullSpaceNet, a novel network that maps from the pixel level ...
research
05/06/2019

Pixel-wise Regression: 3D Hand Pose Estimation via Spatial-form Representation and Differentiable Decoder

3D Hand pose estimation from a single depth image is an essential topic ...
research
06/08/2016

Convolution by Evolution: Differentiable Pattern Producing Networks

In this work we introduce a differentiable version of the Compositional ...

Please sign up or login with your details

Forgot password? Click here to reset