Confidence Adaptive Anytime Pixel-Level Recognition

04/01/2021
by   Zhuang Liu, et al.
0

Anytime inference requires a model to make a progression of predictions which might be halted at any time. Prior research on anytime visual recognition has mostly focused on image classification. We propose the first unified and end-to-end model approach for anytime pixel-level recognition. A cascade of "exits" is attached to the model to make multiple predictions and direct further computation. We redesign the exits to account for the depth and spatial resolution of the features for each exit. To reduce total computation, and make full use of prior predictions, we develop a novel spatially adaptive approach to avoid further computation on regions where early predictions are already sufficiently confident. Our full model with redesigned exit architecture and spatial adaptivity enables anytime inference, achieves the same level of final accuracy, and even significantly reduces total computation. We evaluate our approach on semantic segmentation and human pose estimation. On Cityscapes semantic segmentation and MPII human pose estimation, our approach enables anytime inference while also reducing the total FLOPs of its base models by 44.4 measure the anytime capability of deep equilibrium networks, a recent class of model that is intrinsically iterative, and we show that the accuracy-computation curve of our architecture strictly dominates it.

READ FULL TEXT

page 4

page 7

page 8

page 11

page 12

page 13

page 15

page 16

research
04/05/2019

Spatial Shortcut Network for Human Pose Estimation

Like many computer vision problems, human pose estimation is a challengi...
research
10/16/2020

HPERL: 3D Human Pose Estimation from RGB and LiDAR

In-the-wild human pose estimation has a huge potential for various field...
research
05/03/2022

Multitask Network for Joint Object Detection, Semantic Segmentation and Human Pose Estimation in Vehicle Occupancy Monitoring

In order to ensure safe autonomous driving, precise information about th...
research
09/22/2019

Pixel-Level Dense Prediction without Decoder

Pixel-level dense prediction tasks such as keypoint estimation are domin...
research
05/21/2022

Lightweight Human Pose Estimation Using Heatmap-Weighting Loss

Recent research on human pose estimation exploits complex structures to ...
research
06/05/2018

3D Human Pose Estimation with 2D Marginal Heatmaps

Automatically determining three-dimensional human pose from monocular RG...
research
12/06/2019

Dynamic Convolutions: Exploiting Spatial Sparsity for Faster Inference

Modern convolutional neural networks apply the same operations on every ...

Please sign up or login with your details

Forgot password? Click here to reset