Progress and limitations of deep networks to recognize objects in unusual poses

07/16/2022
by   Amro Abbas, et al.
0

Deep networks should be robust to rare events if they are to be successfully deployed in high-stakes real-world applications (e.g., self-driving cars). Here we study the capability of deep networks to recognize objects in unusual poses. We create a synthetic dataset of images of objects in unusual orientations, and evaluate the robustness of a collection of 38 recent and competitive deep networks for image classification. We show that classifying these images is still a challenge for all networks tested, with an average accuracy drop of 29.5 largely unaffected by various network design choices, such as training losses (e.g., supervised vs. self-supervised), architectures (e.g., convolutional networks vs. transformers), dataset modalities (e.g., images vs. image-text pairs), and data-augmentation schemes. However, networks trained on very large datasets substantially outperform others, with the best network testedx2014Noisy Student EfficentNet-L2 trained on JFT-300Mx2014showing a relatively small accuracy drop of only 14.5 on unusual poses. Nevertheless, a visual inspection of the failures of Noisy Student reveals a remaining gap in robustness with the human visual system. Furthermore, combining multiple object transformationsx20143D-rotations and scalingx2014further degrades the performance of all networks. Altogether, our results provide another measurement of the robustness of deep networks that is important to consider when using them in the real world. Code and datasets are available at https://github.com/amro-kamal/ObjectPose.

READ FULL TEXT

page 2

page 17

page 29

page 30

page 32

page 33

page 34

page 35

research
06/14/2021

Partial success in closing the gap between human and machine vision

A few years ago, the first CNN surpassed human performance on ImageNet. ...
research
08/28/2021

Self-supervised Neural Networks for Spectral Snapshot Compressive Imaging

We consider using untrained neural networks to solve the reconstruction ...
research
11/17/2022

EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones

The superior performance of modern deep networks usually comes at the pr...
research
08/29/2018

DADA: Deep Adversarial Data Augmentation for Extremely Low Data Regime Classification

Deep learning has revolutionized the performance of classification, but ...
research
11/25/2021

Robust Equivariant Imaging: a fully unsupervised framework for learning to image from noisy and partial measurements

Deep networks provide state-of-the-art performance in multiple imaging i...
research
02/14/2022

Online-updated High-order Collaborative Networks for Single Image Deraining

Single image deraining is an important and challenging task for some dow...
research
10/23/2018

Brand > Logo: Visual Analysis of Fashion Brands

While lots of people may think branding begins and ends with a logo, fas...

Please sign up or login with your details

Forgot password? Click here to reset