On Learning Where To Look

04/24/2014
by   Marc'Aurelio Ranzato, et al.
0

Current automatic vision systems face two major challenges: scalability and extreme variability of appearance. First, the computational time required to process an image typically scales linearly with the number of pixels in the image, therefore limiting the resolution of input images to thumbnail size. Second, variability in appearance and pose of the objects constitute a major hurdle for robust recognition and detection. In this work, we propose a model that makes baby steps towards addressing these challenges. We describe a learning based method that recognizes objects through a series of glimpses. This system performs an amount of computation that scales with the complexity of the input rather than its number of pixels. Moreover, the proposed method is potentially more robust to changes in appearance since its parameters are learned in a data driven manner. Preliminary experiments on a handwritten dataset of digits demonstrate the computational advantages of this approach.

READ FULL TEXT

page 7

page 10

page 11

page 12

research
05/24/2018

You Only Look Twice: Rapid Multi-Scale Object Detection In Satellite Imagery

Detection of small objects in large swaths of imagery is one of the prim...
research
07/19/2017

Domain-adversarial neural networks to address the appearance variability of histopathology images

Preparing and scanning histopathology slides consists of several steps, ...
research
04/07/2021

Neural Articulated Radiance Field

We present Neural Articulated Radiance Field (NARF), a novel deformable ...
research
01/02/2022

Splicing ViT Features for Semantic Appearance Transfer

We present a method for semantically transferring the visual appearance ...
research
11/01/2021

Evaluation of Human and Machine Face Detection using a Novel Distinctive Human Appearance Dataset

Face detection is a long-standing challenge in the field of computer vis...
research
11/01/2020

HM4: Hidden Markov Model with Memory Management for Visual Place Recognition

Visual place recognition needs to be robust against appearance variabili...
research
12/21/2018

3D multirater RCNN for multimodal multiclass detection and characterisation of extremely small objects

Extremely small objects (ESO) have become observable on clinical routine...

Please sign up or login with your details

Forgot password? Click here to reset