Attentional Network for Visual Object Detection

02/06/2017
by   Kota Hara, et al.
0

We propose augmenting deep neural networks with an attention mechanism for the visual object detection task. As perceiving a scene, humans have the capability of multiple fixation points, each attended to scene content at different locations and scales. However, such a mechanism is missing in the current state-of-the-art visual object detection methods. Inspired by the human vision system, we propose a novel deep network architecture that imitates this attention mechanism. As detecting objects in an image, the network adaptively places a sequence of glimpses of different shapes at different locations in the image. Evidences of the presence of an object and its location are extracted from these glimpses, which are then fused for estimating the object class and bounding box coordinates. Due to lacks of ground truth annotations of the visual attention mechanism, we train our network using a reinforcement learning algorithm with policy gradients. Experiment results on standard object detection benchmarks show that the proposed network consistently outperforms the baseline networks that does not model the attention mechanism.

READ FULL TEXT

page 2

page 6

research
04/25/2020

Detective: An Attentive Recurrent Model for Sparse Object Detection

In this work, we present Detective - an attentive object detector that i...
research
07/05/2018

Detecting Visual Relationships Using Box Attention

In this paper we propose a new model for detecting visual relationships....
research
11/16/2017

Priming Neural Networks

Visual priming is known to affect the human visual system to allow detec...
research
09/26/2018

Pay attention! - Robustifying a Deep Visuomotor Policy through Task-Focused Attention

Several recent projects demonstrated the promise of end-to-end learned d...
research
12/20/2016

Action-Driven Object Detection with Top-Down Visual Attentions

A dominant paradigm for deep learning based object detection relies on a...
research
04/25/2019

HAR-Net: Joint Learning of Hybrid Attention for Single-stage Object Detection

Object detection has been a challenging task in computer vision. Althoug...
research
02/13/2020

Chaotic Phase Synchronization and Desynchronization in an Oscillator Network for Object Selection

Object selection refers to the mechanism of extracting objects of intere...

Please sign up or login with your details

Forgot password? Click here to reset