Learning Where to Fixate on Foveated Images

11/16/2018
by   Hanxiao Wang, et al.
12

Foveation, the ability to sequentially acquire high-acuity regions of a scene viewed initially at low-acuity, is a key property of biological vision systems. In a computer vision system, foveation is also desired to increase data efficiency and derive task-relevant features. Yet, most existing deep learning models lack the ability to foveate. In this paper, we propose a deep reinforcement learning-based foveation model, DRIFT, and apply it to challenging fine-grained classification tasks. Training of DRIFT requires only image-level category labels and encourages fixations to contain discriminative information while maintaining data efficiency. Specifically, we formulate foveation as a sequential decision-making process and train a foveation actor network with a novel Deep Deterministic Policy Gradient by Conditioned Critic and Coaching (DDPGC3) algorithm. In addition, we propose to shape the reward to provide informative feedback after each fixation to better guide the RL training. We demonstrate the effectiveness of our method on five fine-grained classification benchmark datasets, and show that the proposed approach achieves state-of-the-art performance using an order-of-magnitude fewer pixels.

READ FULL TEXT

page 1

page 2

page 6

page 7

page 8

research
07/09/2018

Video Summarisation by Classification with Deep Reinforcement Learning

Most existing video summarisation methods are based on either supervised...
research
02/20/2017

Learning to Repeat: Fine Grained Action Repetition for Deep Reinforcement Learning

Reinforcement Learning algorithms can learn complex behavioral patterns ...
research
03/22/2016

Fully Convolutional Attention Networks for Fine-Grained Recognition

Fine-grained recognition is challenging due to its subtle local inter-cl...
research
10/06/2020

Microscopic fine-grained instance classification through deep attention

Fine-grained classification of microscopic image data with limited sampl...
research
09/16/2018

Maximum-Entropy Fine-Grained Classification

Fine-Grained Visual Classification (FGVC) is an important computer visio...

Please sign up or login with your details

Forgot password? Click here to reset