Learning to predict where to look in interactive environments using deep recurrent q-learning

12/17/2016
by   Sajad Mousavi, et al.
0

Bottom-Up (BU) saliency models do not perform well in complex interactive environments where humans are actively engaged in tasks (e.g., sandwich making and playing the video games). In this paper, we leverage Reinforcement Learning (RL) to highlight task-relevant locations of input frames. We propose a soft attention mechanism combined with the Deep Q-Network (DQN) model to teach an RL agent how to play a game and where to look by focusing on the most pertinent parts of its visual input. Our evaluations on several Atari 2600 games show that the soft attention based model could predict fixation locations significantly better than bottom-up models such as Itti-Kochs saliency and Graph-Based Visual Saliency (GBVS) models.

READ FULL TEXT

page 5

page 6

research
08/11/2021

An Approach to Partial Observability in Games: Learning to Both Act and Observe

Reinforcement learning (RL) is successful at learning to play games wher...
research
07/25/2018

Attend Before you Act: Leveraging human visual attention for continual learning

When humans perform a task, such as playing a game, they selectively pay...
research
08/26/2020

Selective Particle Attention: Visual Feature-Based Attention in Deep Reinforcement Learning

The human brain uses selective attention to filter perceptual input so t...
research
12/29/2019

Speeding up reinforcement learning by combining attention and agency features

When playing video-games we immediately detect which entity we control a...
research
01/10/2023

Learning to Perceive in Deep Model-Free Reinforcement Learning

This work proposes a novel model-free Reinforcement Learning (RL) agent ...
research
06/17/2018

Task-Relevant Object Discovery and Categorization for Playing First-person Shooter Games

We consider the problem of learning to play first-person shooter (FPS) v...
research
12/23/2019

Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

As deep reinforcement learning (RL) is applied to more tasks, there is a...

Please sign up or login with your details

Forgot password? Click here to reset