Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning

by   Yuke Zhu, et al.

Two less addressed issues of deep reinforcement learning are (1) lack of generalization capability to new target goals, and (2) data inefficiency i.e., the model requires several (and often costly) episodes of trial and error to converge, which makes it impractical to be applied to real-world scenarios. In this paper, we address these two issues and apply our model to the task of target-driven visual navigation. To address the first issue, we propose an actor-critic model whose policy is a function of the goal as well as the current state, which allows to better generalize. To address the second issue, we propose AI2-THOR framework, which provides an environment with high-quality 3D scenes and physics engine. Our framework enables agents to take actions and interact with objects. Hence, we can collect a huge number of training samples efficiently. We show that our proposed method (1) converges faster than the state-of-the-art deep reinforcement learning methods, (2) generalizes across targets and across scenes, (3) generalizes to a real robot scenario with a small amount of fine-tuning (although the model is trained in simulation), (4) is end-to-end trainable and does not need feature engineering, feature matching between frames or 3D reconstruction of the environment. The supplementary video can be accessed at the following link:


page 1

page 2

page 3

page 6

page 7


The AdobeIndoorNav Dataset: Towards Deep Reinforcement Learning based Real-world Indoor Robot Visual Navigation

Deep reinforcement learning (DRL) demonstrates its potential in learning...

Visual Sensor Network Reconfiguration with Deep Reinforcement Learning

We present an approach for reconfiguration of dynamic visual sensor netw...

Visual Transfer between Atari Games using Competitive Reinforcement Learning

This paper explores the use of deep reinforcement learning agents to tra...

Sequence-Agnostic Multi-Object Navigation

The Multi-Object Navigation (MultiON) task requires a robot to localize ...

Learning View and Target Invariant Visual Servoing for Navigation

The advances in deep reinforcement learning recently revived interest in...

Hierarchical Reinforcement Learning for Sensor-Based Navigation

Robotic systems are nowadays capable of solving complex navigation tasks...

Dynamic Routing for Navigation in Changing Unknown Maps Using Deep Reinforcement Learning

In this work, we propose an approach for an autonomous agent that learns...

Please sign up or login with your details

Forgot password? Click here to reset