Sim-Real Joint Reinforcement Transfer for 3D Indoor Navigation

04/08/2019
by   Fengda Zhu, et al.
0

There has been an increasing interest in 3D indoor navigation, where a robot in an environment moves to a target according to an instruction. To deploy a robot for navigation in the physical world, lots of training data is required to learn an effective policy. It is quite labour intensive to obtain sufficient real environment data for training robots while synthetic data is much easier to construct by render-ing. Though it is promising to utilize the synthetic environments to facilitate navigation training in the real world, real environment are heterogeneous from synthetic environment in two aspects. First, the visual representation of the two environments have significant variances. Second, the houseplans of these two environments are quite different. There-fore two types of information,i.e. visual representation and policy behavior, need to be adapted in the reinforce mentmodel. The learning procedure of visual representation and that of policy behavior are presumably reciprocal. We pro-pose to jointly adapt visual representation and policy behavior to leverage the mutual impacts of environment and policy. Specifically, our method employs an adversarial feature adaptation model for visual representation transfer anda policy mimic strategy for policy behavior imitation. Experiment shows that our method outperforms the baseline by 19.47 human annotations.

READ FULL TEXT

page 4

page 8

research
05/15/2018

Visual Representations for Semantic Target Driven Navigation

What is a good visual representation for autonomous agents? We address t...
research
11/18/2019

Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation

Visual navigation is a task of training an embodied agent by intelligent...
research
02/24/2018

The AdobeIndoorNav Dataset: Towards Deep Reinforcement Learning based Real-world Indoor Robot Visual Navigation

Deep reinforcement learning (DRL) demonstrates its potential in learning...
research
06/02/2021

Robot in a China Shop: Using Reinforcement Learning for Location-Specific Navigation Behaviour

Robots need to be able to work in multiple different environments. Even ...
research
09/21/2018

GAPLE: Generalizable Approaching Policy LEarning for Robotic Object Searching in Indoor Environment

We study the problem of learning a generalizable action policy for an in...
research
05/06/2022

Robot navigation from human demonstration: learning control behaviors with environment feature maps

When working alongside human collaborators in dynamic and unstructured e...
research
10/13/2019

Learning to Navigate from Simulation via Spatial and Semantic Information Synthesis with Noise Model Embedding

While training an end-to-end navigation network in the real world is usu...

Please sign up or login with your details

Forgot password? Click here to reset