Vision-based Navigation with Language-based Assistance via Imitation Learning with Indirect Intervention

12/10/2018
by   Khanh Nguyen, et al.
8

We present Vision-based Navigation with Language-based Assistance (VNLA), a grounded vision-language task where an agent with visual perception is guided via language to find objects in photorealistic indoor environments. The task emulates a real-world scenario in that (a) the requester may not know how to navigate to the target objects and thus makes requests by only specifying high-level endgoals, and (b) the agent is capable of sensing when it is lost and querying an advisor, who is more qualified at the task, to obtain language subgoals to make progress. To model language-based assistance, we develop a general framework termed Imitation Learning with Indirect Intervention (I3L), and propose a solution that is effective on the VNLA task. Empirical results show that this approach significantly improves the success rate of the learning agent over other baselines on both seen and unseen environments.

READ FULL TEXT

page 2

page 12

page 14

research
09/04/2019

Help, Anna! Visual Navigation with Natural Multimodal Assistance via Retrospective Curiosity-Encouraging Imitation Learning

Mobile agents that can leverage help from humans can potentially accompl...
research
04/17/2020

Approximate Inverse Reinforcement Learning from Vision-based Imitation Learning

In this work, we present a method for obtaining an implicit objective fu...
research
03/29/2022

ReIL: A Framework for Reinforced Intervention-based Imitation Learning

Compared to traditional imitation learning methods such as DAgger and DA...
research
10/04/2022

Learning Perception-Aware Agile Flight in Cluttered Environments

Recently, neural control policies have outperformed existing model-based...
research
01/24/2023

Language-guided Task Adaptation for Imitation Learning

We introduce a novel setting, wherein an agent needs to learn a task fro...
research
10/07/2022

Learning a Visually Grounded Memory Assistant

We introduce a novel interface for large scale collection of human memor...
research
06/15/2021

Causal Navigation by Continuous-time Neural Networks

Imitation learning enables high-fidelity, vision-based learning of polic...

Please sign up or login with your details

Forgot password? Click here to reset