Just Ask:An Interactive Learning Framework for Vision and Language Navigation

12/02/2019
by   Ta-Chung Chi, et al.
0

In the vision and language navigation task, the agent may encounter ambiguous situations that are hard to interpret by just relying on visual information and natural language instructions. We propose an interactive learning framework to endow the agent with the ability to ask for users' help in such situations. As part of this framework, we investigate multiple learning approaches for the agent with different levels of complexity. The simplest model-confusion-based method lets the agent ask questions based on its confusion, relying on the predefined confidence threshold of a next action prediction model. To build on this confusion-based method, the agent is expected to demonstrate more sophisticated reasoning such that it discovers the timing and locations to interact with a human. We achieve this goal using reinforcement learning (RL) with a proposed reward shaping term, which enables the agent to ask questions only when necessary. The success rate can be boosted by at least 15 one question asked on average during the navigation. Furthermore, we show that the RL agent is capable of adjusting dynamically to noisy human responses. Finally, we design a continual learning strategy, which can be viewed as a data augmentation method, for the agent to improve further utilizing its interaction history with a human. We demonstrate the proposed strategy is substantially more realistic and data-efficient compared to previously proposed pre-exploration techniques.

READ FULL TEXT

page 1

page 5

research
06/06/2023

Active Sparse Conversations for Improved Audio-Visual Embodied Navigation

Efficient navigation towards an audio-goal necessitates an embodied agen...
research
06/20/2022

Good Time to Ask: A Learning Framework for Asking for Help in Embodied Visual Navigation

In reality, it is often more efficient to ask for help than to search th...
research
09/10/2022

Ask Before You Act: Generalising to Novel Environments by Asking Questions

Solving temporally-extended tasks is a challenge for most reinforcement ...
research
05/02/2020

RMM: A Recursive Mental Model for Dialog Navigation

Fluent communication requires understanding your audience. In the new co...
research
03/08/2019

Improving Skin Condition Classification with a Visual Symptom Checker trained using Reinforcement Learning

We present a visual symptom checker that combines a pre-trained Convolut...
research
04/18/2022

Learning to Execute Actions or Ask Clarification Questions

Collaborative tasks are ubiquitous activities where a form of communicat...
research
11/25/2017

Malaria Likelihood Prediction By Effectively Surveying Households Using Deep Reinforcement Learning

We build a deep reinforcement learning (RL) agent that can predict the l...

Please sign up or login with your details

Forgot password? Click here to reset