Interactive Reinforcement Learning for Object Grounding via Self-Talking

12/02/2017
by   Yan Zhu, et al.
0

Humans are able to identify a referred visual object in a complex scene via a few rounds of natural language communications. Success communication requires both parties to engage and learn to adapt for each other. In this paper, we introduce an interactive training method to improve the natural language conversation system for a visual grounding task. During interactive training, both agents are reinforced by the guidance from a common reward function. The parametrized reward function also cooperatively updates itself via interactions, and contribute to accomplishing the task. We evaluate the method on GuessWhat?! visual grounding task, and significantly improve the task success rate. However, we observe language drifting problem during training and propose to use reward engineering to improve the interpretability for the generated conversations. Our result also indicates evaluating goal-ended visual conversation tasks require semantic relevant metrics beyond task success rate.

READ FULL TEXT
research
09/10/2019

Countering Language Drift via Visual Grounding

Emergent multi-agent communication protocols are very different from nat...
research
04/24/2019

Grounding Natural Language Commands to StarCraft II Game States for Narration-Guided Reinforcement Learning

While deep reinforcement learning techniques have led to agents that are...
research
09/21/2023

LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent

3D visual grounding is a critical skill for household robots, enabling t...
research
09/13/2023

Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics

Although Deep Reinforcement Learning (DRL) has achieved notable success ...
research
09/08/2023

Three Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding

3D visual grounding is the task of localizing the object in a 3D scene w...
research
04/27/2018

Reward Learning from Narrated Demonstrations

Humans effortlessly "program" one another by communicating goals and des...
research
05/05/2023

Interactive Acquisition of Fine-grained Visual Concepts by Exploiting Semantics of Generic Characterizations in Discourse

Interactive Task Learning (ITL) concerns learning about unforeseen domai...

Please sign up or login with your details

Forgot password? Click here to reset