Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog

05/08/2018
by   Jiaping Zhang, et al.
0

Creating an intelligent conversational system that understands vision and language is one of the ultimate goals in Artificial Intelligence (AI) winograd1972understanding. Extensive research has focused on vision-to-language generation, however, limited research has touched on combining these two modalities in a goal-driven dialog context. We propose a multimodal hierarchical reinforcement learning framework that dynamically integrates vision and language for task-oriented visual dialog. The framework jointly learns the multimodal dialog state representation and the hierarchical dialog policy to improve both dialog task success and efficiency. We also propose a new technique, state adaptation, to integrate context awareness in the dialog state representation. We evaluate the proposed framework and the state adaptation technique in an image guessing game and achieve promising results.

READ FULL TEXT
research
09/06/2019

Building Task-Oriented Visual Dialog Systems Through Alternative Optimization Between Dialog Policy and Language Generation

Reinforcement learning (RL) is an effective approach to learn an optimal...
research
06/08/2016

Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning

This paper presents an end-to-end framework for task-oriented dialog sys...
research
11/16/2020

Dialog Simulation with Realistic Variations for Training Goal-Oriented Conversational Systems

Goal-oriented dialog systems enable users to complete specific goals lik...
research
05/04/2023

An Asynchronous Updating Reinforcement Learning Framework for Task-oriented Dialog System

Reinforcement learning has been applied to train the dialog systems in m...
research
03/16/2022

Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene

Visual dialog has witnessed great progress after introducing various vis...
research
11/22/2021

Building Goal-Oriented Dialogue Systems with Situated Visual Context

Most popular goal-oriented dialogue agents are capable of understanding ...
research
05/17/2023

Interactive Learning of Hierarchical Tasks from Dialog with GPT

We present a system for interpretable, symbolic, interactive task learni...

Please sign up or login with your details

Forgot password? Click here to reset