End-to-end optimization of goal-driven and visually grounded dialogue systems

03/15/2017
by   Florian Strub, et al.
0

End-to-end design of dialogue systems has recently become a popular research topic thanks to powerful tools such as encoder-decoder architectures for sequence-to-sequence learning. Yet, most current approaches cast human-machine dialogue management as a supervised learning problem, aiming at predicting the next utterance of a participant given the full history of the dialogue. This vision is too simplistic to render the intrinsic planning problem inherent to dialogue as well as its grounded nature, making the context of a dialogue larger than the sole history. This is why only chit-chat and question answering tasks have been addressed so far using end-to-end architectures. In this paper, we introduce a Deep Reinforcement Learning method to optimize visually grounded task-oriented dialogues, based on the policy gradient algorithm. This approach is tested on a dataset of 120k dialogues collected through Mechanical Turk and provides encouraging results at solving both the problem of generating natural dialogues and the task of discovering a specific object in a complex picture.

READ FULL TEXT

page 1

page 6

research
11/29/2017

End-to-End Optimization of Task-Oriented Dialogue Model with Deep Reinforcement Learning

In this paper, we present a neural network based task-oriented dialogue ...
research
09/20/2019

Designing dialogue systems: A mean, grumpy, sarcastic chatbot in the browser

In this work we explore a deep learning-based dialogue system that gener...
research
09/26/2019

GECOR: An End-to-End Generative Ellipsis and Co-reference Resolution Model for Task-Oriented Dialogue

Ellipsis and co-reference are common and ubiquitous especially in multi-...
research
04/15/2016

A Network-based End-to-End Trainable Task-oriented Dialogue System

Teaching machines to accomplish tasks by conversing naturally with human...
research
05/17/2018

Ask No More:Deciding when to guess in referential visual dialogue

Our goal is to explore how the abilities brought in by a dialogue manage...
research
08/15/2019

Towards End-to-End Learning for Efficient Dialogue Agent by Modeling Looking-ahead Ability

Learning an efficient manager of dialogue agent from data with little ma...
research
10/23/2019

TCT: A Cross-supervised Learning Method for Multimodal Sequence Representation

Multimodalities provide promising performance than unimodality in most t...

Please sign up or login with your details

Forgot password? Click here to reset