Improving Goal-Oriented Visual Dialog Agents via Advanced Recurrent Nets with Tempered Policy Gradient

07/02/2018
by   Rui Zhao, et al.
0

Learning goal-oriented dialogues by means of deep reinforcement learning has recently become a popular research topic. However, training text-generating agents efficiently is still a considerable challenge. Commonly used policy-based dialogue agents often end up focusing on simple utterances and suboptimal policies. To mitigate this problem, we propose a class of novel temperature-based extensions for policy gradient methods, which are referred to as Tempered Policy Gradients (TPGs). These methods encourage exploration with different temperature control strategies. We derive three variations of the TPGs and show their superior performance on a recently published AI-testbed, i.e., the GuessWhat?! game. On the testbed, we achieve significant improvements with two innovations. The first one is an extension of the state-of-the-art solutions with Seq2Seq and Memory Network structures that leads to an improvement of 9 methods, which improves the performance additionally by around 5 more importantly, helps produce more convincing utterances. TPG can easily be applied to any goal-oriented dialogue systems.

READ FULL TEXT
research
10/02/2018

Efficient Dialog Policy Learning via Positive Memory Retention

This paper is concerned with the training of recurrent neural networks a...
research
02/07/2020

I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-oriented dialogue agents

Dialogue research tends to distinguish between chit-chat and goal-orient...
research
02/07/2020

I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-orientated dialogue agents

Dialogue research tends to distinguish between chit-chat and goal-orient...
research
12/07/2017

End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient

Learning a goal-oriented dialog policy is generally performed offline wi...
research
02/12/2018

Answerer in Questioner's Mind for Goal-Oriented Visual Dialogue

Goal-oriented dialogue has been paid attention for its numerous applicat...
research
06/03/2018

Building Advanced Dialogue Managers for Goal-Oriented Dialogue Systems

Goal-Oriented (GO) Dialogue Systems, colloquially known as goal oriented...
research
09/03/2018

Emergence of Communication in an Interactive World with Consistent Speakers

Training agents to communicate with one another given task-based supervi...

Please sign up or login with your details

Forgot password? Click here to reset