Collaborative Multi-Agent Dialogue Model Training Via Reinforcement Learning

07/11/2019
by   Alexandros Papangelis, et al.
0

We present the first complete attempt at concurrently training conversational agents that communicate only via self-generated language. Using DSTC2 as seed data, we trained natural language understanding (NLU) and generation (NLG) networks for each agent and let the agents interact online. We model the interaction as a stochastic collaborative game where each agent (player) has a role ("assistant", "tourist", "eater", etc.) and their own objectives, and can only interact via natural language they generate. Each agent, therefore, needs to learn to operate optimally in an environment with multiple sources of uncertainty (its own NLU and NLG, the other agent's NLU, Policy, and NLG). In our evaluation, we show that the stochastic-game agents outperform deep learning based supervised baselines.

READ FULL TEXT
research
07/20/2021

Toward Collaborative Reinforcement Learning Agents that Communicate Through Text-Based Natural Language

Communication between agents in collaborative multi-agent settings is in...
research
03/14/2023

CB2: Collaborative Natural Language Interaction Research Platform

CB2 is a multi-agent platform to study collaborative natural language in...
research
03/26/2022

Demonstrating CAT: Synthesizing Data-Aware Conversational Agents for Transactional Databases

Databases for OLTP are often the backbone for applications such as hotel...
research
07/28/2023

Dialogue Shaping: Empowering Agents through NPC Interaction

One major challenge in reinforcement learning (RL) is the large amount o...
research
03/20/2021

Overprotective Training Environments Fall Short at Testing Time: Let Models Contribute to Their Own Training

Despite important progress, conversational systems often generate dialog...
research
04/26/2018

Interactive Language Acquisition with One-shot Visual Concept Learning through a Conversational Game

Building intelligent agents that can communicate with and learn from hum...
research
07/09/2021

Integrating Planning, Execution and Monitoring in the presence of Open World Novelties: Case Study of an Open World Monopoly Solver

The game of monopoly is an adversarial multi-agent domain where there is...

Please sign up or login with your details

Forgot password? Click here to reset