Playing 20 Question Game with Policy-Based Reinforcement Learning

08/23/2018
by   Huang Hu, et al.
0

The 20 Questions (Q20) game is a well known game which encourages deductive reasoning and creativity. In the game, the answerer first thinks of an object such as a famous person or a kind of animal. Then the questioner tries to guess the object by asking 20 questions. In a Q20 game system, the user is considered as the answerer while the system itself acts as the questioner which requires a good strategy of question selection to figure out the correct object and win the game. However, the optimal policy of question selection is hard to be derived due to the complexity and volatility of the game environment. In this paper, we propose a novel policy-based Reinforcement Learning (RL) method, which enables the questioner agent to learn the optimal policy of question selection through continuous interactions with users. To facilitate training, we also propose to use a reward network to estimate the more informative reward. Compared to previous methods, our RL method is robust to noisy answers and does not rely on the Knowledge Base of objects. Experimental results show that our RL method clearly outperforms an entropy-based engineering system and has competitive performance in a noisy-free simulation environment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2019

Playing a Strategy Game with Knowledge-Based Reinforcement Learning

This paper presents Knowledge-Based Reinforcement Learning (KB-RL) as a ...
research
07/28/2023

Dialogue Shaping: Empowering Agents through NPC Interaction

One major challenge in reinforcement learning (RL) is the large amount o...
research
12/03/2022

Constrained Reinforcement Learning via Dissipative Saddle Flow Dynamics

In constrained reinforcement learning (C-RL), an agent seeks to learn fr...
research
06/21/2022

Finding Optimal Policy for Queueing Models: New Parameterization

Queueing systems appear in many important real-life applications includi...
research
09/06/2021

Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser

Considering the importance of building a good Visual Dialog (VD) Questio...
research
06/17/2023

The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions

Reinforcement learning (RL) algorithms have proven transformative in a r...

Please sign up or login with your details

Forgot password? Click here to reset