Learning Open Domain Multi-hop Search Using Reinforcement Learning

05/30/2022
by   Enrique Noriega-Atala, et al.
0

We propose a method to teach an automated agent to learn how to search for multi-hop paths of relations between entities in an open domain. The method learns a policy for directing existing information retrieval and machine reading resources to focus on relevant regions of a corpus. The approach formulates the learning problem as a Markov decision process with a state representation that encodes the dynamics of the search process and a reward structure that minimizes the number of documents that must be processed while still finding multi-hop paths. We implement the method in an actor-critic reinforcement learning algorithm and evaluate it on a dataset of search problems derived from a subset of English Wikipedia. The algorithm finds a family of policies that succeeds in extracting the desired information while processing fewer documents compared to several baseline heuristic algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/18/2022

DARL1N: Distributed multi-Agent Reinforcement Learning with One-hop Neighbors

Most existing multi-agent reinforcement learning (MARL) methods are limi...
research
07/03/2021

Traffic Signal Control with Communicative Deep Reinforcement Learning Agents: a Case Study

In this work we analyze Multi-Agent Advantage Actor-Critic (MA2C) a rece...
research
09/30/2021

Decentralized Graph-Based Multi-Agent Reinforcement Learning Using Reward Machines

In multi-agent reinforcement learning (MARL), it is challenging for a co...
research
01/02/2021

Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval

Multi-hop reasoning (i.e., reasoning across two or more documents) at sc...
research
07/04/2010

A Reinforcement Learning Model Using Neural Networks for Music Sight Reading Learning Problem

Music Sight Reading is a complex process in which when it is occurred in...
research
07/07/2011

Text Classification: A Sequential Reading Approach

We propose to model the text classification process as a sequential deci...
research
05/23/2019

Exploiting Cognitive Structure for Adaptive Learning

Adaptive learning, also known as adaptive teaching, relies on learning p...

Please sign up or login with your details

Forgot password? Click here to reset