Learning to Navigate Wikipedia by Taking Random Walks

10/31/2022
by   Manzil Zaheer, et al.
0

A fundamental ability of an intelligent web-based agent is seeking out and acquiring new information. Internet search engines reliably find the correct vicinity but the top results may be a few links away from the desired target. A complementary approach is navigation via hyperlinks, employing a policy that comprehends local content and selects a link that moves it closer to the target. In this paper, we show that behavioral cloning of randomly sampled trajectories is sufficient to learn an effective link selection policy. We demonstrate the approach on a graph version of Wikipedia with 38M nodes and 387M edges. The model is able to efficiently navigate between nodes 5 and 20 steps apart 96 embeddings and policy in downstream fact verification and question answering tasks where, in combination with basic TF-IDF search and ranking methods, they are competitive results to the state-of-the-art methods.

READ FULL TEXT

page 18

page 20

research
04/21/2020

A Deeper Investigation of the Importance of Wikipedia Links to the Success of Search Engines

A growing body of work has highlighted the important role that Wikipedia...
research
05/10/2018

Query for Architecture, Click through Military: Comparing the Roles of Search and Navigation on Wikipedia

As one of the richest sources of encyclopedic information on the Web, Wi...
research
03/05/2015

Studying the Wikipedia Hyperlink Graph for Relatedness and Disambiguation

Hyperlinks and other relations in Wikipedia are a extraordinary resource...
research
06/20/2019

Measuring the Importance of User-Generated Content to Search Engines

Search engines are some of the most popular and profitable intelligent t...
research
05/25/2021

Graph Based Link Prediction between Human Phenotypes and Genes

Background: The learning of genotype-phenotype associations and history ...
research
04/08/2019

Revisiting EmbodiedQA: A Simple Baseline and Beyond

In Embodied Question Answering (EmbodiedQA), an agent interacts with an ...
research
02/06/2016

End-to-End Goal-Driven Web Navigation

We propose a goal-driven web navigation as a benchmark task for evaluati...

Please sign up or login with your details

Forgot password? Click here to reset