Studying the Wikipedia Hyperlink Graph for Relatedness and Disambiguation

03/05/2015
by   Eneko Agirre, et al.
0

Hyperlinks and other relations in Wikipedia are a extraordinary resource which is still not fully understood. In this paper we study the different types of links in Wikipedia, and contrast the use of the full graph with respect to just direct links. We apply a well-known random walk algorithm on two tasks, word relatedness and named-entity disambiguation. We show that using the full graph is more effective than just direct links by a large margin, that non-reciprocal links harm performance, and that there is no benefit from categories and infoboxes, with coherent results on both tasks. We set new state-of-the-art figures for systems based on Wikipedia links, comparable to systems exploiting several information sources and/or supervised machine learning. Our approach is open source, with instruction to reproduce results, and amenable to be integrated with complementary text-based methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/12/2019

WikiLinkGraphs: A complete, longitudinal and multi-language dataset of the Wikipedia link networks

Wikipedia articles contain multiple links connecting a subject to other ...
research
11/02/2020

Analyzing Wikidata Transclusion on English Wikipedia

Wikidata is steadily becoming more central to Wikipedia, not just in mai...
research
03/15/2016

Evaluating the word-expert approach for Named-Entity Disambiguation

Named Entity Disambiguation (NED) is the task of linking a named-entity ...
research
10/31/2022

Learning to Navigate Wikipedia by Taking Random Walks

A fundamental ability of an intelligent web-based agent is seeking out a...
research
04/21/2020

A Deeper Investigation of the Importance of Wikipedia Links to the Success of Search Engines

A growing body of work has highlighted the important role that Wikipedia...
research
09/23/2020

Crosslingual Topic Modeling with WikiPDA

We present Wikipedia-based Polyglot Dirichlet Allocation (WikiPDA), a cr...
research
04/11/2017

Persian Wordnet Construction using Supervised Learning

This paper presents an automated supervised method for Persian wordnet c...

Please sign up or login with your details

Forgot password? Click here to reset