Recognizing Descriptive Wikipedia Categories for Historical Figures

04/24/2017
by   Yanqing Chen, et al.
0

Wikipedia is a useful knowledge source that benefits many applications in language processing and knowledge representation. An important feature of Wikipedia is that of categories. Wikipedia pages are assigned different categories according to their contents as human-annotated labels which can be used in information retrieval, ad hoc search improvements, entity ranking and tag recommendations. However, important pages are usually assigned too many categories, which makes it difficult to recognize the most important ones that give the best descriptions. In this paper, we propose an approach to recognize the most descriptive Wikipedia categories. We observe that historical figures in a precise category presumably are mutually similar and such categorical coherence could be evaluated via texts or Wikipedia links of corresponding members in the category. We rank descriptive level of Wikipedia categories according to their coherence and our ranking yield an overall agreement of 88.27 human wisdom.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/28/2019

Uncovering the Semantics of Wikipedia Categories

The Wikipedia category graph serves as the taxonomic backbone for large-...
research
04/21/2020

Use of Wikipedia categories on information retrieval research: a brief review

Wikipedia categories, a classification scheme built for organizing and d...
research
03/11/2020

Entity Extraction from Wikipedia List Pages

When it comes to factual knowledge about a wide range of domains, Wikipe...
research
03/20/2019

A Graph-structured Dataset for Wikipedia Research

Wikipedia is a rich and invaluable source of information. Its central pl...
research
03/02/2017

DAWT: Densely Annotated Wikipedia Texts across multiple languages

In this work, we open up the DAWT dataset - Densely Annotated Wikipedia ...
research
01/28/2020

WikiHist.html: English Wikipedia's Full Revision History in HTML Format

Wikipedia is written in the wikitext markup language. When serving conte...
research
08/19/2020

Generating Categories for Sets of Entities

Category systems are central components of knowledge bases, as they prov...

Please sign up or login with your details

Forgot password? Click here to reset