Harnessing Historical Corrections to build Test Collections for Named Entity Disambiguation

08/27/2018
by   Florian Reitz, et al.
0

Matching mentions of persons to the actual persons (the name disambiguation problem) is central for several digital library applications. Scientists have been working on algorithms to create this matching for decades without finding a universal solution. One problem is that test collections for this problem are often small and specific to a certain collection. In this work, we present an approach that can create large test collections from historical metadata with minimal extra cost. We apply this approach to the DBLP collection to generate two freely available test collections. One collection focuses on the properties of defects and one on the evaluation of disambiguation algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2016

How to do lexical quality estimation of a large OCRed historical Finnish newspaper collection with scarce resources

The National Library of Finland has digitized the historical newspapers ...
research
01/30/2017

Diversification Methods for Zero-One Optimization

We introduce new diversification methods for zero-one optimization that ...
research
12/24/2020

Understanding and Predicting the Characteristics of Test Collections

Shared-task campaigns such as NIST TREC select documents to judge by poo...
research
04/26/2021

The uniqueness of observatory publications

Observatory publications comprise the work of local astronomers from obs...
research
06/18/2018

The Many Shapes of Archive-It

Web archives, a key area of digital preservation, meet the needs of jour...
research
01/26/2023

Towards a semantic approach in GLAM Labs: the case of the Data Foundry at the National Library of Scotland

GLAM organisations have been exploring the benefits of publishing their ...
research
12/06/2018

Neural Word Search in Historical Manuscript Collections

We address the problem of segmenting and retrieving word images in colle...

Please sign up or login with your details

Forgot password? Click here to reset