An Analysis of the Semantic Annotation Task on the Linked Data Cloud

11/13/2018
by   Gagnon Michel, et al.
0

Semantic annotation, the process of identifying key-phrases in texts and linking them to concepts in a knowledge base, is an important basis for semantic information retrieval and the Semantic Web uptake. Despite the emergence of semantic annotation systems, very few comparative studies have been published on their performance. In this paper, we provide an evaluation of the performance of existing systems over three tasks: full semantic annotation, named entity recognition, and keyword detection. More specifically, the spotting capability (recognition of relevant surface forms in text) is evaluated for all three tasks, whereas the disambiguation (correctly associating an entity from Wikipedia or DBpedia to the spotted surface forms) is evaluated only for the first two tasks. Our evaluation is twofold: First, we compute standard precision and recall on the output of semantic annotators on diverse datasets, each best suited for one of the identified tasks. Second, we build a statistical model using logistic regression to identify significant performance differences. Our results show that systems that provide full annotation perform better than named entities annotators and keyword extractors, for all three tasks. However, there is still much room for improvement for the identification of the most relevant entities described in a text.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2023

DaMuEL: A Large Multilingual Dataset for Entity Linking

We present DaMuEL, a large Multilingual Dataset for Entity Linking conta...
research
11/26/2018

Scalable graph-based individual named entity identification

Named entity discovery (NED) is an important information retrieval probl...
research
06/25/2019

Model-based annotation of coreference

Humans do not make inferences over texts, but over models of what texts ...
research
09/12/2023

AKEM: Aligning Knowledge Base to Queries with Ensemble Model for Entity Recognition and Linking

This paper presents a novel approach to address the Entity Recognition a...
research
07/20/2018

Combining Named Entities with WordNet and Using Query-Oriented Spreading Activation for Semantic Text Search

Purely keyword-based text search is not satisfactory because named entit...
research
05/21/2013

Robust Logistic Regression using Shift Parameters (Long Version)

Annotation errors can significantly hurt classifier performance, yet dat...
research
04/11/2021

Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation

Mathematical information retrieval (MathIR) applications such as semanti...

Please sign up or login with your details

Forgot password? Click here to reset