NorNE: Annotating Named Entities for Norwegian

11/27/2019
by   Fredrik Jørgensen, et al.
0

This paper presents NorNE, a manually annotated corpus of named entities which extends the annotation of the existing Norwegian Dependency Treebank. The corpus contains around 600,000 tokens taken from both varieties of written Norwegian (Bokmål and Nynorsk) and annotates a rich set of entity types including persons, organizations, locations, geo-political entities, products, and events, in addition a class corresponding to nominals derived from a name. We here present details on the annotation effort, guidelines, inter-annotator agreement and an experimental analysis of the corpus using a neural sequence labeling architecture.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2022

Wojood: Nested Arabic Named Entity Corpus and Recognition using BERT

This paper presents Wojood, a corpus for Arabic nested Named Entity Reco...
research
05/22/2023

Aligning the Norwegian UD Treebank with Entity and Coreference Information

This paper presents a merged collection of entity and coreference annota...
research
08/12/2020

The Annotation Guideline of LST20 Corpus

This report presents the annotation guideline for LST20, a large-scale c...
research
01/23/2018

What did you Mention? A Large Scale Mention Detection Benchmark for Spoken and Written Text

We describe a large, high-quality benchmark for the evaluation of Mentio...
research
10/21/2022

NEREL-BIO: A Dataset of Biomedical Abstracts Annotated with Nested Named Entities

This paper describes NEREL-BIO – an annotation scheme and corpus of PubM...
research
04/07/2020

A Corpus Study and Annotation Schema for Named Entity Recognition and Relation Extraction of Business Products

Recognizing non-standard entity types and relations, such as B2B product...
research
06/07/2022

Guidelines and a Corpus for Extracting Biographical Events

Despite biographies are widely spread within the Semantic Web, resources...

Please sign up or login with your details

Forgot password? Click here to reset