NEREL: A Russian Dataset with Nested Named Entities, Relations and Events

08/30/2021
by   Natalia Loukachevitch, et al.
0

In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction. NEREL is significantly larger than existing Russian datasets: to date it contains 56K annotated named entities and 39K annotated relations. Its important difference from previous datasets is annotation of nested named entities, as well as relations within nested entities and at the discourse level. NEREL can facilitate development of novel models that can extract relations between nested named entities, as well as relations on both sentence and document levels. NEREL also contains the annotation of events involving named entities and their roles in the events. The NEREL collection is available via https://github.com/nerel-ds/NEREL.

READ FULL TEXT

page 5

page 6

research
10/21/2022

NEREL-BIO: A Dataset of Biomedical Abstracts Annotated with Nested Named Entities

This paper describes NEREL-BIO – an annotation scheme and corpus of PubM...
research
11/05/2022

BEKG: A Built Environment Knowledge Graph

Practices in the built environment have become more digitalized with the...
research
11/29/2020

A Boundary Regressing Model for Nested Named Entity Recognition

Recognizing named entities (NEs) is commonly conducted as a classificati...
research
10/31/2022

Semantic Novelty Detection and Characterization in Factual Text Involving Named Entities

Much of the existing work on text novelty detection has been studied at ...
research
09/12/2016

Joint Extraction of Events and Entities within a Document Context

Events and entities are closely related; entities are often actors or pa...
research
04/27/2020

Automatic Textual Evidence Mining in COVID-19 Literature

We created this EVIDENCEMINER system for automatic textual evidence mini...
research
09/11/2021

Qualitative and Quantitative Analysis of Diversity in Cross-document Coreference Resolution Datasets

Cross-document coreference resolution (CDCR) datasets, such as ECB+, con...

Please sign up or login with your details

Forgot password? Click here to reset