RuCoCo: a new Russian corpus with coreference annotation

06/10/2022
by   Vladimir Dobrovolskii, et al.
0

We present a new corpus with coreference annotation, Russian Coreference Corpus (RuCoCo). The goal of RuCoCo is to obtain a large number of annotated texts while maintaining high inter-annotator agreement. RuCoCo contains news texts in Russian, part of which were annotated from scratch, and for the rest the machine-generated annotations were refined by human annotators. The size of our corpus is one million words and around 150,000 mentions. We make the corpus publicly available.

READ FULL TEXT
research
05/14/2023

CroSentiNews 2.0: A Sentence-Level News Sentiment Corpus

This article presents a sentence-level sentiment dataset for the Croatia...
research
04/02/2020

NUBES: A Corpus of Negation and Uncertainty in Spanish Clinical Texts

This paper introduces the first version of the NUBes corpus (Negation an...
research
09/19/2023

FRACAS: A FRench Annotated Corpus of Attribution relations in newS

Quotation extraction is a widely useful task both from a sociological an...
research
08/12/2020

The Annotation Guideline of LST20 Corpus

This report presents the annotation guideline for LST20, a large-scale c...
research
08/23/2018

Structured Interpretation of Temporal Relations

Temporal relations between events and time expressions in a document are...
research
06/05/2020

Prague Dependency Treebank – Consolidated 1.0

We present a richly annotated and genre-diversified language resource, t...
research
12/04/2019

Implicit Knowledge in Argumentative Texts: An Annotated Corpus

When speaking or writing, people omit information that seems clear and e...

Please sign up or login with your details

Forgot password? Click here to reset