Marmara Turkish Coreference Corpus and Coreference Resolution Baseline

06/06/2017
by   Peter Schüller, et al.
0

We describe the Marmara Turkish Coreference Corpus, which is an annotation of the whole METU-Sabanci Turkish Treebank with mentions and coreference chains. Collecting nine or more independent annotations for each document allowed for fully automatic adjudication. We provide a baseline system for Turkish mention detection and coreference resolution and evaluate it on the corpus.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2022

Quotations, Coreference Resolution, and Sentiment Annotations in Croatian News Articles: An Exploratory Study

This paper presents a corpus annotated for the task of direct-speech ext...
research
10/07/2022

Longtonotes: OntoNotes with Longer Coreference Chains

Ontonotes has served as the most important benchmark for coreference res...
research
05/21/2021

CEREC: A Corpus for Entity Resolution in Email Conversations

We present the first large scale corpus for entity resolution in email c...
research
05/17/2020

LiSSS: A toy corpus of Spanish Literary Sentences for Emotions detection

In this work we present a new small data-set in Computational Creativity...
research
01/25/2022

The Text Anonymization Benchmark (TAB): A Dedicated Corpus and Evaluation Framework for Text Anonymization

We present a novel benchmark and associated evaluation metrics for asses...
research
10/12/2021

Anatomy of OntoGUM–Adapting GUM to the OntoNotes Scheme to Evaluate Robustness of SOTA Coreference Algorithms

SOTA coreference resolution produces increasingly impressive scores on t...
research
06/02/2021

OntoGUM: Evaluating Contextualized SOTA Coreference Resolution on 12 More Genres

SOTA coreference resolution produces increasingly impressive scores on t...

Please sign up or login with your details

Forgot password? Click here to reset