FRACAS: A FRench Annotated Corpus of Attribution relations in newS

09/19/2023
by   Ange Richard, et al.
0

Quotation extraction is a widely useful task both from a sociological and from a Natural Language Processing perspective. However, very little data is available to study this task in languages other than English. In this paper, we present a manually annotated corpus of 1676 newswire texts in French for quotation extraction and source attribution. We first describe the composition of our corpus and the choices that were made in selecting the data. We then detail the annotation guidelines and annotation process, as well as a few statistics about the final corpus and the obtained balance between quote types (direct, indirect and mixed, which are particularly challenging). We end by detailing our inter-annotator agreement between the 8 annotators who worked on manual labelling, which is substantially high for such a difficult linguistic phenomenon.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2022

RuCoCo: a new Russian corpus with coreference annotation

We present a new corpus with coreference annotation, Russian Coreference...
research
05/08/2022

MASALA: Modelling and Analysing the Semantics of Adpositions in Linguistic Annotation of Hindi

We present a completed, publicly available corpus of annotated semantic ...
research
04/06/2020

An Annotated Corpus of Emerging Anglicisms in Spanish Newspaper Headlines

The extraction of anglicisms (lexical borrowings from English) is releva...
research
10/02/2017

HUMOR: A Crowd-Annotated Spanish Corpus for Humor Analysis

Computational Humor, as the name implies, studies humor from a computati...
research
04/08/2022

CrudeOilNews: An Annotated Crude Oil News Corpus for Event Extraction

In this paper, we present CrudeOilNews, a corpus of English Crude Oil ne...
research
11/28/2016

Developing a cardiovascular disease risk factor annotated corpus of Chinese electronic medical records

Cardiovascular disease (CVD) has become the leading cause of death in Ch...
research
02/28/2020

Automatic Section Recognition in Obituaries

Obituaries contain information about people's values across times and cu...

Please sign up or login with your details

Forgot password? Click here to reset