Quotations, Coreference Resolution, and Sentiment Annotations in Croatian News Articles: An Exploratory Study

12/14/2022
by   Jelena Sarajlić, et al.
0

This paper presents a corpus annotated for the task of direct-speech extraction in Croatian. The paper focuses on the annotation of the quotation, co-reference resolution, and sentiment annotation in SETimes news corpus in Croatian and on the analysis of its language-specific differences compared to English. From this, a list of the phenomena that require special attention when performing these annotations is derived. The generated corpus with quotation features annotations can be used for multiple tasks in the field of Natural Language Processing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2018

BCSAT : A Benchmark Corpus for Sentiment Analysis in Telugu Using Word-level Annotations

The presented work aims at generating a systematically annotated corpus ...
research
06/06/2017

Marmara Turkish Coreference Corpus and Coreference Resolution Baseline

We describe the Marmara Turkish Coreference Corpus, which is an annotati...
research
02/08/2023

NewsComp: Facilitating Diverse News Reading through Comparative Annotation

To support efficient, balanced news consumption, merging articles from d...
research
04/08/2022

CrudeOilNews: An Annotated Crude Oil News Corpus for Event Extraction

In this paper, we present CrudeOilNews, a corpus of English Crude Oil ne...
research
06/18/2020

AMALGUM – A Free, Balanced, Multilayer English Web Corpus

We present a freely available, genre-balanced English web corpus totalin...
research
05/05/2022

CATs are Fuzzy PETs: A Corpus and Analysis of Potentially Euphemistic Terms

Euphemisms have not received much attention in natural language processi...
research
08/13/2021

MIND - Mainstream and Independent News Documents Corpus

This paper presents and characterizes MIND, a new Portuguese corpus comp...

Please sign up or login with your details

Forgot password? Click here to reset