A Corpus for Sentence-level Subjectivity Detection on English News Articles

05/29/2023
by   Francesco Antici, et al.
0

We present a novel corpus for subjectivity detection at the sentence level. We develop new annotation guidelines for the task, which are not limited to language-specific cues, and apply them to produce a new corpus in English. The corpus consists of 411 subjective and 638 objective sentences extracted from ongoing coverage of political affairs from online news outlets. This new resource paves the way for the development of models for subjectivity detection in English and across other languages, without relying on language-specific tools like lexicons or machine translation. We evaluate state-of-the-art multilingual transformer-based models on the task, both in mono- and cross-lingual settings, the latter with a similar existing corpus in Italian language. We observe that enriching our corpus with resources in other languages improves the results on the task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/14/2018

Bianet: A Parallel News Corpus in Turkish, Kurdish and English

We present a new open-source parallel corpus consisting of news articles...
research
04/18/2017

A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference

This paper introduces the Multi-Genre Natural Language Inference (MultiN...
research
10/19/2022

Leveraging a New Spanish Corpus for Multilingual and Crosslingual Metaphor Detection

The lack of wide coverage datasets annotated with everyday metaphorical ...
research
03/18/2016

Readability-based Sentence Ranking for Evaluating Text Simplification

We propose a new method for evaluating the readability of simplified sen...
research
06/05/2023

UNIDECOR: A Unified Deception Corpus for Cross-Corpus Deception Detection

Verbal deception has been studied in psychology, forensics, and computat...
research
11/06/2021

Linguistic Cues of Deception in a Multilingual April Fools' Day Context

In this work we consider the collection of deceptive April Fools' Day(AF...
research
05/27/2021

Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat Violence

Automated event extraction in social science applications often requires...

Please sign up or login with your details

Forgot password? Click here to reset