Linguistic Cues of Deception in a Multilingual April Fools' Day Context

11/06/2021
by   Katerina Papantoniou, et al.
3

In this work we consider the collection of deceptive April Fools' Day(AFD) news articles as a useful addition in existing datasets for deception detection tasks. Such collections have an established ground truth and are relatively easy to construct across languages. As a result, we introduce a corpus that includes diachronic AFD and normal articles from Greek newspapers and news websites. On top of that, we build a rich linguistic feature set, and analyze and compare its deception cues with the only AFD collection currently available, which is in English. Following a current research thread, we also discuss the individualism/collectivism dimension in deception with respect to these two datasets. Lastly, we build classifiers by testing various monolingual and crosslingual settings. The results showcase that AFD datasets can be helpful in deception detection studies, and are in alignment with the observations of other deception detection works.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2020

Batch Clustering for Multilingual News Streaming

Nowadays, digital news articles are widely available, published by vario...
research
01/14/2022

Multilingual Open Text 1.0: Public Domain News in 44 Languages

We present a new multilingual corpus containing text in 44 languages, ma...
research
05/29/2023

A Corpus for Sentence-level Subjectivity Detection on English News Articles

We present a novel corpus for subjectivity detection at the sentence lev...
research
08/13/2021

MIND - Mainstream and Independent News Documents Corpus

This paper presents and characterizes MIND, a new Portuguese corpus comp...
research
04/23/2021

Generating abstractive summaries of Lithuanian news articles using a transformer model

In this work, we train the first monolingual Lithuanian transformer mode...
research
10/19/2022

Leveraging a New Spanish Corpus for Multilingual and Crosslingual Metaphor Detection

The lack of wide coverage datasets annotated with everyday metaphorical ...
research
05/25/2023

LFTK: Handcrafted Features in Computational Linguistics

Past research has identified a rich set of handcrafted linguistic featur...

Please sign up or login with your details

Forgot password? Click here to reset