UNIDECOR: A Unified Deception Corpus for Cross-Corpus Deception Detection

06/05/2023
by   Aswathy Velutharambath, et al.
4

Verbal deception has been studied in psychology, forensics, and computational linguistics for a variety of reasons, like understanding behaviour patterns, identifying false testimonies, and detecting deception in online communication. Varying motivations across research fields lead to differences in the domain choices to study and in the conceptualization of deception, making it hard to compare models and build robust deception detection systems for a given language. With this paper, we improve this situation by surveying available English deception datasets which include domains like social media reviews, court testimonials, opinion statements on specific topics, and deceptive dialogues from online strategy games. We consolidate these datasets into a single unified corpus. Based on this resource, we conduct a correlation analysis of linguistic cues of deception across datasets to understand the differences and perform cross-corpus modeling experiments which show that a cross-domain generalization is challenging to achieve. The unified deception corpus (UNIDECOR) can be obtained from https://www.ims.uni-stuttgart.de/data/unidecor.

READ FULL TEXT

page 6

page 8

page 12

page 13

research
02/02/2019

Making a Case for Social Media Corpus for Detecting Depression

The social media platform provides an opportunity to gain valuable insig...
research
05/29/2023

A Corpus for Sentence-level Subjectivity Detection on English News Articles

We present a novel corpus for subjectivity detection at the sentence lev...
research
09/02/2015

Analysis of Communication Pattern with Scammers in Enron Corpus

This paper is an exploratory analysis into fraud detection taking Enron ...
research
11/30/2022

RAFT: Rationale adaptor for few-shot abusive language detection

Abusive language is a concerning problem in online social media. Past re...
research
09/15/2017

Creating and Characterizing a Diverse Corpus of Sarcasm in Dialogue

The use of irony and sarcasm in social media allows us to study them at ...
research
05/31/2023

Guiding Computational Stance Detection with Expanded Stance Triangle Framework

Stance detection determines whether the author of a piece of text is in ...

Please sign up or login with your details

Forgot password? Click here to reset