Matching Theory and Data with Personal-ITY: What a Corpus of Italian YouTube Comments Reveals About Personality

11/11/2020
by   Elisa Bassignana, et al.
0

As a contribution to personality detection in languages other than English, we rely on distant supervision to create Personal-ITY, a novel corpus of YouTube comments in Italian, where authors are labelled with personality traits. The traits are derived from one of the mainstream personality theories in psychology research, named MBTI. Using personality prediction experiments, we (i) study the task of personality prediction in itself on our corpus as well as on TwiSty, a Twitter dataset also annotated with MBTI labels; (ii) carry out an extensive, in-depth analysis of the features used by the classifier, and view them specifically under the light of the original theory that we used to create the corpus in the first place. We observe that no single model is best at personality detection, and that while some traits are easier than others to detect, and also to match back to theory, for other, less frequent traits the picture is much more blurred.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/11/2020

Personal-ITY: A Novel YouTube-based Corpus for Personality Prediction in Italian

We present a novel corpus for personality prediction in Italian, contain...
research
03/16/2020

Developing a Multilingual Annotated Corpus of Misogyny and Aggression

In this paper, we discuss the development of a multilingual annotated co...
research
07/30/2018

YouTube AV 50K: an Annotated Corpus for Comments in Autonomous Vehicles

With one billion monthly viewers, and millions of users discussing and s...
research
11/20/2020

Are Chess Discussions Racist? An Adversarial Hate Speech Data Set

On June 28, 2020, while presenting a chess podcast on Grandmaster Hikaru...
research
10/08/2019

Voice for the Voiceless: Active Sampling to Detect Comments Supporting the Rohingyas

The Rohingya refugee crisis is one of the biggest humanitarian crises of...
research
10/31/2021

Classifying YouTube Comments Based on Sentiment and Type of Sentence

As a YouTube channel grows, each video can potentially collect enormous ...
research
09/11/2019

Kashmir: A Computational Analysis of the Voice of Peace

The recent Pulwama terror attack (February 14, 2019, Pulwama, Kashmir) t...

Please sign up or login with your details

Forgot password? Click here to reset