The Trumpiest Trump? Identifying a Subject's Most Characteristic Tweets

09/09/2019
by   Charuta Pethe, et al.
0

The sequence of documents produced by any given author varies in style and content, but some documents are more typical or representative of the source than others. We quantify the extent to which a given short text is characteristic of a specific person, using a dataset of tweets from fifteen celebrities. Such analysis is useful for generating excerpts of high-volume Twitter profiles, and understanding how representativeness relates to tweet popularity. We first consider the related task of binary author detection (is x the author of text T?), and report a test accuracy of 90.37 five approaches to this problem. We then use these models to compute characterization scores among all of an author's texts. A user study shows human evaluators agree with our characterization model for all 15 celebrities in our dataset, each with p-value < 0.05. We use these classifiers to show surprisingly strong correlations between characterization scores and the popularity of the associated texts. Indeed, we demonstrate a statistically significant correlation between this score and tweet popularity (likes/replies/retweets) for 13 of the 15 celebrities in our study.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/31/2020

Forensic Writer Identification Using Microblogging Texts

Establishing the authorship of online texts is a fundamental issue to co...
research
12/31/2018

Unary and Binary Classification Approaches and their Implications for Authorship Verification

Retrieving indexed documents, not by their topical content but their wri...
research
12/20/2021

Improved Topic modeling in Twitter through Community Pooling

Social networks play a fundamental role in propagation of information an...
research
03/17/2022

Short Text Topic Modeling: Application to tweets about Bitcoin

Understanding the semantic of a collection of texts is a challenging tas...
research
09/22/2019

Adapting Language Models for Non-Parallel Author-Stylized Rewriting

Given the recent progress in language modeling using Transformer-based n...
research
08/22/2019

Gender Prediction from Tweets: Improving Neural Representations with Hand-Crafted Features

Author profiling is the characterization of an author through some key a...
research
07/19/2022

Can You Fool AI by Doing a 180? x2013 A Case Study on Authorship Analysis of Texts by Arata Osada

This paper is our attempt at answering a twofold question covering the a...

Please sign up or login with your details

Forgot password? Click here to reset