Deep Neural Networks for Bot Detection

02/12/2018
by   Sneha Kudugunta, et al.
0

The problem of detecting bots, automated social media accounts governed by software but disguising as human users, has strong implications. For example, bots have been used to sway political elections by distorting online discourse, to manipulate the stock market, or to push anti-vaccine conspiracy theories that caused health epidemics. Most techniques proposed to date detect bots at the account level, by processing large amount of social media posts, and leveraging information from network structure, temporal dynamics, sentiment analysis, etc. In this paper, we propose a deep neural network based on contextual long short-term memory (LSTM) architecture that exploits both content and metadata to detect bots at the tweet level: contextual features are extracted from user metadata and fed as auxiliary input to LSTM deep nets processing the tweet text. Another contribution that we make is proposing a technique based on synthetic minority oversampling to generate a large labeled dataset, suitable for deep nets training, from a minimal amount of labeled data (roughly 3,000 examples of sophisticated Twitter bots). We demonstrate that, from just one single tweet, our architecture can achieve high classification accuracy (AUC > 96 separating bots from humans. We apply the same architecture to account-level bot detection, achieving nearly perfect classification accuracy (AUC > 99 previous state of the art while leveraging a small and interpretable set of features yet requiring minimal training data.

READ FULL TEXT

page 7

page 8

research
05/24/2019

Using Deep Networks and Transfer Learning to Address Disinformation

We apply an ensemble pipeline composed of a character-level convolutiona...
research
03/27/2021

LSTM Based Sentiment Analysis for Cryptocurrency Prediction

Recent studies in big data analytics and natural language processing dev...
research
02/14/2018

Generative Models for Spear Phishing Posts on Social Media

Historically, machine learning in computer security has prioritized defe...
research
10/14/2020

Learning Word Representations for Tunisian Sentiment Analysis

Tunisians on social media tend to express themselves in their local dial...
research
02/28/2020

RP-DNN: A Tweet level propagation context based deep neural networks for early rumor detection in Social Media

Early rumor detection (ERD) on social media platform is very challenging...
research
07/28/2019

Fusing location and text features for sentiment classification

Geo-tagged Twitter data has been used recently to infer insights on the ...
research
01/24/2019

Semantic Classification of Tabular Datasets via Character-Level Convolutional Neural Networks

A character-level convolutional neural network (CNN) motivated by applic...

Please sign up or login with your details

Forgot password? Click here to reset