Detecting Satire in the News with Machine Learning

10/01/2018
by   Andreas Stöckl, et al.
0

We built models with Logistic Regression and linear Support Vector Machines on a large dataset consisting of regular news articles and news from satirical websites, and showed that such linear classifiers on a corpus with about 60,000 articles can perform with a precision of 98.7 random test set of the news. On the other hand, when testing the classifier on "publication sources" which are completely unknown during training, only an accuracy of 88.2 showed that the same algorithm can distinguish between news written by the news agency itself and paid articles from customers. Here the results had an accuracy of 99

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/13/2021

SaRoCo: Detecting Satire in a Novel Romanian Corpus of News Articles

In this work, we introduce a corpus for satire detection in Romanian new...
research
11/06/2021

Distinguishing Commercial from Editorial Content in News

How can we distinguish commercial from editorial content in news, or mor...
research
03/18/2020

NELA-GT-2019: A Large Multi-Labelled News Dataset for The Study of Misinformation in News Articles

In this paper, we present an updated version of the NELA-GT-2018 dataset...
research
01/24/2023

Automated Identification of Disaster News For Crisis Management Using Machine Learning

A lot of news sources picked up on Typhoon Rai (also known locally as Ty...
research
09/17/2018

Similarity measure for Public Persons

For the webportal "Who is in the News!" with statistics about the appear...
research
07/14/2021

Linking Health News to Research Literature

Accurately linking news articles to scientific research works is a criti...
research
11/14/2016

Lost in Space: Geolocation in Event Data

Extracting the "correct" location information from text data, i.e., dete...

Please sign up or login with your details

Forgot password? Click here to reset