ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media

05/23/2023
by   Kung-Hsiang Huang, et al.
0

Considerable advancements have been made to tackle the misrepresentation of information derived from reference articles in the domains of fact-checking and faithful summarization. However, an unaddressed aspect remains - the identification of social media posts that manipulate information within associated news articles. This task presents a significant challenge, primarily due to the prevalence of personal opinions in such posts. We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information. To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles. Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance. Additionally, we have developed a simple yet effective basic model that outperforms LLMs significantly on the ManiTweet dataset. Finally, we have conducted an exploratory analysis of human-written tweets, unveiling intriguing connections between manipulation and the domain and factuality of news articles, as well as revealing that manipulated sentences are more likely to encapsulate the main story or consequences of a news outlet.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2021

Is it a click bait? Let's predict using Machine Learning

In this era of digitisation, news reader tend to read news online. This ...
research
07/28/2022

PHEMEPlus: Enriching Social Media Rumour Verification with External Evidence

Work on social media rumour verification utilises signals from posts, th...
research
11/17/2022

Did They Really Tweet That? Querying Fact-Checking Sites and Politwoops to Determine Tweet Misattribution

Screenshots of social media posts have become common place on social med...
research
03/08/2016

Observing Trends in Automated Multilingual Media Analysis

Any large organisation, be it public or private, monitors the media for ...
research
08/08/2022

Template-based Abstractive Microblog Opinion Summarisation

We introduce the task of microblog opinion summarisation (MOS) and share...
research
09/02/2019

Story-oriented Image Selection and Placement

Multimodal contents have become commonplace on the Internet today, manif...
research
07/04/2022

Location reference recognition from texts: A survey and comparison

A vast amount of location information exists in unstructured texts, such...

Please sign up or login with your details

Forgot password? Click here to reset