Stay On-Topic: Generating Context-specific Fake Restaurant Reviews

05/07/2018
by   Mika Juuti, et al.
0

Automatically generated fake restaurant reviews are a threat to online review systems. Recent research has shown that users have difficulties in detecting machine-generated fake reviews hiding among real restaurant reviews. The method used in this work (char-LSTM ) has one drawback: it has difficulties staying in context, i.e. when it generates a review for specific target entity, the resulting review may contain phrases that are unrelated to the target, thus increasing its detectability. In this work, we present and evaluate a more sophisticated technique based on neural machine translation (NMT) with which we can generate reviews that stay on-topic. We test multiple variants of our technique using native English speakers on Amazon Mechanical Turk. We demonstrate that reviews generated by the best variant have almost optimal undetectability (class-averaged F-score 47 skeptical users and show that our method evades detection more frequently compared to the state-of-the-art (average evasion 3.2/4 vs 1.5/4) with statistical significance, at level α = 1 effective detection tools and reach average F-score of 97 these. Although fake reviews are very effective in fooling people, effective automatic detection is still feasible.

READ FULL TEXT
research
09/09/2016

Detecting Singleton Review Spammers Using Semantic Similarity

Online reviews have increasingly become a very important resource for co...
research
12/18/2022

Impact of Sentiment Analysis in Fake Review Detection

Fake review identification is an important topic and has gained the inte...
research
07/22/2019

Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-based Detection

Advanced neural language models (NLMs) are widely used in sequence gener...
research
04/24/2022

Subgroup Fairness in Graph-based Spam Detection

Fake reviews are prevalent on review websites such as Amazon and Yelp. G...
research
11/29/2016

Context-aware Natural Language Generation with Recurrent Neural Networks

This paper studied generating natural languages at particular contexts o...
research
10/29/2020

Fact or Factitious? Contextualized Opinion Spam Detection

In this paper we perform an analytic comparison of a number of technique...
research
06/27/2023

Shilling Black-box Review-based Recommender Systems through Fake Review Generation

Review-Based Recommender Systems (RBRS) have attracted increasing resear...

Please sign up or login with your details

Forgot password? Click here to reset