Bengali Fake Reviews: A Benchmark Dataset and Detection System

08/03/2023
by   G. M. Shahariar, et al.
0

The proliferation of fake reviews on various online platforms has created a major concern for both consumers and businesses. Such reviews can deceive customers and cause damage to the reputation of products or services, making it crucial to identify them. Although the detection of fake reviews has been extensively studied in English language, detecting fake reviews in non-English languages such as Bengali is still a relatively unexplored research area. This paper introduces the Bengali Fake Review Detection (BFRD) dataset, the first publicly available dataset for identifying fake reviews in Bengali. The dataset consists of 7710 non-fake and 1339 fake food-related reviews collected from social media posts. To convert non-Bengali words in a review, a unique pipeline has been proposed that translates English words to their corresponding Bengali meaning and also back transliterates Romanized Bengali to Bengali. We have conducted rigorous experimentation using multiple deep learning and pre-trained transformer language models to develop a reliable detection system. Finally, we propose a weighted ensemble model that combines four pre-trained transformers: BanglaBERT, BanglaBERT Base, BanglaBERT Large, and BanglaBERT Generator . According to the experiment results, the proposed ensemble model obtained a weighted F1-score of 0.9843 on 13390 reviews, including 1339 actual fake reviews and 5356 augmented fake reviews generated with the nlpaug library. The remaining 6695 reviews were randomly selected from the 7710 non-fake instances. The model achieved a 0.9558 weighted F1-score when the fake reviews were augmented using the bnaug library.

READ FULL TEXT

page 26

page 29

research
12/29/2021

Fake or Genuine? Contextualised Text Representation for Fake Review Detection

Online reviews have a significant influence on customers' purchasing dec...
research
02/10/2023

Combat AI With AI: Counteract Machine-Generated Fake Restaurant Reviews on Social Media

Recent advances in generative models such as GPT may be used to fabricat...
research
04/19/2023

Catch Me If You Can: Identifying Fraudulent Physician Reviews with Large Language Models Using Generative Pre-Trained Transformers

The proliferation of fake reviews of doctors has potentially detrimental...
research
04/05/2023

Bengali Fake Review Detection using Semi-supervised Generative Adversarial Networks

This paper investigates the potential of semi-supervised Generative Adve...
research
01/09/2023

Online Fake Review Detection Using Supervised Machine Learning And BERT Model

Online shopping stores have grown steadily over the past few years. Due ...
research
02/26/2020

Fake Review Detection Using Behavioral and Contextual Features

User reviews reflect significant value of product in the world of e-mark...
research
01/08/2023

Mitigating Human and Computer Opinion Fraud via Contrastive Learning

We introduce the novel approach towards fake text reviews detection in c...

Please sign up or login with your details

Forgot password? Click here to reset