The 2021 Urdu Fake News Detection Task using Supervised Machine Learning and Feature Combinations

04/06/2022
by   Muhammad Humayoun, et al.
0

This paper presents the system description submitted at the FIRE Shared Task: "The 2021 Fake News Detection in the Urdu Language". This challenge aims at automatically identifying Fake news written in Urdu. Our submitted results ranked fifth in the competition. However, after the result declaration of the competition, we managed to attain even better results than the submitted results. The best F1 Macro score achieved by one of our models is 0.6674, higher than the second-best score in the competition. The result is achieved on Support Vector Machines (polynomial kernel degree 1) with stopwords removed, lemmatization applied, and selecting the 20K best features out of 1.557 million features in total (which were produced by Word n-grams n=1,2,3,4 and Char n-grams n=2,3,4,5,6). The code is made available for reproducibility.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2022

Abusive and Threatening Language Detection in Urdu using Supervised Machine Learning and Feature Combinations

This paper presents the system descriptions submitted at the FIRE Shared...
research
07/11/2022

Overview of the Shared Task on Fake News Detection in Urdu at FIRE 2021

Automatic detection of fake news is a highly important task in the conte...
research
03/20/2018

UnibucKernel: A kernel-based learning method for complex word identification

In this paper, we present a kernel-based learning approach for the 2018 ...
research
01/28/2020

Improving Generalizability of Fake News Detection Methods using Propensity Score Matching

Recently, due to the booming influence of online social networks, detect...
research
10/23/2017

A Two-Level Classification Approach for Detecting Clickbait Posts using Text-Based Features

The emergence of social media as news sources has led to the rise of cli...
research
07/11/2022

UrduFake@FIRE2021: Shared Track on Fake News Identification in Urdu

This study reports the second shared task named as UrduFake@FIRE2021 on ...
research
08/22/2022

Fake News Identification using Machine Learning Algorithms Based on Graph Features

The spread of fake news has long been a social issue and the necessity o...

Please sign up or login with your details

Forgot password? Click here to reset