Wikipedia Vandalism Detection Through Machine Learning: Feature Review and New Proposals: Lab Report for PAN at CLEF 2010

10/19/2012
by   Santiago M. Mola-Velasco, et al.
0

Wikipedia is an online encyclopedia that anyone can edit. In this open model, some people edits with the intent of harming the integrity of Wikipedia. This is known as vandalism. We extend the framework presented in (Potthast, Stein, and Gerling, 2008) for Wikipedia vandalism detection. In this approach, several vandalism indicating features are extracted from edits in a vandalism corpus and are fed to a supervised learning algorithm. The best performing classifiers were LogitBoost and Random Forest. Our classifier, a Random Forest, obtained an AUC of 0.92236, ranking in the first place of the PAN'10 Wikipedia vandalism detection task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/19/2011

Edit wars in Wikipedia

We present a new, efficient method for automatically detecting severe co...
research
08/27/2016

Random Forest for Label Ranking

Label ranking aims to learn a mapping from instances to rankings over a ...
research
03/24/2022

Random Forest Regression for continuous affect using Facial Action Units

In this paper we describe our approach to the arousal and valence track ...
research
09/27/2020

Machine Learning for Searching the Dark Energy Survey for Trans-Neptunian Objects

In this paper we investigate how implementing machine learning could imp...
research
04/22/2016

Detecting state of aggression in sentences using CNN

In this article we study verbal expression of aggression and its detecti...
research
06/15/2023

Wikibio: a Semantic Resource for the Intersectional Analysis of Biographical Events

Biographical event detection is a relevant task for the exploration and ...
research
08/26/2017

Implementation and Evaluation of a Framework to calculate Impact Measures for Wikipedia Authors

Wikipedia, an open collaborative website, can be edited by anyone, even ...

Please sign up or login with your details

Forgot password? Click here to reset