Overview of the Wikidata Vandalism Detection Task at WSDM Cup 2017

12/16/2017
by   Stefan Heindorf, et al.
0

We report on the Wikidata vandalism detection task at the WSDM Cup 2017. The task received five submissions for which this paper describes their evaluation and a comparison to state of the art baselines. Unlike previous work, we recast Wikidata vandalism detection as an online learning problem, requiring participant software to predict vandalism in near real-time. The best-performing approach achieves a ROC-AUC of 0.947 at a PR-AUC of 0.458. In particular, this task was organized as a software submission task: to maximize reproducibility as well as to foster future research and development on this task, the participants were asked to submit their working software to the TIRA experimentation platform along with the source code for open source release.

READ FULL TEXT
research
05/26/2020

Reconciler: A Workflow for Certifying Computational Research Reproducibility

Previous work in reproducibility focused on providing frameworks to make...
research
07/25/2023

BotHawk: An Approach for Bots Detection in Open Source Software Projects

Social coding platforms have revolutionized collaboration in software de...
research
08/29/2018

Use of Source Code Similarity Metrics in Software Defect Prediction

In recent years, defect prediction has received a great deal of attentio...
research
06/05/2022

Annotation Error Detection: Analyzing the Past and Present for a More Coherent Future

Annotated data is an essential ingredient in natural language processing...
research
05/11/2018

The risk of sub-optimal use of Open Source NLP Software: UKB is inadvertently state-of-the-art in knowledge-based WSD

UKB is an open source collection of programs for performing, among other...
research
11/13/2017

Detecting Near Duplicates in Software Documentation

Contemporary software documentation is as complicated as the software it...
research
05/04/2018

Time-on-Task Estimation with Log-Normal Mixture Model

We describe a method of estimating a user's time-on-task in an online le...

Please sign up or login with your details

Forgot password? Click here to reset