Can Who-Edits-What Predict Edit Survival?

01/12/2018
by   Ali Batuhan Yardım, et al.
0

The Internet has enabled the emergence of massive online collaborative projects. As the number of contributors to these projects grows, it becomes increasingly important to understand and predict whether the edits that users make will eventually impact the project positively. Existing solutions either rely on a user reputation system or consist of a highly-specialized predictor tailored to a specific peer-production system. In this work, we explore a different point in the solution space, which does not involve any content-based feature of the edits. To this end, we formulate a statistical model of edit outcomes. We view each edit as a game between the editor and the component of the project. We posit that the probability of a positive outcome is a function of the editor's skill, of the difficulty of editing the component and of a user-component interaction term. Our model is broadly applicable, as it only requires observing data about who makes an edit, what the edit affects and whether the edit survives or not. Then, we consider Wikipedia and the Linux kernel, two examples of large-scale collaborative projects, and we seek to understand whether this simple model can effectively predict edit survival: in both cases, we provide a positive answer. Our approach significantly outperforms those based solely on user reputation and bridges the gap with specialized predictors that use content-based features. Furthermore, inspecting the model parameters enables us to discover interesting structure in the data. Our method is simple to implement, computationally inexpensive, and it produces interpretable results; as such, we believe that it is a valuable tool to analyze collaborative systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2017

Wikipedia Vandal Early Detection: from User Behavior to User Embedding

Wikipedia is the largest online encyclopedia that allows anyone to edit ...
research
07/19/2011

Edit wars in Wikipedia

We present a new, efficient method for automatically detecting severe co...
research
10/27/2018

Fabrik: An Online Collaborative Neural Network Editor

We present Fabrik, an online neural network editor that provides tools t...
research
07/25/2022

Overwatch: Learning Patterns in Code Edit Sequences

Integrated Development Environments (IDEs) provide tool support to autom...
research
03/10/2018

Learning Quick Fixes from Code Repositories

Code analyzers such as ErrorProne and FindBugs detect code patterns symp...
research
10/07/2022

Automatic Prediction of Rejected Edits in Stack Overflow

The content quality of shared knowledge in Stack Overflow (SO) is crucia...
research
01/17/2018

Interactive in-base street model edit: how common GIS software and a database can serve as a custom Graphical User Interface

Our modern world produces an increasing quantity of data, and especially...

Please sign up or login with your details

Forgot password? Click here to reset