Proceedings of the WSDM Cup 2017: Vandalism Detection and Triple Scoring

by   Martin Potthast, et al.

The WSDM Cup 2017 was a data mining challenge held in conjunction with the 10th International Conference on Web Search and Data Mining (WSDM). It addressed key challenges of knowledge bases today: quality assurance and entity search. For quality assurance, we tackle the task of vandalism detection, based on a dataset of more than 82 million user-contributed revisions of the Wikidata knowledge base, all of which annotated with regard to whether or not they are vandalism. For entity search, we tackle the task of triple scoring, using a dataset that comprises relevance scores for triples from type-like relations including occupation and country of citizenship, based on about 10,000 human relevance judgements. For reproducibility sake, participants were asked to submit their software on TIRA, a cloud-based evaluation platform, and they were incentivized to share their approaches open source.



There are no comments yet.



Proceedings of the 2017 AdKDD & TargetAd Workshop

Proceedings of the 2017 AdKDD and TargetAd Workshop held in conjunction ...

Relevance Score of Triplets Using Knowledge Graph Embedding - The Pigweed Triple Scorer at WSDM Cup 2017

Collaborative Knowledge Bases such as Freebase and Wikidata mention mult...

A Data Mining Approach to Solve the Goal Scoring Problem

In soccer, scoring goals is a fundamental objective which depends on man...

Relevance Scoring of Triples Using Ordinal Logistic Classification - The Celosia Triple Scorer at WSDM Cup 2017

In this paper, we report our participation in the Task 2: Triple Scoring...

Predicting Relevance Scores for Triples from Type-Like Relations using Neural Embedding - The Cabbage Triple Scorer at WSDM Cup 2017

The WSDM Cup 2017 Triple scoring challenge is aimed at calculating and a...

Ranking Triples using Entity Links in a Large Web Crawl - The Chicory Triple Scorer at WSDM Cup 2017

This paper describes the participation of team Chicory in the Triple Ran...

Triple Scoring Using a Hybrid Fact Validation Approach - The Catsear Triple Scorer at WSDM Cup 2017

With the continuous increase of data daily published in knowledge bases ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.