Efficient Knowledge Graph Accuracy Evaluation

07/23/2019
by   Junyang Gao, et al.
0

Estimation of the accuracy of a large-scale knowledge graph (KG) often requires humans to annotate samples from the graph. How to obtain statistically meaningful estimates for accuracy evaluation while keeping human annotation costs low is a problem critical to the development cycle of a KG and its practical applications. Surprisingly, this challenging problem has largely been ignored in prior research. To address the problem, this paper proposes an efficient sampling and evaluation framework, which aims to provide quality accuracy evaluation with strong statistical guarantee while minimizing human efforts. Motivated by the properties of the annotation cost function observed in practice, we propose the use of cluster sampling to reduce the overall cost. We further apply weighted and two-stage sampling as well as stratification for better sampling designs. We also extend our framework to enable efficient incremental evaluation on evolving KG, introducing two solutions based on stratified sampling and a weighted variant of reservoir sampling. Extensive experiments on real-world datasets demonstrate the effectiveness and efficiency of our proposed solution. Compared to baseline approaches, our best solutions can provide up to 60 reduction on evolving KG evaluation, without loss of evaluation quality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/08/2022

Aggregate Queries on Knowledge Graphs: Fast Approximation with Semantic-aware Sampling

A knowledge graph (KG) manages large-scale and real-world facts as a big...
research
12/16/2018

NSCaching: Simple and Efficient Negative Sampling for Knowledge Graph Embedding

Knowledge Graph (KG) embedding is a fundamental problem in data mining r...
research
07/04/2023

Optimal and Efficient Binary Questioning for Human-in-the-Loop Annotation

Even though data annotation is extremely important for interpretability,...
research
11/10/2019

A Re-evaluation of Knowledge Graph Completion Methods

Knowledge Graph Completion (KGC) aims at automatically predicting missin...
research
06/19/2023

INC: A Scalable Incremental Weighted Sampler

The fundamental problem of weighted sampling involves sampling of satisf...
research
06/26/2018

A Practical Incremental Learning Framework For Sparse Entity Extraction

This work addresses challenges arising from extracting entities from tex...
research
10/21/2016

KGEval: Estimating Accuracy of Automatically Constructed Knowledge Graphs

Automatic construction of large knowledge graphs (KG) by mining web-scal...

Please sign up or login with your details

Forgot password? Click here to reset