From Appearance to Essence: Comparing Truth Discovery Methods without Using Ground Truth

08/07/2017
by   Xiu Susie Fang, et al.
0

Truth discovery has been widely studied in recent years as a fundamental means for resolving the conflicts in multi-source data. Although many truth discovery methods have been proposed based on different considerations and intuitions, investigations show that no single method consistently outperforms the others. To select the right truth discovery method for a specific application scenario, it becomes essential to evaluate and compare the performance of different methods. A drawback of current research efforts is that they commonly assume the availability of certain ground truth for the evaluation of methods. However, the ground truth may be very limited or even out-of-reach in practice, rendering the evaluation biased by the small ground truth or even unfeasible. In this paper, we present CompTruthHyp, a general approach for comparing the performance of truth discovery methods without using ground truth. In particular, our approach calculates the probability of observations in a dataset based on the output of different methods. The probability is then ranked to reflect the performance of these methods. We review and compare twelve existing truth discovery methods and consider both single-valued and multi-valued objects. Empirical studies on both real-world and synthetic datasets demonstrate the effectiveness of our approach for comparing truth discovery methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/07/2017

SmartMTD: A Graph-Based Approach for Effective Multi-Truth Discovery

The Big Data era features a huge amount of data that are contributed by ...
research
04/26/2019

Synthetic Ground Truth Generation for Evaluating Generative Policy Models

Generative Policy-based Models aim to enable a coalition of systems, be ...
research
05/30/2018

Rehabilitating the Color Checker Dataset for Illuminant Estimation

In a previous work, it was shown that there is a curious problem with th...
research
01/19/2022

CPTAM: Constituency Parse Tree Aggregation Method

Diverse Natural Language Processing tasks employ constituency parsing to...
research
11/07/2016

Truth Discovery with Memory Network

Truth discovery is to resolve conflicts and find the truth from multiple...
research
10/03/2021

Unsupervised paradigm for information extraction from transcripts using BERT

Audio call transcripts are one of the valuable sources of information fo...
research
08/14/2020

Continuous Optimization Benchmarks by Simulation

Benchmark experiments are required to test, compare, tune, and understan...

Please sign up or login with your details

Forgot password? Click here to reset