Improving the Results of Machine-based Entity Resolution with Limited Human Effort: A Risk Perspective

05/31/2018
by   Zhaoqiang Chen, et al.
0

Pure machine-based solutions usually struggle in challenging classification tasks such as entity resolution (ER). To alleviate this problem, a recent trend is to involve humans in the resolution process, most notably the crowdsourcing approach. However, it remains very challenging to find a solution that can effectively improve the quality of entity resolution with limited human effort. In this position paper, we investigate the problem of human and machine cooperation for ER from a risk perspective. We propose to select for manual verification the machine-labeled results at high risk of being mislabeled. We present a risk model for this task that takes into consideration the human-labeled results as well as the output of the machine's resolution. Finally, our experiments on real data demonstrate that the proposed risk model picks up the mislabeled instances with considerably higher accuracy than the existing alternatives. Provided with the same human cost budget, it also achieves consistently better resolution quality than the state-of-the-art approach based on active learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2018

Improving Machine-based Entity Resolution with Limited Human Effort: A Risk Perspective

Pure machine-based solutions usually struggle in the challenging classif...
research
03/15/2018

r-HUMO: A Risk-Aware Human-Machine Cooperation Framework for Entity Resolution with Quality Guarantees

Even though many approaches have been proposed for entity resolution (ER...
research
12/06/2019

Towards Interpretable and Learnable Risk Analysis for Entity Resolution

Machine-learning-based entity resolution has been widely studied. Howeve...
research
12/23/2020

Active Deep Learning on Entity Resolution by Risk Sampling

While the state-of-the-art performance on entity resolution (ER) has bee...
research
10/29/2018

Gradual Machine Learning for Entity Resolution

Usually considered as a classification problem, entity resolution can be...
research
09/30/2017

Enabling Quality Control for Entity Resolution: A Human and Machine Cooperative Framework

Even though many machine algorithms have been proposed for entity resolu...
research
03/15/2018

i-HUMO: An Interactive Human and Machine Cooperation Framework for Entity Resolution with Quality Guarantees

Even though many approaches have been proposed for entity resolution (ER...

Please sign up or login with your details

Forgot password? Click here to reset