Bipartite Graph Matching Algorithms for Clean-Clean Entity Resolution: An Empirical Evaluation

12/28/2021
by   George Papadakis, et al.
0

Entity Resolution (ER) is the task of finding records that refer to the same real-world entities. A common scenario is when entities across two clean sources need to be resolved, which we refer to as Clean-Clean ER. In this paper, we perform an extensive empirical evaluation of 8 bipartite graph matching algorithms that take in as input a bipartite similarity graph and provide as output a set of matched entities. We consider a wide range of matching algorithms, including algorithms that have not previously been applied to ER, or have been evaluated only in other ER settings. We assess the relative performance of the algorithms with respect to accuracy and time efficiency over 10 established, real datasets, from which we extract >700 different similarity graphs. Our results provide insights into the relative performance of these algorithms and guidelines for choosing the best one, depending on the data at hand.

READ FULL TEXT

page 8

page 22

research
08/23/2022

FlexER: Flexible Entity Resolution for Multiple Intents

Entity resolution, a longstanding problem of data cleaning and integrati...
research
02/03/2014

Principled Graph Matching Algorithms for Integrating Multiple Data Sources

This paper explores combinatorial optimization for problems of max-weigh...
research
10/21/2021

Online Bipartite Matching with Predicted Degrees

We propose a model for online graph problems where algorithms are given ...
research
05/15/2019

MinoanER: Schema-Agnostic, Non-Iterative, Massively Parallel Resolution of Web Entities

Entity Resolution (ER) aims to identify different descriptions in variou...
research
08/08/2023

A Benchmarking Study of Matching Algorithms for Knowledge Graph Entity Alignment

How to identify those equivalent entities between knowledge graphs (KGs)...
research
03/11/2018

Entity Resolution and Federated Learning get a Federated Resolution

Consider two data providers, each maintaining records of different featu...
research
12/01/2022

xEM: Explainable Entity Matching in Customer 360

Entity matching in Customer 360 is the task of determining if multiple r...

Please sign up or login with your details

Forgot password? Click here to reset