MultiEM: Efficient and Effective Unsupervised Multi-Table Entity Matching

08/02/2023
by   Xiaocan Zeng, et al.
0

Entity Matching (EM), which aims to identify all entity pairs referring to the same real-world entity from relational tables, is one of the most important tasks in real-world data management systems. Due to the labeling process of EM being extremely labor-intensive, unsupervised EM is more applicable than supervised EM in practical scenarios. Traditional unsupervised EM assumes that all entities come from two tables; however, it is more common to match entities from multiple tables in practical applications, that is, multi-table entity matching (multi-table EM). Unfortunately, effective and efficient unsupervised multi-table EM remains under-explored. To fill this gap, this paper formally studies the problem of unsupervised multi-table entity matching and proposes an effective and efficient solution, termed as MultiEM. MultiEM is a parallelable pipeline of enhanced entity representation, table-wise hierarchical merging, and density-based pruning. Extensive experimental results on six real-world benchmark datasets demonstrate the superiority of MultiEM in terms of effectiveness and efficiency.

READ FULL TEXT
research
07/11/2022

PromptEM: Prompt-tuning for Low-resource Generalized Entity Matching

Entity Matching (EM), which aims to identify whether two entity records ...
research
06/15/2021

Machamp: A Generalized Entity Matching Benchmark

Entity Matching (EM) refers to the problem of determining whether two di...
research
05/12/2022

Bridging the Gap between Reality and Ideality of Entity Matching: A Revisiting and Benchmark Re-Construction

Entity matching (EM) is the most critical step for entity resolution (ER...
research
07/06/2023

Through the Fairness Lens: Experimental Analysis and Evaluation of Entity Matching

Entity matching (EM) is a challenging problem studied by different commu...
research
06/08/2021

Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making

Entity Matching (EM) aims at recognizing entity records that denote the ...
research
11/13/2022

Ground Truth Inference for Weakly Supervised Entity Matching

Entity matching (EM) refers to the problem of identifying pairs of data ...
research
06/10/2022

Machop: an End-to-End Generalized Entity Matching Framework

Real-world applications frequently seek to solve a general form of the E...

Please sign up or login with your details

Forgot password? Click here to reset