ClusterEA: Scalable Entity Alignment with Stochastic Training and Normalized Mini-batch Similarities

05/20/2022
by   Yunjun Gao, et al.
5

Entity alignment (EA) aims at finding equivalent entities in different knowledge graphs (KGs). Embedding-based approaches have dominated the EA task in recent years. Those methods face problems that come from the geometric properties of embedding vectors, including hubness and isolation. To solve these geometric problems, many normalization approaches have been adopted for EA. However, the increasing scale of KGs renders it hard for EA models to adopt the normalization processes, thus limiting their usage in real-world applications. To tackle this challenge, we present ClusterEA, a general framework that is capable of scaling up EA models and enhancing their results by leveraging normalization methods on mini-batches with a high entity equivalent rate. ClusterEA contains three components to align entities between large-scale KGs, including stochastic training, ClusterSampler, and SparseFusion. It first trains a large-scale Siamese GNN for EA in a stochastic fashion to produce entity embeddings. Based on the embeddings, a novel ClusterSampler strategy is proposed for sampling highly overlapped mini-batches. Finally, ClusterEA incorporates SparseFusion, which normalizes local and global similarity and then fuses all similarity matrices to obtain the final similarity matrix. Extensive experiments with real-life datasets on EA benchmarks offer insight into the proposed framework, and suggest that it is capable of outperforming the state-of-the-art scalable EA framework by up to 8 times in terms of Hits@1.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 7

page 8

page 9

page 10

research
08/11/2021

LargeEA: Aligning Entities for Large-scale Knowledge Graphs

Entity alignment (EA) aims to find equivalent entities in different know...
research
06/06/2019

Multi-view Knowledge Graph Embedding for Entity Alignment

We study the problem of embedding-based entity alignment between knowled...
research
03/10/2020

A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs

Entity alignment seeks to find entities in different knowledge graphs (K...
research
07/12/2023

An Effective and Efficient Time-aware Entity Alignment Framework via Two-aspect Three-view Label Propagation

Entity alignment (EA) aims to find the equivalent entity pairs between d...
research
03/07/2022

Deep Reinforcement Learning for Entity Alignment

Embedding-based methods have attracted increasing attention in recent en...
research
04/10/2023

Investigating Graph Structure Information for Entity Alignment with Dangling Cases

Entity alignment (EA) aims to discover the equivalent entities in differ...
research
08/22/2022

High-quality Task Division for Large-scale Entity Alignment

Entity Alignment (EA) aims to match equivalent entities that refer to th...

Please sign up or login with your details

Forgot password? Click here to reset