Evaluating Language Models for Knowledge Base Completion

03/20/2023
by   Blerta Veseli, et al.
0

Structured knowledge bases (KBs) are a foundation of many intelligent applications, yet are notoriously incomplete. Language models (LMs) have recently been proposed for unsupervised knowledge base completion (KBC), yet, despite encouraging initial results, questions regarding their suitability remain open. Existing evaluations often fall short because they only evaluate on popular subjects, or sample already existing facts from KBs. In this work, we introduce a novel, more challenging benchmark dataset, and a methodology tailored for a realistic assessment of the KBC potential of LMs. For automated assessment, we curate a dataset called WD-KNOWN, which provides an unbiased random sample of Wikidata, containing over 3.9 million facts. In a second step, we perform a human evaluation on predictions that are not yet in the KB, as only this provides real insights into the added value over existing KBs. Our key finding is that biases in dataset conception of previous benchmarks lead to a systematic overestimate of LM performance for KBC. However, our results also reveal strong areas of LMs. We could, for example, perform a significant completion of Wikidata on the relations nativeLanguage, by a factor of  21 (from 260k to 5.8M) at 82 2.1M to 6.6M) at 82 5.3M) at 90 generalization capabilities: even on relations where most facts were not directly observed in LM training, prediction quality can be high.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2018

Do Embedding Models Perform Well for Knowledge Base Completion?

In this work, we put into question the effectiveness of the evaluation m...
research
08/30/2021

Knowledge Base Completion Meets Transfer Learning

The aim of knowledge base completion is to predict unseen facts from exi...
research
05/14/2023

FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge

Evaluating the factual consistency of automatically generated summaries ...
research
11/13/2022

mOKB6: A Multilingual Open Knowledge Base Completion Benchmark

Automated completion of open knowledge bases (KBs), which are constructe...
research
11/14/2016

Traversing Knowledge Graph in Vector Space without Symbolic Space Guidance

Recent studies on knowledge base completion, the task of recovering miss...
research
05/10/2023

ANALOGYKB: Unlocking Analogical Reasoning of Language Models with A Million-scale Knowledge Base

Analogical reasoning is a fundamental cognitive ability of humans. Howev...
research
06/30/2023

Knowledge Base Completion for Long-Tail Entities

Despite their impressive scale, knowledge bases (KBs), such as Wikidata,...

Please sign up or login with your details

Forgot password? Click here to reset