Head-to-Tail: How Knowledgeable are Large Language Models (LLM)? A.K.A. Will LLMs Replace Knowledge Graphs?

08/20/2023
by   Kai Sun, et al.
0

Since the recent prosperity of Large Language Models (LLMs), there have been interleaved discussions regarding how to reduce hallucinations from LLM responses, how to increase the factuality of LLMs, and whether Knowledge Graphs (KGs), which store the world knowledge in a symbolic form, will be replaced with LLMs. In this paper, we try to answer these questions from a new angle: How knowledgeable are LLMs? To answer this question, we constructed Head-to-Tail, a benchmark that consists of 18K question-answer (QA) pairs regarding head, torso, and tail facts in terms of popularity. We designed an automated evaluation method and a set of metrics that closely approximate the knowledge an LLM confidently internalizes. Through a comprehensive evaluation of 14 publicly available LLMs, we show that existing LLMs are still far from being perfect in terms of their grasp of factual knowledge, especially for facts of torso-to-tail entities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2023

Answering Questions Over Knowledge Graphs Using Logic Programming Along with Language Models

Question Answering over Knowledge Graphs (KGQA) is the task of answering...
research
07/13/2023

Towards Populating Generalizable Engineering Design Knowledge

Aiming to populate generalizable engineering design knowledge, we propos...
research
11/15/2022

Empowering Language Models with Knowledge Graph Reasoning for Question Answering

Answering open-domain questions requires world knowledge about in-contex...
research
11/15/2022

Large Language Models Struggle to Learn Long-Tail Knowledge

The internet contains a wealth of knowledge – from the birthdays of hist...
research
02/15/2020

Open Knowledge Enrichment for Long-tail Entities

Knowledge bases (KBs) have gradually become a valuable asset for many AI...
research
12/02/2020

How Can We Know When Language Models Know?

Recent works have shown that language models (LM) capture different type...
research
05/21/2022

An Empirical Investigation of Commonsense Self-Supervision with Knowledge Graphs

Self-supervision based on the information extracted from large knowledge...

Please sign up or login with your details

Forgot password? Click here to reset