Developing a Scalable Benchmark for Assessing Large Language Models in Knowledge Graph Engineering

08/31/2023
by   Lars-Peter Meyer, et al.
0

As the field of Large Language Models (LLMs) evolves at an accelerated pace, the critical need to assess and monitor their performance emerges. We introduce a benchmarking framework focused on knowledge graph engineering (KGE) accompanied by three challenges addressing syntax and error correction, facts extraction and dataset generation. We show that while being a useful tool, LLMs are yet unfit to assist in knowledge graph generation with zero-shot prompting. Consequently, our LLM-KG-Bench framework provides automatic evaluation and storage of LLM responses as well as statistical data and visualization tools to support tracking of prompt engineering and model performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2021

Engineering Knowledge Graph from Patent Database

We propose a large, scalable engineering knowledge graph, comprising set...
research
07/03/2023

Iterative Zero-Shot LLM Prompting for Knowledge Graph Construction

In the current digitalization era, capturing and effectively representin...
research
08/26/2021

Patent-KG: Patent Knowledge Graph Use for Engineering Design

To facilitate knowledge reuse in engineering design, several dataset app...
research
07/14/2023

Using Large Language Models for Zero-Shot Natural Language Generation from Knowledge Graphs

In any system that uses structured knowledge graph (KG) data as its unde...
research
07/13/2023

LLM-assisted Knowledge Graph Engineering: Experiments with ChatGPT

Knowledge Graphs (KG) provide us with a structured, flexible, transparen...
research
02/03/2023

A Case Study for Compliance as Code with Graphs and Language Models: Public release of the Regulatory Knowledge Graph

The paper presents a study on using language models to automate the cons...
research
08/01/2023

Prompts Matter: Insights and Strategies for Prompt Engineering in Automated Software Traceability

Large Language Models (LLMs) have the potential to revolutionize automat...

Please sign up or login with your details

Forgot password? Click here to reset