Prompting as Probing: Using Language Models for Knowledge Base Construction

08/23/2022
by   Dimitrios Alivanistos, et al.
0

Language Models (LMs) have proven to be useful in various downstream applications, such as summarisation, translation, question answering and text classification. LMs are becoming increasingly important tools in Artificial Intelligence, because of the vast quantity of information they can store. In this work, we present ProP (Prompting as Probing), which utilizes GPT-3, a large Language Model originally proposed by OpenAI in 2020, to perform the task of Knowledge Base Construction (KBC). ProP implements a multi-step approach that combines a variety of prompting techniques to achieve this. Our results show that manual prompt curation is essential, that the LM must be encouraged to give answer sets of variable lengths, in particular including empty answer sets, that true/false questions are a useful device to increase precision on suggestions generated by the LM, that the size of the LM is a crucial factor, and that a dictionary of entity aliases improves the LM score. Our evaluation study indicates that these proposed techniques can substantially enhance the quality of the final predictions: ProP won track 2 of the LM-KBC competition, outperforming the baseline by 36.4 percentage points. Our implementation is available on https://github.com/HEmile/iswc-challenge.

READ FULL TEXT

page 23

page 24

research
05/02/2023

Few-shot In-context Learning for Knowledge Base Question Answering

Question answering over knowledge bases is considered a difficult proble...
research
09/15/2023

Using Large Language Models for Knowledge Engineering (LLMKE): A Case Study on Wikidata

In this work, we explore the use of Large Language Models (LLMs) for kno...
research
09/06/2021

General-Purpose Question-Answering with Macaw

Despite the successes of pretrained language models, there are still few...
research
08/09/2023

ADMUS: A Progressive Question Answering Framework Adaptable to Multiple Knowledge Sources

With the introduction of deep learning models, semantic parsingbased kno...
research
03/06/2023

Two-stage Pipeline for Multilingual Dialect Detection

Dialect Identification is a crucial task for localizing various Large La...
research
10/29/2022

Two is Better than Many? Binary Classification as an Effective Approach to Multi-Choice Question Answering

We propose a simple refactoring of multi-choice question answering (MCQA...
research
04/19/2017

Using Contexts and Constraints for Improved Geotagging of Human Trafficking Webpages

Extracting geographical tags from webpages is a well-motivated applicati...

Please sign up or login with your details

Forgot password? Click here to reset