Evaluation of medium-large Language Models at zero-shot closed book generative question answering

05/19/2023
by   René Peinl, et al.
0

Large language models (LLMs) have garnered significant attention, but the definition of "large" lacks clarity. This paper focuses on medium-sized lan-guage models (MLMs), defined as having at least six billion parameters but less than 100 billion. The study evaluates MLMs regarding zero-shot genera-tive question answering, which requires models to provide elaborate answers without external document retrieval. The paper introduces an own test da-taset and presents results from human evaluation. Results show that combin-ing the best answers from different MLMs yielded an overall correct answer rate of 82.7 which is better than the 60.9 7B parameters, which highlights the importance of using appropriate training data for fine-tuning rather than solely relying on the number of parameters. More fine-grained feedback should be used to further improve the quality of answers.

READ FULL TEXT
research
06/07/2023

Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering

Large Language Models (LLMs) are capable of performing zero-shot closed-...
research
03/10/2022

Internet-augmented language models through few-shot prompting for open-domain question answering

In this work, we aim to capitalize on the unique few-shot capabilities o...
research
05/24/2021

Few-Shot Upsampling for Protest Size Detection

We propose a new task and dataset for a common problem in social science...
research
11/17/2022

Data-Efficient Autoregressive Document Retrieval for Fact Verification

Document retrieval is a core component of many knowledge-intensive natur...
research
06/07/2020

Language Models as Fact Checkers?

Recent work has suggested that language models (LMs) store both common-s...
research
05/16/2022

Heroes, Villains, and Victims, and GPT-3: Automated Extraction of Character Roles Without Training Data

This paper shows how to use large-scale pre-trained language models to e...
research
08/02/2023

Teaching Smaller Language Models To Generalise To Unseen Compositional Questions

We equip a smaller Language Model to generalise to answering challenging...

Please sign up or login with your details

Forgot password? Click here to reset