Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models

08/22/2023
by   Mohamed Elaraby, et al.
0

Large Language Models (LLMs) have revolutionized Natural Language Processing (NLP). Although convenient for research and practical applications, open-source LLMs with fewer parameters often suffer from severe hallucinations compared to their larger counterparts. This paper focuses on measuring and reducing hallucinations in BLOOM 7B, a representative of such weaker open-source LLMs that are publicly available for research and commercial applications. We introduce HaloCheck, a lightweight BlackBox knowledge-free framework designed to quantify the severity of hallucinations in LLMs. Additionally, we explore techniques like knowledge injection and teacher-student approaches to alleviate hallucinations in low-parameter LLMs. Our experiments effectively demonstrate the reduction of hallucinations in challenging domains for these LLMs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/05/2023

Open-Source Large Language Models Outperform Crowd Workers and Approach ChatGPT in Text-Annotation Tasks

This study examines the performance of open-source Large Language Models...
research
02/26/2022

A Systematic Evaluation of Large Language Models of Code

Large language models (LMs) of code have recently shown tremendous promi...
research
07/17/2023

Mini-Giants: "Small" Language Models and Open Source Win-Win

ChatGPT is phenomenal. However, it is prohibitively expensive to train a...
research
06/28/2023

ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases

Large Language Models (LLMs) have shown the potential to revolutionize n...
research
08/19/2023

Open, Closed, or Small Language Models for Text Classification?

Recent advancements in large language models have demonstrated remarkabl...
research
07/31/2023

HouYi: An open-source large language model specially designed for renewable energy and carbon neutrality field

Renewable energy is important for achieving carbon neutrality goal. With...
research
05/25/2023

On the Tool Manipulation Capability of Open-source Large Language Models

Recent studies on software tool manipulation with large language models ...

Please sign up or login with your details

Forgot password? Click here to reset