Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data

04/03/2023
by   Canwen Xu, et al.
0

Chat models, such as ChatGPT, have shown impressive capabilities and have been rapidly adopted across numerous domains. However, these models are only accessible through a restricted API, creating barriers for new research and progress in the field. We propose a pipeline that can automatically generate a high-quality multi-turn chat corpus by leveraging ChatGPT to engage in a conversation with itself. Subsequently, we employ parameter-efficient tuning to enhance LLaMA, an open-source large language model. The resulting model, named Baize, demonstrates good performance in multi-turn dialogues with guardrails that minimize potential risks. The Baize models and data are released for research purposes only at https://github.com/project-baize/baize. An online demo is also available at https://huggingface.co/spaces/project-baize/baize-lora-7B.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2023

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations

Fine-tuning on instruction data has been widely validated as an effectiv...
research
08/25/2023

SoTaNa: The Open-Source Software Development Assistant

Software development plays a crucial role in driving innovation and effi...
research
08/16/2023

MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation

We propose MemoChat, a pipeline for refining instructions that enables l...
research
06/28/2023

ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases

Large Language Models (LLMs) have shown the potential to revolutionize n...
research
08/05/2023

EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education

EduChat (https://www.educhat.top/) is a large-scale language model (LLM)...
research
05/24/2023

RefGPT: Reference -> Truthful Customized Dialogues Generation by GPTs and for GPTs

General chat models, like ChatGPT, have attained impressive capability t...
research
10/23/2017

"Birds in the Clouds": Adventures in Data Engineering

Leveraging their eBird crowdsourcing project, the Cornell Lab of Ornitho...

Please sign up or login with your details

Forgot password? Click here to reset