Prompt2Model: Generating Deployable Models from Natural Language Instructions

08/23/2023
by   Vijay Viswanathan, et al.
0

Large language models (LLMs) enable system builders today to create competent NLP systems through prompting, where they only need to describe the task in natural language and provide a few examples. However, in other ways, LLMs are a step backward from traditional special-purpose NLP models; they require extensive computational resources for deployment and can be gated behind APIs. In this paper, we propose Prompt2Model, a general-purpose method that takes a natural language task description like the prompts provided to LLMs, and uses it to train a special-purpose model that is conducive to deployment. This is done through a multi-step process of retrieval of existing datasets and pretrained models, dataset generation using LLMs, and supervised fine-tuning on these retrieved and generated datasets. Over three tasks, we demonstrate that given the same few-shot prompt as input, Prompt2Model trains models that outperform the results of a strong LLM, gpt-3.5-turbo, by an average of 20 while being up to 700 times smaller. We also show that this data can be used to obtain reliable performance estimates of model performance, enabling model developers to assess model reliability before deployment. Prompt2Model is available open-source at https://github.com/neulab/prompt2model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2021

Natural Instructions: Benchmarking Generalization to New Tasks from Natural Language Instructions

Can we enable NLP models to appropriately respond to instructional promp...
research
05/23/2023

Generating Data for Symbolic Language with Large Language Models

While large language models (LLMs) bring not only performance but also c...
research
09/22/2021

Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Recent progress in the Natural Language Processing domain has given us s...
research
05/03/2022

SparCAssist: A Model Risk Assessment Assistant Based on Sparse Generated Counterfactuals

We introduce SparcAssist, a general-purpose risk assessment tool for the...
research
02/02/2022

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts

PromptSource is a system for creating, sharing, and using natural langua...
research
05/03/2023

Robust Natural Language Watermarking through Invariant Features

Recent years have witnessed a proliferation of valuable original natural...
research
08/10/2023

RTLLM: An Open-Source Benchmark for Design RTL Generation with Large Language Model

Inspired by the recent success of large language models (LLMs) like Chat...

Please sign up or login with your details

Forgot password? Click here to reset