OpenICL: An Open-Source Framework for In-context Learning

03/06/2023
by   Zhenyu Wu, et al.
0

In recent years, In-context Learning (ICL) has gained increasing attention and emerged as the new paradigm for large language model (LLM) evaluation. Unlike traditional fine-tuning methods, ICL instead adapts the pre-trained models to unseen tasks without any parameter updates. However, the implementation of ICL is sophisticated due to the diverse retrieval and inference methods involved, as well as the varying pre-processing requirements for different models, datasets, and tasks. A unified and flexible framework for ICL is urgently needed to ease the implementation of the aforementioned components. To facilitate ICL research, we introduce OpenICL, an open-source toolkit for ICL and LLM evaluation. OpenICL is research-friendly with a highly flexible architecture that users can easily combine different components to suit their needs. It also provides various state-of-the-art retrieval and inference methods to streamline the process of adapting ICL to cutting-edge research. The effectiveness of OpenICL has been validated on a wide range of NLP tasks, including classification, QA, machine translation, and semantic parsing. As a side-product, we found OpenICL to be an efficient yet robust tool for LLMs evaluation. OpenICL is released at https://github.com/Shark-NLP/OpenICL

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/03/2021

OpenPrompt: An Open-source Framework for Prompt-learning

Prompt-learning has become a new paradigm in modern natural language pro...
research
08/08/2023

SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool

Large Language Model (LLM) based Generative AI systems have seen signifi...
research
10/15/2020

An Open-Source Dataset on Dietary Behaviors and DASH Eating Plan Optimization Constraints

Linear constrained optimization techniques have been applied to many rea...
research
05/18/2020

MMFashion: An Open-Source Toolbox for Visual Fashion Analysis

We present MMFashion, a comprehensive, flexible and user-friendly open-s...
research
03/02/2021

A Data-Centric Framework for Composable NLP Workflows

Empirical natural language processing (NLP) systems in application domai...
research
02/15/2022

Russian SuperGLUE 1.1: Revising the Lessons not Learned by Russian NLP models

In the last year, new neural architectures and multilingual pre-trained ...
research
04/13/2021

EXPLAINABOARD: An Explainable Leaderboard for NLP

With the rapid development of NLP research, leaderboards have emerged as...

Please sign up or login with your details

Forgot password? Click here to reset