CooK: Empowering General-Purpose Language Models with Modular and Collaborative Knowledge

05/17/2023
by   Shangbin Feng, et al.
0

Large language models (LLMs) are increasingly adopted for knowledge-intensive tasks and contexts. Existing approaches improve the knowledge capabilities of general-purpose LLMs through retrieval or generated knowledge prompting, but they fall short of reflecting two key properties of knowledge-rich models: knowledge should be modular, ever-growing, sourced from diverse domains; knowledge acquisition and production should be a collaborative process, where diverse stakeholders contribute new information. To this end, we propose CooK, a novel framework to empower general-purpose large language models with modular and collaboratively sourced knowledge. We first introduce specialized language models, autoregressive models trained on corpora from a wide range of domains and sources. These specialized LMs serve as parametric knowledge repositories that are later prompted to generate background knowledge for general-purpose LLMs. We then propose three knowledge filters to dynamically select and retain information in generated documents by controlling for relevance, brevity, and factuality. Finally, we propose bottom-up and top-down knowledge integration approaches to augment general-purpose LLMs with the curated (relevant, factual) knowledge from community-driven specialized LMs that enable multi-domain knowledge synthesis and on-demand knowledge requests. Through extensive experiments, we demonstrate that CooK achieves state-of-the-art performance on six benchmark datasets. Our results highlight the potential of enriching general-purpose LLMs with evolving and modular knowledge – relevant knowledge that can be continuously updated through the collective efforts of the research community.

READ FULL TEXT
research
05/12/2023

PALR: Personalization Aware LLMs for Recommendation

Large language models (LLMs) have recently received significant attentio...
research
06/02/2023

ChatGPT for Zero-shot Dialogue State Tracking: A Solution or an Opportunity?

Recent research on dialogue state tracking (DST) focuses on methods that...
research
09/17/2023

Performance of the Pre-Trained Large Language Model GPT-4 on Automated Short Answer Grading

Automated Short Answer Grading (ASAG) has been an active area of machine...
research
06/13/2022

Language Models are General-Purpose Interfaces

Foundation models have received much attention due to their effectivenes...
research
05/24/2017

How a General-Purpose Commonsense Ontology can Improve Performance of Learning-Based Image Retrieval

The knowledge representation community has built general-purpose ontolog...
research
06/01/2021

SBML2Modelica: integrating biochemical models within open-standard simulation ecosystems

Motivation: SBML is the most widespread language for the definition of b...
research
05/02/2023

A General Static Binary Rewriting Framework for WebAssembly

Binary rewriting is a widely adopted technique in software analysis. Web...

Please sign up or login with your details

Forgot password? Click here to reset