A Survey on Model Compression for Large Language Models

08/15/2023
by   Xunyu Zhu, et al.
0

Large Language Models (LLMs) have revolutionized natural language processing tasks with remarkable success. However, their formidable size and computational demands present significant challenges for practical deployment, especially in resource-constrained environments. As these challenges become increasingly pertinent, the field of model compression has emerged as a pivotal research area to alleviate these limitations. This paper presents a comprehensive survey that navigates the landscape of model compression techniques tailored specifically for LLMs. Addressing the imperative need for efficient deployment, we delve into various methodologies, encompassing quantization, pruning, knowledge distillation, and more. Within each of these techniques, we highlight recent advancements and innovative approaches that contribute to the evolving landscape of LLM research. Furthermore, we explore benchmarking strategies and evaluation metrics that are essential for assessing the effectiveness of compressed LLMs. By providing insights into the latest developments and practical implications, this survey serves as an invaluable resource for both researchers and practitioners. As LLMs continue to evolve, this survey aims to facilitate enhanced efficiency and real-world applicability, establishing a foundation for future advancements in the field.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2022

Large Language Models Meet NL2Code: A Survey

The task of generating code from a natural language description, or NL2C...
research
12/30/2021

Automatic Mixed-Precision Quantization Search of BERT

Pre-trained language models such as BERT have shown remarkable effective...
research
05/22/2023

Interactive Natural Language Processing

Interactive Natural Language Processing (iNLP) has emerged as a novel pa...
research
08/11/2023

Large Language Models for Telecom: Forthcoming Impact on the Industry

Large Language Models (LLMs) have emerged as a transformative force, rev...
research
05/12/2023

Digital Forensics in the Age of Smart Environments: A Survey of Recent Advancements and Challenges

Digital forensics in smart environments is an emerging field that deals ...
research
05/17/2023

Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt

Large Language Models (LLMs), armed with billions of parameters, exhibit...
research
07/06/2023

A Survey on Evaluation of Large Language Models

Large language models (LLMs) are gaining increasing popularity in both a...

Please sign up or login with your details

Forgot password? Click here to reset