DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales

08/02/2023
by   Zhewei Yao, et al.
0

ChatGPT-like models have revolutionized various applications in artificial intelligence, from summarization and coding to translation, matching or even surpassing human performance. However, the current landscape lacks an accessible, efficient, and cost-effective end-to-end RLHF (Reinforcement Learning with Human Feedback) training pipeline for these powerful models, particularly when training at the scale of billions of parameters. This paper introduces DeepSpeed-Chat, a novel system that democratizes RLHF training, making it accessible to the AI community. DeepSpeed-Chat offers three key capabilities: an easy-to-use training and inference experience for ChatGPT-like models, a DeepSpeed-RLHF pipeline that replicates the training pipeline from InstructGPT, and a robust DeepSpeed-RLHF system that combines various optimizations for training and inference in a unified way. The system delivers unparalleled efficiency and scalability, enabling training of models with hundreds of billions of parameters in record time and at a fraction of the cost. With this development, DeepSpeed-Chat paves the way for broader access to advanced RLHF training, even for data scientists with limited resources, thereby fostering innovation and further development in the field of AI.

READ FULL TEXT

page 10

page 11

research
05/08/2019

AI Enabling Technologies: A Survey

Artificial Intelligence (AI) has the opportunity to revolutionize the wa...
research
06/02/2023

Accelerating science with human-aware artificial intelligence

Artificial intelligence (AI) models trained on published scientific find...
research
01/21/2023

The Pipeline for the Continuous Development of Artificial Intelligence Models – Current State of Research and Practice

Companies struggle to continuously develop and deploy AI models to compl...
research
06/09/2023

EfficientBioAI: Making Bioimaging AI Models Efficient in Energy, Latency and Representation

Artificial intelligence (AI) has been widely used in bioimage image anal...
research
03/16/2020

SeegaAI : Deep Reinforcement Learning in Seega

This research paper introduces SeegaAI, a research project to develop a ...
research
05/11/2023

Large Language Models Can Be Used To Effectively Scale Spear Phishing Campaigns

Recent progress in artificial intelligence (AI), particularly in the dom...

Please sign up or login with your details

Forgot password? Click here to reset