Rawn Henry | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Hany Hassan Awadalla
16 publications
Young Jin Kim
13 publications
Raffy Fahim
2 publications

research

∙ 08/16/2023

FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs

Large Language Models (LLMs) have achieved state-of-the-art performance ...

0 Young Jin Kim, et al. ∙

research

∙ 11/18/2022

Who Says Elephants Can't Run: Bringing Large Scale MoE Models into Cloud Scale Production

Mixture of Experts (MoE) models with conditional execution of sparsely a...

0 Young Jin Kim, et al. ∙

Success!

An error occurred