An Overview on Generative AI at Scale with Edge-Cloud Computing

06/02/2023
by   Yun-Cheng Wang, et al.
0

As a specific category of artificial intelligence (AI), generative artificial intelligence (GenAI) generates new content that resembles what is created by humans. The rapid development of GenAI systems has created a huge amount of new data on the Internet, posing new challenges to current computing and communication frameworks. Currently, GenAI services rely on the traditional cloud computing framework due to the need for large computation resources. However, such services will encounter high latency because of data transmission and a high volume of requests. On the other hand, edge-cloud computing can provide adequate computation power and low latency at the same time through the collaboration between edges and the cloud. Thus, it is attractive to build GenAI systems at scale by leveraging the edge-cloud computing paradigm. In this overview paper, we review recent developments in GenAI and edge-cloud computing, respectively. Then, we use two exemplary GenAI applications to discuss technical challenges in scaling up their solutions using edge-cloud collaborative systems. Finally, we list design considerations for training and deploying GenAI systems at scale and point out future research directions.

READ FULL TEXT
research
01/20/2020

Distributed Vehicular Computing at the Dawn of 5G: a Survey

Recent advances in information technology have revolutionized the automo...
research
11/11/2021

Edge-Cloud Polarization and Collaboration: A Comprehensive Survey

Influenced by the great success of deep learning via cloud computing and...
research
04/26/2023

Scalable, Distributed AI Frameworks: Leveraging Cloud Computing for Enhanced Deep Learning Performance and Efficiency

In recent years, the integration of artificial intelligence (AI) and clo...
research
03/28/2023

Unleashing the Power of Edge-Cloud Generative AI in Mobile Networks: A Survey of AIGC Services

Artificial Intelligence-Generated Content (AIGC) is an automated method ...
research
07/12/2023

NetGPT: A Native-AI Network Architecture Beyond Provisioning Personalized Generative Services

Large language models (LLMs) have triggered tremendous success to empowe...
research
03/08/2023

KubeEdge-Sedna v0.3: Towards Next-Generation Automatically Customized AI Engineering Scheme

The scale of the global edge AI market continues to grow. The current te...
research
02/15/2018

Cloud No Longer a Silver Bullet, Edge to the Rescue

This paper takes the position that, while cognitive computing today reli...

Please sign up or login with your details

Forgot password? Click here to reset