Sequential Semantic Generative Communication for Progressive Text-to-Image Generation

09/08/2023
by   Hyelin Nam, et al.
0

This paper proposes new framework of communication system leveraging promising generation capabilities of multi-modal generative models. Regarding nowadays smart applications, successful communication can be made by conveying the perceptual meaning, which we set as text prompt. Text serves as a suitable semantic representation of image data as it has evolved to instruct an image or generate image through multi-modal techniques, by being interpreted in a manner similar to human cognition. Utilizing text can also reduce the overload compared to transmitting the intact data itself. The transmitter converts objective image to text through multi-model generation process and the receiver reconstructs the image using reverse process. Each word in the text sentence has each syntactic role, responsible for particular piece of information the text contains. For further efficiency in communication load, the transmitter sequentially sends words in priority of carrying the most information until reaches successful communication. Therefore, our primary focus is on the promising design of a communication system based on image-to-text transformation and the proposed schemes for sequentially transmitting word tokens. Our work is expected to pave a new road of utilizing state-of-the-art generative models to real communication systems

READ FULL TEXT

page 1

page 3

research
09/20/2023

Language-Oriented Communication with Semantic Coding and Knowledge Distillation for Text-to-Image Generation

By integrating recent advances in large language models (LLMs) and gener...
research
07/11/2023

Diffusion idea exploration for art generation

Cross-Modal learning tasks have picked up pace in recent times. With ple...
research
04/15/2021

Data-QuestEval: A Referenceless Metric for Data to Text Semantic Evaluation

In this paper, we explore how QuestEval, which is a Text-vs-Text metric,...
research
05/18/2023

Rate-Adaptive Coding Mechanism for Semantic Communications With Multi-Modal Data

Recently, the ever-increasing demand for bandwidth in multi-modal commun...
research
06/20/2023

Align, Adapt and Inject: Sound-guided Unified Image Generation

Text-guided image generation has witnessed unprecedented progress due to...
research
09/27/2022

What Does DALL-E 2 Know About Radiology?

Generative models such as DALL-E 2 could represent a promising future to...
research
09/07/2023

T2IW: Joint Text to Image Watermark Generation

Recent developments in text-conditioned image generative models have rev...

Please sign up or login with your details

Forgot password? Click here to reset