Diffusion idea exploration for art generation

07/11/2023
by   Nikhil Verma, et al.
0

Cross-Modal learning tasks have picked up pace in recent times. With plethora of applications in diverse areas, generation of novel content using multiple modalities of data has remained a challenging problem. To address the same, various generative modelling techniques have been proposed for specific tasks. Novel and creative image generation is one important aspect for industrial application which could help as an arm for novel content generation. Techniques proposed previously used Generative Adversarial Network(GAN), autoregressive models and Variational Autoencoders (VAE) for accomplishing similar tasks. These approaches are limited in their capability to produce images guided by either text instructions or rough sketch images decreasing the overall performance of image generator. We used state of the art diffusion models to generate creative art by primarily leveraging text with additional support of rough sketches. Diffusion starts with a pattern of random dots and slowly converts that pattern into a design image using the guiding information fed into the model. Diffusion models have recently outperformed other generative models in image generation tasks using cross modal data as guiding information. The initial experiments for this task of novel image generation demonstrated promising qualitative results.

READ FULL TEXT

page 7

page 17

page 18

page 20

page 22

research
08/18/2023

DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability

Recently, large-scale diffusion models, e.g., Stable diffusion and DallE...
research
03/30/2023

DiffCollage: Parallel Generation of Large Content with Diffusion Models

We present DiffCollage, a compositional diffusion model that can generat...
research
03/01/2022

Variational Autoencoders Without the Variation

Variational autoencdoers (VAE) are a popular approach to generative mode...
research
10/01/2017

Video Generation From Text

Generating videos from text has proven to be a significant challenge for...
research
09/06/2023

My Art My Choice: Adversarial Protection Against Unruly AI

Generative AI is on the rise, enabling everyone to produce realistic con...
research
09/08/2023

Sequential Semantic Generative Communication for Progressive Text-to-Image Generation

This paper proposes new framework of communication system leveraging pro...

Please sign up or login with your details

Forgot password? Click here to reset