Draw Your Art Dream: Diverse Digital Art Synthesis with Multimodal Guided Diffusion

09/27/2022
by   Nisha Huang, et al.
0

Digital art synthesis is receiving increasing attention in the multimedia community because of engaging the public with art effectively. Current digital art synthesis methods usually use single-modality inputs as guidance, thereby limiting the expressiveness of the model and the diversity of generated results. To solve this problem, we propose the multimodal guided artwork diffusion (MGAD) model, which is a diffusion-based digital artwork generation approach that utilizes multimodal prompts as guidance to control the classifier-free diffusion model. Additionally, the contrastive language-image pretraining (CLIP) model is used to unify text and image modalities. Extensive experimental results on the quality and quantity of the generated digital art paintings confirm the effectiveness of the combination of the diffusion model and multimodal guidance. Code is available at https://github.com/haha-lisa/MGAD-multimodal-guided-artwork-diffusion.

READ FULL TEXT

page 1

page 5

page 6

page 8

research
02/14/2023

Universal Guidance for Diffusion Models

Typical diffusion models are trained to accept a particular form of cond...
research
08/15/2023

SGDiff: A Style Guided Diffusion Model for Fashion Synthesis

This paper reports on the development of a novel style guided diffusion ...
research
12/27/2021

Multimodal Image Synthesis and Editing: A Survey

As information exists in various modalities in real world, effective int...
research
11/30/2022

High-Fidelity Guided Image Synthesis with Latent Diffusion Models

Controllable image synthesis with user scribbles has gained huge public ...
research
04/25/2023

Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning

Guided sampling is a vital approach for applying diffusion models in rea...
research
05/19/2023

Any-to-Any Generation via Composable Diffusion

We present Composable Diffusion (CoDi), a novel generative model capable...
research
02/12/2021

Multimodal data visualization, denoising and clustering with integrated diffusion

We propose a method called integrated diffusion for combining multimodal...

Please sign up or login with your details

Forgot password? Click here to reset