Style Generation: Image Synthesis based on Coarsely Matched Texts

09/08/2023
by   Mengyao Cui, et al.
0

Previous text-to-image synthesis algorithms typically use explicit textual instructions to generate/manipulate images accurately, but they have difficulty adapting to guidance in the form of coarsely matched texts. In this work, we attempt to stylize an input image using such coarsely matched text as guidance. To tackle this new problem, we introduce a novel task called text-based style generation and propose a two-stage generative adversarial network: the first stage generates the overall image style with a sentence feature, and the second stage refines the generated style with a synthetic feature, which is produced by a multi-modality style synthesis module. We re-filter one existing dataset and collect a new dataset for the task. Extensive experiments and ablation studies are conducted to validate our framework. The practical potential of our work is demonstrated by various applications such as text-image alignment and story visualization. Our datasets are published at https://www.kaggle.com/datasets/mengyaocui/style-generation.

READ FULL TEXT

page 2

page 5

page 6

page 8

page 10

page 11

page 12

page 13

research
08/15/2023

SGDiff: A Style Guided Diffusion Model for Fashion Synthesis

This paper reports on the development of a novel style guided diffusion ...
research
04/27/2022

Self-Supervised Text Erasing with Controllable Image Synthesis

Recent efforts on scene text erasing have shown promising results. Howev...
research
03/02/2020

Style Example-Guided Text Generation using Generative Adversarial Transformers

We introduce a language generative model framework for generating a styl...
research
09/13/2023

DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models

Recent progresses in large-scale text-to-image models have yielded remar...
research
05/18/2022

3D Segmentation Guided Style-based Generative Adversarial Networks for PET Synthesis

Potential radioactive hazards in full-dose positron emission tomography ...
research
11/14/2022

Learning to Model Multimodal Semantic Alignment for Story Visualization

Story visualization aims to generate a sequence of images to narrate eac...
research
06/22/2022

A Fast Text-Driven Approach for Generating Artistic Content

In this work, we propose a complete framework that generates visual art....

Please sign up or login with your details

Forgot password? Click here to reset