CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image Encoders

06/28/2021
by   Kevin Frans, et al.
0

This work presents CLIPDraw, an algorithm that synthesizes novel drawings based on natural language input. CLIPDraw does not require any training; rather a pre-trained CLIP language-image encoder is used as a metric for maximizing similarity between the given description and a generated drawing. Crucially, CLIPDraw operates over vector strokes rather than pixel images, a constraint that biases drawings towards simpler human-recognizable shapes. Results compare between CLIPDraw and other synthesis-through-optimization methods, as well as highlight various interesting behaviors of CLIPDraw, such as satisfying ambiguous text in multiple ways, reliably producing drawings in diverse artistic styles, and scaling from simple to complex visual representations as stroke count is increased. Code for experimenting with the method is available at: https://colab.research.google.com/github/kvfrans/clipdraw/blob/main/clipdraw.ipynb

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

research
11/04/2021

StyleCLIPDraw: Coupling Content and Style in Text-to-Drawing Synthesis

Generating images that fit a given text description using machine learni...
research
02/24/2022

StyleCLIPDraw: Coupling Content and Style in Text-to-Drawing Translation

Generating images that fit a given text description using machine learni...
research
10/31/2019

Text-to-image synthesis method evaluation based on visual patterns

A commonly used evaluation metric for text-to-image synthesis is the Inc...
research
12/21/2022

3D Highlighter: Localizing Regions on 3D Shapes via Text Descriptions

We present 3D Highlighter, a technique for localizing semantic regions o...
research
10/06/2020

From Language Games to Drawing Games

We attempt to automate various artistic processes by inventing a set of ...
research
03/03/2023

Ask and You Shall Receive (a Graph Drawing): Testing ChatGPT's Potential to Apply Graph Layout Algorithms

Large language models (LLMs) have recently taken the world by storm. The...
research
03/25/2023

Indonesian Text-to-Image Synthesis with Sentence-BERT and FastGAN

Currently, text-to-image synthesis uses text encoder and image generator...

Please sign up or login with your details

Forgot password? Click here to reset