The Five-Dollar Model: Generating Game Maps and Sprites from Sentence Embeddings

08/08/2023
by   Timothy Merino, et al.
0

The five-dollar model is a lightweight text-to-image generative architecture that generates low dimensional images from an encoded text prompt. This model can successfully generate accurate and aesthetically pleasing content in low dimensional domains, with limited amounts of training data. Despite the small size of both the model and datasets, the generated images are still able to maintain the encoded semantic meaning of the textual prompt. We apply this model to three small datasets: pixel art video game maps, video game sprite images, and down-scaled emoji images and apply novel augmentation strategies to improve the performance of our model on these limited datasets. We evaluate our models performance using cosine similarity score between text-image pairs generated by the CLIP VIT-B/32 model.

READ FULL TEXT

page 2

page 3

page 5

page 6

page 7

research
10/05/2018

CanvasGAN: A simple baseline for text to image generation by incrementally patching a canvas

We propose a new recurrent generative model for generating images from t...
research
01/30/2023

PromptMix: Text-to-image diffusion models enhance the performance of lightweight networks

Many deep learning tasks require annotations that are too time consuming...
research
08/14/2018

Text-to-Image-to-Text Translation using Cycle Consistent Adversarial Networks

Text-to-Image translation has been an active area of research in the rec...
research
04/18/2021

The Intrinsic Dimension of Images and Its Impact on Learning

It is widely believed that natural image data exhibits low-dimensional s...
research
12/18/2017

Synthesizing Novel Pairs of Image and Text

Generating novel pairs of image and text is a problem that combines comp...
research
11/02/2022

Unsupervised Syntactically Controlled Paraphrase Generation with Abstract Meaning Representations

Syntactically controlled paraphrase generation has become an emerging re...
research
05/31/2023

Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards

Recent advances in text-to-image diffusion models have achieved remarkab...

Please sign up or login with your details

Forgot password? Click here to reset