Sketch-pix2seq: a Model to Generate Sketches of Multiple Categories

09/13/2017
by   Yajing Chen, et al.
0

Sketch is an important media for human to communicate ideas, which reflects the superiority of human intelligence. Studies on sketch can be roughly summarized into recognition and generation. Existing models on image recognition failed to obtain satisfying performance on sketch classification. But for sketch generation, a recent study proposed a sequence-to-sequence variational-auto-encoder (VAE) model called sketch-rnn which was able to generate sketches based on human inputs. The model achieved amazing results when asked to learn one category of object, such as an animal or a vehicle. However, the performance dropped when multiple categories were fed into the model. Here, we proposed a model called sketch-pix2seq which could learn and draw multiple categories of sketches. Two modifications were made to improve the sketch-rnn model: one is to replace the bidirectional recurrent neural network (BRNN) encoder with a convolutional neural network(CNN); the other is to remove the Kullback-Leibler divergence from the objective function of VAE. Experimental results showed that models with CNN encoders outperformed those with RNN encoders in generating human-style sketches. Visualization of the latent space illustrated that the removal of KL-divergence made the encoder learn a posterior of latent space that reflected the features of different categories. Moreover, the combination of CNN encoder and removal of KL-divergence, i.e., the sketch-pix2seq model, had better performance in learning and generating sketches of multiple categories and showed promising results in creativity tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2017

A Neural Representation of Sketch Drawings

We present sketch-rnn, a recurrent neural network (RNN) able to construc...
research
11/09/2020

Sketch-Inspector: a Deep Mixture Model for High-Quality Sketch Generation of Cats

With the involvement of artificial intelligence (AI), sketches can be au...
research
10/29/2019

Bridging the ELBO and MMD

One of the challenges in training generative models such as the variatio...
research
11/20/2018

Sketch-R2CNN: An Attentive Network for Vector Sketch Recognition

Freehand sketching is a dynamic process where points are sequentially sa...
research
09/30/2021

Towards Better Data Augmentation using Wasserstein Distance in Variational Auto-encoder

VAE, or variational auto-encoder, compresses data into latent attributes...
research
05/19/2020

Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt

Previous researches of sketches often considered sketches in pixel forma...
research
04/26/2022

Leveraging Unlabeled Data for Sketch-based Understanding

Sketch-based understanding is a critical component of human cognitive le...

Please sign up or login with your details

Forgot password? Click here to reset