CoSE: Compositional Stroke Embeddings

06/17/2020
by   Emre Aksan, et al.
7

We present a generative model for stroke-based drawing tasks which is able to model complex free-form structures. While previous approaches rely on sequence-based models for drawings of basic objects or handwritten text, we propose a model that treats drawings as a collection of strokes that can be composed into complex structures such as diagrams (e.g., flow-charts). At the core of the approach lies a novel auto-encoder that projects variable-length strokes into a latent space of fixed dimension. This representation space allows a relational model, operating in latent space, to better capture the relationship between strokes and to predict subsequent strokes. We demonstrate qualitatively and quantitatively that our proposed approach is able to model the appearance of individual strokes, as well as the compositional structure of larger diagram drawings. Our approach is suitable for interactive use cases such as auto-completing diagrams.

READ FULL TEXT

page 7

page 14

research
08/18/2023

MATLABER: Material-Aware Text-to-3D via LAtent BRDF auto-EncodeR

Based on powerful text-to-image diffusion models, text-to-3D generation ...
research
02/24/2022

Learning Multi-Object Dynamics with Compositional Neural Radiance Fields

We present a method to learn compositional predictive models from image ...
research
01/22/2020

From abstract items to latent spaces to observed data and back: Compositional Variational Auto-Encoder

Conditional Generative Models are now acknowledged an essential tool in ...
research
05/13/2021

PassFlow: Guessing Passwords with Generative Flows

Recent advances in generative machine learning models rekindled research...
research
04/06/2021

OodGAN: Generative Adversarial Network for Out-of-Domain Data Generation

Detecting an Out-of-Domain (OOD) utterance is crucial for a robust dialo...
research
08/17/2020

OCEAN: Online Task Inference for Compositional Tasks with Context Adaptation

Real-world tasks often exhibit a compositional structure that contains a...
research
10/05/2018

Learning to Encode Text as Human-Readable Summaries using Generative Adversarial Networks

Auto-encoders compress input data into a latent-space representation and...

Please sign up or login with your details

Forgot password? Click here to reset