ChiroDiff: Modelling chirographic data with Diffusion Models

04/07/2023
by   Ayan Das, et al.
2

Generative modelling over continuous-time geometric constructs, a.k.a such as handwriting, sketches, drawings etc., have been accomplished through autoregressive distributions. Such strictly-ordered discrete factorization however falls short of capturing key properties of chirographic data – it fails to build holistic understanding of the temporal concept due to one-way visibility (causality). Consequently, temporal data has been modelled as discrete token sequences of fixed sampling rate instead of capturing the true underlying concept. In this paper, we introduce a powerful model-class namely "Denoising Diffusion Probabilistic Models" or DDPMs for chirographic data that specifically addresses these flaws. Our model named "ChiroDiff", being non-autoregressive, learns to capture holistic concepts and therefore remains resilient to higher temporal sampling rate up to a good extent. Moreover, we show that many important downstream utilities (e.g. conditional sampling, creative mixing) can be flexibly implemented using ChiroDiff. We further show some unique use-cases like stochastic vectorization, de-noising/healing, abstraction are also possible with this model-class. We perform quantitative and qualitative evaluation of our framework on relevant datasets and found it to be better or on par with competing approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2021

Argmax Flows and Multinomial Diffusion: Towards Non-Autoregressive Language Models

The field of language modelling has been largely dominated by autoregres...
research
11/28/2022

Continuous diffusion for categorical data

Diffusion models have quickly become the go-to paradigm for generative m...
research
09/09/2022

Improved Masked Image Generation with Token-Critic

Non-autoregressive generative transformers recently demonstrated impress...
research
10/04/2022

Diffusion Models for Graphs Benefit From Discrete State Spaces

Denoising diffusion probabilistic models and score matching models have ...
research
08/19/2022

Vector Quantized Diffusion Model with CodeUnet for Text-to-Sign Pose Sequences Generation

Sign Language Production (SLP) aims to translate spoken languages into s...
research
02/01/2022

Partial Directed Coherence and the Vector Autoregressive Modelling Myth and a Caveat

Here we dispel the lingering myth that Partial Directed Coherence is a V...

Please sign up or login with your details

Forgot password? Click here to reset