Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling

05/28/2023
by   Tianqi Chen, et al.
0

Learning to denoise has emerged as a prominent paradigm to design state-of-the-art deep generative models for natural images. How to use it to model the distributions of both continuous real-valued data and categorical data has been well studied in recently proposed diffusion models. However, it is found in this paper to have limited ability in modeling some other types of data, such as count and non-negative continuous data, that are often highly sparse, skewed, heavy-tailed, and/or overdispersed. To this end, we propose learning to jump as a general recipe for generative modeling of various types of data. Using a forward count thinning process to construct learning objectives to train a deep neural network, it employs a reverse count thickening process to iteratively refine its generation through that network. We demonstrate when learning to jump is expected to perform comparably to learning to denoise, and when it is expected to perform better. For example, learning to jump is recommended when the training data is non-negative and exhibits strong sparsity, skewness, heavy-tailedness, and/or heterogeneity.

READ FULL TEXT

page 3

page 9

research
01/10/2020

Review of Probability Distributions for Modeling Count Data

Count data take on non-negative integer values and are challenging to pr...
research
10/31/2017

Flexible Prior Distributions for Deep Generative Models

We consider the problem of training generative models with deep neural n...
research
06/21/2020

VAEM: a Deep Generative Model for Heterogeneous Mixed Type Data

Deep generative models often perform poorly in real-world applications d...
research
05/02/2019

Deep Generative Models for Sparse, High-dimensional, and Overdispersed Discrete Data

Many applications, such as text modelling, high-throughput sequencing, a...
research
06/06/2023

Protecting the Intellectual Property of Diffusion Models by the Watermark Diffusion Process

Diffusion models have emerged as state-of-the-art deep generative archit...
research
07/10/2018

Handling Incomplete Heterogeneous Data using VAEs

Variational autoencoders (VAEs), as well as other generative models, hav...
research
12/10/2019

Representational Rényi heterogeneity

A discrete system's heterogeneity is measured by the Rényi heterogeneity...

Please sign up or login with your details

Forgot password? Click here to reset