Prediction of rare feature combinations in population synthesis: Application of deep generative modelling

09/17/2019
by   Sergio Garrido, et al.
0

In population synthesis applications, when considering populations with many attributes, a fundamental problem is the estimation of rare combinations of feature attributes. Unsurprisingly, it is notably more difficult to reliably representthe sparser regions of such multivariate distributions and in particular combinations of attributes which are absent from the original sample. In the literature this is commonly known as sampling zeros for which no systematic solution has been proposed so far. In this paper, two machine learning algorithms, from the family of deep generative models,are proposed for the problem of population synthesis and with particular attention to the problem of sampling zeros. Specifically, we introduce the Wasserstein Generative Adversarial Network (WGAN) and the Variational Autoencoder(VAE), and adapt these algorithms for a large-scale population synthesis application. The models are implemented on a Danish travel survey with a feature-space of more than 60 variables. The models are validated in a cross-validation scheme and a set of new metrics for the evaluation of the sampling-zero problem is proposed. Results show how these models are able to recover sampling zeros while keeping the estimation of truly impossible combinations, the structural zeros, at a comparatively low level. Particularly, for a low dimensional experiment, the VAE, the marginal sampler and the fully random sampler generate 5 26 WGAN, while for a high dimensional case, these figures escalate to 44 and 170440 agent-based systems and in particular cases where detailed socio-economic or geographical representations are required.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2018

Scalable Population Synthesis with Deep Generative Modeling

Population synthesis is concerned with the generation of synthetic yet r...
research
08/01/2022

A Deep Generative Model for Feasible and Diverse Population Synthesis

An ideal synthetic population, a key input to activity-based models, mim...
research
11/23/2022

Robustness Analysis of Deep Learning Models for Population Synthesis

Deep generative models have become useful for synthetic data generation,...
research
04/15/2020

Composite Travel Generative Adversarial Networks for Tabular and Sequential Population Synthesis

Agent-based transportation modelling has become the standard to simulate...
research
11/13/2020

Population synthesis for urban resident modeling using deep generative models

The impacts of new real estate developments are strongly associated to i...
research
08/21/2023

Feature Extraction Using Deep Generative Models for Bangla Text Classification on a New Comprehensive Dataset

The selection of features for text classification is a fundamental task ...
research
09/09/2020

Multilinear Latent Conditioning for Generating Unseen Attribute Combinations

Deep generative models rely on their inductive bias to facilitate genera...

Please sign up or login with your details

Forgot password? Click here to reset