Synthpop++: A Hybrid Framework for Generating A Country-scale Synthetic Population

04/24/2023
by   Bhavesh Neekhra, et al.
0

Population censuses are vital to public policy decision-making. They provide insight into human resources, demography, culture, and economic structure at local, regional, and national levels. However, such surveys are very expensive (especially for low and middle-income countries with high populations, such as India), time-consuming, and may also raise privacy concerns, depending upon the kinds of data collected. In light of these issues, we introduce SynthPop++, a novel hybrid framework, which can combine data from multiple real-world surveys (with different, partially overlapping sets of attributes) to produce a real-scale synthetic population of humans. Critically, our population maintains family structures comprising individuals with demographic, socioeconomic, health, and geolocation attributes: this means that our “fake” people live in realistic locations, have realistic families, etc. Such data can be used for a variety of purposes: we explore one such use case, Agent-based modelling of infectious disease in India. To gauge the quality of our synthetic population, we use both machine learning and statistical metrics. Our experimental results show that synthetic population can realistically simulate the population for various administrative units of India, producing real-scale, detailed data at the desired level of zoom – from cities, to districts, to states, eventually combining to form a country-scale synthetic population.

READ FULL TEXT
research
09/20/2022

Generating Synthetic Population

In this paper, we provide a method to generate synthetic population at v...
research
06/14/2021

High-resolution population estimation using household survey data and building footprints

The national census is an essential data source to support decision-maki...
research
11/14/2022

A deep learning framework to generate realistic population and mobility data

Census and Household Travel Survey datasets are regularly collected from...
research
08/18/2020

Building a large synthetic population from Australian census data

We present work on creating a synthetic population from census data for ...
research
09/29/2018

Which country epitomizes the world? A study from the perspective of demographic composition

Demographic indicators are an essential element in considering various p...
research
04/16/2019

SynC: A Unified Framework for Generating Synthetic Population with Gaussian Copula

Synthetic population generation is the process of combining multiple soc...

Please sign up or login with your details

Forgot password? Click here to reset