Magnify Your Population: Statistical Downscaling to Augment the Spatial Resolution of Socioeconomic Census Data

06/23/2020
by   Giulia Carella, et al.
0

Fine resolution estimates of demographic and socioeconomic attributes are crucial for planning and policy development. While several efforts have been made to produce fine-scale gridded population estimates, socioeconomic features are typically not available at scales finer than Census units, which may hide local heterogeneity and disparity. In this paper we present a new statistical downscaling approach to derive fine-scale estimates of key socioeconomic attributes. The method leverages demographic and geographical extensive covariates available at multiple scales and additional Census covariates only available at coarse resolution, which are included in the model hierarchically within a "forward learning" approach. For each selected socioeconomic variable, a Random Forest model is trained on the source Census units and then used to generate fine-scale gridded predictions, which are then adjusted to ensure the best possible consistency with the coarser Census data. As a case study, we apply this method to Census data in the United States, downscaling the selected socioeconomic variables available at the block group level, to a grid of  300 spatial resolution. The accuracy of the method is assessed at both spatial scales, first computing a pseudo cross-validation coefficient of determination for the predictions at the block group level and then, for extensive variables only, also for the (unadjusted) predicted counts summed by block group. Based on these scores and on the inspection of the downscaled maps, we conclude that our method is able to provide accurate, smoother, and more detailed socioeconomic estimates than the available Census data.

READ FULL TEXT

page 7

page 9

page 10

research
02/07/2018

Interpolating Distributions for Populations in Nested Geographies using Public-use Data with Application to the American Community Survey

Statistical agencies often publish multiple data products from the same ...
research
11/08/2022

Fine-grained Population Mapping from Coarse Census Counts and Open Geodata

Fine-grained population maps are needed in several domains, like urban p...
research
04/05/2023

Mapping historical forest biomass for stock-change assessments at parcel to landscape scales

Understanding historical forest dynamics, specifically changes in forest...
research
07/09/2021

Prediction of butt rot volume in Norway spruce forest stands using harvester, remotely sensed and environmental data

Butt rot (BR) damages associated with Norway spruce (Picea abies [L.] Ka...
research
06/28/2021

Malaria Risk Mapping Using Routine Health System Incidence Data in Zambia

Improvements to Zambia's malaria surveillance system allow better monito...
research
06/29/2019

An aggregate learning approach for interpretable semi-supervised population prediction and disaggregation using ancillary data

Census data provide detailed information about population characteristic...
research
09/15/2018

Omitted and Included Variable Bias in Tests for Disparate Impact

Policymakers often seek to gauge discrimination against groups defined b...

Please sign up or login with your details

Forgot password? Click here to reset