A comparison of statistical and machine learning methods for creating national daily maps of ambient PM_2.5 concentration

04/17/2019
by   Veronica J. Berrocal, et al.
0

A typical problem in air pollution epidemiology is exposure assessment for individuals for which health data are available. Due to the sparsity of monitoring sites and the limited temporal frequency with which measurements of air pollutants concentrations are collected (for most pollutants, once every 3 or 6 days), epidemiologists have been moving away from characterizing ambient air pollution exposure solely using measurements. In the last few years, substantial research efforts have been placed in developing statistical methods or machine learning techniques to generate estimates of air pollution at finer spatial and temporal scales (daily, usually) with complete coverage. Some of these methods include: geostatistical techniques, such as kriging; spatial statistical models that use the information contained in air quality model outputs (statistical downscaling models); linear regression modeling approaches that leverage the information in GIS covariates (land use regression); or machine learning methods that mine the information contained in relevant variables (neural network and deep learning approaches). Although some of these exposure modeling approaches have been used in several air pollution epidemiological studies, it is not clear how much the predicted exposures generated by these methods differ, and which method generates more reliable estimates. In this paper, we aim to address this gap by evaluating a variety of exposure modeling approaches, comparing their predictive performance and computational difficulty. Using PM_2.5 in year 2011 over the continental U.S. as case study, we examine the methods' performances across seasons, rural vs urban settings, and levels of PM_2.5 concentrations (low, medium, high).

READ FULL TEXT

page 5

page 17

page 18

page 24

page 25

research
05/30/2020

Random Spatial Forests

A fundamental problem in environmental epidemiology studies on the assoc...
research
08/28/2018

National PM2.5 and NO2 Exposure Models for China Based on Land Use Regression, Satellite Measurements, and Universal Kriging

Outdoor air pollution is a major killer worldwide and the fourth largest...
research
05/19/2020

Scalable penalized spatiotemporal land-use regression for ground-level nitrogen dioxide

Nitrogen dioxide (NO_2) is a primary constituent of traffic-related air ...
research
09/03/2022

Evaluation of Model-Based PM_2.5 Estimates for Exposure Assessment During Wildfire Smoke Episodes in the Western U.S

Investigating the health impacts of wildfire smoke requires data on peop...
research
08/07/2018

A Bayesian Downscaler Model to Estimate Daily PM2.5 levels in the Continental US

There has been growing interest in extending the coverage of ground PM2....
research
05/06/2021

Calibration of Spatial Forecasts from Citizen Science Urban Air Pollution Data with Sparse Recurrent Neural Networks

With their continued increase in coverage and quality, data collected fr...
research
01/04/2019

Spatiotemporal Calibration of Atmospheric Nitrogen Dioxide Concentration Estimates From an Air Quality Model for Connecticut

A spatiotemporal calibration and resolution refinement model was fitted ...

Please sign up or login with your details

Forgot password? Click here to reset