pg-Causality: Identifying Spatiotemporal Causal Pathways for Air Pollutants with Urban Big Data

10/22/2016
by   Julie Yixuan Zhu, et al.
0

Many countries are suffering from severe air pollution. Understanding how different air pollutants accumulate and propagate is critical to making relevant public policies. In this paper, we use urban big data (air quality data and meteorological data) to identify the spatiotemporal (ST) causal pathways for air pollutants. This problem is challenging because: (1) there are numerous noisy and low-pollution periods in the raw air quality data, which may lead to unreliable causality analysis, (2) for large-scale data in the ST space, the computational complexity of constructing a causal structure is very high, and (3) the ST causal pathways are complex due to the interactions of multiple pollutants and the influence of environmental factors. Therefore, we present p-Causality, a novel pattern-aided causality analysis approach that combines the strengths of pattern mining and Bayesian learning to efficiently and faithfully identify the ST causal pathways. First, Pattern mining helps suppress the noise by capturing frequent evolving patterns (FEPs) of each monitoring sensor, and greatly reduce the complexity by selecting the pattern-matched sensors as "causers". Then, Bayesian learning carefully encodes the local and ST causal relations with a Gaussian Bayesian network (GBN)-based graphical model, which also integrates environmental influences to minimize biases in the final results. We evaluate our approach with three real-world data sets containing 982 air quality sensors, in three regions of China from 01-Jun-2013 to 19-Dec-2015. Results show that our approach outperforms the traditional causal structure learning methods in time efficiency, inference accuracy and interpretability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/27/2020

AirRL: A Reinforcement Learning Approach to Urban Air Quality Inference

Urban air pollution has become a major environmental problem that threat...
research
04/05/2018

Real-time Air Pollution prediction model based on Spatiotemporal Big data

Air pollution is one of the most concerns for urban areas. Many countrie...
research
09/25/2018

A Survey of Learning Causality with Data: Problems and Methods

The era of big data provides researchers with convenient access to copio...
research
11/16/2019

Spatiotemporal large-scale networks shaped by air mass movements

The movement of atmospheric air masses can be seen as a continuous and g...
research
09/09/2021

Deciphering Environmental Air Pollution with Large Scale City Data

Out of the numerous hazards posing a threat to sustainable environmental...
research
12/20/2021

Spatiotemporal Motion Synchronization for Snowboard Big Air

During the training for snowboard big air, one of the most popular winte...
research
01/31/2021

Information fusion between knowledge and data in Bayesian network structure learning

Bayesian Networks (BNs) have become a powerful technology for reasoning ...

Please sign up or login with your details

Forgot password? Click here to reset