Scalable Bayesian inference for self-excitatory stochastic processes applied to big American gunfire data

05/13/2020
by   Andrew J. Holbrook, et al.
0

The Hawkes process and its extensions effectively model self-excitatory phenomena including earthquakes, viral pandemics, financial transactions, neural spike trains and the spread of memes through social networks. The usefulness of these stochastic process models within a host of economic sectors and scientific disciplines is undercut by the processes' computational burden: complexity of likelihood evaluations grows quadratically in the number of observations for both the temporal and spatiotemporal Hawkes processes. We show that, with care, one may parallelize these calculations using both central and graphics processing unit implementations to achieve over 100-fold speedups over single-core processing. Using a simple adaptive Metropolis-Hastings scheme, we apply our high-performance computing framework to a Bayesian analysis of big gunshot data generated in Washington D.C. between the years of 2006 and 2019, thereby extending a past analysis of the same data from under 10,000 to over 85,000 observations. To encourage wide-spread use, we provide hpHawkes, an open-source R package, and discuss high-level implementation and program design for leveraging aspects of computational hardware that become necessary in a big data setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/11/2019

Massive parallelization boosts big Bayesian multidimensional scaling

Big Bayes is the computationally intensive co-application of big data an...
research
11/24/2014

Big Learning with Bayesian Methods

Explosive growth in data and availability of cheap computing resources h...
research
07/24/2019

New frontiers in Bayesian modeling using the INLA package in R

The INLA package provides a tool for computationally efficient Bayesian ...
research
05/16/2022

binspp: An R Package for Bayesian Inference for Neyman-Scott Point Processes with Complex Inhomogeneity Structure

The Neyman-Scott point process is a widely used point process model whic...
research
09/14/2016

Quick and energy-efficient Bayesian computing of binocular disparity using stochastic digital signals

Reconstruction of the tridimensional geometry of a visual scene using th...
research
10/17/2016

High-performance K-means Implementation based on a Simplified Map-Reduce Architecture

The k-means algorithm is one of the most common clustering algorithms an...
research
10/26/2018

On the Identifiability of the Influence Model for Stochastic Spatiotemporal Spread Processes

The influence model is a discrete-time stochastic model that succinctly ...

Please sign up or login with your details

Forgot password? Click here to reset