Fast Spatial Autocorrelation

10/17/2020
by   Anar Amgalan, et al.
0

Physical or geographic location proves to be an important feature in many data science models, because many diverse natural and social phenomenon have a spatial component. Spatial autocorrelation measures the extent to which locally adjacent observations of the same phenomenon are correlated. Although statistics like Moran's I and Geary's C are widely used to measure spatial autocorrelation, they are slow: all popular methods run in Ω(n^2) time, rendering them unusable for large data sets, or long time-courses with moderate numbers of points. We propose a new S_A statistic based on the notion that the variance observed when merging pairs of nearby clusters should increase slowly for spatially autocorrelated variables. We give a linear-time algorithm to calculate S_A for a variable with an input agglomeration order (available at https://github.com/aamgalan/spatial_autocorrelation). For a typical dataset of n ≈ 63,000 points, our S_A autocorrelation measure can be computed in 1 second, versus 2 hours or more for Moran's I and Geary's C. Through simulation studies, we demonstrate that S_A identifies spatial correlations in variables generated with spatially-dependent model half an order of magnitude earlier than either Moran's I or Geary's C. Finally, we prove several theoretical properties of S_A: namely that it behaves as a true correlation statistic, and is invariant under addition or multiplication by a constant.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2022

A Bayesian shared-frailty spatial scan statistic model for time-to-event data

Spatial scan statistics are well known and widely used methods for the d...
research
11/03/2020

Spatially Clustered Regression

Spatial regression or geographically weighted regression models have bee...
research
01/08/2022

Bayesian Changepoint Estimation for Spatially Indexed Functional Time Series

We propose a Bayesian hierarchical model to simultaneously estimate mean...
research
01/07/2016

Measuring and Discovering Correlations in Large Data Sets

In this paper, a class of statistics named ART (the alternant recursive ...
research
03/29/2023

Measuring spatial association and testing spatial independence based on short time course data

Spatial association measures for univariate static spatial data are wide...
research
04/05/2023

A Class of Models for Large Zero-inflated Spatial Data

Spatially correlated data with an excess of zeros, usually referred to a...
research
11/26/2019

The spatiotemporal tau statistic: a review

Introduction The tau statistic is a recent second-order correlation fu...

Please sign up or login with your details

Forgot password? Click here to reset