Statistically-Robust Clustering Techniques for Mapping Spatial Hotspots: A Survey

03/22/2021
by   Yiqun Xie, et al.
0

Mapping of spatial hotspots, i.e., regions with significantly higher rates or probability density of generating certain events (e.g., disease or crime cases), is a important task in diverse societal domains, including public health, public safety, transportation, agriculture, environmental science, etc. Clustering techniques required by these domains differ from traditional clustering methods due to the high economic and social costs of spurious results (e.g., false alarms of crime clusters). As a result, statistical rigor is needed explicitly to control the rate of spurious detections. To address this challenge, techniques for statistically-robust clustering have been extensively studied by the data mining and statistics communities. In this survey we present an up-to-date and detailed review of the models and algorithms developed by this field. We first present a general taxonomy of the clustering process with statistical rigor, covering key steps of data and statistical modeling, region enumeration and maximization, significance testing, and data update. We further discuss different paradigms and methods within each of key steps. Finally, we highlight research gaps and potential future directions, which may serve as a stepping stone in generating new ideas and thoughts in this growing field and beyond.

READ FULL TEXT
research
06/26/2022

Spatiotemporal Data Mining: A Survey

Spatiotemporal data mining aims to discover interesting, useful but non-...
research
11/23/2022

A Survey of Deep Graph Clustering: Taxonomy, Challenge, and Application

Graph clustering, which aims to divide the nodes in the graph into sever...
research
11/11/2018

A Survey of Mixed Data Clustering Algorithms

Most of the datasets normally contain either numeric or categorical feat...
research
03/31/2021

Spatiotemporal Data Mining: A Survey on Challenges and Open Problems

Spatiotemporal data mining (STDM) discovers useful patterns from the dyn...
research
09/30/2021

High-Availability Clusters: A Taxonomy, Survey, and Future Directions

The delivery of key services in domains ranging from finance and manufac...
research
03/10/2022

A Look into the Problem of Preferential Sampling from the Lens of Survey Statistics

An evolving problem in the field of spatial and ecological statistics is...
research
02/28/2021

Clustering Schemes on the Torus with Application to RNA Clashes

Molecular structures of RNA molecules reconstructed from X-ray crystallo...

Please sign up or login with your details

Forgot password? Click here to reset