ExaGeoStatR: A Package for Large-Scale Geostatistics in R

07/23/2019
by   Sameh Abdulah, et al.
0

Parallel computing in Gaussian process calculation becomes a necessity for avoiding computational and memory restrictions associated with Geostatistics applications. The evaluation of the Gaussian log-likelihood function requires O(n^2) storage and O(n^3) operations where n is the number of geographical locations. In this paper, we present ExaGeoStatR, a package for large-scale Geostatistics in R that supports parallel computation of the maximum likelihood function on shared memory, GPU, and distributed systems. The parallelization depends on breaking down the numerical linear algebra operations into a set of tasks and rendering them for a task-based programming model. ExaGeoStatR supports several maximum likelihood computation variants such as exact, Diagonal Super Tile (DST), and Tile Low-Rank (TLR) approximation besides providing a tool to generate large-scale synthetic datasets which can be used to test and compare different approximations methods. The package can be used directly through the R environment without any C, CUDA, or MPIknowledge. Here, we demonstrate the ExaGeoStatR package by illustrating its implementation details, analyzing its performance on various parallel architectures, and assessing its accuracy using both synthetic datasets and a sea surface temperature dataset. The performance evaluation involves spatial datasets with up to 250K observations.

READ FULL TEXT
research
04/24/2018

Tile Low-Rank Approximation of Large-Scale Maximum Likelihood Estimation on Manycore Architectures

Maximum likelihood estimation is an important statistical technique for ...
research
04/24/2018

Parallel Approximation of the Maximum Likelihood Estimation for the Prediction of Large-Scale Geostatistics Simulations

Maximum likelihood estimation is an important statistical technique for ...
research
08/03/2020

High Performance Multivariate Spatial Modeling for Geostatistical Data on Manycore Systems

Modeling and inferring spatial relationships and predicting missing valu...
research
11/20/2018

Parallel Matrix Condensation for Calculating Log-Determinant of Large Matrix

Calculating the log-determinant of a matrix is useful for statistical co...
research
09/08/2020

GPU Parallel Computation of Morse-Smale Complexes

The Morse-Smale complex is a well studied topological structure that rep...
research
03/06/2022

GPU Accelerated Maximum Likelihood Analysis for Phylogenetic Inference

With the advancement of biology and computer science, the amount of DNA ...
research
04/10/2020

A multi-resolution approximation via linear projection for large spatial datasets

Recent technical advances in collecting spatial data have been increasin...

Please sign up or login with your details

Forgot password? Click here to reset