Raptor Zonal Statistics: Fully Distributed Zonal Statistics of Big Raster + Vector Data [Pre-Print]

10/13/2020
by   Samriddhi Singla, et al.
0

Recent advancements in remote sensing technology have resulted in petabytes of data in raster format. This data is often processed in combination with high resolution vector data that represents, for example, city boundaries. One of the common operations that combine big raster and vector data is the zonal statistics which computes some statistics for each polygon in the vector dataset. This paper models the zonal statistics problem as a join problem and proposes a novel distributed system that can scale to petabytes of raster and vector data. The proposed method does not require any preprocessing or indexing which makes it perfect for ad-hoc queries that scientists usually want to run. We devise a theoretical cost model that proves the efficiency of our algorithm over the baseline method. Furthermore, we run an extensive experimental evaluation on large scale satellite data with up-to a trillion pixels, and big vector data with up-to hundreds of millions of edges, and we show that our method can perfectly scale to big data with up-to two orders of magnitude performance gain over Rasdaman and Google Earth Engine.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/11/2021

An improved tile-based scalable distributed management model of massive high-resolution satellite images

The amount of remote sensing (RS) data has increased at an unexpected sc...
research
03/17/2021

Big Plastic Masses Detection using Sentinel 2 Images

This communication describes a preliminary research on detection of big ...
research
10/01/2013

Hopping over Big Data: Accelerating Ad-hoc OLAP Queries with Grasshopper Algorithms

This paper presents a family of algorithms for fast subset filtering wit...
research
05/16/2022

A Data Cube of Big Satellite Image Time-Series for Agriculture Monitoring

The modernization of the Common Agricultural Policy (CAP) requires the l...
research
05/06/2017

DeepDeath: Learning to Predict the Underlying Cause of Death with Big Data

Multiple cause-of-death data provides a valuable source of information t...
research
07/24/2020

Locality-Aware Rotated Ship Detection in High-Resolution Remote Sensing Imagery Based on Multi-Scale Convolutional Network

Ship detection has been an active and vital topic in the field of remote...

Please sign up or login with your details

Forgot password? Click here to reset