GeoRocket: A scalable and cloud-based data store for big geospatial files

01/25/2020
by   Michel Krämer, et al.
0

We present GeoRocket, a software for the management of very large geospatial datasets in the cloud. GeoRocket employs a novel way to handle arbitrarily large datasets by splitting them into chunks that are processed individually. The software has a modern reactive architecture and makes use of existing services including Elasticsearch and storage back ends such as MongoDB or Amazon S3. GeoRocket is schema-agnostic and supports a wide range of heterogeneous geospatial file formats. It is also format-preserving and does not alter imported data in any way. The main benefits of GeoRocket are its performance, scalability, and usability, which make it suitable for a number of scientific and commercial use cases dealing with very high data volumes, complex datasets, and high velocity (Big Data). GeoRocket also provides many opportunities for further research in the area of geospatial data management.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/22/2022

BigBird: Big Data Storage and Analytics at Scale in Hybrid Cloud

Implementing big data storage at scale is a complex and arduous task tha...
research
06/11/2018

A Cost-based Storage Format Selector for Materialization in Big Data Frameworks

Modern big data frameworks (such as Hadoop and Spark) allow multiple use...
research
09/24/2021

User-Defined Functions for HDF5

Scientific datasets are known for their challenging storage demands and ...
research
04/20/2018

Analyzing astronomical data with Apache Spark

We investigate the performances of Apache Spark, a cluster computing fra...
research
07/26/2018

CloudMe Forensics: A Case of Big-Data Investigation

The issue of increasing volume, variety and velocity of has been an area...
research
09/22/2021

ProvLet: A Provenance Management Service for Long Tail Microscopy Data

Provenance management must be present to enhance the overall security an...
research
12/01/2021

Efficient loading of reduced data ensembles produced at ORNL SNS/HFIR neutron time-of-flight facilities

We present algorithmic improvements to the loading operations of certain...

Please sign up or login with your details

Forgot password? Click here to reset