Data-Intensive Supercomputing in the Cloud: Global Analytics for Satellite Imagery

02/13/2017
by   Michael S. Warren, et al.
0

We present our experiences using cloud computing to support data-intensive analytics on satellite imagery for commercial applications. Drawing from our background in high-performance computing, we draw parallels between the early days of clustered computing systems and the current state of cloud computing and its potential to disrupt the HPC market. Using our own virtual file system layer on top of cloud remote object storage, we demonstrate aggregate read bandwidth of 230 gigabytes per second using 512 Google Compute Engine (GCE) nodes accessing a USA multi-region standard storage bucket. This figure is comparable to the best HPC storage systems in existence. We also present several of our application results, including the identification of field boundaries in Ukraine, and the generation of a global cloud-free base layer from Landsat imagery.

READ FULL TEXT

page 2

page 6

page 7

research
11/02/2020

10 Years Later: Cloud Computing is Closing the Performance Gap

Can cloud computing infrastructures provide HPC-competitive performance ...
research
07/06/2018

Exploring Scientific Application Performance Using Large Scale Object Storage

One of the major performance and scalability bottlenecks in large scient...
research
07/30/2021

Cloud to Ground Secured Computing: User Experiences on the Transition from Cloud-Based to Locally-Sited Hardware

The application of high-performance computing (HPC) processes, tools, an...
research
04/26/2019

A Benchmarking Study to Evaluate Apache Spark on Large-Scale Supercomputers

As dataset sizes increase, data analysis tasks in high performance compu...
research
07/23/2018

From Bare Metal to Virtual: Lessons Learned when a Supercomputing Institute Deploys its First Cloud

As primary provider for research computing services at the University of...
research
05/16/2023

Accelerating Communications in Federated Applications with Transparent Object Proxies

Advances in networks, accelerators, and cloud services encourage program...
research
06/08/2018

Intelligently-automated facilities expansion with the HEPCloud Decision Engine

The next generation of High Energy Physics experiments are expected to g...

Please sign up or login with your details

Forgot password? Click here to reset