Apache Spark Accelerated Deep Learning Inference for Large Scale Satellite Image Analytics

08/08/2019
by   Dalton Lunga, et al.
11

The shear volumes of data generated from earth observation and remote sensing technologies continue to make major impact; leaping key geospatial applications into the dual data and compute intensive era. As a consequence, this rapid advancement poses new computational and data processing challenges. We implement a novel remote sensing data flow (RESFlow) for advanced machine learning and computing with massive amounts of remotely sensed imagery. The core contribution is partitioning massive amount of data based on the spectral and semantic characteristics for distributed imagery analysis. RESFlow takes advantage of both a unified analytics engine for large-scale data processing and the availability of modern computing hardware to harness the acceleration of deep learning inference on expansive remote sensing imagery. The framework incorporates a strategy to optimize resource utilization across multiple executors assigned to a single worker. We showcase its deployment across computationally and data-intensive on pixel-level labeling workloads. The pipeline invokes deep learning inference at three stages; during deep feature extraction, deep metric mapping, and deep semantic segmentation. The tasks impose compute intensive and GPU resource sharing challenges motivating for a parallelized pipeline for all execution steps. By taking advantage of Apache Spark, Nvidia DGX1, and DGX2 computing platforms, we demonstrate unprecedented compute speed-ups for deep learning inference on pixel labeling workloads; processing 21,028 Terrabytes of imagery data and delivering an output maps at area rate of 5.245sq.km/sec, amounting to 453,168 sq.km/day - reducing a 28 day workload to 21 hours.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 8

page 14

research
04/01/2018

EarthMapper: A Tool Box for the Semantic Segmentation of Remote Sensing Imagery

Deep learning continues to push state-of-the-art performance for the sem...
research
03/13/2021

A review of machine learning in processing remote sensing data for mineral exploration

As a primary step in mineral exploration, a variety of features are mapp...
research
04/15/2013

GPU Acclerated Automated Feature Extraction from Satellite Images

The availability of large volumes of remote sensing data insists on high...
research
06/18/2019

SEN12MS -- A Curated Dataset of Georeferenced Multi-Spectral Sentinel-1/2 Imagery for Deep Learning and Data Fusion

The availability of curated large-scale training data is a crucial facto...
research
11/20/2020

Enhancing Poaching Predictions for Under-Resourced Wildlife Conservation Parks Using Remote Sensing Imagery

Illegal wildlife poaching is driving the loss of biodiversity. To combat...
research
12/02/2019

Large-scale text processing pipeline with Apache Spark

In this paper, we evaluate Apache Spark for a data-intensive machine lea...

Please sign up or login with your details

Forgot password? Click here to reset