SDN helps Big Data to optimize access to data

12/30/2020
by   Yuankun Fu, et al.
0

This chapter introduces the state-of-the-art in the emerging area of combining High Performance Computing (HPC) with Big Data Analysis. To understand the new area, the chapter first surveys the existing approaches to integrating HPC with Big Data. Next, the chapter introduces several optimization solutions that focus on how to minimize the data transfer time from computation-intensive applications to analysis-intensive applications as well as minimizing the end-to-end time-to-solution. The solutions utilize SDN to adaptively use both high speed interconnect network and high performance parallel file systems to optimize the application performance. A computational framework called DataBroker is designed and developed to enable a tight integration of HPC with data analysis. Multiple types of experiments have been conducted to show different performance issues in both message passing and parallel file systems and to verify the effectiveness of the proposed research approaches.

READ FULL TEXT

page 5

page 8

page 10

page 11

page 12

page 15

page 19

page 20

research
07/29/2019

Geospatial Big Data Handling with High Performance Computing: Current Approaches and Future Directions

Geospatial big data plays a major role in the era of big data, as most d...
research
12/23/2019

Parallel Computing With R: A Brief Review

Parallel computing has established itself as another standard method for...
research
12/08/2017

OneDataShare: A Vision for Cloud-hosted Data Transfer Scheduling and Optimization as a Service

Fast, reliable, and efficient data transmission across wide-area network...
research
07/04/2022

Sea: A lightweight data-placement library for Big Data scientific computing

The recent influx of open scientific data has contributed to the transit...
research
02/14/2020

Big Data Staging with MPI-IO for Interactive X-ray Science

New techniques in X-ray scattering science experiments produce large dat...
research
03/22/2018

SCISPACE: A Scientific Collaboration Workspace for File Systems in Geo-Distributed HPC Data Centers

Future terabit networks are committed to dramatically improving big data...
research
06/30/2017

From Big Data to Big Displays: High-Performance Visualization at Blue Brain

Blue Brain has pushed high-performance visualization (HPV) to complement...

Please sign up or login with your details

Forgot password? Click here to reset