Memory-Disaggregated In-Memory Object Store Framework for Big Data Applications

04/27/2022
by   Robin Abrahamse, et al.
0

The concept of memory disaggregation has recently been gaining traction in research. With memory disaggregation, data center compute nodes can directly access memory on adjacent nodes and are therefore able to overcome local memory restrictions, introducing a new data management paradigm for distributed computing. This paper proposes and demonstrates a memory disaggregated in-memory object store framework for big data applications by leveraging the newly introduced ThymesisFlow memory disaggregation system. The framework extends the functionality of the pre-existing Apache Arrow Plasma object store framework to distributed systems by enabling clients to easily and efficiently produce and consume data objects across multiple compute nodes. This allows big data applications to increasingly leverage parallel processing at reduced development costs. In addition, the paper includes latency and throughput measurements that indicate only a modest performance penalty is incurred for remote disaggregated memory access as opposed to local ( 6.5 vs  5.75 GiB/s). The results can be used to guide the design of future systems that leverage memory disaggregation as well as the newly presented framework. This work is open-source and publicly accessible at https://doi.org/10.5281/zenodo.6368998.

READ FULL TEXT
research
08/26/2016

When to use 3D Die-Stacked Memory for Bandwidth-Constrained Big Data Workloads

Response time requirements for big data processing systems are shrinking...
research
11/04/2021

Revisiting Active Object Stores: Bringing Data Locality to the Limit With NVM

Object stores are widely used software stacks that achieve excellent sca...
research
10/11/2018

A Comparative Study of Consistent Snapshot Algorithms for Main-Memory Database Systems

In-memory databases (IMDBs) are gaining increasing popularity in big dat...
research
03/09/2018

ROLP: Runtime Object Lifetime Profiling for Big Data Memory Management

Low latency services such as credit-card fraud detection and website tar...
research
03/11/2023

Design and Evaluation of a Rack-Scale Disaggregated Memory Architecture For Data Centers

Memory disaggregation is being considered as a strong alternative to tra...
research
08/03/2021

Optimally Hiding Object Sizes with Constrained Padding

Among the most challenging traffic-analysis attacks to confound are thos...
research
07/30/2019

Learning over inherently distributed data

The recent decades have seen a surge of interests in distributed computi...

Please sign up or login with your details

Forgot password? Click here to reset