Transactional Indexes on (RDMA or CXL-based) Disaggregated Memory with Repairable Transaction

07/21/2023
by   Xingda Wei, et al.
0

The failure atomic and isolated execution of clients operations is a default requirement for a system that serve multiple loosely coupled clients at a server. However, disaggregated memory breaks this requirement in remote indexes because a client operation is disaggregated to multiple remote reads/writes. Current indexes focus on performance improvements and largely ignore tolerating client failures. We argue that a practical DM index should be transactional: each index operation should be failure atomic and isolated in addition to being concurrency isolated. We present repairable transaction (rTX), a lightweight primitive to execute DM index operations. Each rTX can detect other failed rTXes on-the-fly with the help of concurrency control. Upon detection, it will repair their non-atomic updates online with the help of logging, thus hiding their failures from healthy clients. By further removing unnecessary logging and delegating concurrency control to existing carefully-tuned index algorithms, we show that transactional indexes can be built at a low performance overhead on disaggregated memory. We have refactored two state-of-the-art DM indexes, RaceHashing and Sherman (B+Tree), with rTX. Evaluations show that rTX is 1.2 to 2X faster than other alternatives, e.g., distributed transaction. Meanwhile, its overhead is up to 42 non-fault-tolerant indexes.

READ FULL TEXT
research
01/24/2023

FUSEE: A Fully Memory-Disaggregated Key-Value Store (Extended Version)

Distributed in-memory key-value (KV) stores are embracing the disaggrega...
research
06/04/2019

Reconfigurable Atomic Transaction Commit (Extended Version)

Modern data stores achieve scalability by partitioning data into shards ...
research
07/28/2022

FedVARP: Tackling the Variance Due to Partial Client Participation in Federated Learning

Data-heterogeneous federated learning (FL) systems suffer from two signi...
research
09/04/2019

A Tale of Two Trees: One Writes, and Other Reads. Optimized Oblivious Accesses to Large-Scale Blockchains

The Bitcoin network has offered a new way of securely performing financi...
research
12/20/2018

Atomic Appends: Selling Cars and Coordinating Armies with Multiple Distributed Ledgers

The various applications using Distributed Ledger Technologies (DLT) or ...
research
02/09/2023

FLAC: A Robust Failure-Aware Atomic Commit Protocol for Distributed Transactions

In distributed transaction processing, atomic commit protocol (ACP) is u...
research
03/12/2020

A Fault-Tolerance Shim for Serverless Computing

Serverless computing has grown in popularity in recent years, with an in...

Please sign up or login with your details

Forgot password? Click here to reset