COLE: A Column-based Learned Storage for Blockchain Systems

06/19/2023
by   Ce Zhang, et al.
0

Blockchain systems suffer from high storage costs as every node needs to store and maintain the entire blockchain data. After investigating Ethereum's storage, we find that the storage cost mostly comes from the index, i.e., Merkle Patricia Trie (MPT), that is used to guarantee data integrity and support provenance queries. To reduce the index storage overhead, an initial idea is to leverage the emerging learned index technique, which has been shown to have a smaller index size and more efficient query performance. However, directly applying it to the blockchain storage results in even higher overhead owing to the blockchain's persistence requirement and the learned index's large node size. Meanwhile, existing learned indexes are designed for in-memory databases, whereas blockchain systems require disk-based storage and feature frequent data updates. To address these challenges, we propose COLE, a novel column-based learned storage for blockchain systems. We follow the column-based database design to contiguously store each state's historical values, which are indexed by learned models to facilitate efficient data retrieval and provenance queries. We develop a series of write-optimized strategies to realize COLE in disk environments. Extensive experiments are conducted to validate the performance of the proposed COLE system. Compared with MPT, COLE reduces the storage size by up to 94

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2018

vChain: Enabling Verifiable Boolean Range Queries over Blockchain Databases

Blockchains have recently been under the spotlight due to the boom of cr...
research
02/14/2018

ForkBase: An Efficient Storage Engine for Blockchain and Forkable Applications

Existing data storage systems offer a wide range of functionalities to a...
research
06/05/2023

A Simple Yet High-Performing On-disk Learned Index: Can We Have Our Cake and Eat it Too?

While in-memory learned indexes have shown promising performance as comp...
research
03/04/2023

Building a Modal-balanced BlockChain with Semantic Reconstruction

The current large blockchain systems (BTC Lightning network, Ethereum, e...
research
04/24/2023

An Efficient Built-in Temporal Support in MVCC-based Graph Databases

Real-world graphs are often dynamic and evolve over time. To trace the e...
research
08/29/2023

SALI: A Scalable Adaptive Learned Index Framework based on Probability Models

The growth in data storage capacity and the increasing demands for high ...
research
03/03/2021

Integrating Column-Oriented Storage and Query Processing Techniques Into Graph Database Management Systems

We revisit column-oriented storage and query processing techniques in th...

Please sign up or login with your details

Forgot password? Click here to reset