Automatic Storage Structure Selection for hybrid Workload

08/15/2020
by   Hongzhi Wang, et al.
0

In the use of database systems, the design of the storage engine and data model directly affects the performance of the database when performing queries. Therefore, the users of the database need to select the storage engine and design data model according to the workload encountered. However, in a hybrid workload, the query set of the database is dynamically changing, and the design of its optimal storage structure is also changing. Motivated by this, we propose an automatic storage structure selection system based on learning cost, which is used to dynamically select the optimal storage structure of the database under hybrid workloads. In the system, we introduce a machine learning method to build a cost model for the storage engine, and a column-oriented data layout generation algorithm. Experimental results show that the proposed system can choose the optimal combination of storage engine and data model according to the current workload, which greatly improves the performance of the default storage structure. And the system is designed to be compatible with different storage engines for easy use in practical applications.

READ FULL TEXT
research
06/16/2020

Index Selection for NoSQL Database with Deep Reinforcement Learning

We propose a new approach of NoSQL database index selection. For differe...
research
03/08/2019

Deductive Optimization of Relational Data Storage

Optimizing the physical data storage and retrieval of data are two key d...
research
04/11/2020

Adaptive HTAP through Elastic Resource Scheduling

Modern Hybrid Transactional/Analytical Processing (HTAP) systems use an ...
research
10/26/2021

Endure: A Robust Tuning Paradigm for LSM Trees Under Workload Uncertainty

Log-Structured Merge trees (LSM trees) are increasingly used as the stor...
research
08/22/2017

Towards a Holistic Integration of Spreadsheets with Databases: A Scalable Storage Engine for Presentational Data Management

Spreadsheet software is the tool of choice for interactive ad-hoc data m...
research
02/25/2023

TS-Cabinet: Hierarchical Storage for Cloud-Edge-End Time-series Database

Hierarchical data storage is crucial for cloud-edge-end time-series data...
research
06/26/2018

A Tensor Based Data Model for Polystore: An Application to Social Networks Data

In this article, we show how the mathematical object tensor can be used ...

Please sign up or login with your details

Forgot password? Click here to reset