Coo: Rethink Data Anomalies In Databases

09/14/2021
by   Haixiang Li, et al.
0

Transaction processing technology has three important contents: data anomalies, isolation levels, and concurrent control algorithms. Concurrent control algorithms are used to eliminate some or all data anomalies at different isolation levels to ensure data consistency. Isolation levels in the current ANSI standard are defined by disallowing certain kinds of data anomalies. Yet, the definitions of data anomalies in the ANSI standard are controversial. On one hand, the definitions lack a mathematical formalization and cause ambiguous interpretations. On the other hand, the definitions are made in a case-by-case manner and lead to a situation that even a senior DBA could not have infallible knowledge of data anomalies, due to a lack of a full understanding of its nature. While revised definitions in existing literature propose various mathematical formalizations to correct the former argument, how to address the latter argument still remains an open problem. In this paper, we present a general framework called Coo with the capability to systematically define data anomalies. Under this framework, we show that existing reported data anomalies are only a small portion. While we theoretically prove that Coo is complete to mathematically formalize data anomalies, we employ a novel method to classify infinite data anomalies. In addition, we use this framework to define new isolation levels and quantitatively describe the concurrency and rollback rate of mainstream concurrency control algorithms. These works show that the C and I of ACID can be quantitatively analyzed based on all data anomalies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2021

Systematic definition and classification of data anomalies in DBMS (English Version)

There is no unified definition of Data anomalies, which refers to the sp...
research
03/08/2018

A New Model for Evaluating Range-Based Anomaly Detection Algorithms

Classical anomaly detection (AD) is principally concerned with point-bas...
research
06/29/2022

Coo: Consistency Check for Transactional Databases

In modern databases, transaction processing technology provides ACID (At...
research
05/11/2019

RDMA Performance Isolation With Justitia

Despite its increasing popularity, most of RDMA's benefits such as ultra...
research
05/16/2018

On the Semantics of Snapshot Isolation

Snapshot isolation (SI) is a standard transactional consistency model us...
research
03/09/2023

On the Existence of Anomalies, The Reals Case

The Independence Postulate (IP) is a finitary Church-Turing Thesis, sayi...
research
03/23/2020

Elle: Inferring Isolation Anomalies from Experimental Observations

Users who care about their data store it in databases, which (at least i...

Please sign up or login with your details

Forgot password? Click here to reset