Gap-Measure Tests with Applications to Data Integrity Verification

06/03/2019
by   Truc Le, et al.
0

In this paper we propose and examine gap statistics for assessing uniform distribution hypotheses. We provide examples relevant to data integrity testing for which max-gap statistics provide greater sensitivity than chi-square (χ^2), thus allowing the new test to be used in place of or as a complement to χ^2 testing for purposes of distinguishing a larger class of deviations from uniformity. We establish that the proposed max-gap test has the same sequential and parallel computational complexity as χ^2 and thus is applicable for Big Data analytics and integrity verification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/03/2018

Fast Integrity Verification for High-Speed File Transfers

The amount of data generated by scientific and commercial applications i...
research
02/21/2020

Practical Verification of MapReduce Computation Integrity via Partial Re-execution

Big data processing is often outsourced to powerful, but untrusted cloud...
research
07/01/2022

AUDITEM: Toward an Automated and Efficient Data Integrity Verification Model Using Blockchain

Data tampering is often considered a severe problem in industrial applic...
research
10/23/2018

Goodness-of-Fit Tests for Large Datasets

Nowadays, data analysis in the world of Big Data is connected typically ...
research
02/05/2023

Simulation-Driven Automated End-to-End Test and Oracle Inference

This is the first work to report on inferential testing at scale in indu...
research
07/31/2022

Locating modifications in signed data for partial data integrity

We consider the problem of detecting and locating modifications in signe...
research
10/25/2019

Embracing a mechanized formalization gap

If a code base is so big and complicated that complete mechanical verifi...

Please sign up or login with your details

Forgot password? Click here to reset