2000 character limit reached
Gap-Measure Tests with Applications to Data Integrity Verification (1906.01465v1)
Published 3 Jun 2019 in stat.ME and cs.CR
Abstract: In this paper we propose and examine gap statistics for assessing uniform distribution hypotheses. We provide examples relevant to data integrity testing for which max-gap statistics provide greater sensitivity than chi-square ($\chi2$), thus allowing the new test to be used in place of or as a complement to $\chi2$ testing for purposes of distinguishing a larger class of deviations from uniformity. We establish that the proposed max-gap test has the same sequential and parallel computational complexity as $\chi2$ and thus is applicable for Big Data analytics and integrity verification.