Independence Testing for Temporal Data (1908.06486v5)
Abstract: Temporal data are increasingly prevalent in modern data science. A fundamental question is whether two time series are related or not. Existing approaches often have limitations, such as relying on parametric assumptions, detecting only linear associations, and requiring multiple tests and corrections. While many non-parametric and universally consistent dependence measures have recently been proposed, directly applying them to temporal data can inflate the p-value and result in an invalid test. To address these challenges, this paper introduces the temporal dependence statistic with block permutation to test independence between temporal data. Under proper assumptions, the proposed procedure is asymptotically valid and universally consistent for testing independence between stationary time series, and capable of estimating the optimal dependence lag that maximizes the dependence. Moreover, it is compatible with a rich family of distance and kernel based dependence measures, eliminates the need for multiple testing, and exhibits excellent testing power in various simulation settings.
- Graspy: Graph statistics in python. Journal of Machine Learning Research, 20(158):1–7.
- A kernel independence test for random processes. In 31st International Conference on Machine Learning, pages 1422–1430.
- A wild bootstrap for degenerate kernel tests.
- A multi-modal parcellation of human cerebral cortex. Nature, 536:171–178.
- Good, P. (2005). Permutation, Parametric, and Bootstrap Tests of Hypotheses. Springer.
- A kernel two-sample test. Journal of Machine Learning Research, 13:723–773.
- Consistent nonparametric tests of independence. Journal of Machine Learning Research, 11:1391–1423.
- Kernel methods for measuring independence. Journal of Machine Learning Research, 6:2075–2129.
- Dismantling the mantel tests. Methods in Ecology and Evolution, 4(4):336–344.
- New hsic-based tests for independence between two stationary multivariate time series. Statistica Sinica, 31(1):269–300.
- A consistent multivariate test of association based on ranks of distances. Biometrika, 100(2):503–510.
- Network dependence testing via diffusion maps and distance-based correlations. Biometrika, 106(4):857–873.
- On a measure of a lack of fit in time series models. Biometrika, 65(2):297–303.
- Ball covariance: A generic measure of dependence in banach space. Journal of the American Statistical Association, 115(529):307–317.
- hyppo: A comprehensive multivariate hypothesis testing python package.
- Politis, D. N. (2003). The impact of bootstrap methods on time series analysis. Statistical Science, 18(2):219–230.
- Equivalence of distance-based and rkhs-based statistics in hypothesis testing. Annals of Statistics, 41(5):2263–2291.
- The chi-square test of distance correlation. Journal of Computational and Graphical Statistics, 31(1):254–262.
- From distance correlation to multiscale graph correlation. Journal of the American Statistical Association, 115(529):280–291.
- The exact equivalence of distance and kernel methods in hypothesis testing. AStA Advances in Statistical Analysis, 105(3):385–403.
- Brownian distance covariance. Annals of Applied Statistics, 3(4):1233–1303.
- Partial distance correlation with methods for dissimilarities. Annals of Statistics, 42(6):2382–2412.
- Measuring and testing independence by correlation of distances. Annals of Statistics, 35(6):2769–2794.
- Discovering and deciphering relationships across disparate data modalities. eLife, 8:e41690.
- Large-scale kernel methods for independence testing. Statistics and Computing, 28(1):113–130.
- Projection correlation between two random vectors. Biometrika, 104(4):829–843.