Causal Change Point Detection and Localization (2403.12677v1)
Abstract: Detecting and localizing change points in sequential data is of interest in many areas of application. Various notions of change points have been proposed, such as changes in mean, variance, or the linear regression coefficient. In this work, we consider settings in which a response variable $Y$ and a set of covariates $X=(X1,\ldots,X{d+1})$ are observed over time and aim to find changes in the causal mechanism generating $Y$ from $X$. More specifically, we assume $Y$ depends linearly on a subset of the covariates and aim to determine at what time points either the dependency on the subset or the subset itself changes. We call these time points causal change points (CCPs) and show that they form a subset of the commonly studied regression change points. We propose general methodology to both detect and localize CCPs. Although motivated by causality, we define CCPs without referencing an underlying causal model. The proposed definition of CCPs exploits a notion of invariance, which is a purely observational quantity but -- under additional assumptions -- has a causal meaning. For CCP localization, we propose a loss function that can be combined with existing multiple change point algorithms to localize multiple CCPs efficiently. We evaluate and illustrate our methods on simulated datasets.
- J. Aldrich. Autonomy. Oxford Economic Papers, 41(1):15–34, 1989.
- D. W. Andrews. Tests for parameter instability and structural change with unknown change point. Econometrica: Journal of the Econometric Society, 61(4):821–856, 1993.
- A. Aue and L. Horváth. Structural breaks in time series. Journal of Time Series Analysis, 34(1):1–16, 2013.
- J. Bai. Testing for parameter constancy in linear regressions: an empirical distribution function approach. Econometrica: Journal of the Econometric Society, 64(3):597–622, 1996.
- J. Bai. Estimating multiple breaks one at a time. Econometric theory, 13(3):315–352, 1997a.
- J. Bai. Estimation of a change point in multiple regression models. Review of Economics and Statistics, 79(4):551–563, 1997b.
- J. Bai and P. Perron. Estimating and testing linear models with multiple structural changes. Econometrica: Journal of the Econometric Society, 66(1):47–78, 1998.
- J. Bai and P. Perron. Computation and analysis of multiple structural change models. Journal of applied econometrics, 18(1):1–22, 2003.
- Narrowest-over-threshold detection of multiple change points and change-point-like features. Journal of the Royal Statistical Society Series B: Statistical Methodology, 1, 2019.
- B. Brodsky and B. Darkhovsky. Non-Parametric Methods in Change-Point Problems. Kluwer Academic Publishers, 1993.
- G. C. Chow. Tests of equality between sets of coefficients in two linear regressions. Econometrica: Journal of the Econometric Society, 28(3):591–605, 1960.
- P. Fryzlewicz. Wild binary segmentation for multiple change-point detection. The Annals of Statistics, 42(6):2243–2281, 2014.
- T. Haavelmo. The probability approach in econometrics. Econometrica: Journal of the Econometric Society, 12:iii–115, 1944.
- B. E. Hansen. Testing for structural change in conditional models. Journal of Econometrics, 97(1):93–115, 2000.
- D. M. Hawkins. Point estimation of the parameters of piecewise regression models. Journal of the Royal Statistical Society Series C: Applied Statistics, 25(1):51–57, 1976.
- Causal discovery from heterogeneous/nonstationary data. The Journal of Machine Learning Research, 21(1):3482–3534, 2020.
- Optimal detection of changepoints with a linear computational cost. Journal of the American Statistical Association, 107(500):1590–1598, 2012.
- Seeded binary segmentation: a general methodology for fast and optimal changepoint detection. Biometrika, 110(1), 2023.
- F. Leonardi and P. Bühlmann. Computationally efficient change point detection for high-dimensional regression. arXiv preprint arXiv:1601.03704, 2016.
- L. Orváth and P. Kokoszka. Change-point detection with non-parametric regression. Statistics: A Journal of Theoretical and Applied Statistics, 36(1):9–31, 2002.
- E. Page. A test for a change in a parameter occurring at an unknown point. Biometrika, 42(3/4):523–527, 1955.
- E. S. Page. Continuous inspection schemes. Biometrika, 41(1/2):100–115, 1954.
- J. Pearl. Causality. Cambridge university press, 2009.
- Testing jointly for structural changes in the error variance and coefficients of a linear regression model. Quantitative Economics, 11(3):1019–1057, 2020.
- Causal inference by using invariant prediction: identification and confidence intervals. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 78(5):947–1012, 2016.
- Invariant causal prediction for sequential data. Journal of the American Statistical Association, 114(527):1264–1276, 2019.
- Selective review of offline change point detection methods. Signal Processing, 167:107299, 2020.
- L. Y. Vostrikova. Detecting “disorder” in multidimensional random processes. In Doklady akademii nauk, volume 259, pages 270–274. Russian Academy of Sciences, 1981.
- Statistically and computationally efficient change point localization in regression settings. The Journal of Machine Learning Research, 22(1):11255–11300, 2021.
- Testing and dating of structural changes in practice. Computational Statistics & Data Analysis, 44(1-2):109–123, 2003.