Multiscale Quantile Regression with Local Error Control
Abstract: For robust and efficient detection of change points, we introduce a novel methodology MUSCLE (multiscale quantile segmentation controlling local error) that partitions serial data into multiple segments, each sharing a common quantile. It leverages multiple tests for quantile changes over different scales and locations, and variational estimation. Unlike the often adopted global error control, MUSCLE focuses on local errors defined on individual segments, significantly improving detection power in finding change points. Meanwhile, due to the built-in model complexity penalty, it enjoys the finite sample guarantee that its false discovery rate (or the expected proportion of falsely detected change points) is upper bounded by its unique tuning parameter. Further, we obtain the consistency and the localisation error rates in estimating change points, under mild signal-to-noise-ratio conditions. Both match (up to log factors) the minimax optimality results in the Gaussian setup. All theories hold under the only distributional assumption of serial independence. Incorporating the wavelet tree data structure, we develop an efficient dynamic programming algorithm for computing MUSCLE. Extensive simulations as well as real data applications in electrophysiology and geophysics demonstrate its competitiveness and effectiveness. An implementation via R package muscle is available from GitHub.
- Highly parallel computing. Benjamin-Cummings Publishing Co., Inc.
- Detection of an anomalous cluster in a network. Ann. Statist., 39(1):278–304.
- Narrowest-over-threshold detection of multiple change points and change-point-like features. J. R. Stat. Soc. Ser. B. Stat. Methodol., 81(3):649–672.
- Global ecosystem thresholds driven by aridity. Science, 367(6479):787–790.
- Sparse high-dimensional regression: exact scalable algorithms and phase transitions. Ann. Statist., 48(1):300–323.
- Outlier detection: Methods, models, and classification. ACM Comput. Surv., 53(3):1–37.
- Consistencies and rates of convergence of jump-penalized least squares estimators. Ann. Statist., 37(1):157–183.
- Towards optimal range medians. Theor. Comput. Sci., 412(24):2588–2601.
- Wavelet trees for competitive programming. Olymp. Inform., 10:19–37.
- Introduction to algorithms. MIT Press, Cambridge, MA, third edition.
- Inferring change points in the spread of covid-19 reveals the effectiveness of interventions. Science, 369(6500):eabb9789.
- Multiscale change point detection for dependent data. Scand. J. Stat., 47(4):1243–1274.
- Ideal spatial adaptation by wavelet shrinkage. Biometrika, 81(3):425–455.
- Stepwise signal extraction via marginal likelihood. J. Amer. Statist. Assoc., 111(513):314–330.
- On the limit distribution of multiscale test statistics for nonparametric curve estimation. Math. Methods Statist., 15(1):20–25.
- Multiscale testing of qualitative hypotheses. Ann. Statist., 29(1):124–152.
- A MOSUM procedure for the estimation of multiple random change points. Bernoulli, 24(1):526–564.
- Window size selection in unsupervised time series analytics: A review and benchmark. In Guyet, T., Ifrim, G., Malinowski, S., Bagnall, A., Shafer, P., and Lemaire, V., editors, Advanced Analytics and Learning on Temporal Data, pages 83–101, Cham. Springer International Publishing.
- Identifying localized changes in large systems: Change-point detection for biomolecular simulations. Proc. Natl. Acad. Sci. U.S.A., 112(24):7454–7459.
- Segmentation and estimation of change-point models: false positive control and confidence regions. Ann. Statist., 48(3):1615–1647.
- Fearnhead, P. (2006). Exact and efficient Bayesian inference for multiple changepoint problems. Stat. Comput., 16(2):203–213.
- Changepoint detection in the presence of outliers. J. Amer. Statist. Assoc., 114(525):169–183.
- Multiscale change point inference. J. R. Stat. Soc. Ser. B. Stat. Methodol., 76(3):495–580.
- Fryzlewicz, P. (2014). Wild binary segmentation for multiple change-point detection. Ann. Statist., 42(6):2243–2281.
- Fryzlewicz, P. (2024). Robust narrowest significance pursuit: Inference for multiple change-points in the median. J. Bus. Econ. Stat. To appear.
- Multiscale DNA partitioning: statistical evidence for segments. Bioinformatics, 30(16):2255–2262.
- Mathematical foundations of infinite-dimensional statistical models. Cambridge University Press, New York.
- High-order entropy-compressed text indexes. In Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms, page 841–850. Society for Industrial and Applied Mathematics.
- A variational view on statistical multiscale estimation. Annu. Rev. Stat. Appl., 9:343–372.
- Multiple change-point estimation with a total variation penalty. J. Amer. Statist. Assoc., 105(492):1480–1493.
- A computationally efficient nonparametric approach for changepoint detection. Stat. Comput., 27(5):1293–1305.
- Monitoring for a change point in a sequence of distributions. Ann. Statist., 49(4):2271–2291.
- Kabluchko, Z. (2011). Extremes of the standardized Gaussian noise. Stochastic Process. Appl., 121(3):515–533.
- Optimal detection of changepoints with a linear computational cost. J. Amer. Statist. Assoc., 107(500):1590–1598.
- Koenker, R. (2005). Quantile Regression. Cambridge University Press.
- Seeded binary segmentation: a general methodology for fast and optimal changepoint detection. Biometrika, 110(1):249–256.
- Multiscale change-point segmentation: beyond step functions. Electron. J. Stat., 13(2):3254–3296.
- FDR-control in multiscale change-point segmentation. Electron. J. Stat., 10(1):918–959.
- Change-point detection in time-series data by relative density-ratio estimation. Neural Netw., 43:72–83.
- Optimal nonparametric change point analysis. Electron. J. Stat., 15(1):1154–1201.
- On optimal multiple changepoint algorithms for large data. Stat. Comput., 27(2):519–533.
- A nonparametric approach for multiple change point analysis of multivariate data. J. Amer. Statist. Assoc., 109(505):334–345.
- Navarro, G. (2014). Wavelet trees for all. J. Discrete Algorithms, 25:2–20.
- Multiple change-point detection: a selective overview. Statist. Sci., 31(4):611–623.
- Norris, J. R. (1998). Markov chains, volume 2. Cambridge University Press, Cambridge.
- Page, E. S. (1955). A test for a change in a parameter occurring at an unknown point. Biometrika, 42:523–527.
- Heterogeneous idealization of ion channel recordings–open channel noise. IEEE Trans. Nanobioscience, 20(1):57–78.
- Heterogeneous change point inference. J. R. Stat. Soc. Ser. B. Stat. Methodol., 79(4):1207–1227.
- Fully automatic multiresolution idealization for filtered ion channel recordings: flickering event detection. IEEE Trans. Nanobioscience, 17(3):300–320.
- V-measure: A conditional entropy-based external cluster evaluation measure. In Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL), pages 410–420.
- Numerical Bayesian methods applied to signal processing. Springer Science & Business Media.
- Breaks and the statistical process of inflation: the case of estimating the ‘modern’long-run phillips curve. Empir. Econ., 56:1455–1475.
- Sakmann, B. (2013). Single-channel recording. Springer Science & Business Media.
- Selective review of offline change point detection methods. Signal Process., 167:107299.
- Tsybakov, A. B. (2009). Introduction to nonparametric estimation. Springer Series in Statistics. Springer, New York.
- Multiscale quantile segmentation. J. Amer. Statist. Assoc., 117(539):1384–1397.
- Identification of hidden markov models for ion channel currents. II. State-dependent excess noise. IEEE Trans. Signal Process., 46(7):1916–1929.
- Optimal change-point detection and localization. Ann. Statist., 51(4):1586–1610.
- Vostrikova, L. Y. (1981). Detecting “disorder” in multidimensional random processes. In Doklady akademii nauk, volume 259, pages 270–274. Russian Academy of Sciences.
- Wald, A. (1945). Sequential tests of statistical hypotheses. Ann. Math. Statist., 16:117–186.
- Calibrating the scan statistic: finite sample performance versus asymptotics. J. R. Stat. Soc. Ser. B. Stat. Methodol., 84(5):1608–1639.
- Wu, W. B. (2005). Nonlinear system theory: another look at dependence. Proc. Natl. Acad. Sci. USA, 102(40):14150–14154.
- Approximate simulation-free Bayesian inference for multiple changepoint models with dependence within segments. Bayesian Anal., 6(4):501–527.
- Nonparametric maximum likelihood approach to multiple change-point problems. Ann. Statist., 42(3):970–1002.
- Composite quantile regression and the oracle model selection theory. Ann. Statist., 36(3):1108–1126.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.