Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
153 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Adaptive Bayesian Multivariate Spline Knot Inference with Prior Specifications on Model Complexity (2405.13353v1)

Published 22 May 2024 in stat.ME and stat.ML

Abstract: In multivariate spline regression, the number and locations of knots influence the performance and interpretability significantly. However, due to non-differentiability and varying dimensions, there is no desirable frequentist method to make inference on knots. In this article, we propose a fully Bayesian approach for knot inference in multivariate spline regression. The existing Bayesian method often uses BIC to calculate the posterior, but BIC is too liberal and it will heavily overestimate the knot number when the candidate model space is large. We specify a new prior on the knot number to take into account the complexity of the model space and derive an analytic formula in the normal model. In the non-normal cases, we utilize the extended Bayesian information criterion to approximate the posterior density. The samples are simulated in the space with differing dimensions via reversible jump Markov chain Monte Carlo. We apply the proposed method in knot inference and manifold denoising. Experiments demonstrate the splendid capability of the algorithm, especially in function fitting with jumping discontinuity.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (36)
  1. S. Aminikhanghahi and D. J. Cook. A survey of methods for time series change point detection. Knowledge and Information Systems, 51(2):339–367, 2017.
  2. M. Balasubramanian and E. L. Schwartz. The isomap algorithm and topological stability. Science, 295(5552):7–7, 2002.
  3. M. Belkin and P. Niyogi. Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computation, 15(6):1373–1396, 2003.
  4. Malware family discovery using reversible jump MCMC sampling of regimes. Journal of the American Statistical Association, 113(524):1490–1502, 2018.
  5. A novel Bayesian continuous piecewise linear log-hazard model, with estimation and inference via reversible jump Markov chain Monte Carlo. Statistics in Medicine, 39(12):1766–1780, 2020.
  6. A comparison of estimators for regression models with change points. Statistics and Computing, 21:395–414, 2011.
  7. J. Chen and Z. Chen. Extended Bayesian information criteria for model selection with large model spaces. Biometrika, 95(3):759–771, 09 2008.
  8. J. Chen and Z. Chen. Extended BIC for small-n-large-p sparse GLM. Statistica Sinica, 22(2):555–574, 2012.
  9. Bayesian MARS. Statistics and Computing, 8:337–346, 1998a.
  10. Automatic Bayesian curve fitting. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 60(2):333–350, 1998b.
  11. P. Dierckx. Curve and Surface Fitting with Splines. Oxford University Press, New York, 1995.
  12. Bayesian curve-fitting with free-knot splines. Biometrika, 88(4):1055–1071, 12 2001. ISSN 0006-3444.
  13. P. Fearnhead. Exact and efficient Bayesian inference for multiple changepoint problems. Statistics and Computing, 16:203–213, 2006.
  14. Fitting a putative manifold to noisy data. In S. Bubeck, V. Perchet, and P. Rigollet, editors, Proceedings of the 31st Conference On Learning Theory, volume 75, pages 688–720. PMLR, 06–09 Jul 2018.
  15. R. Foygel and M. Drton. Extended Bayesian information criteria for Gaussian graphical models. Advances in Neural Information Processing Systems, 23, 2010.
  16. P. J. Green. Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika, 82(4):711–732, 12 1995.
  17. Nonparametric Regression and Generalized Linear Models: A Roughness Penalty Approach. Chapman and Hall, London, 1994.
  18. C. Gu. Smoothing Spline ANOVA Models. Springer, New York, 2013.
  19. B. Z. Guangyu Yang and M. Zhang. Estimation of knots in linear spline models. Journal of the American Statistical Association, 118(541):639–650, 2023.
  20. T. Hastie and W. Stuetzle. Principal curves. Journal of the American Statistical Association, 84(406):502–516, 1989.
  21. R. E. Kass and L. Wasserman. A reference Bayesian test for nested hypotheses and its relationship to the Schwarz criterion. Journal of the American Statistical Association, 90(431):928–934, 1995.
  22. P. M. Lerman. Fitting segmented regression models by grid search. Journal of the Royal Statistical Society Series C: Applied Statistics, 29(1):77–84, 3 1980.
  23. Extended Bayesian information criterion in the Cox model with a high-dimensional feature space. Annals of the Institute of Statistical Mathematics, 67:287–311, 2015.
  24. Spline regression models. Sage, Iowa, 2001.
  25. UMAP: Uniform manifold approximation and projection for dimension reduction. Preprint arXiv:1802.03426, 2020.
  26. K. Meng and A. Eloyan. Principal manifold estimation via model complexity selection. Journal of the Royal Statistical Society Series B: Statistical Methodology, 83(2):369–394, 03 2021.
  27. V. M. Muggeo. Estimating regression models with unknown break-points. Statistics in Medicine, 22(19):3055–3071, 2003.
  28. V. M. Muggeo. Segmented: an R package to fit regression models with broken-line relationships. R news, 8(1):20–25, 2008.
  29. A review of spline function procedures in R. BMC Medical Research Methodology, 19:1–16, 2019.
  30. Fast estimation of regression parameters in a broken-stick model for longitudinal data. Journal of the American Statistical Association, 111(515):1132–1143, 2016.
  31. L. Schumaker. Spline Functions: Basic Theory. Cambridge University Press, New York, 2007.
  32. Selective review of offline change point detection methods. Signal Processing, 167:107299, 2020.
  33. G. Wahba. Spline Models for Observational Data. Society for Industrial and Applied Mathematics, Philadelphia, 1990.
  34. S. N. Wood. Thin Plate Regression Splines. Journal of the Royal Statistical Society Series B: Statistical Methodology, 65(1):95–114, 2003.
  35. Z. Yao and Y. Xia. Manifold fitting under unbounded noise. Preprint arXiv:1909.10228, 2023.
  36. Manifold fitting with cyclegan. Proceedings of the National Academy of Sciences, 121(5):e2311436121, 2024. doi: 10.1073/pnas.2311436121.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets