Score Matching for Truncated Density Estimation on a Manifold (2206.14668v2)
Abstract: When observations are truncated, we are limited to an incomplete picture of our dataset. Recent methods propose to use score matching for truncated density estimation, where the access to the intractable normalising constant is not required. We present a novel extension of truncated score matching to a Riemannian manifold with boundary. Applications are presented for the von Mises-Fisher and Kent distributions on a two dimensional sphere in $\mathbb{R}3$, as well as a real-world application of extreme storm observations in the USA. In simulated data experiments, our score matching estimator is able to approximate the true parameter values with a low estimation error and shows improvements over a naive maximum likelihood estimator.
- A kernel test of goodness of fit. In Balcan, M. F. and Weinberger, K. Q. (eds.), Proceedings of The 33rd International Conference on Machine Learning, volume 48 of Proceedings of Machine Learning Research, pp. 2606–2615, New York, New York, USA, 20–22 Jun 2016. PMLR.
- Hyvärinen, A. Estimation of non-normalized statistical models by score matching. Journal of Machine Learning Research, 6(Apr):695–709, 2005.
- Hyvärinen, A. Some extensions of score matching. Computational statistics & data analysis, 51(5):2499–2512, 2007.
- Jost, J. Riemannian geometry and geometric analysis, volume 42005. Springer, 2008.
- Kent, J. T. The fisher-bingham distribution on the sphere. Journal of the Royal Statistical Society: Series B (Methodological), 44(1):71–80, 1982.
- Atlantic hurricane database uncertainty and presentation of a new database format, 2013. Mon. Wea. Rev., 141, 3576-3592. Date accessed: 10th May 2022.
- Lee, J. M. Smooth manifolds. In Introduction to Smooth Manifolds, pp. 1–31. Springer, 2013.
- Li, S. Z. Markov random field modeling in image analysis. Springer Science & Business Media, 2009.
- A kernelized stein discrepancy for goodness-of-fit tests. In International conference on machine learning, pp. 276–284. PMLR, 2016.
- Estimating density models with truncation boundaries using score matching. Journal of Machine Learning Research, 23(186):1–38, 2022. URL http://jmlr.org/papers/v23/21-0218.html.
- Directional statistics, 2009.
- Score matching estimators for directional distributions. arXiv preprint arXiv:1604.08470, 2016.
- Ncdc storm events database, 2022. Date accessed: 10th May 2022.
- Oliver, J. E. Encyclopedia of world climatology. Springer Science & Business Media, 2008.
- Rfast: A Collection of Efficient and Extremely Fast R Functions, 2021. URL https://CRAN.R-project.org/package=Rfast. R package version 2.0.3.
- R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2021. URL https://www.R-project.org/.
- Calculus: early transcendentals. Cengage Learning, 2020.
- Energy-based models for sparse overcomplete representations. Journal of Machine Learning Research, 4(Dec):1235–1260, 2003.
- Directional: A Collection of R Functions for Directional Data Analysis, 2021. URL https://CRAN.R-project.org/package=Directional. R package version 5.0.
- Wickham, H. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York, 2016. ISBN 978-3-319-24277-4. URL https://ggplot2.tidyverse.org.
- Wood, A. T. Simulation of the von mises fisher distribution. Communications in statistics-simulation and computation, 23(1):157–164, 1994.
- Score-based hypothesis testing for unnormalized models. IEEE Access, 10:71936–71950, 2022.
- Graphical models for non-negative data using generalized score matching. In Proceedings of the Twenty-First International Conference on Artificial Intelligence and Statistics, volume 84 of Proceedings of Machine Learning Research, pp. 1781–1790. PMLR, 09–11 Apr 2018.
- Generalized score matching for non-negative data. Journal of Machine Learning Research, 20(76):1–70, 2019a. URL http://jmlr.org/papers/v20/18-278.html.
- Generalized score matching for non-negative data. Journal of Machine Learning Research, 20(76):1–70, 2019b. URL http://jmlr.org/papers/v20/18-278.html.
- Generalized score matching for general domains. Information and Inference: A Journal of the IMA, 01 2021a.
- Generalized score matching for general domains. Information and Inference: A Journal of the IMA, 01 2021b. ISSN 2049-8772. doi: 10.1093/imaiai/iaaa041. URL https://doi.org/10.1093/imaiai/iaaa041. iaaa041.