Universal Functional Regression with Neural Operator Flows (2404.02986v3)
Abstract: Regression on function spaces is typically limited to models with Gaussian process priors. We introduce the notion of universal functional regression, in which we aim to learn a prior distribution over non-Gaussian function spaces that remains mathematically tractable for functional regression. To do this, we develop Neural Operator Flows (OpFlow), an infinite-dimensional extension of normalizing flows. OpFlow is an invertible operator that maps the (potentially unknown) data function space into a Gaussian process, allowing for exact likelihood estimation of functional point evaluations. OpFlow enables robust and accurate uncertainty quantification via drawing posterior samples of the Gaussian process and subsequently mapping them into the data function space. We empirically study the performance of OpFlow on regression and generation tasks with data generated from Gaussian processes with known posterior forms and non-Gaussian processes, as well as real-world earthquake seismograms with an unknown closed-form distribution.
- Carl Edward Rasmussen and Christopher K. I. Williams. Gaussian processes for machine learning. Adaptive computation and machine learning. MIT Press, Cambridge, Mass, 2006. ISBN 978-0-262-18253-9. OCLC: ocm61285753.
- Geostatistics: modeling spatial uncertainty, volume 713. John Wiley & Sons, 2012. ISBN 0-470-18315-2.
- Gaussian Process Regression for Materials and Molecules. Chemical Reviews, 121(16):10073–10141, August 2021. ISSN 0009-2665. doi: 10.1021/acs.chemrev.1c00022. URL https://doi.org/10.1021/acs.chemrev.1c00022. Publisher: American Chemical Society.
- Gaussian Process Regression for Astronomical Time Series. Annual Review of Astronomy and Astrophysics, 61(1):329–371, 2023. doi: 10.1146/annurev-astro-052920-103508. URL https://doi.org/10.1146/annurev-astro-052920-103508. _eprint: https://doi.org/10.1146/annurev-astro-052920-103508.
- Generative Adversarial Neural Operators, October 2022a. URL http://arxiv.org/abs/2205.03017. arXiv:2205.03017 [cs, math].
- Score-based Diffusion Models in Function Space, November 2023. URL http://arxiv.org/abs/2302.07400. arXiv:2302.07400 [cs, math, stat].
- Variational Autoencoding Neural Operators, February 2023. URL http://arxiv.org/abs/2302.10351. arXiv:2302.10351 [cs, stat].
- Diffusion Generative Models in Infinite Dimensions, February 2023. URL http://arxiv.org/abs/2212.00886. arXiv:2212.00886 [cs, stat].
- Infinite-Dimensional Diffusion Models, February 2023. URL https://arxiv.org/abs/2302.10130v2.
- Conditional score-based diffusion models for Bayesian inference in infinite dimensions. Advances in Neural Information Processing Systems, 36:24262–24290, December 2023. URL https://proceedings.neurips.cc/paper_files/paper/2023/hash/4c79c359b3c5f077c0b955f93cb0f53e-Abstract-Conference.html.
- Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation, March 2023. URL https://arxiv.org/abs/2303.04772v3.
- Bayesian learning via stochastic gradient Langevin dynamics. pages 681–688. Citeseer, 2011.
- Neural Operator: Learning Maps Between Function Spaces, April 2023. URL http://arxiv.org/abs/2108.08481. arXiv:2108.08481 [cs, math].
- Neural Operator: Graph Kernel Network for Partial Differential Equations, March 2020. URL http://arxiv.org/abs/2003.03485. arXiv:2003.03485 [cs, math, stat].
- Fourier Neural Operator for Parametric Partial Differential Equations, May 2021. URL http://arxiv.org/abs/2010.08895. arXiv:2010.08895 [cs, math].
- Neural Operators for Accelerating Scientific Simulations and Design, January 2024. URL http://arxiv.org/abs/2309.15325. arXiv:2309.15325 [physics].
- Neural Operators with Localized Integral and Differential Kernels, February 2024. URL http://arxiv.org/abs/2402.16845. arXiv:2402.16845 [cs, math] version: 1.
- Generative Adversarial Networks, June 2014. URL http://arxiv.org/abs/1406.2661. arXiv:1406.2661 [cs, stat].
- SCORE-BASED GENERATIVE MODELING THROUGH STOCHASTIC DIFFERENTIAL EQUATIONS. 2021.
- Auto-Encoding Variational Bayes, December 2022. URL http://arxiv.org/abs/1312.6114. arXiv:1312.6114 [cs, stat].
- Development of Synthetic Ground-Motion Records through Generative Adversarial Neural Operators. pages 105–113, February 2024a. doi: 10.1061/9780784485316.012. URL https://ascelibrary.org/doi/10.1061/9780784485316.012. Publisher: American Society of Civil Engineers.
- Broadband Ground-Motion Synthesis via Generative Adversarial Neural Operators: Development and Validation. Bulletin of the Seismological Society of America, March 2024b. ISSN 0037-1106. doi: 10.1785/0120230207. URL https://doi.org/10.1785/0120230207.
- Seismic Wave Propagation and Inversion with Neural Operators. The Seismic Record, 1(3):126–134, November 2021. ISSN 2694-4006. doi: 10.1785/0320210026. URL https://doi.org/10.1785/0320210026.
- Neural Encoding and Decoding With a Flow-Based Invertible Generative Model. IEEE Transactions on Cognitive and Developmental Systems, 15(2):724–736, June 2023. ISSN 2379-8920, 2379-8939. doi: 10.1109/TCDS.2022.3176977. URL https://ieeexplore.ieee.org/document/9780264/.
- Globally injective and bijective neural operators, June 2023. URL http://arxiv.org/abs/2306.03982. arXiv:2306.03982 [cs, stat].
- Neural Processes, July 2018. URL http://arxiv.org/abs/1807.01622. arXiv:1807.01622 [cs, stat].
- Neural Diffusion Processes, June 2022. URL http://arxiv.org/abs/2206.03992. arXiv:2206.03992 [cs, stat].
- Generative Models as Distributions of Functions, February 2022. URL http://arxiv.org/abs/2102.04776. arXiv:2102.04776 [cs, stat].
- Glow: Generative Flow with Invertible 1x1 Convolutions, July 2018. URL http://arxiv.org/abs/1807.03039. arXiv:1807.03039 [cs, stat].
- Density estimation using Real NVP, February 2017. URL http://arxiv.org/abs/1605.08803. arXiv:1605.08803 [cs, stat].
- Neural Ordinary Differential Equations, December 2019. URL http://arxiv.org/abs/1806.07366. arXiv:1806.07366 [cs, stat].
- Normalizing Flows: An Introduction and Review of Current Methods. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(11):3964–3979, November 2021. ISSN 0162-8828, 2160-9292, 1939-3539. doi: 10.1109/TPAMI.2020.2992934. URL http://arxiv.org/abs/1908.09257. arXiv:1908.09257 [cs, stat].
- WaveFlow: A Compact Flow-based Model for Raw Audio. In Proceedings of the 37th International Conference on Machine Learning, pages 7706–7716. PMLR, November 2020. URL https://proceedings.mlr.press/v119/ping20a.html. ISSN: 2640-3498.
- Leveraging exploration in off-policy algorithms via normalizing flows. In Proceedings of the Conference on Robot Learning, pages 430–444. PMLR, May 2020. URL https://proceedings.mlr.press/v100/mazoure20a.html. ISSN: 2640-3498.
- DeepGEM: Generalized Expectation-Maximization for Blind Inversion. In Advances in Neural Information Processing Systems, volume 34, pages 11592–11603. Curran Associates, Inc., 2021. URL https://proceedings.neurips.cc/paper_files/paper/2021/hash/606c90a06173d69682feb83037a68fec-Abstract.html.
- Composing Normalizing Flows for Inverse Problems. In Proceedings of the 38th International Conference on Machine Learning, pages 11158–11169. PMLR, July 2021. URL https://proceedings.mlr.press/v139/whang21b.html. ISSN: 2640-3498.
- SRFlow: Learning the Super-Resolution Space with Normalizing Flow, July 2020. URL http://arxiv.org/abs/2006.14200. arXiv:2006.14200 [cs, eess].
- The Bayesian Approach to Inverse Problems. In Roger Ghanem, David Higdon, and Houman Owhadi, editors, Handbook of Uncertainty Quantification, pages 311–428. Springer International Publishing, Cham, 2017. ISBN 978-3-319-12384-4 978-3-319-12385-1. doi: 10.1007/978-3-319-12385-1_7. URL http://link.springer.com/10.1007/978-3-319-12385-1_7.
- Bayesian Nonparametric Models. Encyclopedia of machine learning, 1:81–89, 2010.
- Daniel Barry. Nonparametric Bayesian Regression. The Annals of Statistics, 14(3):934–953, 1986. ISSN 0090-5364. URL https://www.jstor.org/stable/3035551. Publisher: Institute of Mathematical Statistics.
- Larry Wasserman. All of nonparametric statistics. Springer Science & Business Media, 2006. ISBN 0-387-30623-4.
- Augmented Normalizing Flows: Bridging the Gap Between Generative Flows and Latent Variable Models, February 2020. URL http://arxiv.org/abs/2002.07101. arXiv:2002.07101 [cs, stat].
- A survey on Image Data Augmentation for Deep Learning. Journal of Big Data, 6(1):60, July 2019. ISSN 2196-1115. doi: 10.1186/s40537-019-0197-0. URL https://doi.org/10.1186/s40537-019-0197-0.
- FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators, February 2022. URL http://arxiv.org/abs/2202.11214. arXiv:2202.11214 [physics].
- The ERA5 global reanalysis. Quarterly Journal of the Royal Meteorological Society, 146(730):1999–2049, 2020. ISSN 0035-9009. Publisher: Wiley Online Library.
- Learning from uncertain curves: The 2-Wasserstein metric for Gaussian processes. In Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc., 2017. URL https://proceedings.neurips.cc/paper_files/paper/2017/hash/7a006957be65e608e863301eb98e1808-Abstract.html.
- PaCMO: Partner Dependent Human Motion Generation in Dyadic Human Activity using Neural Operators, November 2022b. URL http://arxiv.org/abs/2211.16210. arXiv:2211.16210 [cs].
- A class of Wasserstein metrics for probability distributions. Michigan Mathematical Journal, 31(2):231–240, 1984. ISSN 0026-2285. Publisher: University of Michigan, Department of Mathematics.
- Fast Matrix Square Roots with Applications to Gaussian Processes and Bayesian Optimization. In Advances in Neural Information Processing Systems, volume 33, pages 22268–22281. Curran Associates, Inc., 2020. URL https://proceedings.neurips.cc/paper_files/paper/2020/hash/fcf55a303b71b84d326fb1d06e332a26-Abstract.html.
- Non-Gaussian Process Regression, September 2022. URL http://arxiv.org/abs/2209.03117. arXiv:2209.03117 [cs, stat].
- Sungjin Ahn. Stochastic gradient mcmc: Algorithms and applications. University of California, Irvine, 2015. ISBN 1-339-12430-0.
- Optimal Thinning of MCMC Output. Journal of the Royal Statistical Society Series B: Statistical Methodology, 84(4):1059–1081, September 2022. ISSN 1369-7412, 1467-9868. doi: 10.1111/rssb.12503. URL https://academic.oup.com/jrsssb/article/84/4/1059/7073269.
- A Survey of Constrained Gaussian Process Regression: Approaches and Implementation Challenges. Journal of Machine Learning for Modeling and Computing, 1(2):119–156, 2020. ISSN 2689-3967. doi: 10.1615/JMachLearnModelComput.2020035155. URL http://arxiv.org/abs/2006.09319. arXiv:2006.09319 [cs, math, stat].
- Monte Carlo statistical methods, volume 2. Springer, 1999.
- NIED. Strong-Motion Seismograph Networks (K-NET, KiK-Net). National Research Institute for Earth Science and Disaster Prevention, 2013.
- Peter M. Shearer. Introduction to Seismology. Cambridge University Press, May 2019. ISBN 978-1-107-18447-3. Google-Books-ID: 08aVDwAAQBAJ.
- SciPy 1.0: fundamental algorithms for scientific computing in Python. Nature methods, 17(3):261–272, 2020. ISSN 1548-7105. Publisher: Nature Publishing Group.
- The Random Feature Model for Input-Output Maps between Banach Spaces. SIAM Journal on Scientific Computing, 43(5):A3212–A3243, January 2021. ISSN 1064-8275, 1095-7197. doi: 10.1137/20M133957X. URL http://arxiv.org/abs/2005.10224. arXiv:2005.10224 [physics, stat].
- An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale, June 2021. URL http://arxiv.org/abs/2010.11929. arXiv:2010.11929 [cs].
- Blending Anti-Aliasing into Vision Transformer. In Advances in Neural Information Processing Systems, volume 34, pages 5416–5429. Curran Associates, Inc., 2021. URL https://proceedings.neurips.cc/paper/2021/hash/2b3bf3eee2475e03885a110e9acaab61-Abstract.html.