Neural FIM for learning Fisher Information Metrics from point cloud data (2306.06062v2)
Abstract: Although data diffusion embeddings are ubiquitous in unsupervised learning and have proven to be a viable technique for uncovering the underlying intrinsic geometry of data, diffusion embeddings are inherently limited due to their discrete nature. To this end, we propose neural FIM, a method for computing the Fisher information metric (FIM) from point cloud data - allowing for a continuous manifold model for the data. Neural FIM creates an extensible metric space from discrete point cloud data such that information from the metric can inform us of manifold characteristics such as volume and geodesics. We demonstrate Neural FIM's utility in selecting parameters for the PHATE visualization method as well as its ability to obtain information pertaining to local volume illuminating branching points and cluster centers embeddings of a toy dataset and two single-cell datasets of IPSC reprogramming and PBMCs (immune cells).
- 10x Genomics. Pbmcs from c57bl/6 mice (v1, 150x150), single cell immune profiling dataset by cell ranger 3.1.0, 2019.
- Amari, S.-i. Information geometry and its applications, volume 194. Springer, 2016.
- Information geometry. Springer, 2008.
- Laplacian eigenmaps for dimensionality reduction and data representation. Neural computation, 15(6):1373–1396, 2003.
- Out-of-sample extensions for lle, isomap, mds, eigenmaps, and spectral clustering. Advances in neural information processing systems, 16, 2003.
- Diffusion curvature for estimating local curvature in high dimensional data. arXiv preprint arXiv:2206.03977, 2022.
- Geometric deep learning: going beyond euclidean data. IEEE Signal Processing Magazine, 34(4):18–42, 2017.
- Stochastic neighbor embedding (sne) for dimension reduction and visualization using arbitrary divergences. Neurocomputing, 90:23–45, 2012.
- Neural ordinary differential equations. Advances in neural information processing systems, 31, 2018.
- On the diffusion geometry of graph laplacians and applications. Applied and Computational Harmonic Analysis, 46(3):674–688, 2019.
- Diffusion maps. Applied and Computational Harmonic Analysis, 21(1):5–30, July 2006. ISSN 10635203. doi: 10.1016/j.acha.2006.04.006.
- Elements of information theory second edition solutions to problems. Internet Access, pp. 19–20, 2006.
- Crooks, G. E. Measuring thermodynamic length. Physical Review Letters, 99(10):100602, 2007.
- Cybenko, G. V. Approximation by superpositions of a sigmoidal function. Mathematics of Control, Signals and Systems, 2:303–314, 1989.
- Out-of-sample extension for dimensionality reduction of noisy time series. IEEE Transactions on Image Processing, 26(11):5435–5446, 2017.
- De Domenico, M. Diffusion geometry unravels the emergence of functional clusters in collective phenomena. Physical review letters, 118(16):168301, 2017.
- Diffusion pseudotime robustly reconstructs lineage branching. Nature methods, 13(10):845–848, 2016.
- Intrinsic dimensionality estimation based on manifold assumption. Journal of Visual Communication and Image Representation, 25(5):740–747, 2014.
- Manifold interpolating optimal-transport flows for trajectory inference. In NeurIPS, 2022.
- Diffusion kernels on statistical manifolds. Journal of Machine Learning Research, 6(1), 2005.
- Lauritzen, S. L. Statistical manifolds. Differential geometry in statistical inference, 10:163–216, 1987.
- Incremental nonlinear dimensionality reduction by manifold learning. IEEE transactions on pattern analysis and machine intelligence, 28(3):377–391, 2006.
- On a prior based on the wasserstein information matrix. Statistics & Probability Letters, 190:109645, 2022.
- Wasserstein proximal of gans. In Geometric Science of Information: 5th International Conference, GSI 2021, Paris, France, July 21–23, 2021, Proceedings, pp. 524–533. Springer, 2021.
- Riemannian manifold learning. IEEE transactions on pattern analysis and machine intelligence, 30(5):796–809, 2008.
- Umap: Uniform manifold approximation and projection for dimension reduction, 2018.
- Kernel pca and de-noising in feature spaces. Advances in neural information processing systems, 11, 1998.
- Visualizing structure and transitions in high-dimensional biological data. Nat Biotechnol, 37(12):1482–1492, 2019.
- Nielsen, F. An elementary introduction to information geometry. Entropy, 22(10):1100, 2020.
- Noguchi, M. Geometry of statistical manifolds. Differential Geometry and its Applications, 2(3):197–222, 1992.
- Error metrics for learning reliable manifolds from streaming data. In Proceedings of the 2017 SIAM International Conference on Data Mining, pp. 750–758. SIAM, 2017.
- The shape of data: Intrinsic distance for data distributions. arXiv preprint arXiv:1905.11141, 2019.
- Visualizing data using t-sne. Journal of Machine Learning Research, 2008.
- A continuous molecular roadmap to ipsc reprogramming through progression analysis of single-cell mass cytometry. Cell stem cell, 16(3):323–337, 2015.