Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold (2408.14608v2)
Abstract: Numerous biological and physical processes can be modeled as systems of interacting entities evolving continuously over time, e.g. the dynamics of communicating cells or physical particles. Learning the dynamics of such systems is essential for predicting the temporal evolution of populations across novel samples and unseen environments. Flow-based models allow for learning these dynamics at the population level - they model the evolution of the entire distribution of samples. However, current flow-based models are limited to a single initial population and a set of predefined conditions which describe different dynamics. We argue that multiple processes in natural sciences have to be represented as vector fields on the Wasserstein manifold of probability densities. That is, the change of the population at any moment in time depends on the population itself due to the interactions between samples. In particular, this is crucial for personalized medicine where the development of diseases and their respective treatment response depend on the microenvironment of cells specific to each patient. We propose Meta Flow Matching (MFM), a practical approach to integrate along these vector fields on the Wasserstein manifold by amortizing the flow model over the initial populations. Namely, we embed the population of samples using a Graph Neural Network (GNN) and use these embeddings to train a Flow Matching model. This gives MFM the ability to generalize over the initial distributions, unlike previously proposed methods. We demonstrate the ability of MFM to improve the prediction of individual treatment responses on a large-scale multi-patient single-cell drug screen dataset.
- Task2vec: Task embedding for meta-learning. In Proceedings of the IEEE/CVF international conference on computer vision (ICCV), pages 6430–6439.
- Building normalizing flows with stochastic interpolants. arXiv preprint arXiv:2209.15571.
- Gradient flows: in metric spaces and in the space of probability measures. Springer Science & Business Media.
- Meta optimal transport. arXiv preprint arXiv:2206.05262.
- Amos, B. et al. (2023). Tutorial on amortized optimization. Foundations and Trends® in Machine Learning, 16(5):592–732.
- Input convex neural networks. In International conference on machine learning, pages 146–155. PMLR.
- Deciphering cell–cell interactions and communication from gene expression. Nature Reviews Genetics, 22(2):71–88.
- Benamou, J.-D. (2003). Numerical resolution of an “unbalanced” mass transport problem. ESAIM: Mathematical Modelling and Numerical Analysis, 37(5):851–868.
- Understanding the tumor immune microenvironment (time) for effective therapy. Nature Medicine, 24(5):541–550.
- Learning single-cell perturbation responses using neural optimal transport. Nature Methods, 20(11):1759–1768.
- Generative flows on discrete state-spaces: Enabling multimodal flows with applications to protein co-design. arXiv preprint arXiv:2402.04997.
- Dissecting heterogeneous cell populations across drug and disease conditions with popalign. Proceedings of the National Academy of Sciences, 117(46):28784–28794.
- Learning to optimize: A primer and a benchmark. Journal of Machine Learning Research, 23(189):1–59.
- Unbalanced optimal transport: Dynamic and kantorovich formulations. Journal of Functional Analysis, 274(11):3090–3123.
- Single-cell rna-seq enables comprehensive tumour and immune cell profiling in primary breast cancer. Nature Communications, 8(1).
- Flow matching in latent space. arXiv preprint arXiv:2307.08698.
- Diffusion schrödinger bridge with applications to score-based generative modeling. Advances in Neural Information Processing Systems, 34:17695–17709.
- Multimodal pooled perturb-cite-seq screens in patient models define mechanisms of cancer immune evasion. Nature Genetics, 53(3):332–341.
- Neural message passing for quantum chemistry. In International conference on machine learning, pages 1263–1272. PMLR.
- Gap junctions. Cold Spring Harb Perspect Biol, 1(1):a002576.
- A kernel two-sample test. The Journal of Machine Learning Research, 13(1):723–773.
- Single-cell transcriptional diversity is a hallmark of developmental potential. Science, 367(6476):405–411.
- Learning population-level diffusions with generative recurrent networks. In Proceedings of the 33rd International Conference on Machine Learning, pages 2417–2426.
- Predicting cellular responses to novel drug perturbations at a single-cell resolution. In Koyejo, S., Mohamed, S., Agarwal, A., Belgrave, D., Cho, K., and Oh, A., editors, Advances in Neural Information Processing Systems, volume 35, pages 26711–26722. Curran Associates, Inc.
- Meta-learning in neural networks: A survey. IEEE transactions on pattern analysis and machine intelligence, 44(9):5149–5169.
- Manifold interpolating optimal-transport flows for trajectory inference.
- Geodesic sinkhorn: Optimal transport for high-dimensional datasets. In IEEE MLSP.
- Extended flow matching: a method of conditional generation with generalized continuity equation. arXiv preprint arXiv:2402.18839.
- Mean field limit and propagation of chaos for vlasov systems with bounded forces. Journal of Functional Analysis, 271(12):3588–3627.
- Machine learning for perturbational single-cell omics. Cell Systems, 12(6):522–537.
- Neural lagrangian schr\"odinger bridge. In ICLR.
- Flow matching for generative modeling. In The Eleventh International Conference on Learning Representations.
- I2sb: Image-to-image schrödinger bridge. In ICML.
- Flow straight and fast: Learning to generate and transfer data with rectified flow. arXiv preprint arXiv:2209.03003.
- Deep generative modeling for single-cell transcriptomics. Nature Methods, 15(12):1053–1058.
- scgen predicts single-cell perturbation responses. Nature Methods, 16(8):715–721.
- Optimal transport mapping via input convex neural networks. In ICML.
- A single cell characterisation of human embryogenesis identifies pluripotency transitions and putative anterior hypoblast centre. Nature Communications, 12(1).
- A computational framework for solving wasserstein lagrangian flows. arXiv preprint arXiv:2310.10649.
- Action matching: A variational method for learning stochastic dynamics from samples.
- Otto, F. (2001). The geometry of dissipative evolution equations: the porous medium equation.
- scperturb: harmonized single-cell perturbation data. Nature Methods, pages 1–10.
- Computational Optimal Transport. arXiv:1803.00567.
- Multisample flow matching: Straightening flows with minibatch couplings. arXiv preprint arXiv:2304.14772.
- Gencast: Diffusion-based ensemble forecasting for medium-range weather. arXiv preprint arXiv:2312.15796.
- Trellis tree-based analysis reveals stromal regulation of patient-derived organoid drug responses. Cell, 186(25):5606–5619.e24.
- Single-cell topological rna-seq analysis reveals insights into cellular differentiation and development. Nature Biotechnology, 35(6):551–560.
- High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10684–10695.
- Palette: Image-to-image diffusion models. In ACM SIGGRAPH 2022 conference proceedings, pages 1–10.
- Photorealistic text-to-image diffusion models with deep language understanding. Advances in neural information processing systems, 35:36479–36494.
- Learning to simulate complex physics with graph networks. In International conference on machine learning, pages 8459–8468. PMLR.
- Optimal-transport analysis of single-cell gene expression identifies developmental trajectories in reprogramming. Cell, 176(4):928–943.
- Aligned diffusion schröodinger bridges. In UAI.
- Exponential scaling of single-cell rna-seq in the past decade. Nature protocols, 13(4):599–604.
- Improving and generalizing flow-based generative models with minibatch optimal transport. Transactions on Machine Learning Research. Expert Certification.
- Trajectorynet: A dynamic optimal transport network for modeling cellular dynamics. In International conference on machine learning, pages 9526–9536. PMLR.
- Simulation-free schrödinger bridges via score and flow matching. In International Conference on Artificial Intelligence and Statistics, pages 1279–1287. PMLR.
- The monge gap: A regularizer to learn all transport maps. In Krause, A., Brunskill, E., Cho, K., Engelhardt, B., Sabato, S., and Scarlett, J., editors, Proceedings of the 40th International Conference on Machine Learning, volume 202 of Proceedings of Machine Learning Research, pages 34709–34733. PMLR.
- Climode: Climate and weather forecasting with physics-informed neural odes. arXiv preprint arXiv:2404.10024.
- Villani, C. (2009). Optimal transport: old and new, volume 338. Springer.
- Fundamental limits on dynamic inference from single-cell snapshots. 115(10):E2467–E2476.
- Scalable unbalanced optimal transport using generative adversarial networks. In 7th International Conference on Learning Representations, page 20.
- Deep sets. Advances in neural information processing systems, 30.
- Single-cell rna sequencing-based computational analysis to describe disease heterogeneity. Frontiers in Genetics, 10.
- Guided flows for generative modeling and decision making. arXiv preprint arXiv:2311.13443.