Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
149 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Novel Skip Orthogonal List for Dynamic Optimal Transport Problem (2310.18446v5)

Published 27 Oct 2023 in cs.DS, cs.AI, cs.CG, and math.OC

Abstract: Optimal transport is a fundamental topic that has attracted a great amount of attention from the optimization community in the past decades. In this paper, we consider an interesting discrete dynamic optimal transport problem: can we efficiently update the optimal transport plan when the weights or the locations of the data points change? This problem is naturally motivated by several applications in machine learning. For example, we often need to compute the optimal transport cost between two different data sets; if some changes happen to a few data points, should we re-compute the high complexity cost function or update the cost by some efficient dynamic data structure? We are aware that several dynamic maximum flow algorithms have been proposed before, however, the research on dynamic minimum cost flow problem is still quite limited, to the best of our knowledge. We propose a novel 2D Skip Orthogonal List together with some dynamic tree techniques. Although our algorithm is based on the conventional simplex method, it can efficiently find the variable to pivot within expected $O(1)$ time, and complete each pivoting operation within expected $O(|V|)$ time where $V$ is the set of all supply and demand nodes. Since dynamic modifications typically do not introduce significant changes, our algorithm requires only a few simplex iterations in practice. So our algorithm is more efficient than re-computing the optimal transport cost that needs at least one traversal over all $|E| = O(|V|2)$ variables, where $|E|$ denotes the number of edges in the network. Our experiments demonstrate that our algorithm significantly outperforms existing algorithms in the dynamic scenarios.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (32)
  1. Screening sinkhorn algorithm for regularized optimal transport. Advances in Neural Information Processing Systems, 32, 2019.
  2. Geometric dataset distances via optimal transport. In NeurIPS 2020. ACM, February 2020. URL https://www.microsoft.com/en-us/research/publication/geometric-dataset-distances-via-optimal-transport/.
  3. Jon Louis Bentley. Multidimensional binary search trees used for associative searching. Communications of the ACM, 18(9):509–517, 1975.
  4. Displacement Interpolation Using Lagrangian Mass Transport. ACM Transactions on Graphics (SIGGRAPH ASIA 2011), 30(6), 2011.
  5. A dictionary of computer science. Oxford University Press, 2016.
  6. Maximum flow and minimum-cost flow in almost-linear time. In 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 612–623. IEEE, 2022.
  7. Dynamical wasserstein barycenters for time-series modeling. Advances in Neural Information Processing Systems, 34:27991–28003, 2021.
  8. William H Cunningham. A network simplex method. Mathematical Programming, 11:105–116, 1976.
  9. Marco Cuturi. Sinkhorn distances: Lightspeed computation of optimal transport. Advances in neural information processing systems, 26, 2013.
  10. The generalized simplex method for minimizing a linear form under linear inequality restraints. Pacific Journal of Mathematics, 5(2):183–195, 1955.
  11. Theoretical improvements in algorithmic efficiency for network flow problems. Journal of the ACM (JACM), 19(2):248–264, 1972.
  12. The skip quadtree: a simple dynamic data structure for multidimensional data. In Proceedings of the twenty-first annual symposium on Computational geometry, pages 296–305, 2005.
  13. Pot: Python optimal transport. The Journal of Machine Learning Research, 22(1):3571–3578, 2021.
  14. Fast optimal transport averaging of neuroimaging data. In Information Processing in Medical Imaging: 24th International Conference, IPMI 2015, Sabhal Mor Ostaig, Isle of Skye, UK, June 28-July 3, 2015, Proceedings 24, pages 261–272. Springer, 2015.
  15. Optimal mass transport for registration and warping. International Journal of computer vision, 60:225–240, 2004.
  16. pybind11 – seamless operability between c++11 and python. ttps://github.com/pybind/pybind11, 2017. Accessed: 2023-05-11.
  17. Group level meg/eeg source imaging via optimal transport: minimum wasserstein estimates. In Information Processing in Medical Imaging: 26th International Conference, IPMI 2019, Hong Kong, China, June 2–7, 2019, Proceedings 26, pages 743–754. Springer, 2019.
  18. Mnist handwritten digit database. http://yann.lecun.com/exdb/mnist, 2010. Accessed: 2022-07-29.
  19. Measuring the misfit between seismograms using an optimal transport distance: Application to full waveform inversion. Geophysical Supplements to the Monthly Notices of the Royal Astronomical Society, 205(1):345–377, 2016.
  20. Bradford G. Nickerson. Skip list data structures for multidimensional data. Technical report, University of Maryland at College Park, USA, 1994.
  21. James B Orlin. A polynomial time primal network simplex algorithm for minimum cost flows. Mathematical Programming, 78:109–129, 1997.
  22. Fast and robust earth mover’s distances. In 2009 IEEE 12th international conference on computer vision, pages 460–467. IEEE, 2009.
  23. Computational optimal transport: With applications to data science. Foundations and Trends® in Machine Learning, 11(5-6):355–607, 2019.
  24. William Pugh. Skip lists: a probabilistic alternative to balanced trees. Communications of the ACM, 33(6):668–676, 1990.
  25. Jonah Sherman. Generalized preconditioning and undirected minimum-cost flow. In Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 772–780. SIAM, 2017.
  26. A data structure for dynamic trees. In Proceedings of the thirteenth annual ACM symposium on Theory of computing, pages 114–122, 1981.
  27. Self-adjusting binary search trees. Journal of the ACM (JACM), 32(3):652–686, 1985.
  28. Smoothed analysis of algorithms: Why the simplex algorithm usually takes polynomial time. Journal of the ACM (JACM), 51(3):385–463, 2004.
  29. Robert E Tarjan. Dynamic trees as search trees via euler tours, applied to the network simplex algorithm. Mathematical Programming, 78(2):169–177, 1997.
  30. Overrelaxed sinkhorn–knopp algorithm for regularized optimal transport. Algorithms, 14(5):143, 2021.
  31. A survey on optimal transport for machine learning: Theory and applications. arXiv preprint arXiv:2106.01963, 2021.
  32. Minimum cost flows, mdps, and ℓℓ\ellroman_ℓ1-regression in nearly linear time for dense instances. In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing, pages 859–869, 2021.

Summary

We haven't generated a summary for this paper yet.