Papers
Topics
Authors
Recent
Search
2000 character limit reached

Box Filtration

Published 8 Apr 2024 in cs.CG and math.AT | (2404.05859v3)

Abstract: We define a new framework that unifies the filtration and mapper approaches from TDA, and present efficient algorithms to compute it. Termed the box filtration of a PCD, we grow boxes (hyperrectangles) that are not necessarily centered at each point (in place of balls centered at points). We grow the boxes non-uniformly and asymmetrically in different dimensions based on the distribution of points. We present two approaches to handle the boxes: a point cover where each point is assigned its own box at start, and a pixel cover that works with a pixelization of the space of the PCD. Any box cover in either setting automatically gives a mapper of the PCD. We show that the persistence diagrams generated by the box filtration using both point and pixel covers satisfy the classical stability based on the Gromov-Hausdorff distance. Using boxes also implies that the box filtration is identical for pairwise or higher order intersections whereas the VR and Cech filtration are not the same. Growth in each dimension is computed by solving a linear program (LP) that optimizes a cost functional balancing the cost of expansion and benefit of including more points in the box. The box filtration algorithm runs in $O(m|U(0)|\log(mn\pi)L(q))$ time, where $m$ is number of steps of increments considered for box growth, $|U(0)|$ is the number of boxes in the initial cover ($\leq$ number of points), $\pi$ is the step length for increasing each box dimension, each LP is solved in $O(L(q))$ time, $n$ is the PCD dimension, and $q = n \times |X|$. We demonstrate through multiple examples that the box filtration can produce more accurate results to summarize the topology of the PCD than VR and distance-to-measure (DTM) filtrations. Software for our implementation is available at https://github.com/pragup/Box-Filteration.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (28)
  1. DTM-Based Filtrations. In Gill Barequet and Yusu Wang, editors, 35th International Symposium on Computational Geometry, SoCG 2019, June 18-21, 2019, Portland, Oregon, USA, volume 129 of LIPIcs, pages 58:1–58:15. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2019.
  2. Robust statistics, hypothesis testing, and confidence intervals for persistent homology on metric measure spaces. Foundations of Computational Mathematics, 14(4):745–789, 2014.
  3. Topological consistency via kernel estimation. Bernoulli, 23(1):288–328, 2017.
  4. Efficient and robust persistent homology for measures. Computational Geometry, 58:70–96, 2016.
  5. Gunnar Carlsson. Topology and data. Bulletin of the American Mathematical Society, 46(2):255–308, January 2009.
  6. Statistical analysis and parameter selection for Mapper. Journal of Machine Learning Research, 19(12):1–39, 2018. arXiv:1706.00204.
  7. Structure and stability of the one-dimensional Mapper. Foundations of Computational Mathematics, 18:1333–1396, Oct 2018. arXiv:1511.05823.
  8. Geometric inference for probability measures. Foundations of Computational Mathematics, 11:733–751, 12 2011.
  9. The Structure and Stability of Persistence Modules. SpringerBriefs in Mathematics. Springer Cham, 1 edition, 2016.
  10. Persistence stability for geometric complexes. Geometriae Dedicata, 173:193–214, 2014.
  11. Scalar Field Analysis over Point Cloud Data. Discrete and Computational Geometry, 46(4):743–775, December 2011.
  12. Persistence-Based Clustering in Riemannian Manifolds. Journal of the ACM, 60(6), 2013.
  13. Finding and characterizing tunnels in macromolecules with application to ion channels and pores. Biophysical Journal, 96(2):632–645, 2009.
  14. Computing the Multicover Bifiltration. In Kevin Buchin and Éric Colin de Verdière, editors, 37th International Symposium on Computational Geometry (SoCG 2021), volume 189 of Leibniz International Proceedings in Informatics (LIPIcs), pages 27:1–27:17, Dagstuhl, Germany, 2021. Schloss Dagstuhl – Leibniz-Zentrum für Informatik.
  15. Multiscale Mapper: Topological summarization via codomain covers. In Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’16, pages 997–1013, Philadelphia, PA, USA, 2016. Society for Industrial and Applied Mathematics. arXiv:1504.03763.
  16. Paweł Dłotko. Ball Mapper: A shape summary for topological data analysis, 2019. arxiv:1901.07410.
  17. Computational Topology An Introduction. American Mathematical Society, December 2009.
  18. Persistent Homology: Theory and Practice. Lawrence Berkeley National Laboratory eScholarship, 2013.
  19. Witnessed K-Distance. In Proceedings of the Twenty-Seventh Annual Symposium on Computational Geometry, SoCG ’11, pages 57–64, New York, NY, USA, 2011. Association for Computing Machinery.
  20. Algorithm AS 136: A K-Means Clustering Algorithm. Journal of the Royal Statistical Society. Series C (Applied Statistics), 28(1):100–108, 1979.
  21. Finding the homology of manifolds using ellipsoids, 2020. arxiv:2006.09194.
  22. The Localized Union-Of-Balls Bifiltration. In Erin W. Chambers and Joachim Gudmundsson, editors, 39th International Symposium on Computational Geometry (SoCG 2023), volume 258 of Leibniz International Proceedings in Informatics (LIPIcs), pages 45:1–45:19, Dagstuhl, Germany, 2023. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. Full version: arXiv:2303.07002.
  23. Extracting insights from the shape of complex data using topology. Scientific Reports, 3(1236), 2013.
  24. Geometric Inference on Kernel Density Estimates. In Lars Arge and János Pach, editors, 31st International Symposium on Computational Geometry (SoCG 2015), volume 34 of Leibniz International Proceedings in Informatics (LIPIcs), pages 857–871, Dagstuhl, Germany, 2015. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik.
  25. Alexander Schrijver. Theory of Linear and Integer Programming. Wiley-Interscience Series in Discrete Mathematics. John Wiley & Sons Ltd., Chichester, 1986.
  26. Topological Methods for the Analysis of High Dimensional Data Sets and 3D Object Recognition. In M. Botsch, R. Pajarola, B. Chen, and M. Zwicker, editors, Proceedings of the Symposium on Point Based Graphics, pages 91–100, Prague, Czech Republic, 2007. Eurographics Association.
  27. Deformation transfer for triangle meshes. ACM Transactions on Graphics, 23(3):399–405, aug 2004.
  28. Jan van den Brand. A Deterministic Linear Program Solver in Current Matrix Multiplication Time. In Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms (SODA ’20), pages 259–278, 2020.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 3 tweets with 0 likes about this paper.