Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
149 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning rigid-body simulators over implicit shapes for large-scale scenes and vision (2405.14045v1)

Published 22 May 2024 in cs.LG and cs.CV

Abstract: Simulating large scenes with many rigid objects is crucial for a variety of applications, such as robotics, engineering, film and video games. Rigid interactions are notoriously hard to model: small changes to the initial state or the simulation parameters can lead to large changes in the final state. Recently, learned simulators based on graph networks (GNNs) were developed as an alternative to hand-designed simulators like MuJoCo and PyBullet. They are able to accurately capture dynamics of real objects directly from real-world observations. However, current state-of-the-art learned simulators operate on meshes and scale poorly to scenes with many objects or detailed shapes. Here we present SDF-Sim, the first learned rigid-body simulator designed for scale. We use learned signed-distance functions (SDFs) to represent the object shapes and to speed up distance computation. We design the simulator to leverage SDFs and avoid the fundamental bottleneck of the previous simulators associated with collision detection. For the first time in literature, we demonstrate that we can scale the GNN-based simulators to scenes with hundreds of objects and up to 1.1 million nodes, where mesh-based approaches run out of memory. Finally, we show that SDF-Sim can be applied to real world scenes by extracting SDFs from multi-view images.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (42)
  1. Learning rigid dynamics with face interaction graph networks. In International Conference on Learning Representations, 2023.
  2. Neural implicit surfaces for efficient and accurate collisions in physically based simulations. ArXiv, abs/2110.01614, 2021. URL https://api.semanticscholar.org/CorpusID:238354360.
  3. Neural rgb-d surface reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 6290–6301, June 2022.
  4. Mip-NeRF 360: Unbounded anti-aliased neural radiance fields. CVPR, 2022.
  5. Relational inductive biases, deep learning, and graph networks. arXiv preprint arXiv:1806.01261, 2018.
  6. SDFusion: Multimodal 3d shape completion, reconstruction, and generation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4456–4465, 2023.
  7. Diffusion-sdf: Conditional generative modeling of signed distance functions. 2023.
  8. James H. Clark. Hierarchical geometric models for visible surface algorithms. Commun. ACM, 19(10):547–554, oct 1976. ISSN 0001-0782. doi: 10.1145/360349.360354. URL https://doi.org/10.1145/360349.360354.
  9. Differentiable physics simulation of dynamics-augmented neural objects. IEEE Robotics and Automation Letters, 8:2780–2787, 2022. URL https://api.semanticscholar.org/CorpusID:252967901.
  10. Erwin Coumans. Bullet physics simulation. In ACM SIGGRAPH 2015 Courses, page 7, 2015.
  11. Learning models as functionals of signed-distance fields for manipulation planning. In 5th Annual Conference on Robot Learning, 2021. URL https://openreview.net/forum?id=FS30JeiGG3h.
  12. Learning multi-object dynamics with compositional neural radiance fields. In 6th Annual Conference on Robot Learning, 2022. URL https://openreview.net/forum?id=qUvTmyGpnm7.
  13. Klaus Greff et al. Kubric: a scalable dataset generator. 2022.
  14. Propagation networks for model-based control under partial observation. In 2019 International Conference on Robotics and Automation (ICRA), pages 1205–1211. IEEE, 2019.
  15. Scaling face interaction graph networks to real world scenes, 2024.
  16. Marching cubes: A high resolution 3d surface construction algorithm. In Seminal graphics: pioneering efforts that shaped the field, pages 347–353. 1998.
  17. Neural-pull: Learning signed distance function from point clouds by learning to pull space onto surface. In Marina Meila and Tong Zhang, editors, Proceedings of the 38th International Conference on Machine Learning, volume 139 of Proceedings of Machine Learning Research, pages 7246–7257. PMLR, 18–24 Jul 2021. URL https://proceedings.mlr.press/v139/ma21b.html.
  18. Learning signed distance functions from noisy 3d point clouds via noise to noise mapping. In International Conference on Machine Learning (ICML), 2023.
  19. Surfsup: Learning fluid simulation for novel surfaces. arXiv preprint arXiv:2303.08128, 2023.
  20. A-sdf: Learning disentangled signed distance functions for articulated shape representation. pages 12981–12991, 2021.
  21. Meshless deformations based on shape matching. ACM transactions on graphics (TOG), 24(3):471–478, 2005.
  22. CabiNet: Scaling neural collision detection for object rearrangement with procedural scene generation. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), May 2023. URL https://arxiv.org/abs/2304.09302.
  23. isdf: Real-time neural signed distance fields for robot perception. In Robotics: Science and Systems, 2022.
  24. Deepsdf: Learning continuous signed distance functions for shape representation. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019.
  25. Learning mesh-based simulation with graph networks. In International Conference on Learning Representations, 2021.
  26. Dynamic mesh-aware radiance fields. ICCV, 2023.
  27. Learning to simulate complex physics with graph networks. In International Conference on Machine Learning, pages 8459–8468. PMLR, 2020.
  28. Structure-from-motion revisited. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4104–4113, 2016.
  29. Spelunking the deep: Guaranteed queries on general neural implicit surfaces via range analysis. ACM Trans. Graph., 41(4), jul 2022. ISSN 0730-0301. doi: 10.1145/3528223.3530155. URL https://doi.org/10.1145/3528223.3530155.
  30. Diffusion-based signed distance fields for 3d shape generation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 20887–20897, June 2023.
  31. Diffsdfsim: Differentiable rigid-body dynamics with implicit shapes. In 3DV, pages 96–105, 2021. URL https://doi.org/10.1109/3DV53792.2021.00020.
  32. Mujoco: A physics engine for model-based control. In 2012 IEEE/RSJ international conference on intelligent robots and systems, pages 5026–5033. IEEE, 2012.
  33. Fast-grasp’d: Dexterous multi-finger grasp generation through differentiable simulation. In ICRA, 2023.
  34. 3d neural sculpting (3dns): Editing neural signed distance functions, 2023.
  35. Neus: Learning neural implicit surfaces by volume rendering for multi-view reconstruction. NeurIPS, 2021.
  36. BundleSDF: Neural 6-DoF tracking and 3D reconstruction of unknown objects. CVPR, 2023.
  37. Learning 3d particle-based simulators from rgb-d videos, 2023.
  38. Virdo: Visio-tactile implicit representations of deformable objects. 2022 International Conference on Robotics and Automation (ICRA), pages 3583–3590, 2022. URL https://api.semanticscholar.org/CorpusID:246473027.
  39. Volume rendering of neural implicit surfaces. In Thirty-Fifth Conference on Neural Information Processing Systems, 2021.
  40. Monosdf: Exploring monocular geometric cues for neural implicit surface reconstruction. Advances in Neural Information Processing Systems (NeurIPS), 2022.
  41. Dynamic neural garments. ACM Transactions on Graphics (TOG), 40:1 – 15, 2021. URL https://api.semanticscholar.org/CorpusID:232013515.
  42. Open3D: A modern library for 3D data processing. arXiv:1801.09847, 2018.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com