Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
166 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Self-supervised Learning of Implicit Shape Representation with Dense Correspondence for Deformable Objects (2308.12590v2)

Published 24 Aug 2023 in cs.CV

Abstract: Learning 3D shape representation with dense correspondence for deformable objects is a fundamental problem in computer vision. Existing approaches often need additional annotations of specific semantic domain, e.g., skeleton poses for human bodies or animals, which require extra annotation effort and suffer from error accumulation, and they are limited to specific domain. In this paper, we propose a novel self-supervised approach to learn neural implicit shape representation for deformable objects, which can represent shapes with a template shape and dense correspondence in 3D. Our method does not require the priors of skeleton and skinning weight, and only requires a collection of shapes represented in signed distance fields. To handle the large deformation, we constrain the learned template shape in the same latent space with the training shapes, design a new formulation of local rigid constraint that enforces rigid transformation in local region and addresses local reflection issue, and present a new hierarchical rigid constraint to reduce the ambiguity due to the joint learning of template shape and correspondences. Extensive experiments show that our model can represent shapes with large deformations. We also show that our shape representation can support two typical applications, such as texture transfer and shape editing, with competitive performance. The code and models are available at https://iscas3dv.github.io/deformshape

Definition Search Book Streamline Icon: https://streamlinehq.com
References (45)
  1. As-rigid-as-possible shape interpolation. In Siggraph, pages 157–164, 2000.
  2. Least-squares fitting of two 3-d point sets. IEEE Transactions on Pattern Analysis and Machine Intelligence, (5):698–700, 1987.
  3. Sal: Sign agnostic learning of shapes from raw data. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 2565–2574, 2020.
  4. Loopreg: Self-supervised learning of implicit surface correspondences, pose and shape for 3d human mesh registration. Advances in Neural Information Processing Systems, 33:12909–12922, 2020.
  5. Dynamic FAUST: Registering human bodies in motion. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), July 2017.
  6. Snarf: Differentiable forward skinning for animating non-rigid neural implicit shapes. arXiv preprint arXiv:2104.03953, 2021.
  7. Learning implicit fields for generative shape modeling. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5939–5948, 2019.
  8. Neural unsigned distance fields for implicit function learning. Advances in Neural Information Processing Systems, 33:21638–21652, 2020.
  9. Neural articulated shape approximation. In The European Conference on Computer Vision (ECCV). Springer, August 2020.
  10. Deformed implicit field: Modeling 3d shapes with learned dense correspondence. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10286–10296, June 2021.
  11. Neuromorph: Unsupervised shape interpolation and correspondence in one go. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7473–7483, 2021.
  12. Deep shells: Unsupervised shape correspondence with optimal transport. Advances in Neural information processing systems, 33:10491–10502, 2020.
  13. 3d-coded : 3d correspondences by deep deformation. In The European Conference on Computer Vision (ECCV), 2018.
  14. Unsupervised learning of dense shape correspondence. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 4370–4379, 2019.
  15. As-rigid-as-possible shape manipulation. ACM Transactions on Graphics (TOG), 24(3):1134–1141, 2005.
  16. Learning compositional representation for 4d captures with neural ode. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5340–5350, 2021.
  17. John M Lee. Smooth manifolds. In Introduction to smooth manifolds, pages 1–31. Springer, 2013.
  18. An analysis of svd for deep rotation estimation. Advances in Neural Information Processing Systems, 33:22554–22565, 2020.
  19. Lbs autoencoder: Self-supervised fitting of articulated meshes to point clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11967–11976, 2019.
  20. 4dcomplete: Non-rigid motion estimation beyond the observable surface. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021.
  21. Deep functional maps: Structured prediction for dense shape correspondence. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2017.
  22. Regularized deep signed distance fields for reactive motion generation. In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022.
  23. Smpl: A skinned multi-person linear model. ACM Transactions on Graphics (TOG), 34(6):1–16, 2015.
  24. Marching cubes: A high resolution 3d surface construction algorithm. Siggraph, 21(4):163–169, 1987.
  25. Neural point-based shape modeling of humans in challenging clothing. In International Conference on 3D Vision (3DV) 2022, Sept. 2022.
  26. The power of points for modeling humans in clothing. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 10974–10984, October 2021.
  27. Occupancy networks: Learning 3d reconstruction in function space. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 4460–4470, 2019.
  28. Leap: Learning articulated occupancy of people. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10461–10471, June 2021.
  29. Dynamicfusion: Reconstruction and tracking of non-rigid scenes in real-time. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 343–352, 2015.
  30. Occupancy flow: 4d reconstruction by learning particle dynamics. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 5379–5389, 2019.
  31. Deepsdf: Learning continuous signed distance functions for shape representation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 165–174, 2019.
  32. Nerfies: Deformable neural radiance fields. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021.
  33. Anr: Articulated neural rendering for virtual avatars. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 3722–3731, 2021.
  34. Embodied hands: Modeling and capturing hands and bodies together. arXiv preprint arXiv:2201.02610, 2022.
  35. Pifu: Pixel-aligned implicit function for high-resolution clothed human digitization. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 2304–2314, 2019.
  36. Neural descriptor fields: Se (3)-equivariant object representations for manipulation. In 2022 International Conference on Robotics and Automation (ICRA), pages 6394–6400. IEEE, 2022.
  37. Implicit neural representations with periodic activation functions. Advances in Neural Information Processing Systems, 33:7462–7473, 2020.
  38. Scene representation networks: Continuous 3d-structure-aware neural scene representations. Advances in Neural Information Processing Systems, 32, 2019.
  39. As-rigid-as-possible surface modeling. In Symposium on Geometry Processing, volume 4, pages 109–116, 2007.
  40. Least-squares rigid motion using svd. Computing, 1(1):1–5, 2017.
  41. Implicit field supervision for robust non-rigid shape matching. The European Conference on Computer Vision (ECCV), 2022.
  42. Shape registration in the time of transformers. Advances in Neural Information Processing Systems, 34:5731–5744, 2021.
  43. Shinji Umeyama. Least-squares estimation of transformation parameters between two point patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(04):376–380, 1991.
  44. The stitched puppet: A graphical model of 3d human shape and pose. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3537–3546, 2015.
  45. 3D menagerie: Modeling the 3D shape and pose of animals. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), July 2017.
Citations (5)

Summary

We haven't generated a summary for this paper yet.