3DMambaIPF: A State Space Model for Iterative Point Cloud Filtering via Differentiable Rendering (2404.05522v1)
Abstract: Noise is an inevitable aspect of point cloud acquisition, necessitating filtering as a fundamental task within the realm of 3D vision. Existing learning-based filtering methods have shown promising capabilities on small-scale synthetic or real-world datasets. Nonetheless, the effectiveness of these methods is constrained when dealing with a substantial quantity of point clouds. This limitation primarily stems from their limited denoising capabilities for large-scale point clouds and their inclination to generate noisy outliers after denoising. The recent introduction of State Space Models (SSMs) for long sequence modeling in NLP presents a promising solution for handling large-scale data. Encouraged by iterative point cloud filtering methods, we introduce 3DMambaIPF, firstly incorporating Mamba (Selective SSM) architecture to sequentially handle extensive point clouds from large scenes, capitalizing on its strengths in selective input processing and long sequence modeling capabilities. Additionally, we integrate a robust and fast differentiable rendering loss to constrain the noisy points around the surface. In contrast to previous methodologies, this differentiable rendering loss enhances the visual realism of denoised geometric structures and aligns point cloud boundaries more closely with those observed in real-world objects. Extensive evaluation on datasets comprising small-scale synthetic and real-world models (typically with up to 50K points) demonstrate that our method achieves state-of-the-art results. Moreover, we showcase the superior scalability and efficiency of our method on large-scale models with about 500K points, where the majority of the existing learning-based denoising methods are unable to handle.
- Point set surfaces. In Proceedings Visualization, 2001. VIS’01. IEEE, 21–29.
- Computing and rendering point set surfaces. IEEE Transactions on visualization and computer graphics 9, 1 (2003), 3–15.
- L1-sparse reconstruction of sharp point set surfaces. ACM Transactions on Graphics (TOG) 29, 5 (2010), 1–12.
- Differentiable rendering of neural sdfs through reparameterization. In SIGGRAPH Asia 2022 Conference Papers. 1–9.
- Frédéric Cazals and Marc Pouget. 2005. Estimating differential quantities using polynomial fitting of osculating jets. Computer aided geometric design 22, 2 (2005), 121–146.
- Brian Curless and Marc Levoy. 1996. A volumetric method for building complex models from range images. In Proceedings of the 23rd annual conference on Computer graphics and interactive techniques. 303–312.
- IterativePFN: True iterative point cloud filtering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 13530–13539.
- 3D Gaussian as a New Vision Era: A Survey. arXiv preprint arXiv:2402.07181 (2024).
- Comprehensive review of deep learning-based 3d point cloud completion processing and analysis. IEEE Transactions on Intelligent Transportation Systems 23, 12 (2022), 22862–22883.
- Self-supervised learning for pre-training 3d point clouds: A survey. arXiv preprint arXiv:2305.04691 (2023).
- Progressive Growth for Point Cloud Completion by Surface-Projection Optimization. IEEE Transactions on Intelligent Vehicles (2024).
- Graph-based point cloud denoising. In 2018 IEEE Fourth International Conference on Multimedia Big Data (BigMM). IEEE, 1–6.
- Albert Gu and Tri Dao. 2023. Mamba: Linear-time sequence modeling with selective state spaces. arXiv preprint arXiv:2312.00752 (2023).
- Hippo: Recurrent memory with optimal polynomial projections. Advances in neural information processing systems 33 (2020), 1474–1487.
- Efficiently modeling long sequences with structured state spaces. arXiv preprint arXiv:2111.00396 (2021).
- Combining recurrent, convolutional, and continuous-time models with linear state space layers. Advances in neural information processing systems 34 (2021), 572–585.
- Pcpnet learning local shape properties from raw point clouds. In Computer graphics forum, Vol. 37. Wiley Online Library, 75–85.
- DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models. arXiv preprint arXiv:2403.00818 (2024).
- Monte carlo convolution for learning on non-uniformly sampled point clouds. ACM Transactions on Graphics (TOG) 37, 6 (2018), 1–12.
- Feature graph learning for 3D point cloud denoising. IEEE Transactions on Signal Processing 68 (2020), 2841–2856.
- Graph signal processing for geometric data and beyond: Theory and applications. IEEE Transactions on Multimedia 24 (2021), 3961–3977.
- Edge-aware point set resampling. ACM transactions on graphics (TOG) 32, 1 (2013), 1–12.
- MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation. arXiv preprint arXiv:2402.18451 (2024).
- Eldar Insafutdinov and Alexey Dosovitskiy. 2018. Unsupervised learning of shape and pose with differentiable point clouds. Advances in neural information processing systems 31 (2018).
- 3d gaussian splatting for real-time radiance field rendering. ACM Transactions on Graphics 42, 4 (2023), 1–14.
- Venkat Krishnamurthy and Marc Levoy. 1996. Fitting smooth surfaces to dense polygon meshes. In Proceedings of the 23rd annual conference on Computer graphics and interactive techniques. 313–324.
- Modular Primitives for High-Performance Differentiable Rendering. ACM Transactions on Graphics 39, 6 (2020).
- Sparse regularization-based approach for point cloud denoising and sharp features enhancement. Sensors 20, 11 (2020), 3206.
- PointMamba: A Simple State Space Model for Point Cloud Analysis. arXiv preprint arXiv:2402.10739 (2024).
- LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation. arXiv preprint arXiv:2403.05246 (2024).
- Parameterization-free projection for geometry reconstruction. ACM Transactions on Graphics (ToG) 26, 3 (2007), 22–es.
- Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy. arXiv preprint arXiv:2403.06467 (2024).
- Soft rasterizer: A differentiable renderer for image-based 3d reasoning. In Proceedings of the IEEE/CVF international conference on computer vision. 7708–7717.
- Structured state space models for in-context reinforcement learning. Advances in Neural Information Processing Systems 36 (2024).
- Shitong Luo and Wei Hu. 2020. Differentiable manifold reconstruction for point cloud denoising. In Proceedings of the 28th ACM international conference on multimedia. 1330–1338.
- Shitong Luo and Wei Hu. 2021. Score-based point cloud denoising. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 4583–4592.
- Pd-flow: A point cloud denoising framework with normalizing flows. In European Conference on Computer Vision. Springer, 398–415.
- Dereck S Meek and Desmond J Walton. 2000. On surface normal and Gaussian curvature approximations given data sampled from a smooth surface. Computer aided geometric design 17, 6 (2000), 521–543.
- Nerf: Representing scenes as neural radiance fields for view synthesis. Commun. ACM 65, 1 (2021), 99–106.
- On the low-shot transferability of [V]-Mamba. arXiv preprint arXiv:2403.10696 (2024).
- Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling. In European Conference on Computer Vision. Springer, 281–299.
- Leif E Peterson. 2009. K-nearest neighbor. Scholarpedia 4, 2 (2009), 1883.
- Learning graph-convolutional representations for point cloud denoising. In European conference on computer vision. Springer, 103–118.
- Pointcleannet: Learning to denoise and remove outliers from dense point clouds. In Computer graphics forum, Vol. 39. Wiley Online Library, 185–203.
- Volrecon: Volume rendering of signed ray distance functions for generalizable multi-view reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 16685–16695.
- Robust filtering of noisy scattered point data. In Proceedings Eurographics/IEEE VGTC Symposium Point-Based Graphics, 2005. IEEE, 71–144.
- Paris-rue-Madame database: A 3D mobile laser scanner dataset for benchmarking urban detection, segmentation and classification methods. In 4th international conference on pattern recognition, applications and methods ICPRAM 2014.
- Denoising point sets via L0 minimization. Computer Aided Geometric Design 35 (2015), 2–15.
- Yogita Thakran and Durga Toshniwal. 2012. Unsupervised outlier detection in streaming data using weighted clustering. In 2012 12th international conference on intelligent systems design and applications (ISDA). IEEE, 947–952.
- Differentiable signed distance function rendering. ACM Transactions on Graphics (TOG) 41, 4 (2022), 1–18.
- ClinicalMamba: A Generative Clinical Language Model on Longitudinal Clinical Notes. arXiv preprint arXiv:2403.05795 (2024).
- Differentiable surface splatting for point-based geometry processing. ACM Transactions on Graphics (TOG) 38, 6 (2019), 1–14.
- Pu-net: Point cloud upsampling network. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2790–2799.
- Yubiao Yue and Zhenzhang Li. 2024. MedMamba: Vision Mamba for Medical Image Classification. arXiv preprint arXiv:2403.03849 (2024).
- Density-based denoising of point cloud. In 9th International Conference on Robotic, Vision, Signal Processing and Power Applications: Empowering Research and Innovation. Springer, 287–295.
- Pointfilter: Point cloud filtering via encoder-decoder modeling. IEEE Transactions on Visualization and Computer Graphics 27, 3 (2020), 2015–2027.
- Point Could Mamba: Point Cloud Learning via State Space Model. arXiv preprint arXiv:2403.00762 (2024).
- Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference. arXiv preprint arXiv:2403.14520 (2024).
- Fast and accurate normal estimation for point clouds via patch stitching. Computer-Aided Design 142 (2022), 103121.