NeCGS: Neural Compression for 3D Geometry Sets (2405.15034v2)
Abstract: We present NeCGS, the first neural compression paradigm, which can compress a geometry set encompassing thousands of detailed and diverse 3D mesh models by up to 900 times with high accuracy and preservation of detailed geometric structures. Specifically, we first propose TSDF-Def, a new implicit representation that is capable of \textbf{accurately} representing irregular 3D mesh models with various structures into regular 4D tensors of \textbf{uniform} and \textbf{compact} size, where 3D surfaces can be extracted through the deformable marching cubes. Then we construct a quantization-aware auto-decoder network architecture to regress these 4D tensors to explore the local geometric similarity within each shape and across different shapes for redundancy removal, resulting in more compact representations, including an embedded feature of a smaller size associated with each 3D model and a network parameter shared by all models. We finally encode the resulting features and network parameters into bitstreams through entropy coding. Besides, our NeCGS can handle the dynamic scenario well, where new 3D models are constantly added to a compressed set. Extensive experiments and ablation studies demonstrate the significant advantages of our NeCGS over state-of-the-art methods both quantitatively and qualitatively. The source code is available at https://github.com/rsy6318/NeCGS.
- Dynamic point cloud geometry compression using cuboid based commonality modeling framework. In 2021 IEEE International Conference on Image Processing (ICIP), pages 2159–2163. IEEE, 2021.
- Method for registration of 3-d shapes. In Sensor Fusion IV: Control Paradigms and Data Structures, volume 1611, pages 586–606. Spie, 1992.
- Muscle: Multi sweep compression of lidar using deep entropy models. Advances in Neural Information Processing Systems, 33:22170–22181, 2020.
- Hnerv: A hybrid neural representation for videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10270–10279, 2023.
- Nerv: Neural representations for videos. Advances in Neural Information Processing Systems, 34:21557–21568, 2021.
- A survey of methods for moving least squares surfaces. In Proceedings of the Fifth Eurographics/IEEE VGTC conference on Point-Based Graphics, pages 9–23, 2008.
- Implicit functions in feature space for 3d shape reconstruction and completion. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6970–6981, June 2020.
- Neural unsigned distance fields for implicit function learning. Advances in Neural Information Processing Systems, 33:21638–21652, 2020.
- Multiscale latent-guided entropy model for lidar point cloud compression. IEEE Transactions on Circuits and Systems for Video Technology, 33(12):7857–7869, 2023.
- Octattention: Octree-based large-scale contexts model for point cloud compression. In Proceedings of the AAAI conference on artificial intelligence, volume 36, pages 625–633, 2022.
- Dynamic fusion with intra-and inter-modality attention flow for visual question answering. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 6639–6648, 2019.
- Google. Point cloud compression reference software. Website. https://github. com/google/draco.
- An overview of ongoing point cloud compression standardization activities: Video-based (v-pcc) and geometry-based (g-pcc). APSIPA Transactions on Signal and Information Processing, 9:e13, 2020.
- Point cloud coding: Adopting a deep learning-based approach. In 2019 Picture Coding Symposium (PCS), pages 1–5. IEEE, 2019.
- Meshudf: Fast and differentiable meshing of unsigned distance field networks. In European Conference on Computer Vision, pages 576–592, 2022.
- Compressing 3-d human motions via keyframe-based geometry videos. IEEE Transactions on Circuits and Systems for Video Technology, 25(1):51–62, 2014.
- Sparse low-rank matrix approximation for data compression. IEEE Transactions on Circuits and Systems for Video Technology, 27(5):1043–1054, 2015.
- Octsqueeze: Octree-structured entropy model for lidar compression. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 1313–1323, 2020.
- T. Huang and Y. Liu. 3d point cloud geometry compression on deep learning. In Proceedings of the 27th ACM international conference on multimedia, pages 890–898, 2019.
- D. A. Huffman. A method for the construction of minimum-redundancy codes. Proceedings of the IRE, 40(9):1098–1101, 1952.
- Poisson surface reconstruction. In Proceedings of the fourth Eurographics symposium on Geometry processing, pages 61–70, 2006.
- M. Kazhdan and H. Hoppe. Screened poisson surface reconstruction. ACM Transactions on Graphics (ToG), 32(3):1–13, 2013.
- D. P. Kingma and J. Ba. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- R. Kolluri. Provably good moving least squares. ACM Transactions on Algorithms, 4(2):1–25, 2008.
- Modular primitives for high-performance differentiable rendering. ACM Transactions on Graphics (ToG), 39(6):1–14, 2020.
- Advanced 3d motion prediction for video-based dynamic point cloud compression. IEEE Transactions on Image Processing, 29:289–302, 2019.
- 4dcomplete: Non-rigid motion estimation beyond the observable surface. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 12706–12716, 2021.
- A comprehensive study and comparison of core technologies for mpeg 3-d point cloud compression. IEEE Transactions on Broadcasting, 66(3):701–717, 2019.
- Deep implicit moving least-squares functions for 3d reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1788–1797, June 2021.
- A convnet for the 2020s. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 11976–11986, 2022.
- Smpl: A skinned multi-person linear model. ACM Trans. Graph., 34(6), oct 2015.
- Marching cubes: A high resolution 3d surface construction algorithm. ACM siggraph computer graphics, 21(4):163–169, 1987.
- Design, implementation, and evaluation of a point cloud codec for tele-immersive video. IEEE Transactions on Circuits and Systems for Video Technology, 27(4):828–842, 2016.
- Practical full resolution learned lossless image compression. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10629–10638, 2019.
- Occupancy networks: Learning 3d reconstruction in function space. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4460–4470, June 2019.
- Deepsdf: Learning continuous signed distance functions for shape representation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 165–174, June 2019.
- E. Peixoto. Intra-frame compression of point cloud geometry using dyadic decomposition. IEEE Signal Processing Letters, 27:246–250, 2020.
- Convolutional occupancy networks. In European Conference on Computer Vision, pages 523–540. Springer, 2020.
- Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 652–660, 2017.
- Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Advances in neural information processing systems, 30:1–xxx, 2017.
- Learning convolutional transforms for lossy point cloud geometry compression. In 2019 IEEE international conference on image processing (ICIP), pages 4320–4324. IEEE, 2019.
- Voxelcontext-net: An octree based framework for point cloud compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6042–6051, 2021.
- Silhouette 4d with context selection: Lossless geometry compression of dynamic point clouds. IEEE Signal Processing Letters, 28:1660–1664, 2021.
- Geoudf: Surface reconstruction from 3d point clouds via geometry-guided distance representation. In Proceedings of the IEEE/CVF Internation Conference on Computer Vision, pages 14214–14224, 2023.
- Emerging mpeg standards for point cloud compression. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 9(1):133–148, 2018.
- Deep marching tetrahedra: a hybrid representation for high-resolution 3d shape synthesis. Advances in Neural Information Processing Systems, 34:6087–6101, 2021.
- Flexible isosurface extraction for gradient-based mesh optimization. ACM Transactions on Graphics (TOG), 42(4):1–16, 2023.
- Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1874–1883, 2016.
- Implicit neural representations for image compression. In European Conference on Computer Vision, pages 74–91. Springer, 2022.
- Articulated mesh animation from multi-view silhouettes. ACM Transactions on Graphics, 27(3):1–9, 2008.
- Point-voting based point cloud geometry compression. In 2021 IEEE 23rd International Workshop on Multimedia Signal Processing (MMSP), pages 1–5. IEEE, 2021.
- Multiscale point cloud geometry compression. In 2021 Data Compression Conference (DCC), pages 73–82. IEEE, 2021.
- Lossy point cloud geometry compression via end-to-end learning. IEEE Transactions on Circuits and Systems for Video Technology, 31(12):4909–4923, 2021.
- Geometric prior based deep human point cloud geometry compression. IEEE Transactions on Circuits and Systems for Video Technology, 2024.
- Efficient geometry surface coding in v-pcc. IEEE Transactions on Multimedia, 25:3329–3342, 2022.
- Pose-driven compression for dynamic 3d human via human prior models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024.
- Improving inference for neural image compression. Advances in Neural Information Processing Systems, 33:573–584, 2020.
- Implicit geometry partition for point cloud compression. In 2020 Data Compression Conference (DCC), pages 73–82. IEEE, 2020.
- Q. Zhou and A. Jacobson. Thingi10k: A dataset of 10,000 3d-printing models. arXiv preprint arXiv:1605.04797, 2016.
- Lossy point cloud geometry compression via region-wise processing. IEEE Transactions on Circuits and Systems for Video Technology, 31(12):4575–4589, 2021.
- 3D menagerie: Modeling the 3D shape and pose of animals. In IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), July 2017.