ECM-OPCC: Efficient Context Model for Octree-based Point Cloud Compression (2211.10916v4)
Abstract: Recently, deep learning methods have shown promising results in point cloud compression. For octree-based point cloud compression, previous works show that the information of ancestor nodes and sibling nodes are equally important for predicting current node. However, those works either adopt insufficient context or bring intolerable decoding complexity (e.g. >600s). To address this problem, we propose a sufficient yet efficient context model and design an efficient deep learning codec for point clouds. Specifically, we first propose a window-constrained multi-group coding strategy to exploit the autoregressive context while maintaining decoding efficiency. Then, we propose a dual transformer architecture to utilize the dependency of current node on its ancestors and siblings. We also propose a random-masking pre-train method to enhance our model. Experimental results show that our approach achieves state-of-the-art performance for both lossy and lossless point cloud compression. Moreover, our multi-group coding strategy saves 98% decoding time compared with previous octree-based compression method.
- Semantickitti: A dataset for semantic scene understanding of lidar sequences. IEEE/CVF International Conference on Computer Vision (ICCV), page 9296–9306, 2019.
- Muscle: Multi sweep compression of lidar using deep entropy models. In H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 22170–22181. Curran Associates, Inc., 2020.
- 3d point cloud compression: A survey. In The 24th International Conference on 3D Web Technology, pages 1–9, 2019.
- Microsoft voxelized upper bodies - a voxelized point cloud dataset. ISO/IEC JTC1/SC29 Joint WG11/WG1 (MPEG/JPEG) input document m38673/M7201, 2016.
- Point cloud compression with sibling context and surface priors. In Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXVIII, page 744–759, Berlin, Heidelberg, 2022. Springer-Verlag.
- Bert: Pre-training of deep bidirectional transformers for language understanding, 2018.
- Real-time spatio-temporal lidar point cloud compression. In 2020 IEEE/RSJ international conference on intelligent robots and systems (IROS), pages 10766–10773. IEEE, 2020.
- Octattention: Octree-based large-scale contexts model for point cloud compression. arXiv preprint arXiv:2202.06028, 2022.
- Mask-predict: Parallel decoding of conditional masked language models. arXiv preprint arXiv:1904.09324, 2019.
- Non-autoregressive neural machine translation. arXiv preprint arXiv:1711.02281, 2017.
- Elic: Efficient learned image compression with unevenly grouped space-channel contextual adaptive coding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5718–5727, 2022.
- Checkerboard context model for efficient learned image compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14771–14780, 2021.
- Density-preserving deep point cloud compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2333–2342, 2022.
- On the learning of non-autoregressive transformers. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvari, Gang Niu, and Sivan Sabato, editors, Proceedings of the 39th International Conference on Machine Learning, volume 162 of Proceedings of Machine Learning Research, pages 9356–9376. PMLR, 17–23 Jul 2022.
- Octsqueeze: Octree-structured entropy model for lidar compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2020.
- 3d point cloud geometry compression on deep learning. In Proceedings of the 27th ACM international conference on multimedia, pages 890–898, 2019.
- 8i voxelized full bodies - a voxelized point cloud dataset. ISO/IEC JTC1/SC29 Joint WG11/WG1 (MPEG/JPEG) input document WG11M40059/WG1M74006, 2017.
- Hybrid spatial-temporal entropy modelling for neural video compression. arXiv preprint arXiv:2207.05894, 2022.
- Transpcc: Towards deep point cloud compression via transformers. 2022.
- MPEG. https://github.com/MPEGGroup/mpeg-pcc-tmc13, 2021.
- Learning-based lossless compression of 3d point cloud geometry. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 4220–4224. IEEE, 2021.
- Lossless coding of point cloud geometry using a deep generative model. IEEE Transactions on Circuits and Systems for Video Technology, 31(12):4617–4629, 2021.
- Multiscale deep context modeling for lossless point cloud geometry compression. In 2021 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), pages 1–6. IEEE, 2021.
- Learning convolutional transforms for lossy point cloud geometry compression. In 2019 IEEE international conference on image processing (ICIP), pages 4320–4324. IEEE, 2019.
- Improved deep point cloud geometry compression. In 2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP), pages 1–6. IEEE, 2020.
- Voxelcontext-net: An octree based framework for point cloud compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6042–6051, 2021.
- Parallel multiscale autoregressive density estimation. In Doina Precup and Yee Whye Teh, editors, Proceedings of the 34th International Conference on Machine Learning, volume 70 of Proceedings of Machine Learning Research, pages 2912–2921. PMLR, 06–11 Aug 2017.
- A novel point cloud compression algorithm based on clustering. IEEE Robotics and Automation Letters, 4(2):2132–2139, 2019.
- Point cloud compression for 3d lidar sensor using recurrent neural network with residual blocks. In 2019 International Conference on Robotics and Automation (ICRA), pages 3274–3280. IEEE, 2019.
- Sparse tensor-based multiscale representation for point cloud geometry compression. arXiv preprint arXiv:2111.10633, 2021.
- Multiscale point cloud geometry compression. In 2021 Data Compression Conference (DCC), pages 73–82. IEEE, 2021.
- Point cloud compression with range image-based entropy model for autonomous driving. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXII, pages 323–340. Springer, 2022.
- Patch-based deep autoencoder for point cloud geometry compression. In ACM Multimedia Asia, pages 1–7. 2021.
- Real-time scene-aware lidar point cloud compression using semantic prior representation. IEEE Transactions on Circuits and Systems for Video Technology, 2022.
- Riddle: Lidar data compression with range image deep delta encoding, 2022.