Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
166 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Activating Frequency and ViT for 3D Point Cloud Quality Assessment without Reference (2312.05972v1)

Published 10 Dec 2023 in cs.CV, cs.LG, cs.MM, and eess.IV

Abstract: Deep learning-based quality assessments have significantly enhanced perceptual multimedia quality assessment, however it is still in the early stages for 3D visual data such as 3D point clouds (PCs). Due to the high volume of 3D-PCs, such quantities are frequently compressed for transmission and viewing, which may affect perceived quality. Therefore, we propose no-reference quality metric of a given 3D-PC. Comparing to existing methods that mostly focus on geometry or color aspects, we propose integrating frequency magnitudes as indicator of spatial degradation patterns caused by the compression. To map the input attributes to quality score, we use a light-weight hybrid deep model; combined of Deformable Convolutional Network (DCN) and Vision Transformers (ViT). Experiments are carried out on ICIP20 [1], PointXR [2] dataset, and a new big dataset called BASICS [3]. The results show that our approach outperforms state-of-the-art NR-PCQA measures and even some FR-PCQA on PointXR. The implementation code can be found at: https://github.com/o-messai/3D-PCQA

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. “Quality evaluation of static point clouds encoded using mpeg codecs,” in 2020 IEEE International Conference on Image Processing (ICIP). IEEE, 2020, pp. 3428–3432.
  2. “Pointxr: A toolbox for visualization and subjective evaluation of point clouds in virtual reality,” in 2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX). IEEE, 2020, pp. 1–6.
  3. “Basics: Broad quality assessment of static point clouds in compression scenarios,” arXiv preprint arXiv:2302.04796, 2023.
  4. “Frustum pointnets for 3d object detection from rgb-d data,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 918–927.
  5. “Adaboost neural network and cyclopean view for no-reference stereoscopic image quality assessment,” Signal Processing: Image Communication, vol. 82, pp. 115772, 2020.
  6. “3d saliency guided deep quality predictor for no-reference stereoscopic images,” Neurocomputing, 2022.
  7. Oussama Messai and Chetouani, “End-to-end deep multi-score model for no-reference stereoscopic image quality assessment,” in 2022 IEEE International Conference on Image Processing (ICIP). IEEE, 2022, pp. 2721–2725.
  8. “Evaluation criteria for pcc (point cloud compression),” 2016.
  9. “Geometric distortion metrics for point cloud compression,” in 2017 IEEE International Conference on Image Processing (ICIP). IEEE, 2017, pp. 3460–3464.
  10. “Point cloud quality assessment metric based on angular similarity,” in 2018 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 2018, pp. 1–6.
  11. “Towards a point cloud quality assessment model using local binary patterns,” in 2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX). IEEE, 2020, pp. 1–6.
  12. “Pcqm: A full-reference quality metric for colored 3d point clouds,” in 2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX). IEEE, 2020, pp. 1–6.
  13. “Blind projection-based 3d point cloud quality assessment method using a convolutional neural network.,” in VISIGRAPP (4: VISAPP), 2022, pp. 518–525.
  14. “Deep learning-based quality assessment of 3d point clouds without reference,” in 2021 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). IEEE, 2021, pp. 1–6.
  15. “No-reference quality assessment for 3d colored point cloud and mesh models,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 32, no. 11, pp. 7618–7631, 2022.
  16. “Representation learning optimization for 3d point cloud quality assessment without reference,” in 2022 IEEE International Conference on Image Processing (ICIP). IEEE, 2022, pp. 3702–3706.
  17. “Pointnet++: Deep hierarchical feature learning on point sets in a metric space,” Advances in neural information processing systems, vol. 30, 2017.
  18. “Gpa-net: No-reference point cloud quality assessment with multi-task graph convolutional network,” arXiv preprint arXiv:2210.16478, 2022.
  19. “Mm-pcqa: Multi-modal learning for no-reference point cloud quality assessment,” arXiv preprint arXiv:2209.00244, 2022.
  20. “A survey on vision transformer,” IEEE transactions on pattern analysis and machine intelligence, vol. 45, no. 1, pp. 87–110, 2022.
  21. “Coatnet: Marrying convolution and attention for all data sizes,” Advances in Neural Information Processing Systems, vol. 34, pp. 3965–3977, 2021.
Citations (2)

Summary

We haven't generated a summary for this paper yet.