Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Contrastive Pre-Training with Multi-View Fusion for No-Reference Point Cloud Quality Assessment (2403.10066v3)

Published 15 Mar 2024 in cs.CV and cs.MM

Abstract: No-reference point cloud quality assessment (NR-PCQA) aims to automatically evaluate the perceptual quality of distorted point clouds without available reference, which have achieved tremendous improvements due to the utilization of deep neural networks. However, learning-based NR-PCQA methods suffer from the scarcity of labeled data and usually perform suboptimally in terms of generalization. To solve the problem, we propose a novel contrastive pre-training framework tailored for PCQA (CoPA), which enables the pre-trained model to learn quality-aware representations from unlabeled data. To obtain anchors in the representation space, we project point clouds with different distortions into images and randomly mix their local patches to form mixed images with multiple distortions. Utilizing the generated anchors, we constrain the pre-training process via a quality-aware contrastive loss following the philosophy that perceptual quality is closely related to both content and distortion. Furthermore, in the model fine-tuning stage, we propose a semantic-guided multi-view fusion module to effectively integrate the features of projected images from multiple perspectives. Extensive experiments show that our method outperforms the state-of-the-art PCQA methods on popular benchmarks. Further investigations demonstrate that CoPA can also benefit existing learning-based PCQA models.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (43)
  1. Crosspoint: Self-supervised cross-modal contrastive learning for 3d point cloud understanding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9902–9912, 2022.
  2. Towards a point cloud structural similarity metric. In ICMEW, pages 1–6, 2020.
  3. Contrastive self-supervised pre-training for video quality assessment. IEEE Transactions on Image Processing, 31:458–471, 2021.
  4. A simple framework for contrastive learning of visual representations. In International conference on machine learning, pages 1597–1607. PMLR, 2020.
  5. Imagenet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, pages 248–255. Ieee, 2009.
  6. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
  7. A no-reference quality assessment metric for point cloud based on captured video sequences. In 2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP), pages 1–5. IEEE, 2022.
  8. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
  9. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 9729–9738, 2020.
  10. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
  11. Pqa-net: Deep no reference point cloud quality assessment via multi-view projection. IEEE Transactions on Circuits and Systems for Video Technology, 31(12):4645–4660, 2021a.
  12. Perceptual quality assessment of colored 3d point clouds. IEEE Transactions on Visualization and Computer Graphics, 2022.
  13. Point cloud quality assessment: Dataset construction and learning-based no-reference metric. ACM Transactions on Multimedia Computing, Communications and Applications, 19(2s):1–26, 2023.
  14. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision, pages 10012–10022, 2021b.
  15. Image quality assessment using contrastive learning. IEEE Transactions on Image Processing, 31:4149–4161, 2022.
  16. Evaluation criteria for point cloud compression. ISO/IEC MPEG, (16332), 2016.
  17. Pcqm: A full-reference quality metric for colored 3d point clouds. In QoMEX, pages 1–6, 2020.
  18. Multi-view aggregation transformer for no-reference point cloud quality assessment. Displays, 78:102450, 2023.
  19. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748, 2018.
  20. Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32, 2019.
  21. Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 652–660, 2017.
  22. Accelerating 3d deep learning with pytorch3d. arXiv preprint arXiv:2007.08501, 2020.
  23. Re-iqa: Unsupervised learning for image quality assessment in the wild. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5846–5855, 2023.
  24. Gpa-net: No-reference point cloud quality assessment with multi-task graph convolutional network. IEEE Transactions on Visualization and Computer Graphics, 2023.
  25. A deep learning based no-reference quality assessment model for ugc videos. In Proceedings of the 30th ACM International Conference on Multimedia, pages 856–865, 2022.
  26. On the importance of initialization and momentum in deep learning. In International conference on machine learning, pages 1139–1147. PMLR, 2013.
  27. Point cloud projection and multi-scale feature fusion network based blind quality assessment for colored point clouds. In Proceedings of the 29th ACM International Conference on Multimedia, pages 5266–5272, 2021.
  28. Geometric distortion metrics for point cloud compression. In IEEE ICIP, pages 3460–3464, 2017.
  29. Representation learning optimization for 3d point cloud quality assessment without reference. In 2022 IEEE International Conference on Image Processing (ICIP), pages 3702–3706. IEEE, 2022.
  30. Pcqa-graphpoint: Efficient deep-based graph metric for point cloud quality assessment. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 1–5. IEEE, 2023.
  31. A novel methodology for quality assessment of voxelized point clouds. In Applications of Digital Image Processing XLI, pages 174–190, 2018.
  32. Laurens Van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of machine learning research, 9(11), 2008.
  33. ψ𝜓\psiitalic_ψ-net: Point structural information network for no-reference point cloud quality assessment. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 1–5. IEEE, 2023.
  34. Predicting the perceptual quality of point cloud: A 3d-to-2d projection-based exploration. IEEE Transactions on Multimedia, 23:3877–3891, 2020a.
  35. Inferring point cloud quality via graph similarity. IEEE transactions on pattern analysis and machine intelligence, 44(6):3015–3029, 2020b.
  36. No-reference point cloud quality assessment via domain adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 21179–21188, 2022a.
  37. Mped: Quantifying point cloud distortion based on multiscale potential energy discrepancy. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(5):6037–6054, 2022b.
  38. Pointclip: Point cloud understanding by clip. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8552–8562, 2022a.
  39. Ms-graphsim: Inferring point cloud quality via multiscale graph similarity. In Proceedings of the 29th ACM International Conference on Multimedia, pages 1230–1238, 2021.
  40. No-reference quality assessment for 3d colored point cloud and mesh models. IEEE Transactions on Circuits and Systems for Video Technology, 32(11):7618–7631, 2022b.
  41. Mm-pcqa: Multi-modal learning for no-reference point cloud quality assessment. arXiv preprint arXiv:2209.00244, 2022c.
  42. Quality-aware pre-trained models for blind image quality assessment. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 22302–22313, 2023.
  43. Blind quality assessment of 3d dense point clouds with structure guided resampling. arXiv preprint arXiv:2208.14603, 2022.
Citations (7)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com