Introduction
Artificial Intelligence Generated Content (AIGC) is transforming numerous domains by enabling the creation of dynamic content such as images, text, audio, and video. A critical component within this domain is assessing the quality of AI-generated images (AIGIs) from a human perception standpoint. AIGC Image Quality Assessment (AIGCIQA) is an evolving area within computer vision intending to solve this. The traditional methods followed to evaluate AIGIs often suffer from limitations like not considering the variations in images or dealing with information loss due to preprocessing steps like resizing and cropping. To combat these challenges, a new approach called Patches Sampling-based Contrastive Regression (PSCR) for image quality assessment is proposed.
Methodology
The suggested framework, PSCR, revolutionizes the traditional ways of assessing AIGI quality by addressing two primary concerns - the lack of comparative analysis between different images and the loss of image quality due to geometric distortion from pre-processing. Introducing a contrastive regression framework changes how the model learns by focusing on the differences among AIGIs and predicting scores based on these distinctions. Moreover, to prevent information loss, the method incorporates a patches sampling strategy. This strategy employs a sliding window to capture overlapping portions of images, retaining maximum detail and avoiding geometric distortions.
Experimental Validation
The validation of the PSCR framework was conducted on three leading AIGCIQA databases, demonstrating significant improvements over baseline models and previous approaches. It delivers a robust comparative across metrics like the Spearman rank correlation coefficient (SRCC) and Pearson linear correlation coefficient (PLCC). The gains achieved indicate the method's effectiveness in better accommodating the nuances of AIGI quality assessment by preserving detail and capturing a fine-grained understanding through direct contrasts across different AI-generated images.
Conclusion and Future Work
The PSCR stands out as a comprehensive framework tackling AIGCIQA with a nuanced understanding and comparative analysis of image differences. Its integration into current methodologies offers substantial performance improvements, showcased across different AIGCIQA datasets. There are areas for future exploration, such as minimizing information redundancy in the patches sampling process and refining the balance between preserving detail and avoiding global information loss. The code is made available, thus encouraging further research and development in this promising direction for AIGC image quality assessment.