Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Probabilistic Quality Representation Approach to Deep Blind Image Quality Prediction (1708.08190v2)

Published 28 Aug 2017 in cs.CV

Abstract: Blind image quality assessment (BIQA) remains a very challenging problem due to the unavailability of a reference image. Deep learning based BIQA methods have been attracting increasing attention in recent years, yet it remains a difficult task to train a robust deep BIQA model because of the very limited number of training samples with human subjective scores. Most existing methods learn a regression network to minimize the prediction error of a scalar image quality score. However, such a scheme ignores the fact that an image will receive divergent subjective scores from different subjects, which cannot be adequately represented by a single scalar number. This is particularly true on complex, real-world distorted images. Moreover, images may broadly differ in their distributions of assigned subjective scores. Recognizing this, we propose a new representation of perceptual image quality, called probabilistic quality representation (PQR), to describe the image subjective score distribution, whereby a more robust loss function can be employed to train a deep BIQA model. The proposed PQR method is shown to not only speed up the convergence of deep model training, but to also greatly improve the achievable level of quality prediction accuracy relative to scalar quality score regression methods. The source code is available at https://github.com/HuiZeng/BIQA_Toolbox.

A Probabilistic Quality Representation Approach to Deep Blind Image Quality Prediction

Overview

The paper entitled "A Probabilistic Quality Representation Approach to Deep Blind Image Quality Prediction" addresses significant challenges in blind image quality assessment (BIQA) by introducing a novel approach: Probabilistic Quality Representation (PQR). The authors, Zeng, Zhang, and Bovik, highlight the limitations associated with traditional scalar regression models used in BIQA, particularly their inadequacy in capturing the subjective diversity of perceived image quality among different viewers.

Technical Approach and Methodology

The core novelty of the proposed PQR approach lies in its probabilistic representation of subjective image scores, offering a more comprehensive depiction of the perceptual variability inherent in human image quality assessments. Unlike conventional methods that regress a scalar quality score, the PQR model maps image quality scores into a probabilistic space defined by a pre-selected set of quality anchors distributed across the score range.

To achieve this, the authors define the concept of quality anchors, which segment the quality score range into discrete levels. Each image score is then transformed into a probability distribution over these anchors, capturing the likelihood of an image attaining each anchor score. This distribution is generated using a soft-mapping function that ensures smoothness and robustness against small perceptual variations, essentially regularizing the learning process by minimizing the divergence between predicted and actual quality distributions.

Results and Performance

The implementation of the PQR framework considerably accelerates the convergence of deep model training and enhances prediction accuracy over scalar quality representation methods. The paper reports improved Spearman Rank Order Correlation Coefficient (SRCC) and Pearson Linear Correlation Coefficient (PLCC) across multiple image quality assessment databases, with noticeable performance improvements seen in challenging datasets, such as the LIVE Challenge database containing authentic distortions.

Implications

By employing a probabilistic approach, the PQR methodology not only advances the state-of-the-art in BIQA by capturing the subjective nature of image quality more effectively but also suggests broader implications for neural network training in problems where subjectivity and variability are prominent. In practical applications, this probabilistic representation could lead to more robust BIQA systems capable of providing more granular feedback in subjective contexts, such as user experience quality monitoring in multimedia environments.

Future Directions

The introduction of probabilistic quality representation opens several avenues for future exploration. Further research could investigate optimizing the selection of quality anchors or developing more sophisticated methods for anchor determination based on subjective data. Additionally, extending this approach to other subjective AI tasks, such as audio and video quality assessment, could yield substantial benefits. Moreover, combining PQR with recent developments in deep learning architectures might further improve the generalization capabilities of BIQA models across larger and more diverse datasets.

In conclusion, the paper significantly contributes to moving blind image quality assessment toward more robust and representative solutions by leveraging probabilistic modeling, thus aligning computational predictions more closely with human perceptions of quality.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Hui Zeng (41 papers)
  2. Lei Zhang (1689 papers)
  3. Alan C. Bovik (83 papers)
Citations (72)
Github Logo Streamline Icon: https://streamlinehq.com