Assessing Image Quality Using a Simple Generative Representation (2404.18178v1)
Abstract: Perceptual image quality assessment (IQA) is the task of predicting the visual quality of an image as perceived by a human observer. Current state-of-the-art techniques are based on deep representations trained in discriminative manner. Such representations may ignore visually important features, if they are not predictive of class labels. Recent generative models successfully learn low-dimensional representations using auto-encoding and have been argued to preserve better visual features. Here we leverage existing auto-encoders and propose VAE-QA, a simple and efficient method for predicting image quality in the presence of a full-reference. We evaluate our approach on four standard benchmarks and find that it significantly improves generalization across datasets, has fewer trainable parameters, a smaller memory footprint and faster run time.
- A novel image quality assessment with globally and locally consilient visual quality perception. IEEE Transactions on Image Processing, 25(5):2392–2406, 2016.
- Deep neural networks for no-reference and full-reference image quality assessment. IEEE Transactions on Image Processing, 27(1):206–219, January 2018. ISSN 1941-0042.
- Multi-stage variational auto-encoders for coarse-to-fine image generation, 2017.
- Perceptual image quality assessment with transformers, 2021.
- Image quality assessment: Unifying structure and texture similarity. IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1–1, 2020. ISSN 1939-3539.
- Falcon, W. and The PyTorch Lightning team. PyTorch Lightning, 2019. Version 1.4.
- Improving neural networks by preventing co-adaptation of feature detectors, 2012.
- Denoising diffusion probabilistic models, 2020.
- Deep learning of human visual sensitivity in image quality assessment framework. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1969–1977, 2017.
- Adam: A method for stochastic optimization, 2017.
- Auto-encoding variational bayes. CoRR, abs/1312.6114, 2013.
- The open images dataset v4: Unified image classification, object detection, and visual relationship detection at scale. International Journal of Computer Vision, 128(7):1956–1981, March 2020. ISSN 1573-1405.
- Attentions help cnns see better: Attention-based hybrid image quality assessment network, 2022.
- Perceptual image quality assessment using a normalized laplacian pyramid. In Human Vision and Electronic Imaging, 2016.
- Most apparent distortion: full-reference image quality assessment and the role of strategy. Journal of Electronic Imaging, 19(1):011006, 2010.
- Dreamteacher: Pretraining image backbones with deep generative models, 2023.
- Kadid-10k: A large-scale artificially distorted iqa database. In 2019 Eleventh International Conference on Quality of Multimedia Experience (QoMEX), pp. 1–3. IEEE, 2019.
- Sgdr: Stochastic gradient descent with warm restarts, 2017.
- High fidelity image synthesis with deep vaes in latent space, 2023.
- Subjective and objective quality assessment of image: A survey, 2014.
- Pytorch: An imperative style, high-performance deep learning library, 2019.
- Image quality assessment using human visual dog model fused with random forest. IEEE Transactions on Image Processing, 24(11):3282–3292, 2015.
- Image database tid2013: Peculiarities, results and perspectives. Signal Processing: Image Communication, 30:57–77, 2015.
- Pieapp: Perceptual image-error assessment through pairwise preference. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1808–1817, 2018.
- Generating diverse high-fidelity images with vq-vae-2, 2019.
- High-resolution image synthesis with latent diffusion models, 2021.
- High-resolution image synthesis with latent diffusion models, 2022.
- Imagenet large scale visual recognition challenge, 2015.
- A novel just-noticeable-difference-based saliency-channel attention residual network for full-reference image quality predictions, 2020.
- A statistical evaluation of recent full reference image quality assessment algorithms. IEEE Transactions on Image Processing, 15(11):3440–3451, 2006.
- Denoising diffusion implicit models, 2022.
- Efficient object localization using convolutional networks, 2015.
- Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing, 13(4):600–612, 2004.
- Group normalization, 2018.
- Gradient magnitude similarity deviation: A highly efficient perceptual image quality index. IEEE Transactions on Image Processing, 23(2):684–695, February 2014. ISSN 1941-0042.
- Fsim: A feature similarity index for image quality assessment. IEEE Transactions on Image Processing, 20(8):2378–2386, 2011.
- Vsi: A visual saliency-induced index for perceptual image quality assessment. IEEE Transactions on Image Processing, 23(10):4270–4281, 2014.
- The unreasonable effectiveness of deep features as a perceptual metric, 2018.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.