PyTorch Image Quality: Metrics for Image Quality Assessment (2208.14818v1)

Published 31 Aug 2022 in eess.IV and cs.CV

Abstract: Image Quality Assessment (IQA) metrics are widely used to quantitatively estimate the extent of image degradation following some forming, restoring, transforming, or enhancing algorithms. We present PyTorch Image Quality (PIQ), a usability-centric library that contains the most popular modern IQA algorithms, guaranteed to be correctly implemented according to their original propositions and thoroughly verified. In this paper, we detail the principles behind the foundation of the library, describe the evaluation strategy that makes it reliable, provide the benchmarks that showcase the performance-time trade-offs, and underline the benefits of GPU acceleration given the library is used within the PyTorch backend. PyTorch Image Quality is an open source software: https://github.com/photosynthesis-team/piq/.

Authors (4)

Sergey Kastryulin (8 papers)
Jamil Zakirov (2 papers)
Denis Prokopenko (5 papers)
Dmitry V. Dylov (34 papers)

Citations (35)

View on Semantic Scholar

Summary

The paper introduces a comprehensive PyTorch library implementing 38 IQA metrics with GPU acceleration and benchmark validation.
It details user-friendly APIs that integrate full-reference, no-reference, and distribution-based metrics into deep learning workflows.
The library supports metric-based loss functions and customizable feature extractors to enhance model training and performance analysis.

PyTorch Image Quality: Metrics for Image Quality Assessment

The paper presents PyTorch Image Quality (PIQ), a comprehensive library for Image Quality Assessment (IQA) within the PyTorch framework. This library addresses critical needs in computer vision by providing efficient implementations of various modern IQA metrics. The focus is on ensuring correctness with original formulations and offering GPU-accelerated computations.

Library Overview

PIQ is structured to offer a wide array of 38 metrics, divided into Full-Reference (FR), No-Reference (NR), and Distribution-Based (DB) categories. This range caters to diverse applications, from high-quality visual media to medical imaging domains. Some widely utilized metrics include SSIM, PSNR for FR, BRISQUE for NR, and FID for DB. Each metric is implemented to be seamlessly compatible with PyTorch, facilitating integration into machine learning workflows.

Design Principles

The library adheres to three key principles: user-friendliness, reliability, and pragmatic design. By providing intuitive APIs and ensuring implementation accuracy, PIQ addresses common inconsistencies in public IQA implementations. The emphasis on GPU acceleration improves computational efficiency, which is crucial given the increasing complexity of image processing tasks.

Metrics Utilization and Evaluation

Aside from standalone use, PIQ allows metrics to function as loss functions for models, leveraging PyTorch's automatic differentiation capabilities. Additionally, feature extractors, which are integral for DB IQMs, can be customized within PIQ to test various configurations, acknowledging the impact of feature extractors on performance. Chrominance versions of certain metrics are also supported to align with human visual perception more closely.

Validation and Performance

The PIQ implementations have been compared against established IQA datasets such as TID2013 and KADID-10k, ensuring consistency with recognized benchmarks. The comprehensive evaluation includes correlation metrics like SRCC to validate the library's accuracy. Performance assessments also reveal a noteworthy trade-off between computation time and metric quality, with metrics like MDSI and HaarPSI consistently exemplifying an optimal balance.

Implications and Future Work

The development of PIQ as an open-source tool has significant implications for researchers and practitioners. It streamlines the validation of image processing algorithms through reliable IQA metrics and enhances reproducibility in research. Future development will focus on incorporating emerging IQA trends and improving algorithm scalability within PIQ.

Conclusion

PIQ stands as a robust and versatile IQA solution that caters to the computational imaging community. By providing accurate, GPU-optimized implementations of a vast array of metrics, it facilitates advanced research and application development in various fields. Its continuous evolution will likely mirror advancements in IQA and image processing technologies, maintaining its relevance and utility.

PDF Markdown