Quaternion Representation of Color Images
- Quaternion representation of color images is a framework that encodes RGB values into quaternions, fusing color channel data for integrated processing.
- It leverages quaternion algebra to preserve inter-channel correlations using techniques like low-rank models, efficient ADMM schemes, and QSVD.
- The approach enhances applications such as inpainting, denoising, and classification, with extensions to incorporate depth cues and neural architectures.
Quaternion representation of color images is a mathematical and algorithmic framework in which each pixel’s color channels (Red, Green, Blue) are encoded as a single quaternion algebraic entity—typically as a pure-imaginary quaternion—with many extensions involving “full” quaternions incorporating scalar information such as luminance or depth. This approach permits joint modeling of channel correlations and facilitates advanced image processing methodologies, including matrix/tensor completion, compression, enhancement, classification, and inpainting. Quaternion models are now prominent in computer vision tasks where cross-channel statistical structure is critical.
1. Mathematical Formulation of Quaternion-Based Encoding
A quaternion is defined as , where and satisfy with non-commutative multiplication (e.g., , ). In the canonical pure-imaginary encoding for color images, the RGB channels of a pixel at are mapped to a quaternion: In this representation, the real part vanishes and the three imaginary coefficients represent Red, Green, and Blue intensities. The entire image forms a quaternion matrix , enabling holistic processing of color data (Tatsumi et al., 21 Mar 2025, Song et al., 2020, Miao et al., 2020, Miao et al., 2019).
Extensions such as the full quaternion representation incorporate scalar information (e.g., luminance average or depth) into the real part, yielding representations of the form: where may be zero, the luminance, or a depth value (Tatsumi et al., 21 Mar 2025, Parchami et al., 2020).
2. Preservation of Channel Correlations and Algebraic Advantages
Quaternion encoding couples the RGB channels, facilitating operations that naturally respect the geometric and statistical interdependence among color channels. Algebraic manipulation such as addition, unitary transforms, Hamilton product, and singular value decomposition act on all three channels simultaneously. This avoids per-channel artifacts and enables models to capture spatial–chromatic features otherwise lost in independent channel processing. Notably, quaternion representations preserve hue and saturation relationships during transformation, enhancement, or reconstruction tasks (Zeng et al., 2015, Miao et al., 2020, Grigoryan et al., 2017).
3. Low-Rank Quaternion Matrix and Tensor Models
Low-rank priors in the quaternion domain underpin state-of-the-art approaches to color image inpainting, denoising, deblurring, and recovery. The typical minimization seeks a low-rank quaternion matrix or tensor matching observed pixels: where is the quaternion nuclear norm (sum of quaternion singular values), and restricts to observed indices (Miao et al., 2019, Miao et al., 2020). Factorization surrogates are used to avoid the high computational cost of large-scale Quaternion SVDs (QSVD), relying on optimization over smaller quaternion matrix factors : Quaternions also permit higher-order color video or multispectral data modeling using tensor decompositions such as Tucker rank, tensor ring, and Quaternion HOSVD (QHOSVD) (Miao et al., 2023, Miao et al., 2021).
4. Depth-Augmented Quaternion Representation
In recent advances, scalar scene-depth is embedded in the real part of each quaternion pixel, forming "depth-aided" color image inpainting. The process involves preliminary restoration using conventional LRQMC, followed by depth estimation (typically via a pretrained network) and subsequent fusion into the real part: This depth-augmented encoding improves completion in regions where depth strongly predicts color structure. Empirical results confirm systematic gains in PSNR and SSIM over pure-imaginary schemes (Tatsumi et al., 21 Mar 2025).
5. Quaternion Structure in Deep and Neural Architectures
Quaternion neural networks have emerged in classification and generative tasks, where quaternion convolutional layers perform joint rotation and scaling in RGB space. Quaternion-based principal component analysis (QPCA), Quaternion CNNs (QCNNs), and Generative Adversarial Networks (QGANs) extend real-valued architectures to exploit cross-channel structure via Hamilton product-based convolution and quaternion batch normalization. These models demonstrate superior convergence speed, robustness to noise, and accuracy—especially for color-sensitive recognition and inpainting (Zhu et al., 2019, Wang et al., 2024, Zeng et al., 2015, Chen et al., 9 Dec 2025).
6. Optimization, Regularization, and Efficient Algorithms
Matrix and tensor completion models employ regularizers tailored to quaternion structure. These include nuclear norm minus Frobenius norm (QNMF), truncated nuclear norm, and -column sparse surrogates, all implemented via efficient ADMM schemes. Computational gains are achieved using QR over QSVD-based factorizations or deep implicit priors (untrained QCNN). Sparse regularization in the quaternion DCT or QDFT frequency domain preserves texture and edge fidelity (Guo et al., 2024, Yang et al., 2022, Han et al., 2022, Han et al., 2023, Miao et al., 2023).
7. Applications and Empirical Impact
Quaternion representation has demonstrated benefits in image inpainting, denoising, deblurring, classification, face recognition, video recovery, compression, and image enhancement:
- Image inpainting: Low-rank and tensor-completion quaternion schemes yield sharper restoration and less artifacting at lower sample ratios than channel-wise or tensor-based approaches (Miao et al., 2019, Miao et al., 2023, Miao et al., 2020, Wang et al., 2024).
- Compression: Full quaternion models via autoencoder-based transformation halve matrix width, preserve cross-covariance, and accelerate QSVD (Parchami et al., 2020).
- Feature extraction, recognition, and classification: Weighted quaternion PCA/projection and LSQMM classification robustly extract discriminative color features, showing superior performance in small-sample or noisy scenarios (Zeng et al., 2015, Chen et al., 9 Dec 2025, Jia et al., 2020).
- Denoising and fusion: QHOSVD and QNMF models deliver higher PSNR, SSIM, and perceptual quality than conventional matrix methods, with competitive runtime via efficient ADMM and factorization (Miao et al., 2021, Guo et al., 2024).
Table: Common Quaternion Color Image Encodings
| Representation Type | Formula (pixel) | Real part meaning |
|---|---|---|
| Pure-imaginary quaternion | None (zero) | |
| Full quaternion (general) | Luminance or depth | |
| Depth-augmented quaternion | Depth (estimated) |
These structural options offer flexibility in encoding auxiliary cues beyond color, such as geometry or illumination.
Conclusion
Quaternion representation of color images provides an algebraically coherent, computationally efficient, and statistically robust framework for color image analysis. By encoding RGB vectors as unified entities and leveraging quaternion-specific optimization schemes, these models outperform conventional channel-wise and tensor methods across a range of vision applications. Recent integration of depth and advances in neural architectures suggest continued expansion of quaternion methodologies in multidimensional image processing (Tatsumi et al., 21 Mar 2025, Song et al., 2020, Miao et al., 2020, Zeng et al., 2015, Chen et al., 9 Dec 2025, Parchami et al., 2020, Wang et al., 2024).