Lossless Haar Wavelet Pyramid
- Lossless Haar wavelet pyramid is a fully invertible, multilevel decomposition framework that uses integer arithmetic for exact signal reconstruction.
- It employs a multiplierless lifting scheme with shift-and-add operations to perform efficient prediction and update steps, minimizing computational resources.
- The design achieves high throughput on FPGA platforms by cascading optimized modules in a multilevel pipeline for real-time embedded signal processing.
A lossless Haar wavelet pyramid is a multilevel, fully invertible decomposition of discrete signals using integer arithmetic and a multiplierless architecture, specifically optimized for efficient implementation in parallel and reprogrammable logic. Employing the Haar wavelet in its lifting scheme variant, the architecture enables precise lossless transformation and reconstruction of signals with minimal computational resources, eliminating all multipliers and substituting them with shift-and-add logic. The core modules operate on 8-bit samples and are configured to process data in streaming fashion, achieving substantial hardware savings and real-time throughput suitable for FPGA-based applications (Kolev, 2010).
1. Multiplierless Integer Wavelet Filter–Bank Architecture
The foundation of the lossless Haar wavelet pyramid consists of forward (analysis) and inverse (reconstruction) modules designed for integer arithmetic without multipliers. Each basic processing element comprises:
- Two programmable delays for past sample buffering.
- Three registers for temporal alignment during prediction and update steps.
- A single adder or subtractor; no explicit multiplier.
- Division by 2 or 4 is implemented via right-shift operations with conditional one-bit correction to emulate integer rounding (floor).
Forward (Analysis) Module Workflow
Given 8-bit input samples :
- Prediction:
- Update:
Output is the detail and coarse-scale .
Inverse (Reconstruction) Module Workflow
Given and :
- Undo update:
- Undo predict:
Output is the interleaved reconstructed samples .
Integer division operations use bit shifts plus single-bit conditional corrections for rounding:
2. Lifting Scheme Equations for Integer Haar Wavelet
The lossless Haar wavelet lifting steps formalize decomposition and reconstruction using integer arithmetic. Let denote input at level , the detail at level , and the coarse output.
- Predict step (P):
- Update step (U):
Inverse lifting undoes the forward steps:
This integer lifting formulation ensures full reversibility with no loss of information due to rounding, which is essential for lossless operations.
3. Multilevel (Pyramidal) Decomposition and Reconstruction
The Haar lifting modules are cascaded to build an -level wavelet pyramid. Coarse outputs at each stage serve as inputs for the next level, creating a hierarchy of resolution representations.
- Decomposition ():
- Reconstruction (reverse):
After levels, one obtains the low-band and all detail bands , forming a fully lossless Haar wavelet pyramid. This structure is exact and supports perfect reconstruction.
4. Hardware Optimization: Multiplier Elimination and Resource Savings
The architecture completely dispenses with multipliers, replacing all fixed-point divisions by simple bit shifts plus minor corrections for sign and rounding, implemented in hardware as follows:
- Division by 2: with conditional subtraction if and least significant bit equals 1.
- Division by 4: with subtraction if and the two-bit LSB .
Resource utilization in the critical path is greatly minimized:
- Per sample: 2 shifts plus 2 additions.
- Compared to floating-point implementations or 5/3 filter banks (8 multiplies/adds per sample), operational complexity is dramatically reduced.
This design is directly traceable to the modules proposed by Kolev, which serve as the blueprint for efficient reconfigurable logic wavelet processing (Kolev, 2010).
5. Implementation and Performance on Reprogrammable Logic Devices
The modules are implemented on FPGAs using standard synthesis tools:
- Devices: Virtex xc2v200e-pq240-8 (analysis), Spartan2 xc2s150-fg256-6 (reconstruction).
- Clock rate: 100 MHz.
- Resource use (forward): 30 registers, one 8-bit subtractor, one 9-bit adder.
- Resource use (inverse): 8 registers, two 8-bit adders, four 8-bit subtractors.
- Throughput: One sample per clock cycle ($100$ MSamples/s).
- Line processing: For 256 samples at 8 bits, the Haar pyramid executes in approximately s per line, while comparable floating-point DSP lifting requires s per line.
Instantiating the forward module times for decomposition and the inverse module for reconstruction yields a lossless, multilevel Haar wavelet pyramid pipeline suitable for real-time embedded or FPGA-based image and signal processing without rounding loss.
6. Context, Application, and Significance
The lossless Haar wavelet pyramid constructed with multiplierless integer lifting modules is uniquely suited to applications requiring exact reconstruction and minimal hardware cost, including real-time image processing and embedded signal analysis. The absence of multipliers directly translates to energy savings, reduced silicon area, and higher throughput, with the critical path comprising only shifts and adds.
A plausible implication is that such factorized architectures represent an efficient tradeoff between numerical fidelity and resource utilization, especially when compared with floating-point or traditional 5/3 filter approaches. Given the typical usage patterns in FPGAs and ASICs, this method enables direct integration into existing digital signal pipelines, leveraging reprogrammable logic for dynamic applications.
The approach is directly attributable to the work of Kolev and was rigorously formalized and benchmarked in (Kolev, 2010), demonstrating its technical viability and operational efficiency for lossless, integer-based wavelet decomposition and reconstruction in hardware.