Complex Kernel LMS Overview
- CKLMS is a nonlinear adaptive filtering algorithm that maps complex-valued signals into an RKHS using complex kernels for enhanced learning of nonlinear input–output relationships.
- The algorithm leverages an extension of Wirtinger calculus and augmented filtering to efficiently process both circular and noncircular (improper) signals, achieving lower mean-squared error and faster convergence.
- Sparsification strategies and pseudo-kernel design in CKLMS control computational complexity while enabling practical implementations in tasks such as channel equalization and system identification.
A Complex Kernel Least-Mean-Square (CKLMS) algorithm is a nonlinear adaptive filtering method that extends the classical LMS algorithm to complex-valued signals and nonlinear input–output relationships via a reproducing kernel Hilbert space (RKHS) framework. The CKLMS leverages either complex kernels or the complexification of real RKHSs, exploits an extension of Wirtinger calculus for infinite-dimensional settings, and allows for both standard ("pure") and augmented ("widely linear" or "augmented CKLMS", ACKLMS) variants. The augmented form brings significant improvements in regression and equalization tasks involving improper (noncircular) complex data by modeling both covariance and pseudo-covariance structures.
1. Mathematical Framework and Formulation
Let denote a training sequence with and . Data is mapped to a complex RKHS by a positive-definite kernel , typically the complex Gaussian kernel
with the associated feature map (Bouboulis et al., 2010).
For adaptive filtering, the estimator in CKLMS is
where is the unknown representer. In the augmented CKLMS (ACKLMS), two elements are used to implement a widely linear (augmented) filter:
where denotes the complex-conjugate feature (Bouboulis et al., 2011).
The cost function is the instantaneous squared error:
2. Kernel Construction and Complexification
Two principal ways exist to define the RKHS:
- Native Complex Kernel: Employ kernels defined directly on , such as the complex Gaussian kernel.
- Complexification of Real RKHS: Given a real kernel on , rewrite and define with the real-feature map for . The inner product structure persists: (Bouboulis et al., 2010, Bouboulis et al., 2010).
The complexification approach allows the re-use of well-established real kernels for complex-valued learning, while native complex kernels can exploit signal geometry tailored to complex domains.
3. Algorithmic Derivation and Wirtinger Calculus
The functional gradient of the cost function in is calculated using the extension of Wirtinger calculus to infinite-dimensional settings. For , treating variables and their conjugates independently, the derivatives are
where . The update rules, with step-size , are
For standard CKLMS, only the update is present (Bouboulis et al., 2011, Bouboulis et al., 2010).
4. Relation to Linear and Nonlinear Filtering
The increase in model expressivity over standard complex LMS arises through the kernel mapping, which enables nonlinear regression via the "kernel trick" in RKHSs. The standard (non-augmented) CKLMS recovers many properties of classic LMS, including convergence and misadjustment, but offers substantially lower mean-squared error and faster convergence in nonlinear scenarios (Bouboulis et al., 2010, Bouboulis et al., 2010, Bouboulis et al., 2010).
The widely linear (augmented) CKLMS exploits "pseudo-covariance" and complementary information present in improper (noncircular) complex signals. For circular signals, standard and augmented CKLMS yield identical results (Bouboulis et al., 2011).
Table: Comparison of CKLMS Variants
| Variant | Model Formulation | Key Advantage |
|---|---|---|
| CKLMS | Standard nonlinear adaptive estimation for complex signals | |
| ACKLMS | Enhanced modeling for noncircular signals |
5. Sparsification, Complexity, and Implementation
The CKLMS and ACKLMS algorithms possess an inherent computational growth, as each time step potentially increases the expansion in the RKHS. To control complexity, a sparsification strategy (novelty criterion) is typically employed:
- Accept a new center into the dictionary only if the distance for all exceeds a threshold , and .
- Otherwise, discard as a non-novel or sufficiently modeled sample.
This results in a manageable dictionary size (often hundreds of centers for several thousand samples), making real-time operation feasible. Each iteration then requires kernel and inner product operations, where is the size of the current dictionary (Bouboulis et al., 2010, Bouboulis et al., 2010).
6. Performance Characteristics and Empirical Results
Empirical studies in nonlinear channel equalization and identification tasks demonstrate the benefits of CKLMS and its augmented extension:
- For noncircular inputs (), ACKLMS achieves approximately $2$ dB lower steady-state MSE than standard CKLMS.
- For circular inputs, both kernel methods yield virtually identical performance.
- Both CKLMS and ACKLMS achieve faster convergence and significantly reduced error floors compared to classical LMS, widely linear LMS, and other nonlinear algorithms such as MLP or nonlinear gradient descent (Bouboulis et al., 2011, Bouboulis et al., 2010, Bouboulis et al., 2010).
- The generalized Complex Kernel LMS (gCKLMS) further demonstrates superior performance by including a pseudo-kernel, providing better convergence and steady-state MSE especially when real and imaginary signal components display different statistics or are correlated (Boloix-Tortosa et al., 2019).
7. Extensions and Kernel Design Considerations
The generalized framework developed in "The Generalized Complex Kernel Least-Mean-Square Algorithm" (Boloix-Tortosa et al., 2019) provides a kernel and pseudo-kernel approach rooted in widely linear RKHS theory. Design rules dictate:
- Use only the kernel if and are independent and share the same statistics; set the pseudo-kernel .
- Employ distinct kernel bandwidths or pseudo-kernel cross-terms if real and imaginary parts differ in smoothness or are statistically dependent.
- Pseudo-kernel introduction is critical for learning tasks with complex-valued signals where real and imaginary parts are not independent or have distinct properties.
The learning rate and kernel parameters should be selected to satisfy stability bounds and adjusted based on validation data. Dictionary management is essential for computational tractability in long-running applications.
References
- "The Augmented Complex Kernel LMS" (Bouboulis et al., 2011)
- "Extension of Wirtinger's Calculus to Reproducing Kernel Hilbert Spaces and the Complex Kernel LMS" (Bouboulis et al., 2010)
- "The Complex Gaussian Kernel LMS algorithm" (Bouboulis et al., 2010)
- "Extension of Wirtinger Calculus in RKH Spaces and the Complex Kernel LMS" (Bouboulis et al., 2010)
- "The Generalized Complex Kernel Least-Mean-Square Algorithm" (Boloix-Tortosa et al., 2019)