Papers
Topics
Authors
Recent
2000 character limit reached

Weighted Minimum Mean-Squared Error Algorithm

Updated 27 November 2025
  • Weighted Minimum Mean-Squared Error Algorithm is an optimization method that minimizes a weighted quadratic error function, essential in MIMO systems and resource allocation.
  • It employs weight matrices and matrix-monotone functions to generalize traditional MSE minimization, enabling adaptive designs like beamforming and multi-agent estimation.
  • Its solution structure, leveraging SVD and water-filling strategies, facilitates iterative convergence while satisfying power and fairness constraints.

A weighted minimum mean-squared error (WMMSE) algorithm is a class of optimization techniques for systems where error covariance plays a central role in performance, and where the objective is to minimize a quadratic form of estimation or detection error, possibly after transformation by application-specific weightings. In particular, the WMMSE paradigm generalizes classic MSE minimization by introducing weight matrices (scalar or matrix-valued) and by minimizing a monotone function (e.g., trace, log-determinant) of the weighted MSE matrix. This approach provides a unified foundation for the design of MIMO transceivers, multi-agent estimators, array processing, resource allocation, beamforming, and beyond, under a variety of power, fairness, and structural constraints.

1. Mathematical Foundations and Problem Statement

The core WMMSE optimization problem arises in vector Gaussian models, such as the general MIMO transceiver system: y=HFs+n,y = H F s + n, with transmitted symbol vector sCN×1s \in \mathbb C^{N \times 1}, precoder FCNtx×NF \in \mathbb C^{N_{tx} \times N}, channel HCNrx×NtxH \in \mathbb C^{N_{rx} \times N_{tx}}, and noise nCN(0,Rn)n \sim \mathcal{CN}(0, R_n). For a linear receiver GG, the error MSE matrix is

Φ(G,F)=E[(Gys)(Gys)H].\Phi(G, F) = \mathbb E \big[(G y - s)(G y - s)^{H}\big].

Classical MSE objectives operate on the diagonal or trace of Φ\Phi, e.g. Tr[Φ]\operatorname{Tr}[\Phi]. The weighted MSE approach introduces a weighting operator, extending to a matrix-field as

Ψ(G,F)=k=1KWkHΦ(G,F)Wk+Π,\Psi(G, F) = \sum_{k=1}^K W_k^{H} \Phi(G, F) W_k + \Pi,

where WkW_k and Π0\Pi \succeq 0 are predetermined, not necessarily diagonal (Xing et al., 2013, Xing et al., 2016).

The general WMMSE objective chooses an increasing matrix-monotone scalar function ff (e.g., Tr\operatorname{Tr}, log-\log|\cdot|, λmax\lambda_{\max}) and solves

minG,Ff(Ψ(G,F)),s.t.Tr(FFH)P.\min_{G, F} f\big(\Psi(G, F)\big), \quad \text{s.t.} \quad \operatorname{Tr}(F F^H) \leq P.

Alternatives include per-antenna constraints, per-user constraints, or MIMO network extensions (Zhao et al., 2022).

2. Solution Structure: SVD Parametrization and Water-Filling

The WMMSE problem's solution admits a canonical structure based on the eigen-decomposition or SVD of the effective channel. For any fixed FF, the optimal GG is the LMMSE equalizer: Gopt=(HFFHHH+Rn)1HF,G_{\mathrm{opt}} = (H F F^H H^H + R_n)^{-1} H F, implying

Φ(Gopt,F)=(FHHHRn1HF+I)1.\Phi(G_{\mathrm{opt}}, F) = (F^H H^H R_n^{-1} H F + I)^{-1}.

The remaining precoder design reduces to an optimization over FF: Ψ(F)=k=1KWkH(FHHHRn1HF+I)1Wk+Π.\Psi(F) = \sum_{k=1}^K W_k^H \left( F^H H^H R_n^{-1} H F + I \right)^{-1} W_k + \Pi. By SVD,

Rn1/2H=UHΛHVHH,R_n^{-1/2} H = U_H \Lambda_H V_H^H,

and the optimal FF is parametrized as

Fopt=VHΛFUFH,F_{\mathrm{opt}} = V_H \Lambda_F U_F^H,

where ΛF\Lambda_F is diagonal (power allocation) and UFU_F is a unitary "alignment" dependent on the weights and ff (Xing et al., 2013, Xing et al., 2016). The corresponding power allocation is solved via scalar water-filling.

Special Cases

Sum-MSE Minimization (f=Trf = \operatorname{Tr}): Alignment UFU_F matches the left-singular vectors of WW, and ΛF\Lambda_F is obtained by

fj2=[μwj21λh,j]+,f_j^2 = \left[ \frac{\mu}{w_j^2} - \frac{1}{\lambda_{h, j}}\right]^+,

with μ\mu chosen for the power constraint.

Capacity Maximization (f=logΨf = -\log|\Psi|): Alignment UFU_F aligns with the eigenvectors of WΠ1WHW\Pi^{-1}W^H, and the scalar optimization is another water-filling problem.

This water-filling-based structure is central in all algorithmic implementations.

3. Alternating Optimization Algorithms and Iterative Schemes

In the general matrix-monotone case, closed-form solutions for FoptF_{\mathrm{opt}} are unavailable, requiring iterative alternating minimization (Xing et al., 2013, Xing et al., 2016):

  1. Initialize F(0)F^{(0)} within the feasible set.
  2. Iterate until convergence:

    1. Update G(t)G^{(t)} via LMMSE formula.
    2. Compute Ψ(t)\Psi^{(t)}.
    3. Update F(t+1)F^{(t+1)} via diagonalization (SVD) and water-filling on power allocations, with updated alignment.
    4. Check for convergence in f[Ψ]f[\Psi].

Each iteration involves an SVD, possible eigenvalue computations, and a water-filling step, ensuring monotonic decrease in objective and convergence to a stationary point (Xing et al., 2016).

Algorithmic Complexity

  • Per Iteration: O(n3)O(n^3) for SVDs and EVDs, O(n)O(n) for water-filling.

  • Total: Moderately efficient for moderate system sizes; scalable relaxations enable applications in large-scale/capacity-constrained scenarios (Pellaco et al., 2022, Zhao et al., 2022).

4. Extensions: Matrix-Field, Robustness, and Application Domains

The matrix-field extension Ψ(G,F)\Psi(G, F) allows for highly general objectives. By choice of ff and weight matrices, WMMSE formulations encompass:

  • Fairness objectives (e.g., min–max-MSE, realized by max-eigenvalue ff and DFT weighting).
  • Nonlinear transceiver structures: Matrix WW set to lower-triangular or structured, yielding designs for Tomlinson-Harashima or Decision-Feedback Equalizers.
  • Capacity maximization in relays/multi-hop MIMO: Capacity objectives (log-determinant) are special cases of matrix-field WMMSE (Xing et al., 2013).
  • Robust, per-antenna, ARQ, network MIMO design: By varying weights and monotone ff, diverse constraints and settings are encompassed (Xing et al., 2016).
  • Team mean-squared error in multi-agent filtering: Weighted quadratic costs generalize to block-matrix weights for coordinating multiple estimators (Afshari et al., 2019).
  • Adaptive selection and bandit problems: Weighted MSE arises in maximizing statistical informativeness under sampling constraints (Boda et al., 2019).

These extensions unify a spectrum of estimation, control, and communications problems (Xing et al., 2013, Xing et al., 2016).

5. Representative Algorithms and Implementation Strategies

Below is archetypal high-level pseudocode for the general matrix-field WMMSE algorithm (Xing et al., 2013):

1
2
3
4
5
6
7
8
9
10
11
12
13
Input: H, R_n, {W_k}, Π, P, f(·), tolerance ε
1: [U_H, Λ_H, V_H] = svd(R_n^{-1/2} H)
2: Initialize F = V_H (sqrt(P/r) I_r) U_0^H
3: repeat
4:   G = (H F F^H H^H + R_n)^{-1} H F
5:   A = (F^H H^H R_n^{-1} H F + I)^{-1}
6:   Ψ = sum_k W_k^H A W_k + Π
7:   Compute alignment unitary U_F from EVDs as per f(·), {W_k}, Π
8:   Solve for power allocations {f_j}: min g(diag_j{…}) s.t. sum f_j^2 ≤ P
9:   Λ_F = diag(f_1,…,f_r)
10:  F = V_H Λ_F U_F^H
11: until |f(Ψ_new) - f(Ψ_old)| < ε
12: Return F*, G*

This encodes the block-coordinate procedure with per-block convexity, SVD structure, and monotonic cost decrease.

6. Key Methodological Variations and Theoretical Guarantees

Distinct methodological perspectives underpin WMMSE solvers:

  • Lagrange-multiplier method: Solves KKT conditions directly for structured weights, with closed-form SVD and scalar water-filling (Xing et al., 2016).
  • Majorization-theory approach: Exploits Schur-concavity/convexity of weighted MSE objectives to deduce optimal alignment and power allocation by eigenvalue ordering, often yielding one-shot solutions.
  • Block-coordinate descent (BCD) and MM: General nonconvex WMMSE settings are handled by BCD, with convergence to stationary points ensured by separable convexity within blocks and compactness of the feasible set (Xing et al., 2016).
  • Matrix-inverse-free and accelerated variants: For massive, high-dimensional, or hardware-accelerated settings, Schulz iterations or gradient schemes are used to avoid explicit inversion, offering parallelizability and speed (Pellaco et al., 2022, Gao et al., 23 Oct 2025).

These variants accommodate large-scale problems and unconventional scenarios while adhering to foundational monotonicity and convergence assurances.

7. Interconnections and Impact

The weighted MMSE paradigm is mathematically equivalent, by suitable reformulation, to a range of utility-maximizing resource allocation problems in wireless, control, and distributed estimation:

  • Weighted sum-rate maximization is equivalent to minimizing WMMSE via an auxiliary variable reformulation (Zhang et al., 2023).
  • Fractional programming (WSR-FP) and MM approaches coincide with WMMSE updates for appropriate surrogate functions.
  • Special cases (classic ZF, MMSE, THP/DFE, robust filtering) correspond to particular forms of weight matrices and ff (Xing et al., 2013).
  • In multi-agent best-arm identification or adaptive sensor selection, minimization of weighted (possibly coupled) MSEs maps to WMMSE objectives (Boda et al., 2019).

The unifying character and algorithmic tractability of WMMSE methods have made them a central design tool in MIMO systems, resource allocation, distributed filtering, and convex optimization in high-dimensional inference (Xing et al., 2013, Xing et al., 2016, Zhao et al., 2022).


Principal References:

  • "A Matrix-Field Weighted Mean-Square-Error Model for MIMO Transceiver Designs" (Xing et al., 2013)
  • "On Weighted MSE Model for MIMO Transceiver Optimization" (Xing et al., 2016)
Slide Deck Streamline Icon: https://streamlinehq.com

Whiteboard

Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Weighted Minimum Mean-Squared Error Algorithm.