Tensor-Based Proximal Alternating Minimization
- The paper introduces a novel tensor-based proximal alternating minimization method that reformulates inhomogeneous quartic optimization into a four-block multilinear problem, enabling efficient closed-form updates.
- It employs block coordinate descent with proximal regularization, ensuring strong convexity of each subproblem and guaranteeing convergence under mild assumptions.
- Empirical studies on Bose–Einstein condensate simulations demonstrate rapid convergence and lower computational cost compared to inexact ADMM across various discretizations.
A tensor-based proximal alternating minimization (PAM) algorithm is a numerical optimization approach developed for solving inhomogeneous quartic polynomial optimization problems on the sphere, with a structure inspired by applications such as discretized Bose–Einstein condensate (BEC) ground state computations. The method constructs an equivalence between a fourth-degree inhomogeneous polynomial minimization and a four-block multilinear optimization problem (MOP), exploiting tensor representations and block coordinate descent with proximal regularization for efficient, closed-form iterative updates. The convergence of the algorithm is established under mild assumptions, and empirical studies demonstrate notable gains compared to alternative methods such as inexact ADMM (Chen et al., 31 Dec 2025).
1. Problem Formulation and Tensor-Multilinear Equivalence
The method addresses minimization of an inhomogeneous quartic polynomial of the form: where and is symmetric. Any degree-4 polynomial in variables can be expressed via a symmetric order-4 tensor as . In the inhomogeneous case, the variable is lifted to , and the homogenized tensor encodes the full objective.
Defining a corresponding four-way multilinear function,
with each (first entry fixed at 1, remainder from the respective variables), a key result is that, assuming a mild concavity condition on , minimization of the original quartic is equivalent to minimizing over four blocks constrained to the unit sphere: To guarantee global concavity, a shift is introduced, with . The modified problem is
and its multilinear equivalent is
with the minimizers of both formulations coinciding under these conditions (Chen et al., 31 Dec 2025).
2. Multi-Block Structure and Blockwise Minimization
The MOP
exhibits a four-block structure, each block constrained to the unit sphere. By freezing three blocks, the subproblem for the remaining block reduces to a quartic–quadratic function composed with a proximal term. Multilinearity ensures that block coordinate descent (BCD) delivers closed-form updates at each step.
This natural cyclic update scheme is the basis for the overall proximal alternating minimization algorithm.
3. Proximal Alternating Minimization Updates
Defining , the PAM algorithm performs, for ,
Each block update is a strongly convex quadratic minimization on the sphere, with a closed-form solution: where
Here, selects the relevant indices in the lifted variable. The regularization parameters ensure the strong convexity of each subproblem.
4. Convergence Properties
Key convergence assumptions include (i) for concavity, (ii) positive proximal coefficients , and (iii) compactness of the feasible set . Theoretical results (Theorem 4.1 and 4.2) establish:
- Descent: After each full cycle, the objective decreases at least by a multiple of the squared-step norm, guaranteeing monotonic convergence.
- Vanishing Steps: The sum of squared step sizes is finite; thus, as (with collecting all lifted variables).
- Cluster Points: Compactness ensures existence of accumulation points.
- Stationarity: Any cluster point satisfies the KKT conditions for the MOP, i.e., each block’s variational inequality for .
5. Computational Complexity Per Iteration
Each block update's primary expense is the tensor contraction . For a general dense order-4 tensor, this cost scales as . Nevertheless, when arises from BEC applications, e.g., via discretized Gross–Pitaevskii energies, the tensor exhibits significant sparsity and symmetry; the dominant cost then reduces to or even per contraction for 1D and 2D finite-difference grids.
Each block update further requires operations for normalization and vector addition, so, in practice, total cost per full iteration is in 1D and can approach in 2D with optimal exploitation of tensor structure.
6. Hyperparameter Selection and Acceleration
Several algorithmic choices affect performance:
- Shift parameter : Chosen to satisfy . Slightly larger improves concavity and numerical stability but excessive scaling suppresses the multilinear term, slowing convergence.
- Proximal coefficients : Any positive value yields convergence. In practice, a moderate constant (e.g., $0.5$) balances progress and stability.
- Initialization: Random unit-norm initialization suffices; marginally improved warm starts may be obtained via a few steps of the power method or MBI.
- Acceleration strategies: Proximal coefficients may be increased adaptively when little progress is detected, or SQUAREM-type extrapolation may be applied to the iterates for superlinear convergence in early phases.
7. Numerical Experiments and Performance Comparison
Empirical validation employed synthetic BEC data, with both 1D and 2D Gross–Pitaevskii energy discretizations. The algorithm was compared to an inexact ADMM (with Newton-type inner iterations), across a range of discretization sizes:
- Iteration and runtime efficiency: PAM required $4$–$20$ outer iterations and $0.001$–$0.05$ seconds, while ADMM took $50$–$300$ iterations and $0.02$–$0.2$ seconds, with both methods attaining identical objective values to machine accuracy.
- Convergence profile: PAM's objective exhibited rapid initial decrease and smooth convergence, contrasting with the slower final-phase descent in ADMM, due to inexact Newton steps.
- Ground-state recovery: Final minimizers reproduced nonnegative, symmetric, and exponentially decaying ground-state wavefunctions, in agreement with analytical expectations for BEC.
- Sensitivity to : Modest variation of from to had little impact on final objectives, with runtime varying by at most a factor of two. Setting generated instability; large slowed progress.
In sum, the tensor-based PAM method exploits the equivalence between inhomogeneous quartic and multilinear formulations to provide efficient blockwise updates with simple convergence guarantees, demonstrating superior per-iteration efficacy compared to ADMM in the context of synthetic BEC tests (Chen et al., 31 Dec 2025).