Papers
Topics
Authors
Recent
Search
2000 character limit reached

Cayley Transform Parametrization

Updated 19 February 2026
  • Cayley Transform Parametrization is a rational mapping from Lie algebra elements (skew-symmetric or anti-Hermitian) to Lie groups like SO(n) and SU(n), ensuring automatic orthogonality/unitarity.
  • It enables efficient computation in diverse fields such as quantum mechanics, optimization on manifolds, machine learning, and robotics by using unconstrained variable representations.
  • Extensions like the scaled and block-skew variants allow global representations for matrices with problematic eigenvalues and facilitate robust optimization on the Stiefel manifold and SE(3) spaces.

The Cayley transform parametrization is a classical and versatile technique that provides a rational (non-exponential) map from the Lie algebra of skew-symmetric (or anti-Hermitian, or more generally Lie algebra) elements to certain matrix Lie groups such as SO(n), SU(n), and the Stiefel manifold. Its algebraic structure, computational efficiency, and unconstrained variable usage have made it fundamental in diverse areas, including quantum mechanics, optimization on manifolds, machine learning, molecular simulation, K-theory, and geometric statistics.

1. Mathematical Foundation of the Cayley Transform Parametrization

The Cayley transform for a Lie group GG (e.g., SO(n)SO(n), SU(n)SU(n)) is a rational map from the associated Lie algebra g\mathfrak{g} (typically skew-symmetric or anti-Hermitian matrices) to GG. The essential form is

C(M)=(I+M)(IM)1C(M) = (I + M)(I - M)^{-1}

where MM is skew-symmetric (M=MM^\top = -M) or anti-Hermitian (M=MM^\dagger = -M), and IMI - M is invertible. For SU(2)SU(2), one takes M=2ianJM=2i a\, \mathbf{n}\cdot\mathbf{J} with J\mathbf{J} the spin generators, aRa \in \mathbb{R}, and n2=1\mathbf{n}^2=1 (Kortryk, 2015).

For real orthogonal matrices UO(n)U\in O(n), the Cayley transform maps skew-symmetric SS to orthogonal QQ via Q=(IS)(I+S)1Q=(I-S)(I+S)^{-1}, if I+SI+S is invertible (Biborski, 22 Jan 2026). Conversely, for certain UU, especially with eigenvalue 1-1, a "signature" diagonal D{±1}nD\in\{\pm1\}^n is used so that DUDU lies in the Cayley domain. Then U=D(IS)(I+S)1U = D(I - S)(I + S)^{-1} is a global representation for all O(n)O(n) matrices (Biborski, 22 Jan 2026, Helfrich et al., 2017).

A central property is that orthogonality/unitarity is automatic: if S=SS^\top = -S, then

QQ=[(I+S)1(IS)](IS)(I+S)1=IQ^\top Q = [(I+S)^{-1}(I-S)]^\top (I-S)(I+S)^{-1} = I

since the product (IS2)=(IS)(I+S)=(I+S)(IS)(I-S^2) = (I-S)(I+S) = (I+S)(I-S). Analogous constructions hold for skew-Hermitian AA and the unitary group U(n)U(n).

2. Extensions: Scaled and Generalized Cayley Parametrizations

The classical Cayley parametrization does not cover matrices with eigenvalue 1-1 because ISI-S becomes singular. To resolve this, Kahan and successors introduced a "scaled Cayley transform": W=(I+A)1(IA)DW = (I + A)^{-1}(I - A) D where AA is skew-symmetric and DD is a fixed diagonal of signs for O(n)O(n) (Helfrich et al., 2017, Maduranga et al., 2018). In the complex/unitary case, AA becomes skew-Hermitian and D=diag(eiθj)D=\operatorname{diag}(e^{i\theta_j}) is a diagonal of phases, allowing a differentiable parameterization of U(n)U(n) with AA and θ\theta as unconstrained variables (Maduranga et al., 2018).

For rectangular orthogonal matrices and the Stiefel manifold $\St(n,p)$, the Cayley transform is generalized via block-skew coordinates. Given a "center" SO(n)S\in O(n), the following map holds for $U\in \St(n,p)$: ΦS(U)=[AS(U)BS(U) BS(U)0]\Phi_S(U) = \begin{bmatrix}A_S(U) & -B_S(U)^\top \ B_S(U) & 0\end{bmatrix} with AS(U)=2(Ip+SleTU)T(UTSle)(Ip+SleTU)1A_S(U)=2(I_p+S_\text{le}^T U)^{-T}(U^T S_\text{le})(I_p+S_\text{le}^TU)^{-1} and BS(U)=SriTU(Ip+SleTU)1B_S(U) = -S_\text{ri}^T U (I_p+S_\text{le}^T U)^{-1}, so every UU in an open dense subset has an unconstrained vector representation (Kume et al., 2023, Kume et al., 2023, Jauch et al., 2018).

The Cayley parametrization is further extended to the group of rigid-body motions SE(3)SE(3) in robotics as: TCay(ξ)=(I412ξ^)1(I4+12ξ^)T_\text{Cay}(\xi) = \big(I_4 - \frac{1}{2}\hat{\xi}\big)^{-1} \big(I_4 + \frac{1}{2}\hat{\xi}\big) where ξ\xi is a 6-vector encoding translation and rotation, and ξ^\hat{\xi} is its se(3) matrix representation (Barfoot et al., 2021).

3. Computational Techniques and Analytical Properties

A key computational advantage is that the Cayley transform yields polynomial or rational forms in unconstrained variables, which allows efficient evaluation, gradient computation, and inversion. In the SU(2)SU(2) spin-jj case, the Cayley transform of a spin operator leads to a degree-$2j$ matrix polynomial whose coefficients are explicit rational functions (in tan(θ/2)\tan(\theta/2)) and can be computed via resolvent expansions and truncations of determinants (Kortryk, 2015).

In high-dimensional settings, the construction of the Cayley parameters can be performed algorithmically, e.g., for any UO(n)U\in O(n) the signature matrix DD can be found by a Gaussian-elimination-style pivoting in O(n3)O(n^3) arithmetic steps, such that U=D(IS)(I+S)1U = D(I - S)(I + S)^{-1}, with S21+2n\|S\|_2 \le 1 + 2^n (Biborski, 22 Jan 2026).

For over-parameterized cases like the Stiefel manifold (n>pn > p), the parameter space is a Euclidean ball of dimension np12p(p+1)np - \frac{1}{2}p(p+1), and direct optimization in this space is possible. Adaptive recentring strategies, which change the center SS whenever optimization steps approach a domain singularity, mitigate ill-conditioning and ensure bounded parameter norms (Kume et al., 2023, Kume et al., 2023).

In optimization, Cayley-based retractions serve as efficient alternatives to matrix exponentials, QR, or polar decompositions. The iterative Cayley retraction on Stiefel avoids inversion and achieves O(np2)O(n p^2) complexity per step, markedly accelerating training in deep learning or Riemannian settings (2002.01113).

4. Practical Applications in Physics, Machine Learning, and Geometry

Cayley parametrization is utilized in a spectrum of modern applications:

  • Quantum spin and SU(2)SU(2) theory: Offers a closed, computationally efficient polynomial form for arbitrary spin-j rotations, advantageous over the exponential/CFZ expansion for iterative algorithms and rational approximations (Kortryk, 2015).
  • K-theory and operator algebras: The Cayley transform provides an explicit isomorphism at the level of cycles between van Daele KK-theory and KK-theory for graded CC^*-algebras. It preserves Morita equivalence, compatibility with Kasparov products, and index pairings, underpins KOKO-theory, and enables explicit representative computations for topological phases of matter (Bourne et al., 2019).
  • Robotics and Vision: The SE(3) Cayley map enables fast, fully polynomial pose updates without transcendental functions, with improved numerical conditioning for iterative pose alignment (the "CayPer" algorithm) and geometric optimization (Barfoot et al., 2021).
  • Deep Learning:
    • Orthogonal and unitary RNNs enforce norm-preserving transformations via Cayley-parametrized weight matrices, yielding better gradient flow and robustness. The "scaled Cayley" overcomes blind spots at eigenvalue 1-1, allowing fully expressive, differentiable unitary or orthogonal parameterizations (Helfrich et al., 2017, Maduranga et al., 2018).
    • Orthogonal convolution layers via Cayley transforms parameterize filters as skew-symmetric convolutions, enabling direct, fast, and exact orthogonality constraints in both spatial and Fourier domains. This confers stability, tighter Lipschitz bounds, and certified adversarial robustness (Trockman et al., 2021).
    • Layerwise and end-to-end Lipschitz control in 1D CNNs is enabled by combining Cayley-orthogonal parameterizations with the controllability Gramian, resulting in architectures with provable robustness and unconstrained training (Pauli et al., 2023).
  • Optimization on Manifolds:
    • On the Stiefel (and Grassmann) manifolds, the Cayley parametrization converts orthogonality constraints into unconstrained vector optimization, facilitating Euclidean algorithm transfer, fast iteration, and global convergence guarantees (Kume et al., 2023, Kume et al., 2023, 2002.01113, Jauch et al., 2018).
  • Data Fitting and Geometric Statistics:
    • Ellipsoid fitting via Cayley parametrization replaces constrained search over orthogonal transformations with unconstrained skew-symmetric blocks, allowing globally elliptic solutions and efficient nonlinear least-squares with explicit gradients (Melikechi et al., 2023).
    • Stochastic simulation and random matrix theory exploit the Cayley change-of-variables to sample from Stiefel or Grassmann distributions, with Jacobians computed via Kronecker-structured derivatives and asymptotic normal approximations linking random Cayley parameters to Gaussians under Haar measure (Jauch et al., 2018).
  • SU(3) Gauge Theory Simulation:
    • Modified Cayley maps for SU(3)\mathrm{SU}(3) include a nonlinear phase to enforce unit determinant, providing a local diffeomorphism to SU(3)SU(3) suitable for Hamiltonian splitting and hybrid Monte Carlo integration in lattice gauge theory. This construction maintains reversibility, preserves volume, and can enhance numerical stability compared to exponential mappings (Schäfers et al., 2024).

5. Limitations and Handling of Singularities

The surjectivity of the classical Cayley map fails on group elements with eigenvalue 1-1 (or, for SE(3)SE(3), on 180° rotations). Scaled variants (introducing a diagonal DD or phase factors in complex/unitary cases) guarantee global parameterizations by moving potential singularities away from the current chart (Helfrich et al., 2017, Biborski, 22 Jan 2026, Maduranga et al., 2018). In Stiefel/Grassmann optimization the domain of the Cayley map is only an open dense subset, but adaptive strategies (center point shifting) or pivoting ensure that optimization iterates remain within the domain (Kume et al., 2023, Kume et al., 2023).

Jacobians and differentials of the transform can be singular at chart boundaries, complicating change-of-variable formulas or gradient propagation. In practice, careful numerical safeguards (e.g., monitoring parameter norms, recentering) are employed to avoid these ill-conditioned regions (Jauch et al., 2018, Kume et al., 2023).

6. Comparative Analysis With Exponential and Other Parametrizations

The Cayley transform is a rational function in the algebra parameter, which avoids trigonometric and transcendental evaluations required by the exponential map. For SU(2)SU(2), the Cayley polynomial for UCay(a)U_{\rm Cay}(a) maps directly and exactly to the exponential parametrization via a=tan(θ/2)a = \tan(\theta/2); the Cayley coefficients are purely rational and more amenable to analytic and numerical computation than the truncated Taylor/arcsin series of the exponential (CFZ) expansion (Kortryk, 2015).

In optimization and machine learning, the Cayley parametrization leads to O(n3n^3) complexity per orthogonality-enforcing update or layer, matching or improving on polar or QR retraction methods especially in high dimensions, and conferring exactness by construction (2002.01113). Compared to penalty or SVD-based approaches for orthogonal convolutions, the Cayley approach is both more expressive and computationally efficient (Trockman et al., 2021).

For Lie group-valued integration in molecular dynamics (SU(3)SU(3)), modified Cayley transforms offer improved step-size stability and acceptance rates compared to exponential maps, notably in moderate–coarse regimes; standard exponential maps retain advantages at very small step sizes (Schäfers et al., 2024).

7. Summary Table of Canonical Forms and Their Domains

Group / Manifold Classical Cayley Map Global Extension Domain / Notes
SO(n)SO(n) Q=(IS)(I+S)1Q = (I-S)(I+S)^{-1} Q=D(IS)(I+S)1Q = D(I-S)(I+S)^{-1}, DD sign I+SI+S invertible; all O(n)O(n)
SU(n)SU(n) Q=(IS)(I+S)1Q = (I-S)(I+S)^{-1}, S=SS^\dagger=-S Q=(IS)(I+S)1DQ = (I-S)(I+S)^{-1} D, DD unitary phase I+SI+S invertible
St(n,p)St(n,p) U=S(IV)(I+V)1In×pU = S(I-V)(I+V)^{-1}I_{n\times p} Adaptive center, block-skew VV Open dense set
SE(3)SE(3) TCay(ξ)T_{Cay}(\xi) as above 180180^\circ rotation excluded I12ξ^I - \frac{1}{2}\hat\xi nonsing
SU(3)\mathrm{SU}(3) Q=(IΩ)1(I+Ω)Q = (I-\Omega)^{-1}(I+\Omega) Modified with phase θ(Ω)\theta(\Omega) Local diffeomorphism

References

  • "Cayley transforms of su(2) representations" (Kortryk, 2015)
  • "A Constructive Cayley Representation of Orthogonal Matrices and Applications to Optimization" (Biborski, 22 Jan 2026)
  • "Orthogonal Recurrent Neural Networks with Scaled Cayley Transform" (Helfrich et al., 2017)
  • "Complex Unitary Recurrent Neural Networks using Scaled Cayley Transform" (Maduranga et al., 2018)
  • "Adaptive Localized Cayley Parametrization for Optimization over Stiefel Manifold" (Kume et al., 2023)
  • "Generalized Left-Localized Cayley Parametrization for Optimization with Orthogonality Constraints" (Kume et al., 2023)
  • "Efficient Riemannian Optimization on the Stiefel Manifold via the Cayley Transform" (2002.01113)
  • "Orthogonalizing Convolutional Layers with the Cayley Transform" (Trockman et al., 2021)
  • "Random orthogonal matrices and the Cayley transform" (Jauch et al., 2018)
  • "Ellipsoid fitting with the Cayley transform" (Melikechi et al., 2023)
  • "A modified Cayley transform for SU(3) molecular dynamics simulations" (Schäfers et al., 2024)
  • "Vectorial Parameterizations of Pose" (Barfoot et al., 2021)
  • "The Cayley transform in complex, real and graded KK-theory" (Bourne et al., 2019)
  • "Lipschitz-bounded 1D convolutional neural networks using the Cayley transform and the controllability Gramian" (Pauli et al., 2023)
  • "Cayley parametrization and the rotation group over a non-archimedean pythagorean field" (Mahmoudi, 2016)

The Cayley transform parametrization is thus a structurally transparent, computationally efficient, and algebraically robust tool for representing, analyzing, and optimizing orthogonality and unitarity constraints in both finite and infinite-dimensional settings, across mathematics, physics, engineering, and machine learning.

Topic to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Cayley Transform Parametrization.