Clifford-Steerable CNNs
- Clifford-Steerable CNNs are deep learning architectures that achieve exact equivariance under pseudo-Euclidean isometries using multivector-valued feature fields.
- They employ implicit, group-steerable convolutional kernels via Clifford group-equivariant networks to model complex physical phenomena in tasks like fluid dynamics and electrodynamics.
- The method integrates FFT-based optimizations and conditional kernel techniques to restore full harmonic bases, offering enhanced accuracy and stability despite higher computational costs.
Clifford-Steerable Convolutional Neural Networks (CS-CNNs) are a class of deep learning architectures designed to achieve exact equivariance under pseudo-Euclidean isometry groups, unifying the treatment of translational, orthogonal, and Lorentzian symmetries within a single convolutional framework. These networks process feature fields whose values are multivectors in Clifford algebras, allowing natural and explicit representations for a wide range of physical quantities. CS-CNNs implement implicit, group-steerable convolutional kernels via Clifford group-equivariant neural networks, and have demonstrated state-of-the-art performance on partial differential equation (PDE) forecasting tasks including fluid dynamics and relativistic electrodynamics (Zhdanov et al., 2024, Szarvas et al., 15 Oct 2025).
1. Pseudo-Euclidean Symmetry and Clifford Algebra
A central aspect of CS-CNNs is their principled equivariance to the isometry group , where acts as the pseudo-orthogonal (Lorentz) group on , a vector space with metric signature ; for example, for classical Euclidean space and for Minkowski spacetime.
The Clifford algebra , generated by orthogonalizing basis vectors under the bilinear form , provides the representation space for the multivector-valued feature fields directly encoding scalar, vector, and higher-grade quantities. The action is lifted to the entire algebra via the natural grading-preserving algebra automorphism 0, enabling precise equivariant transformations for all geometric components present in the data (Zhdanov et al., 2024, Szarvas et al., 15 Oct 2025).
2. Equivariant Convolutions and Steerable Kernel Parameterization
A convolutional layer 1 is 2-equivariant if and only if its kernel 3 satisfies the steerability relation:
4
CS-CNNs realize such steerable kernels via an implicit parameterization. Specifically, a Clifford group-equivariant neural network (CGENN) 5 produces c-channel multivector outputs. These are passed through a fixed, linear "kernel head" 6 that expands multivector entries into 7-linear endomorphisms of 8 by weighted, partial geometric product expansions, with learnable mixing weights 9:
0
Together, equivariance of 1 and 2 ensures that their composition 3 satisfies the group steerability constraint (Zhdanov et al., 2024).
3. Network Architecture, Implementation, and Optimization
The principal components of CS-CNNs include:
- Input/Output: Channel-wise multivector fields 4 decomposed into grades.
- Steerable convolution: 5.
- Nonlinearity: Grade-wise geometric-product gating, e.g., 6 with a scalar nonlinearity 7 on the grade-0 part.
- Residual blocks: Combination of steerable convolution, group normalization, activation, and skip connections.
Optimization techniques include pre-computation of geometric-product Cayley tables for efficient kernel expansion, the use of FFT-based (Fast Fourier Transform) convolution and backpropagation, and the implementation of CGENNs with grade-structured linear and geometric-product layers (Zhdanov et al., 2024).
4. Kernel Basis Completeness and Conditional Kernels
The standard CS-CNN parameterization does not realize a complete basis of equivariant steerable kernels. For example, in the O(2)-equivariant vector-to-vector setting, harmonic components at frequency two (cos8, sin9) are missing from the representable space when kernels depend solely on the spatial offset 0, limiting expressivity.
Conditional Clifford-Steerable CNNs (C-CSCNNs) solve this by incorporating global, translation-invariant summaries 1 of the input field as additional kernel arguments:
2
Here, 3 is typically the mean pooled multivector over spatial regions, ensuring both translation and 4-equivariance. The conditional kernel 5 is parameterized via an O(6)-equivariant network 7 and the kernel head 8 as before, restoring the missing harmonics and the full kernel basis in a single layer (Szarvas et al., 15 Oct 2025).
5. Empirical Performance in PDE Modeling
Empirical evaluations demonstrate that CS-CNNs and their conditional variants achieve robust, data-efficient results on diverse PDE forecasting tasks including Navier–Stokes (R9), shallow-water equations, non-relativistic and relativistic Maxwell equations in both Euclidean and Minkowski space:
| Task | Metric | Best C-CSCNN Result | Comparison |
|---|---|---|---|
| Navier–Stokes | 1-step MSE | C-CSCNN halves MSE of plain CS-CNN with 512 trajectories | Outperforms ResNet, Transolver, Swin-Transformer, FNO |
| Shallow Water SWE | 5-step relative L0 | C-CSCNN-L: 2.94% (55M params); C-CSCNN-S: 3.51% (10M) | Better or competitive with much larger baselines |
| Maxwell (3D) | 1-step field MSE | C-CSCNN halves error vs. CS-CNN and outperforms FNO/G-FNO | - |
| Maxwell (R1) | 1-stepfield MSE | Similar data-efficiency gains across data regimes | - |
In all tasks, both CSCNNs and C-CSCNNs maintain relative equivariance error 2 and error maps demonstrate lower transient and stable long-horizon errors compared to baselines. Notably, C-CSCNNs achieve improved data efficiency, enabling smaller models to match or surpass much larger transformer- and FNO-based architectures (Szarvas et al., 15 Oct 2025, Zhdanov et al., 2024).
6. Limitations, Computational Cost, and Extensions
CS-CNNs currently support only full multivector (Cl(3)) representations and do not directly realize all 4 irreducible representations (e.g., spinors). Some kernel degrees of freedom require stacking multiple layers due to the implicit parameterization, and practical implementation on noncompact 5 groups necessitates grid truncation, leading to approximate equivariance under large boosts.
Feature dimensionality scales as 6 per channel (with 7), and geometric-product layers are computationally more expensive than conventional ReLU nonlinearities, though tractable up to 8. Prospective extensions include applying alternative Clifford-based nonlinearities, conditioning at local or hierarchical levels, adaptations to pseudo-Riemannian manifolds, and the construction of physics-informed architectures, e.g., using steerable PDE operators. Conditioning on global or learned multivector summaries further improves the kernel basis, enabling stable and expressive modeling of a broader class of physical phenomena (Szarvas et al., 15 Oct 2025, Zhdanov et al., 2024).
7. Impact and Future Research Directions
Clifford-Steerable CNNs provide a unified, E(9)-equivariant deep learning framework grounded in Clifford algebra and group theory, enabling exact symmetry-preserving modeling of fields with diverse geometric content across varied pseudo-Euclidean spaces. Conditional kernel mechanisms resolve basis incompleteness while maintaining computational tractability, expanding applicability to challenging PDEs and physical forecasting problems with notable gains in accuracy, efficiency, and long-term stability.
Possible future research includes: local/hierarchical or adaptive conditioning, expansion to curved manifold scenarios via parallel-transport convolution, and application domains such as molecular dynamics, geophysical flows, and high-energy physics on space-times of different signatures. This suggests continued investigation of Clifford-algebraic neural architectures and generalized symmetry-informed networks for scientific machine learning (Szarvas et al., 15 Oct 2025, Zhdanov et al., 2024).