Papers
Topics
Authors
Recent
Search
2000 character limit reached

Group Equivariant GSA

Updated 12 March 2026
  • Group Equivariant GSA is a neural network model that combines sparse autoencoding with explicit equivariance, ensuring its latent features transform consistently under group actions.
  • It employs dual losses—invariance to ensure consistent reconstruction across group transformations and equivariance loss to adaptively fit transformation matrices.
  • Empirical evaluations show that these models improve mechanistic interpretability and performance on symmetry-based tasks, with high R² and elevated F1 scores in structured probing.

Group Equivariant Group Sparse Autoencoders (Group Equivariant GSA) are a class of neural network models that combine sparse autoencoding with explicit equivariance to group actions, allowing the discovery and utilization of symmetry-aware latent features. These models extend the sparse autoencoder architecture by enforcing structured constraints reflecting the symmetries present in the data, with the result that learned features exhibit predictable transformations under the action of a specified symmetry group, such as rotations. This approach enhances downstream task performance and mechanistic interpretability, particularly for scientific and structured domains where group symmetries are intrinsic (Erdogan et al., 12 Nov 2025).

1. Group Equivariance and Sparse Autoencoding

Group equivariance refers to the property that a network layer or mapping commutes with the action of a group GG. Formally, for a function ff, group action gGg\in G, and representation ρ\rho, equivariance requires f(ρ(g)x)=ρ(g)f(x)f(\rho(g)\,x) = \rho'(g)\,f(x). In the context of sparse autoencoders, this principle is incorporated by requiring the encoder fθf_\theta and decoder gϕg_\phi of the autoencoder to interact with a family of linear maps MpM^p (one per group element gpg^p) such that the encoded features and reconstructive output transform consistently with GG's action. Specifically, for data xx, autoencoder activations ψ(x)\psi(x), and group element gpg^p, the equivariance constraints are: fθ(ψ(gp ⁣x))=Mpfθ(ψ(x))f_\theta\bigl(\psi(g^p\!\cdot x)\bigr) = M^p\,f_\theta\bigl(\psi(x)\bigr) and

MpD(E(ψ(gpx)))=ψ(gpx)M^p\,D(E(\psi(g^p x))) = \psi(g^p x)

where EE and DD denote encoder and decoder, respectively (Erdogan et al., 12 Nov 2025).

This structured constraint ensures that the learned feature activations correspond to physically or semantically meaningful entities, which follow the same symmetry as the data and underlying domain.

2. Mathematical Formulation and Training Objectives

Group Equivariant GSA models incorporate both an invariance-driven autoencoding loss and a transformation-fitting loss:

  • Invariance Loss: Enforces that all group-augmented versions of an input xx produce decodings matching the canonical features ψ(x)\psi(x),

Linv(θ,ϕ)=ExX1Gp=0G1ψ(x)D(E(ψ(gpx)))22\mathcal{L}_{\mathrm{inv}}(\theta,\phi) = \mathbb{E}_{x\sim\mathcal{X}} \frac{1}{|G|}\sum_{p=0}^{|G|-1} \left\|\psi(x)-D(E(\psi(g^p x)))\right\|_2^2

  • Equivariance Loss: Fits the set of linear maps MpM^p to capture how the base model’s representations transform under group actions,

LM(M)=ExX1Gp=0G1ψ(gpx)Mpψ(x)22\mathcal{L}_M(M) = \mathbb{E}_{x\sim\mathcal{X}} \frac{1}{|G|}\sum_{p=0}^{|G|-1} \left\|\psi(g^p x)-M^p\psi(x)\right\|_2^2

Sparsity in the encoder is enforced via a hard TopK operator, selecting the KK largest activations in the latent vector. These losses are jointly optimized, yielding a family of sparse features whose transformations under GG are explicitly parameterized (Erdogan et al., 12 Nov 2025).

3. Architecture and Implementation Details

A representative Group Equivariant GSA employs the following components (as described for cyclic group rotations G=C4G=C_4):

  • Base model ψ\psi: Either an MLP or CNN autoencoder maps xR64×64x \in \mathbb{R}^{64 \times 64} to d=256d=256 pre-activations.
  • Encoder E=fθE=f_{\theta}: A two-layer MLP, culminating in a TopK-sparsified nn-dimensional vector (n=8×d=2048n=8\times d = 2048).
  • Decoder D=gϕD=g_{\phi}: A single linear layer reconstructing the feature space.
  • Group action maps M={Mp}M = \{M^p\}: Each MpRd×dM^p \in \mathbb{R}^{d \times d} is initialized as identity and trained to map base activations according to the group structure.

The encoder-decoder parameters (θ,ϕ)(\theta,\phi) are updated via Adam optimization to minimize Linv\mathcal{L}_{\mathrm{inv}}, while MM is periodically updated to minimize LM\mathcal{L}_M. No explicit regularization is placed on MM to enforce group representation structure, reflecting a post-hoc, adaptive fit to the observed equivariance in the underlying network (Erdogan et al., 12 Nov 2025).

4. Adaptive Equivariance and Feature Analysis

The adaptive character of the approach lies in the flexibility of MpM^p: they are fit to best explain the behavior of the (possibly non-equivariant) base model activations under group transformation. This mechanism reveals the degree of approximate symmetry already present, and can distinguish invariant from truly equivariant (transforming) directions in the latent space. A quantitative result for a model trained on rotated synthetic images shows that the learned MpM^p explain R2=0.987±0.001R^2 = 0.987 \pm 0.001 of the variance in ψ(gpx)\psi(g^p x) for both MLP and CNN activations, substantially outperforming the naive baseline Mp=IM^p = I (Erdogan et al., 12 Nov 2025).

Downstream, this enables partitioning decoder atoms into strictly invariant and equivariant features, aiding mechanistic interpretability and producing representations attuned to structured probing tasks (e.g., those querying the presence, position, or orientation of an object).

5. Empirical Performance and Probing Results

Empirical evaluation on synthetic image datasets with known symmetry structure demonstrates that Group Equivariant GSA models yield latent representations and reconstructions that are more informative for group-structured probing tasks. In quantitative probing over four grouped binary classification frameworks (detecting shape SS, shape-position SPSP, shape-orientation SOSO, and shape-position-orientation SPOSPO), Equivariant SAEs (Equi-SAE) achieve higher or equal mean F1 scores on both latent and decoder-reconstruction probes compared to regular and wide SAEs:

Model S SP SO SPO
Reg-SAE (recon) 0.93 0.83 0.70 0.30
Wide-SAE (recon) 0.94 0.83 0.65 0.22
Equi-SAE (recon) 0.96 0.89 0.70 0.35

This suggests that incorporating equivariance explicitly in the autoencoder yields more structure-disentangled and semantically meaningful features, particularly for tasks aligned with the symmetry group (Erdogan et al., 12 Nov 2025).

6. Connections to Broader Group-Equivariant Architectures

The Group Equivariant GSA principle is conceptually aligned with other approaches enforcing equivariance in neural networks, such as group equivariant convolutions (Lengyel et al., 2021), equivariant self-attention mechanisms (Romero et al., 2020), and general frameworks for equivariant neural networks on reductive Lie groups (Batatia et al., 2023). However, the focus in Group Equivariant GSA is on interpretability and feature disentanglement, as opposed to predictive modeling alone.

Distinctively, the adaptive mechanism for learning representation matrices MpM^p enables Group Equivariant GSA to work post-hoc on arbitrarily pretrained neural activations, providing a tool for mechanistic analysis even when the original network is not strictly equivariant.

7. Limitations and Future Directions

Group Equivariant GSA models, as presented, learn a set of linear operators MpM^p individually per group element, without explicit enforcement of the group structure (such as Mp+q=MpMqM^{p+q} = M^p M^q or MG=IM^{|G|} = I for cyclic groups). This suggests some inductive bias could be further injected, possibly through parameter tying or regularization. Additionally, the current models employ hard TopK sparsity; alternative sparsity-inducing methods could offer finer control. Scaling to continuous or higher-dimensional groups would require extending the parameterization of MpM^p and possibly integrating more advanced representation-theoretic tools. A plausible implication is that such extensions could bridge the methodology with group-equivariant Lie group neural architectures (Batatia et al., 2023).

Group Equivariant GSA provides an effective and interpretable approach for leveraging group symmetries in feature learning and mechanistic probing, particularly for data domains with explicit group actions. Its effectiveness has been empirically established in synthetic structured data and forms an adaptable foundation for further developments in group-aware unsupervised learning (Erdogan et al., 12 Nov 2025).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Group Equivariant GSA.