Neural Representation with Spherical Harmonics

Updated 17 March 2026

Continuous neural representation with spherical harmonics is a framework that uses an orthonormal basis to effectively model and interpolate functions on the sphere.
It employs spectral transforms and learnable weights to achieve high accuracy, rotation-equivariance, and stability near coordinate singularities.
Hybrid architectures merge neural parameterizations with traditional SH methods, enabling efficient operator learning and precise interpolation in spatial applications.

Continuous neural representation with spherical harmonics refers to the construction of neural function approximators on the sphere $\mathbb{S}^2$ (and higher-dimensional spheres) using the spherical-harmonic basis for both theoretical and practical advances. This paradigm produces neural models capable of learning, representing, and interpolating functions that are continuous, globally valid, and potentially equivariant under $\mathrm{SO(3)}$ (rotations). Spherical harmonics offer a mathematically principled, completeness-guaranteed, and non-redundant basis for $L^2$ -integrable functions on the sphere, enabling neural architectures to natively respect spherical geometry, avoid coordinate artifacts (e.g., pole singularities), and achieve high accuracy for spatial and spatio-angular data.

1. Mathematical Foundation of Spherical Harmonics in Neural Representations

Spherical harmonics, denoted $Y_\ell^m(\theta, \phi)$ (complex form) or $Y_{\ell, m}(\theta, \phi)$ (real form), form an orthonormal basis for square-integrable functions on $\mathbb{S}^2$ . Each basis function is indexed by degree $\ell \ge 0$ and order $m \in [-\ell, \ell]$ : $Y_\ell^m(\theta, \phi) = \sqrt{\frac{2\ell+1}{4\pi} \frac{(\ell - m)!}{(\ell + m)!}} P_\ell^m(\cos\theta) e^{i m \phi},$ with the associated Legendre polynomial $P_\ell^m$ . Orthonormality holds: $\mathrm{SO(3)}$ 0 Any $\mathrm{SO(3)}$ 1 can be expanded as

$\mathrm{SO(3)}$ 2

where $\mathrm{SO(3)}$ 3 are SH coefficients, and $\mathrm{SO(3)}$ 4 is the truncation.

Numerically robust evaluation of $\mathrm{SO(3)}$ 5 and their derivatives is critical for neural networks; implementations such as sphericart provide $\mathrm{SO(3)}$ 6 complexity for each sample and stable Cartesian recurrences for gradients (Bigi et al., 2023).

2. Spherical Harmonic Transforms and Spectral Neural Architectures

A continuous neural representation on the sphere is constructed by alternating between the spatial domain and the frequency (SH) domain. The forward SHT projects sample values onto SH coefficients (typically by least squares). In matrix form, for $\mathrm{SO(3)}$ 7 sampled directions $\mathrm{SO(3)}$ 8: $\mathrm{SO(3)}$ 9 where $L^2$ 0 contains $L^2$ 1. The inverse SHT (ISHT) maps coefficients to sampled values: $L^2$ 2 Spectral convolution is realized by filtering (Hadamard multiplication) in SH space with learnable spectral weights, followed by ISHT (Chen et al., 2023). For example, with a zonal kernel (only $L^2$ 3 coefficients nonzero), per-coefficient modulation implements rotation-equivariant convolution.

Clebsch–Gordan Nets perform all operations—including quadratic nonlinearity—entirely in the SH domain using the CG transform for tensor product decomposition, achieving exact $L^2$ 4 equivariance by construction (Kondor et al., 2018).

3. Operator Learning and Green’s Functions Formulation

Modern spherical neural operator architectures employ spectral integral operators derived from Green’s function expansions. Any Green’s function $L^2$ 5 solving $L^2$ 6 can be decomposed spectrally: $L^2$ 7 with eigenvalues $L^2$ 8 of $L^2$ 9. This enables operator learning via

$Y_\ell^m(\theta, \phi)$ 0

with trainable spectral weights $Y_\ell^m(\theta, \phi)$ 1, generalizing classical convolution (which restricts to per- $Y_\ell^m(\theta, \phi)$ 2). GSNO (Generalized Spherical Neural Operator) blocks introduce relative (SO(3)-equivariant) and absolute (position-dependent) spectral filtering, enabling controlled modeling of anisotropy and spatially localized features. The GSHNet architecture implements a U-Net style spectral hierarchy, handling multi-scale signal structure via band-limited SH representations and up/down-sampling in the frequency domain (Tang et al., 11 Dec 2025).

4. Neural Parameterizations of Spherical Harmonic Coefficients

Instead of training a separate SH series per spatial or spatio-angular location, coordinate-based neural parameterizations yield continuous representations. The Neural Spherical Harmonics (NeSH) approach uses an MLP whose input is a positional encoding of coordinates $Y_\ell^m(\theta, \phi)$ 3; output are all $Y_\ell^m(\theta, \phi)$ 4 up to degree $Y_\ell^m(\theta, \phi)$ 5. The reconstructed signal is

$Y_\ell^m(\theta, \phi)$ 6

for $Y_\ell^m(\theta, \phi)$ 7. Training proceeds on measured data $Y_\ell^m(\theta, \phi)$ 8 using a data-fidelity term (e.g., smooth L1) and sparsity regularization on coefficients. Once trained, the network enables continuous upsampling in both space and angle, denoising, and improved coherence, as shown for diffusion MRI signals (Hendriks et al., 2023).

5. Hybrid Architectures and Spherical Harmonics as Positional Embeddings

Another prominent design encodes input spherical coordinates $Y_\ell^m(\theta, \phi)$ 9 via truncated SH expansions (vector $Y_{\ell, m}(\theta, \phi)$ 0). Feeding this vector into a neural network—for example, a Sinusoidal Representation Network (SIREN) block—yields a representation expressible as a composition of globally smooth, spectrally controlled basis functions and learned nonlinear transformations. Empirical results in geospatial tasks, synthetic benchmarks, and multi-task regression demonstrate that SH embedding (either with only a linear layer or with a shallow SIREN) achieves state-of-the-art generalization and stability near the poles, outperforming rectangular Fourier (DFS) encodings (Rußwurm et al., 2023). The spatial resolution is tunable by $Y_{\ell, m}(\theta, \phi)$ 1, and SIREN layers extend frequency expressiveness beyond linear SH models.

6. Theoretical Approximation and Learning Properties

Function approximation on the sphere via neural networks is strongly linked to the properties of truncated SH spaces and their reproducing kernels. For functions in the Sobolev space $Y_{\ell, m}(\theta, \phi)$ 2, deep convolutional-spline architectures achieve optimal uniform rates by approximating the SH expansion with convolutionally constructed features (Toeplitz filters implement $Y_{\ell, m}(\theta, \phi)$ 3), followed by shallow nonlinear modules mapping $Y_{\ell, m}(\theta, \phi)$ 4 polynomial kernel features (Fang et al., 2020). The total number of parameters scales linearly with network depth for a given approximation error $Y_{\ell, m}(\theta, \phi)$ 5. Additive ridge functions (i.e., $Y_{\ell, m}(\theta, \phi)$ 6) can be efficiently approximated in this framework, matching univariate complexity per component.

7. Computational and Implementation Considerations

Real-valued SH are preferred for many neural implementations due to numerical advantages; vectorized, stable Cartesian tensor recursions improve both performance and gradient propagation (Bigi et al., 2023).
GPU-optimized libraries (e.g., sphericart) permit efficient large-batch evaluation and automatic differentiation.
Computational cost per point is $Y_{\ell, m}(\theta, \phi)$ 7; batching and parallelization are necessary for high $Y_{\ell, m}(\theta, \phi)$ 8 or large datasets.
When used as neural layers or features, SH are combined with learnable spectral weights or as input to MLP blocks; parameter count and expressiveness are tunably traded off via $Y_{\ell, m}(\theta, \phi)$ 9 and network width/depth.
Best practices include using scaled (solid) harmonics, integrating with radial basis filters for point cloud or graph networks, and exploiting batched SHT/ISHT operations for convolutional networks.

In summary, continuous neural representation with spherical harmonics is a versatile, theoretically grounded, and practically effective framework for constructing spherical neural networks and neural operators. SH-based models deliver precise, equivariant, and pole-stable representations, support fast computation, and exhibit universal approximation properties, making them core to advances in spherical machine learning, operator learning, and spherical scientific applications (Chen et al., 2023, Tang et al., 11 Dec 2025, Hendriks et al., 2023, Rußwurm et al., 2023, Kondor et al., 2018, Fang et al., 2020, Bigi et al., 2023).