Sphere2Vec: Spherical Positional Encoding

Updated 18 February 2026

Sphere2Vec is a family of multi-scale spherical positional encoders that represent geospatial points as high-dimensional vectors while preserving great-circle distances.
It leverages a Double Fourier Sphere framework with sinusoidal basis functions and scales to capture both global and local spatial context without projection distortions.
Empirical evaluations show that Sphere2Vec achieves improved accuracy in synthetic and real-world datasets, particularly in challenging polar and sparse regions.

Sphere2Vec is a family of multi-scale positional encoders designed to represent point locations on a spherical surface as high-dimensional vectors while explicitly preserving great-circle (geodesic) distances. Developed to address the shortcomings of planar or Euclidean-based encodings when applied to global-scale geospatial prediction and classification, Sphere2Vec constructs location embeddings that avoid map projection distortion and maintain invariants important for geo-aware machine learning. This methodology is grounded in the Double Fourier Sphere (DFS) framework, enabling injective, analytically tractable, and scalable encodings for downstream neural models in tasks involving species distribution, remote sensing, and more (Mai et al., 2022, Mai et al., 2023).

1. Motivation: Limitations of Planar and Euclidean Encodings

Geospatial coordinates, naturally distributed over the Earth's surface, reside on a sphere rather than a plane or in $\mathbb{R}^3$ . Traditional methods, including Space2Vec, grid-cell encoders, or NeRF-style positional encodings, interpret latitude $\phi$ and longitude $\lambda$ as planar $(x, y)$ or embed them into $\mathbb{R}^3$ . These approaches inherently introduce two principal sources of distortion:

Map projection distortion: Any planar mapping of the sphere misrepresents true surface (great-circle) distances, particularly at high latitudes, leading to discrepancies between encoded metric and geodesic reality. As a result, location-aware models can yield biased predictions in polar and data-sparse regions.
Spherical-to-Euclidean error: Embedding points as $(\cos\phi\cos\lambda, \cos\phi\sin\lambda, \sin\phi)$ in $\mathbb{R}^3$ preserves inner products only at scale-0 (unit sphere), but multi-scale Fourier expansions on axes fail to mimic the spherical law of cosines essential for distance preservation at arbitrary scales (Mai et al., 2023).

Sphere2Vec responds by supplying positional encodings $\mathrm{PE}: \mathbb{S}^2 \rightarrow \mathbb{R}^d$ with analytic guarantees of spherical distance preservation and injectivity, regardless of position or density on the globe (Mai et al., 2022).

2. Mathematical Foundations and Encoding Schemes

For two points $p = (\phi_1, \lambda_1)$ , $q = (\phi_2, \lambda_2)$ on a sphere of radius $R$ , the geodesic (great-circle) distance is

$d_s(p, q) = R \arccos\left[ \sin\phi_1\sin\phi_2 + \cos\phi_1\cos\phi_2\cos(\lambda_1 - \lambda_2)\right].$

Sphere2Vec encodes locations via multi-scale sinusoidal basis functions parameterized by frequency bands $\omega_s$ , where each scale $s$ enables the capture of both global and local spatial context. The family of encoders includes:

sphereC (Canonical): At each scale $s$ , compute

$[\sin(\omega_s\phi),\ \cos(\omega_s\phi)\cos(\omega_s\lambda),\ \cos(\omega_s\phi)\sin(\omega_s\lambda)]$

and concatenate across $S$ scales for a $3S$-dimensional vector.

sphereM: Adds explicit high-frequency mixing terms for additional expressivity.
sphereC+ and sphereM+: Concatenate the corresponding spherical encoding with classical 2D “grid” (planar) terms: $[\sin(\omega_s\phi), \cos(\omega_s\phi), \sin(\omega_s\lambda), \cos(\omega_s\lambda)]$ for each scale.
sphereDFS: A high-dimensional embedding that includes all two-way products of frequency bands up to $S-1$ in both latitude and longitude, mirroring the complete DFS basis from pseudospectral analysis. The dimension is $4S^2 + 4S$ (Mai et al., 2022, Mai et al., 2023).

The scaling parameters are typically geometrically spaced between a minimum and maximum frequency:

$\omega_s = \min \cdot \left(\frac{\max}{\min}\right)^{s/(S-1)}, \quad s = 0,\ldots,S-1.$

3. Theoretical Guarantees: Distance Preservation and Injectivity

Sphere2Vec encodings are constructed so that, at the coarsest scale ( $S=1$ ), the three-dimensional embedding produces inner products that are strictly monotonic in the true spherical distance:

$\langle \mathrm{PE}_1(p), \mathrm{PE}_1(q) \rangle = \sin\phi_1\sin\phi_2 + \cos\phi_1\cos\phi_2\cos(\lambda_1 - \lambda_2) = \cos\left(\frac{d_s(p, q)}{R}\right),$

which ensures that

$\left\|\mathrm{PE}_1(p) - \mathrm{PE}_1(q)\right\|_2 = 2 \sin\left(\frac{d_s(p, q)}{2R}\right)$

and for small $d_s$ , this is approximately linear in $d_s/R$ . Multi-scale concatenations preserve injectivity: no two distinct points map to the same vector. Full proofs are provided in (Mai et al., 2022, Mai et al., 2023); these results do not hold for 2D “wrap” or planar encoders, nor for grid-cell-based encodings.

4. Implementation and Computational Aspects

Sphere2Vec encodings are non-learned—each basis expansion is analytic with $O(d)$ cost per point, where $d$ is the embedding dimension. Typical configurations are:

$S=32$ for sphereC, sphereM, etc. ( $d=96$ , $160$, $192$, $256$)
$S=8$ for sphereDFS ( $d=4S^2+4S=256$ )

The embedding process consists of:

Converting $\phi, \lambda$ to radians.
Computing $\omega_s$ for $s\in[0, S-1]$ .
Generating the basis function terms (as per the encoding choice).
Concatenating across scales.

Location embeddings are then integrated into downstream models, typically as input to an FFN or as an auxiliary “geo-prior” branch. No normalization is needed beyond the bounded range of trigonometric functions.

5. Empirical Performance and Evaluation

Synthetic Benchmarks

On synthetic datasets defined by mixtures of von Mises–Fisher distributions sampled across the sphere (20 datasets with varying class and concentration structure), Sphere2Vec variants (notably sphereM+, sphereC+) outperform grid, planar wrap, xyz (Euclidean $\mathbb{R}^3$ ), NeRF-style, and RBF/random Fourier features across all metrics. In particular, maximal absolute Top-1 accuracy gains are observed up to 2%, with error rate reductions up to 30.8%. Gains are especially pronounced for class-conditional distributions concentrated at high latitudes or in sparse regions (Mai et al., 2023).

Real-World Geospatial Prediction and Classification

Sphere2Vec has been evaluated on seven global-scale datasets, including fine-grained species recognition (BirdSnap, NABirds, iNat2017/2018), remote sensing (fMoW), and Flickr YFCC-GEO100. Summary results:

All Sphere2Vec variants systematically outperform 2D/3D baselines in location-only and image+location settings, with mean Reciprocal Rank (MRR) and Top-1 accuracy gains of 0.3–1.0% and robust improvements in polar/sparse regimes.
Qualitative analyses (e.g., Arctic fox predicted distributions) confirm that Sphere2Vec produces sharply localized, geodesically accurate probability maps, compared to spread or overgeneralized baselines.
Latitude-band stratification demonstrates that performance superiority is magnified in bands above $80^\circ$ latitude or below $-80^\circ$ (Mai et al., 2022, Mai et al., 2023).

6. Limitations and Directions for Further Research

Sphere2Vec encodes points on a perfect sphere; as such, it does not directly account for ellipsoidal shape (WGS84) or altitude. The selection of scale parameters $(\min, \max, S)$ affects locality and receptive field, but defaults are robust. The O( $S^2$ ) complexity of the full DFS basis may present memory/performance trade-offs for very high resolutions.

Potential improvements include:

Learning frequency bands $\omega_s$ end-to-end.
Extending encoders to ellipsoidal coordinates or incorporating vertical dimension.
Integrating with graph-based geographic kernels or positional encodings for global Earth system models.
Adapting the framework for sequence and trajectory modeling on the sphere, and for spatially structured transformers (Mai et al., 2023).

7. Broader Impact and Applicability

Sphere2Vec represents a principled solution to a longstanding limitation in geospatial AI: the reliable, distortion-free embedding of locations for machine learning on global-scale data. Its analytic construction ensures the preservation of key geometric invariants, leading to measurable gains in diverse applications, especially where spatial sparsity or polar distribution is significant. A plausible implication is that Sphere2Vec, by providing a universal geospatial positional encoding, stands to become an essential component in future foundation models for biodiversity prediction, climate modeling, and beyond (Mai et al., 2023).

Markdown Report Issue Upgrade to Chat

References (2)

Sphere2Vec: Multi-Scale Representation Learning over a Spherical Surface for Geospatial Predictions (2022)

Sphere2Vec: A General-Purpose Location Representation Learning over a Spherical Surface for Large-Scale Geospatial Predictions (2023)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Sphere2Vec.

Sphere2Vec: Spherical Positional Encoding

1. Motivation: Limitations of Planar and Euclidean Encodings

2. Mathematical Foundations and Encoding Schemes

3. Theoretical Guarantees: Distance Preservation and Injectivity

4. Implementation and Computational Aspects

5. Empirical Performance and Evaluation

Synthetic Benchmarks

Real-World Geospatial Prediction and Classification

6. Limitations and Directions for Further Research

7. Broader Impact and Applicability

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Sphere2Vec: Spherical Positional Encoding

1. Motivation: Limitations of Planar and Euclidean Encodings

2. Mathematical Foundations and Encoding Schemes

3. Theoretical Guarantees: Distance Preservation and Injectivity

4. Implementation and Computational Aspects

5. Empirical Performance and Evaluation

Synthetic Benchmarks

Real-World Geospatial Prediction and Classification

6. Limitations and Directions for Further Research

7. Broader Impact and Applicability

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research