Papers
Topics
Authors
Recent
Search
2000 character limit reached

OmniSplat: Real-Time 3D Gaussian Splatting

Updated 2 June 2026
  • OmniSplat is a feed-forward 3D Gaussian Splatting framework that generates 3D radiance fields from sparse omnidirectional images in a single neural inference step.
  • It employs a Yin–Yang grid decomposition to counter non-uniform sampling and uneven Gaussian distribution challenges inherent in 360° imagery.
  • By leveraging pre-trained perspective CNNs, OmniSplat achieves real-time, editable 3D scene acquisition without the need for per-scene optimization.

OmniSplat is a feed-forward 3D Gaussian Splatting (3DGS) framework developed to directly generate 3D radiance field representations from sparse omnidirectional (360°) images in a single neural inference step, eliminating the requirement for per-scene optimization and specialized omnidirectional training data. The system leverages the Yin–Yang grid decomposition to circumvent the geometric and sampling challenges inherent to omnidirectional images, enabling the reuse of convolutional neural networks (CNNs) pre-trained on perspective datasets and facilitating real-time, editable 3D scene acquisition (Lee et al., 2024).

1. Feed-Forward 3D Gaussian Splatting and Omnidirectional Challenges

Feed-forward 3D Gaussian Splatting systems such as PixelSplat and MVSplat take a small number of perspective (pinhole) image inputs and directly predict a point-based radiance field, where each point is parameterized by a mean position μ\mu, covariance Σ\Sigma, opacity α\alpha, and a set of spherical harmonics color coefficients cc. These methods are an order of magnitude faster than iterative per-scene optimization pipelines (e.g., ODGS), as they avoid computationally-intensive refinement.

Adapting feed-forward 3DGS architectures to omnidirectional imagery, such as equirectangular or fisheye images, introduces two critical issues:

  • Non-uniform Sampling: Equirectangular projections sample latitude (near the poles) more densely than the equator, producing feature distortions when processed with standard CNNs.
  • Uneven Gaussian Distribution: Predicting 3DGS parameters from these distorted features yields point clouds denser at the poles and sparser at the equator. The result is "stripe" and "dot" artifacts during novel-view synthesis due to over- and under-sampling, respectively.

2. Yin–Yang Grid Decomposition

To mitigate the domain shift between perspective and omnidirectional images, OmniSplat employs the “Yin–Yang” overset grid decomposition. The sphere is partitioned into two overlapping, quasi-uniform patches: the Yin patch and the Yang patch.

2.1 Mathematical Formulation

Let (θ,ϕ)(\theta, \phi) define latitude and longitude on the sphere (θ[π/2,π/2]\theta \in [-\pi/2, \pi/2], ϕ[π,π)\phi \in [-\pi, \pi)).

  • Yin Patch: θ[π/4,π/4]\theta \in [-\pi/4, \pi/4], ϕ[3π/4,3π/4]\phi \in [-3\pi/4, 3\pi/4]
  • Yang Patch: Remainder, obtained by a 9090^\circ rotation Σ\Sigma0 about the X-axis, with

Σ\Sigma1

Each patch is mapped onto a square grid via orthographic reparameterization, minimizing distortion and producing a near-uniform sampling density analogous to perspective images.

2.2 Pixel-to-Sphere Transformation

  • For a Yin patch pixel Σ\Sigma2:

Σ\Sigma3

Σ\Sigma4

  • For the Yang patch, Σ\Sigma5 is first applied to Σ\Sigma6, and the resulting vector is mapped back to Σ\Sigma7 for grid sampling.

This decomposition enables the direct application of unmodified perspective-based CNN architectures to omnidirectional content.

3. Network Architecture and Processing Pipeline

OmniSplat’s workflow is structured as follows:

  1. Image Decomposition: Two omnid
Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to OmniSplat.