Coordinates Encoding Module (OPE)

Updated 10 March 2026

Coordinates Encoding Module is a method that maps spatial coordinates to high-dimensional feature vectors using truncated, orthogonal Fourier bases, ensuring precise reconstruction of band-limited image patches.
It integrates local latent codes from a convolutional encoder with analytic OPE vectors, enabling efficient, resolution-agnostic upsampling without additional trainable parameters.
This parameter-free, interpretable approach achieves performance comparable to state-of-the-art methods, offering significant computational and memory efficiency for continuous image synthesis.

A Coordinates Encoding Module (CEM), as instantiated by Orthogonal Position Encoding (OPE) in the OPE-SR framework, provides an analytical, parameter-free mapping from continuous spatial coordinates to high-dimensional feature vectors for continuous image reconstruction, particularly in arbitrary-scale image super-resolution tasks. OPE leverages orthogonal 2D basis functions—truncated real-valued Fourier bases—allowing for precise, lossless reconstruction of band-limited image patches. The upsampling module operates without trainable weights, instead combining local encoder-produced latent codes linearly with coordinate-dependent OPE vectors, enabling efficient, resolution-agnostic image synthesis (Song et al., 2023).

1. Mathematical Foundation of Orthogonal Position Encoding

Orthogonal Position Encoding constructs a Fourier-type orthonormal basis on the domain $[-1,1]^2$ . For a maximum frequency $n$ , the one-dimensional positional encoding is defined as

$\gamma(x) = [ 1,\ \sqrt{2}\cos(\pi x),\ \sqrt{2}\sin(\pi x),\ \sqrt{2}\cos(2\pi x),\ \sqrt{2}\sin(2\pi x),\ \ldots,\ \sqrt{2}\cos(n\pi x),\ \sqrt{2}\sin(n\pi x) ] \in \mathbb{R}^{2n+1}$

The two-dimensional basis functions for an image patch $f(x,y)$ are derived from the outer product of the one-dimensional encodings:

$e_{i,j}(x,y) = \gamma_i(x)\gamma_j(y)$

where $i,j=0,\ldots,2n$ . These basis functions are orthonormal with respect to the $L^{2}([{-1},1]^2)$ inner product:

$\langle g,h \rangle = \frac{1}{4} \int_{-1}^1 \int_{-1}^1 g(x,y) h(x,y)\ dx\,dy$

and

$\langle e_{i_1, j_1}, e_{i_2, j_2} \rangle = \delta_{i_1,i_2}\delta_{j_1,j_2}$

ensuring the basis spans band-limited 2D functions up to frequency $n$ per axis (Song et al., 2023).

2. Mapping Coordinates to OPE Feature Vectors

For each continuous spatial coordinate $n$ 0, the OPE mapping proceeds in the following steps:

Compute $n$ 1
Form the outer product $n$ 2
Flatten $n$ 3 into the OPE vector $n$ 4

Each entry of $n$ 5 is a product of cosines/sines at different frequencies, encoding both low- and high-frequency spatial structure. This direct analytic mapping ensures that any $n$ 6 can be queried at arbitrary resolution, providing a consistent interface irrespective of output image size.

3. Structure and Workflow of the OPE-Upscale (Coordinates Encoding) Module

The OPE-Upscale Module operates as follows:

Encoder: A conventional convolutional backbone (e.g., EDSR-base, RDN) processes the input LR image, yielding a feature map $n$ 7, where $n$ 8. Each location's feature is interpreted as the latent code $n$ 9 for a local patch.
Upsampling: For each target HR pixel $\gamma(x) = [ 1,\ \sqrt{2}\cos(\pi x),\ \sqrt{2}\sin(\pi x),\ \sqrt{2}\cos(2\pi x),\ \sqrt{2}\sin(2\pi x),\ \ldots,\ \sqrt{2}\cos(n\pi x),\ \sqrt{2}\sin(n\pi x) ] \in \mathbb{R}^{2n+1}$ 0:
1. Determine its continuous coordinate.
2. Identify the four nearest encoder locations $\gamma(x) = [ 1,\ \sqrt{2}\cos(\pi x),\ \sqrt{2}\sin(\pi x),\ \sqrt{2}\cos(2\pi x),\ \sqrt{2}\sin(2\pi x),\ \ldots,\ \sqrt{2}\cos(n\pi x),\ \sqrt{2}\sin(n\pi x) ] \in \mathbb{R}^{2n+1}$ 1.
3. Compute relative patch coordinates $\gamma(x) = [ 1,\ \sqrt{2}\cos(\pi x),\ \sqrt{2}\sin(\pi x),\ \sqrt{2}\cos(2\pi x),\ \sqrt{2}\sin(2\pi x),\ \ldots,\ \sqrt{2}\cos(n\pi x),\ \sqrt{2}\sin(n\pi x) ] \in \mathbb{R}^{2n+1}$ 2 for each neighbor, using
$\gamma(x) = [ 1,\ \sqrt{2}\cos(\pi x),\ \sqrt{2}\sin(\pi x),\ \sqrt{2}\cos(2\pi x),\ \sqrt{2}\sin(2\pi x),\ \ldots,\ \sqrt{2}\cos(n\pi x),\ \sqrt{2}\sin(n\pi x) ] \in \mathbb{R}^{2n+1}$ 3

where $\gamma(x) = [ 1,\ \sqrt{2}\cos(\pi x),\ \sqrt{2}\sin(\pi x),\ \sqrt{2}\cos(2\pi x),\ \sqrt{2}\sin(2\pi x),\ \ldots,\ \sqrt{2}\cos(n\pi x),\ \sqrt{2}\sin(n\pi x) ] \in \mathbb{R}^{2n+1}$ 4. 4. Generate $\gamma(x) = [ 1,\ \sqrt{2}\cos(\pi x),\ \sqrt{2}\sin(\pi x),\ \sqrt{2}\cos(2\pi x),\ \sqrt{2}\sin(2\pi x),\ \ldots,\ \sqrt{2}\cos(n\pi x),\ \sqrt{2}\sin(n\pi x) ] \in \mathbb{R}^{2n+1}$ 5. 5. For each color channel $\gamma(x) = [ 1,\ \sqrt{2}\cos(\pi x),\ \sqrt{2}\sin(\pi x),\ \sqrt{2}\cos(2\pi x),\ \sqrt{2}\sin(2\pi x),\ \ldots,\ \sqrt{2}\cos(n\pi x),\ \sqrt{2}\sin(n\pi x) ] \in \mathbb{R}^{2n+1}$ 6, reconstruct $\gamma(x) = [ 1,\ \sqrt{2}\cos(\pi x),\ \sqrt{2}\sin(\pi x),\ \sqrt{2}\cos(2\pi x),\ \sqrt{2}\sin(2\pi x),\ \ldots,\ \sqrt{2}\cos(n\pi x),\ \sqrt{2}\sin(n\pi x) ] \in \mathbb{R}^{2n+1}$ 7. 6. Aggregate values with bilinear-style weights $\gamma(x) = [ 1,\ \sqrt{2}\cos(\pi x),\ \sqrt{2}\sin(\pi x),\ \sqrt{2}\cos(2\pi x),\ \sqrt{2}\sin(2\pi x),\ \ldots,\ \sqrt{2}\cos(n\pi x),\ \sqrt{2}\sin(n\pi x) ] \in \mathbb{R}^{2n+1}$ 8 proportional to relative areas:

$\gamma(x) = [ 1,\ \sqrt{2}\cos(\pi x),\ \sqrt{2}\sin(\pi x),\ \sqrt{2}\cos(2\pi x),\ \sqrt{2}\sin(2\pi x),\ \ldots,\ \sqrt{2}\cos(n\pi x),\ \sqrt{2}\sin(n\pi x) ] \in \mathbb{R}^{2n+1}$ 9

This architecture achieves spatial smoothness through patch ensembles and enables continuous, arbitrary-scale super-resolution, as each HR pixel is a deterministic function of $f(x,y)$ 0, local latent codes, and known basis functions (Song et al., 2023).

4. Parameter-Free, Resolution-Agnostic Upsampling

The Coordinates Encoding Module introduces no trainable parameters in the upsampling or decoding stage. All weights are confined to the encoder $f(x,y)$ 1. Once local latent codes are extracted, every output pixel is computed analytically via basis evaluation and linear operations. There is no learned neural decoder (e.g., MLPs typical of implicit neural representations), making the upsampling "parameter-free." Arbitrary output resolutions are possible as $f(x,y)$ 2 coordinates may be sampled at any density, unconstrained by training-time resolution or grid structure (Song et al., 2023).

5. Orthogonality, Completeness, and Reconstruction Guarantees

The orthonormality of the OPE basis functions guarantees that, for functions band-limited to frequency $f(x,y)$ 3 per axis, the truncated reconstruction

$f(x,y)$ 4

with $f(x,y)$ 5 is exact within this subspace. In practice, the encoder's latent code $f(x,y)$ 6 stores an approximate block of these coefficients over each grid cell. Upon querying, the OPE basis is evaluated locally at arbitrary spatial locations, extracting the appropriate mixture of coefficients for continuous patch reconstruction. For sufficiently large $f(x,y)$ 7 (typically $f(x,y)$ 8 or $f(x,y)$ 9), local details are preserved up to the Nyquist frequency of the encoder's output. Increasing $e_{i,j}(x,y) = \gamma_i(x)\gamma_j(y)$ 0 beyond this range can recover finer structure if encoded, but may introduce ringing artefacts if higher frequencies are not captured by the backbone (Song et al., 2023).

6. Practical Implications, Efficiency, and Interpretability

The OPE-based CEM replaces the traditional MLP decoder commonly used in coordinate-based implicit representations with an entirely analytical, interpretable mechanism. The analytic, orthogonal design assures flip-consistent and efficient decoding, resulting in high computational and memory efficiency relative to state-of-the-art (SOTA) SR models. In experimental evaluations, the OPE-Upscale module achieves results comparable to SOTA methods, while offering a concise SR framework with practical advantages in efficiency and resource consumption (Song et al., 2023). This suggests a broader utility of orthogonal, parameter-free coordinate encoding in continuous signal reconstruction tasks beyond image super-resolution.

Markdown Report Issue Upgrade to Chat

References (1)

OPE-SR: Orthogonal Position Encoding for Designing a Parameter-free Upsampling Module in Arbitrary-scale Image Super-Resolution (2023)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Coordinates Encoding Module (CEM).

Coordinates Encoding Module (OPE)

1. Mathematical Foundation of Orthogonal Position Encoding

2. Mapping Coordinates to OPE Feature Vectors

3. Structure and Workflow of the OPE-Upscale (Coordinates Encoding) Module

4. Parameter-Free, Resolution-Agnostic Upsampling

5. Orthogonality, Completeness, and Reconstruction Guarantees

6. Practical Implications, Efficiency, and Interpretability

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Coordinates Encoding Module (OPE)

1. Mathematical Foundation of Orthogonal Position Encoding

2. Mapping Coordinates to OPE Feature Vectors

3. Structure and Workflow of the OPE-Upscale (Coordinates Encoding) Module

4. Parameter-Free, Resolution-Agnostic Upsampling

5. Orthogonality, Completeness, and Reconstruction Guarantees

6. Practical Implications, Efficiency, and Interpretability

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research