REMAC: Martian Asymmetrical Image Compression
- The paper demonstrates REMAC's ability to shift computational load from Mars rovers to Earth decoders, reducing encoder FLOPs by 43.51% and improving rate-distortion performance.
- The framework integrates reference-guided entropy modeling and multi-scale ref-decoding to exploit both intra-image and inter-image redundancies in Martian scenes.
- A latent feature recycling mechanism minimizes onboard processing by selecting optimal reference features, enabling efficient compression under strict bandwidth and compute constraints.
Reference-Based Martian Asymmetrical Image Compression (REMAC) is an image compression framework purpose-built for Martian remote sensing scenarios, where the uplink communication bandwidth is severely constrained and rover computational resources are highly limited. REMAC achieves compression by leveraging both intra-image and inter-image similarities specific to Martian scenes, using a reference-based architecture that shifts computational burden predominantly to the terrestrial (Earth-side) decoder. The approach integrates a reference-guided entropy model, an asymmetrical encoder–decoder structure, and a latent feature recycling strategy to reduce rover-side complexity while enhancing compression rates compared to conventional learned codecs (Ding et al., 26 Jan 2026).
1. Martian Image Compression Problem and Similarity Analysis
REMAC formalizes the Martian compression problem as optimizing a constrained rate-distortion objective: subject to the Mars-side encoder's FLOPs not exceeding a strict compute budget . Here, is the expected bitrate, the distortion (often MSE), a Lagrange multiplier, an input Martian image, the bitstream, the reconstructed image, and the encoder and decoder networks.
Extensive quantitative analysis motivates the approach:
- Intra-image texture similarity: BM3D denoising reveals PSNR values surpassing those for Earth images under identical noise, indicating stronger patch self-similarity.
- Inter-image texture similarity: GLCM features yield pairwise distances that are markedly smaller for Martian images, facilitating reference-based strategies.
- Color similarity: CIE LAB a/b channel standard deviations are substantially lower for Martian images (3.00/4.54 vs 8.44/13.95 for Earth), with chrominance clusters more compactly distributed.
- Semantic similarity: VGG16 ReLU-layer embeddings display reduced distances between Martian images.
These empirical findings favor approaches that exploit both intra- and inter-image redundancies in the encoding process.
2. Asymmetrical Encoder–Decoder Architecture and Computational Shifting
The REMAC architecture is strongly asymmetrical, purpose-built to allocate complexity away from the resource-constrained rover encoder to the Earth-side decoder. The Mars-side encoder is characterized by shallow depth, narrow channels, and small-kernel convolutions, tuned to minimize FLOPs. The decoder—operating with minimal resource constraints—employs deep multi-scale modules, large receptive fields, and reference-integration layers.
A summary FLOPs comparison for compressing a Martian image is as follows:
| Method | Encoder FLOPs (G) |
|---|---|
| Hyperprior (Ballé) | 151.33 |
| Minnen et al. | 362.77 |
| Cheng et al. | 699.55 |
| WACNN | 1297.64 |
| REMAC | 395.16 |
Parameter counts: Mars-side encoder ≈ 0.5M, decoder ≈ 5M.
3. Reference-Guided Entropy Module
REMAC's entropy module incorporates side information from candidate reference images to improve probabilistic latent modeling and reduce required code length. The architecture includes:
- Hyper-analysis : Four convolutions and GDN, producing side-latent .
- Hyper-entropy (): Each has two convolutions generating and from .
- Ref-entropy (): Each with two convolutions yielding , from the reference features .
- Concatenation and autoregressive modeling:
followed by slice-wise channel autoregression ().
The conditional latent probability is: where feature map concatenation (not attention) is used for efficiency.
4. Multi-Scale Ref-Decoder and Intra-Image Similarity
The decoder synthesizes high-quality reconstructions by explicitly modeling intra-image similarity via deep multi-scale feature extraction:
- Input-decoder : Four up-sampling convolutions + iGDN with kernels to reconstruct a coarse image .
- Ref-decoder pathway: Adds learned residual using:
- Multi-scale reference analysis networks (each: three serial convs + GDN, yielding an effective receptive field).
- Reference synthesis and residual synthesis .
- Residual-Fusion Block (CRB): Concatenates , two convs with ReLU, skip-connection, and up-sampling as required.
Reconstruction is expressed as:
5. Latent Feature Recycling for Mars-Side Complexity Reduction
To minimize computational overhead during inference, REMAC employs a latent feature recycling mechanism:
- Precompute: For all reference images, compute via the full encoder.
- Inference: For each input , compute deep features once, measure distance to all , select best-matching “deep” reference. Only shallow features are recomputed for selected reference and .
- Pseudocode summary:
1 2 3 4 5 6 7 8 9 10 11 12 |
# Precompute on Mars: for n in 1…N: y_ref[n] ← Encoder_ShallowDeep(x_ref[n]) # During inference: y ← Encoder_ShallowDeep(x) # deep part only once s[n] = ‖y – y_ref[n]‖₁ # deep-feature L1 distance q ← argmin_n s[n] # select “deep” ref j ← pre‐selected shallow ref index y_shallow ← Encoder_Shallow(x) y_ref_shallow ← Encoder_Shallow(x_ref[j]) b ← ReferenceGuidedEntropy([y_shallow, y_ref_shallow, y_ref[q]]) send b → Earth |
- Complexity reduction: Encoder GFLOPs decrease from 406.85 to 395.16 (2.87%); overall encoder complexity is reduced by 43.51% versus SOTA.
6. Optimization Objective and Training Regimen
REMAC is trained end-to-end with a rate-distortion (R–D) loss: Where: No auxiliary or adversarial losses are used.
7. Empirical Results and Comparative Performance
Experiments are carried out on the MIC dataset (3,088 Martian images: 2,702 train, 386 reference), using patches and eight settings for variable bitrate. Evaluation metrics include PSNR, MS-SSIM (dB scale), LPIPS, and their BD (Bjøntegaard Delta) variants.
| Method | Encoder FLOPs (G) | BD-PSNR (dB) |
|---|---|---|
| Cheng et al. | 699.55 | +0.6830 |
| REMAC | 395.16 | +0.9494 |
| Reduction and Gain | –43.51% | +0.2664 dB |
REMAC demonstrates substantial Mars-side complexity reduction and superior rate-distortion performance against symmetric learned codecs.
8. Conclusions and Prospective Extensions
REMAC systematically exploits intrinsic similarities in Martian imaging through reference-guided entropy modeling and deep multi-scale ref-decoding, achieving a marked shift of computational effort from the rover to ground-side processing. The latent feature recycling mechanism further streamlines Mars-side operations, enabling deployment in severely resource-constrained environments (e.g., 256 MB DRAM).
Planned extensions include decoder-side model compression (quantization, pruning), adaptation to other homogeneous remote sensing domains (e.g., lunar, underwater), and integration of advanced attention modules within the entropy model to sharpen conditional probabilities (Ding et al., 26 Jan 2026).