Hi-GRPO: Optimized 3D Volumetric Imaging
- Hi-GRPO is a family of unsupervised inverse optimization methods that reconstruct high-fidelity 3D volumetric images while effectively reducing ring artifacts.
- The approach integrates a differentiable forward model with implicit neural fields and multiresolution hash encoding for robust, memory-efficient artifact correction.
- Empirical results demonstrate that Hi-GRPO outperforms traditional and supervised techniques, achieving higher PSNR and SSIM in both simulated and real CT imaging scenarios.
Hi-GRPO refers to the class of highly general, multi-parameter inverse problem optimization methods for reconstructing high-fidelity volumetric images from raw physical measurements, with specific application in 3D imaging modalities such as cone-beam computed tomography (CBCT) with strong non-ideal physical artifacts. This family of techniques is characterized by unsupervised, memory-efficient joint estimation frameworks in which imaging artifacts (e.g., detector gain non-uniformity, dead pixels) are modeled as learnable physical parameters, and the artifact-free image is represented via implicit neural fields. These methods are exemplified by recent proposals such as AR3D-R1 (also called Riner), which combines a physically accurate forward model, differentiable programming, and neural spectral bias for robust artifact reduction in large-scale 3D imaging problems (Wu et al., 8 Dec 2024).
1. Physical Model of Ring-Artifact Formation in 3D CBCT
In 3D CBCT, a volumetric object is imaged by acquiring a series of 2D X-ray projections using a planar detector array at a set of angles . The idealized system assumes all detector channels have unit gain () and no defects; in practice, each detector may have a unique unknown gain () or may be non-responsive (), leading to prominent ring artifacts and invalid signal channels.
The raw photon count at detector and angle follows from Lambert–Beer's law:
where is the path traced by a ray through the volume. The negative log-ratio, after masking dead channels, defines the physically-consistent forward model:
with , for dead () and 1 otherwise. In operator form:
where and . Measurement noise is modeled as .
2. Multi-Parameter Inverse Problem and Implicit Neural Regularization
The Hi-GRPO methodology, as instantiated by AR3D-R1, is framed as an unsupervised joint inverse problem:
- Recover the artifact-free attenuation volume ,
- Estimate channel responses , directly from raw noisy data , without paired ground truth.
The problem is ill-posed due to the multiplicity of physical unknowns. Rather than explicit or total variation penalties, regularization is achieved by parameterizing as a neural field (MLP with multiresolution hash encoding), leveraging its spectral bias for implicit smoothness. Formally:
where is an MLP mapping . Mini-batches are sampled per iteration for stochastic optimization.
3. Differentiable Forward Model and Memory Efficiency
All operations in the forward model—ray sampling, MLP evaluation, log gain offset, dead-channel masking—are implemented via differentiable operators (e.g., ReLU, hash/Fourier encoding, log-sum) in frameworks such as PyTorch. Gradient backpropagation is enabled for both neural field weights and physical gains .
Memory efficiency is a central design constraint. Hi-GRPO employs "ray marching" mini-batch optimization: at each step, only a small set of rays (e.g., ), covering a fraction of angles and detector pixels, are processed. Volume-wide predictions are avoided; memory scales as instead of . This enables fitting to volumes as large as on commodity GPUs.
Training proceeds by stochastic ray sampling, forward projection, loss calculation against measurements, and simultaneous Adam-based updates of both field parameters and detector gains.
4. Neural Representation: Hash-Encoding Neural Fields
The artifact-free attenuation field is parameterized as an implicit neural field with a multiresolution hash encoding architecture (following Müller et al. 2022):
- resolution levels,
- hash table size ,
- feature dimension ,
- two fully connected layers with 64 ReLU activations.
The hash encoding induces bias toward coarse geometric structures initially, enabling effective quick optimization and natural recovery of high-frequency details as training proceeds. No explicit denoising or additional regularization is applied; artifact removal arises from jointly optimizing the forward physical model and the neural field.
5. Empirical Performance and Benchmarks
Benchmarking is conducted on both simulated and real datasets:
- Simulated 2D/3D: DeepLesion, LIDC (2D fan-beam CBCT), AAPM (3D, ),
- Real: Bruker SKYSCAN 1276 micro-CT (Walnut: 2D FBCT, Chicken foot: 3D CBCT, ).
Ring artifacts are synthesized by randomly assigning of detectors random gains and inserting dead pixels (), plus Poisson and Gaussian noise.
Performance is assessed using PSNR and SSIM. Key 3D AAPM results:
| Method | PSNR (dB) | SSIM |
|---|---|---|
| FDK (linear) | $0.799$ | |
| Restormer (DL-SOTA) | $0.952$ | |
| AR3D-R1 (Hi-GRPO) | $0.977$ |
Hi-GRPO consistently outperforms both supervised deep learning and traditional model-based RAR baselines by $1$–$3$ dB in PSNR (2D and 3D). Qualitatively, reconstructions exhibit clearer structural detail and effective suppression of ring artifacts.
6. Strengths, Limitations, and Extensions
Hi-GRPO avoids the domain-shift limitations of supervised RAR, as unsupervised direct fitting to measurement sinograms is agnostic to the synthetic/real divergence found in medical imaging data. The same code accommodates various geometries (2D fan-beam, 3D cone-beam, helical) via appropriate ray parameterization.
Limitations include per-scan optimization time (15–20 minutes on modern GPUs), as there is no feed-forward prediction. Potential accelerations could involve more compact encodings (e.g., K-planes, Gaussian Splatting) or transfer-based warm starting. Current models assume static objects and stationary detector gains; dynamic or time-varying gain models () would require further methodological extension.
7. Context and Research Impact
The Hi-GRPO paradigm represents a shift from paired, supervised, data- and memory-intensive approaches to unsupervised, physically-motivated, and scalable frameworks in 3D imaging artifact correction. By integrating differentiable physics, neural implicit representation, and memory-efficient inverse optimization, these methods deliver improved artifact suppression, better generalization, and the practical capacity to operate on real-world, large-scale volumetric scans directly from raw measurement data (Wu et al., 8 Dec 2024).