Polarization-Aided Adaptive Region Growing (PARG)
- The paper introduces PARG, which segments the polarized object mask into locally convex subregions to resolve π-periodicity in azimuth angles and enhance 3D reconstruction fidelity.
- It employs a 4D polarization feature tensor with adaptive weighting based on local variance to robustly guide the region growing process.
- The strategy integrates into the SMSfP framework by decomposing complex surfaces into locally well-posed convex subproblems, ensuring consistent surface normal and height recovery.
Polarization-Aided Adaptive Region Growing (PARG) is a segmentation strategy introduced within the segmentation-driven monocular shape-from-polarization (SMSfP) framework for enhanced 3D surface reconstruction from single-view polarization images. PARG addresses the intrinsic azimuth ambiguity found in conventional monocular shape-from-polarization (SfP) methods by partitioning the polarized object mask into locally convex subregions and adapting the convexity prior separately in each. This decomposition enables independent, locally well-posed convexity-prior optimizations, effectively suppressing global azimuth ambiguities while preserving overall surface coherence (Zhang et al., 8 Jan 2026).
1. Conceptual Foundations and Motivation
The central challenge in monocular SfP arises from azimuth angle ambiguity (), which leads to instability in global surface normal and height recovery. Traditional global optimization methods suffer from poor disambiguation, especially for complex geometries. PARG is specifically designed to segment the object mask into a set of labeled, locally convex regions. Within each region, the multi-scale fusion convexity prior (MFCP) can be applied independently, so the -periodic azimuth ambiguity is consistently resolved per region and not globally, drastically improving geometric fidelity and reconstruction accuracy.
2. Mathematical Formulation
PARG’s mathematical structure is built around polarization-based features and local adaptation:
- Polarization Feature Tensor: At each pixel , a 4D tensor is constructed:
Here, is the degree of polarization (DOP), the angle of polarization (AOP), and the modulus of the AOP gradient, revealing orientation discontinuities. Wrapping into and removes -periodicity.
- Local Variance and Reliability: For each pixel , a centered window is defined. Compute local variances:
Reliability scores normalize these via
- Adaptive Channel Weighting: With user-chosen hyperparameters , , the pixelwise weights are
- Feature Distance and Region Growing: For a region seed with current mean feature , a neighbor is assessed with:
where indicates element-wise multiplication. is added if , with a global threshold.
3. Algorithmic Workflow and Pseudocode
PARG is implemented as a seeded, weighted adaptive region-growing algorithm. The pipeline follows these sequential steps:
- Feature Computation: Generate for all mask pixels.
- Seed Initialization: Choose seeds (object boundary, skeleton, or uniform grid).
- Region Growing: For each seed:
- Assign new region label .
- Initialize mean feature with the seed’s features.
- Iteratively expand region, adding unlabeled neighbors whose weighted feature distance to the current mean is below ; update incrementally.
- Post-processing: Apply morphological hole filling and Gaussian smoothing (with ).
- Output: Segmentation label map .
Implementation parameters include window size (), recommended , and empirical in normalized feature-distance units. Post-processing operations further refine region boundaries (Zhang et al., 8 Jan 2026).
| Component | Function | Notes |
|---|---|---|
| Feature tensor | Encodes DOP, unwrapped AOP, gradient | 4D, combines intensity and geometric info |
| Reliability scores | Modulate adaptive weighting | Based on local variance |
| Seeds | Initialize region-growing locations | Chosen at boundaries, skeleton, or grid of mask |
| Threshold | Controls inclusion via feature similarity | Empirically chosen in implementation |
| Post-processing | Fills holes, smooths boundaries | Morphological and Gaussian filtering |
4. Segmentation-Driven Disambiguation Mechanisms
PARG’s core advantage is in mitigating the global ambiguity. By restricting each growing region to pixels with homogeneous polarization signatures, local azimuth direction variation remains smooth and consistent. Consequently, when the multi-scale fusion convexity prior is enforced on each segmented region, ambiguity collapses to a single, internally consistent solution per segment rather than across the whole mask, preserving both geometric detail and continuity. The global height-from-polarization recovery task becomes a union of locally well-posed convex problems.
A plausible implication is that this approach could generalize to other segmentation-driven physical inversion contexts where local homogeneity improves global tractability, subject to suitable prior design and feature selection.
5. Inputs, Hyperparameters, and Practical Considerations
PARG depends on inputs derived from four polarization-state images to compute:
- Degree of Polarization : Sensitive to surface geometry; boundaries correspond to high local changes.
- Angle of Polarization : Encoded as and to remove periodic ambiguity.
- Gradient Magnitude : Emphasizes azimuthal boundaries and discontinuities.
- Foreground Mask : Binary mask limiting segmentation to the object region.
Hyperparameters include:
- Variance window size: .
- Adaptive weight strengths: in reported experiments.
- Similarity threshold : $0.1$–$0.3$ (empirically determined; normalized units).
- Seed strategy: mask boundary, skeleton, or grid sampling.
- Morphological and Gaussian post-filtering.
These design choices are necessary for accurate region partitioning, ensuring each subregion is “locally convex” for subsequent MFCP surface height recovery.
6. Workflow Integration within SMSfP
The role of PARG is situated between raw polarization inference and local convexity-regularized optimization. The complete SMSfP framework follows:
- Compute DOP and AOP from raw polarization images.
- Encode pixelwise 4D feature vectors, .
- Segregate the masked object into locally convex regions using PARG.
- Apply region-level convexity prior (MFCP) optimization independently in each segment.
- Fuse local surface reconstructions into a global 3D solution.
By structuring global reconstruction as a composition of locally optimized subproblems, SMSfP with PARG demonstrates substantial improvement in disambiguation accuracy and surface quality compared with prior monocular physics-based SfP techniques (Zhang et al., 8 Jan 2026).
7. Limitations and Scope of Applicability
PARG’s efficacy is tightly linked to the correctness of input polarization data, the reliability of the adaptive weighting scheme, and the empirical setting of and other hyperparameters. The method assumes foreground masks are available and that object surfaces can be decomposed into regions sufficiently convex to suppress azimuth ambiguities with a locally applied convexity prior. Topologically complex or highly concave shapes may challenge the region-growing process and subsequent optimization unless subregion convexity hypotheses are valid.
While the SMSfP framework with PARG is validated on both synthetic and real-world datasets, further investigation is warranted to establish generalization across diverse material classes and observation configurations.