Phase-Retrieval-Based PINN for Acoustic Fields
- The paper introduces PRB-PINN, which integrates independent magnitude and phase networks with a Helmholtz PDE loss to accurately reconstruct complex acoustic fields.
- The method leverages a joint optimization strategy that decouples magnitude and phase inference by embedding a phase-retrieval mechanism within the PINN framework.
- Experimental results in a 3D acoustic domain show that PRB-PINN reduces test log-spectral loss by 1–3 dB compared to baseline methods, especially under sparse sensor conditions.
Phase-retrieval-based physics-informed neural networks (PRB-PINNs) address the task of reconstructing acoustic magnitude fields when only spatially sparse magnitude measurements are available and phase information is inaccessible or unreliable. This method leverages PINNs by augmenting them with a phase-retrieval mechanism, enabling full-field magnitude estimation through explicit modeling of the phase and magnitude as independent neural submodules. The central innovation lies in the integration of a physics-based PDE loss that enables the implicit recovery of spatial phase information, producing a consistent complex field solution from magnitude data alone (Schrader et al., 27 Jan 2026).
1. Governing Equations and Physics Constraints
The problem formulation assumes a free-field acoustic environment at a fixed frequency, with the pressure field governed by the homogeneous Helmholtz equation: where denotes the 3D Laplacian, is the wavenumber, and is the spatial domain of interest.
To enforce physical correctness, the method defines a physics loss computed as the mean squared residual of the Helmholtz operator on randomly sampled collocation points in : No explicit Neumann or nontrivial boundary term is used in the basic acoustic room setup; Dirichlet and magnitude constraints are incorporated via data loss terms.
2. Network Architecture and Field Representation
The PRB-PINN models the magnitude and phase of the acoustic field with two independent multilayer perceptrons (MLPs):
- The magnitude network outputs .
- The phase network outputs .
Both networks are parameterized identically with 4 hidden layers, 256 units per layer, and tanh activations. The input is projected via a random Fourier feature embedding with drawn from a standard normal distribution .
At each spatial point, the complex-valued pressure field is reconstructed as:
3. Loss Function and Data Fitting
The total loss function is a weighted combination of data-fitting and physics terms: with and .
- The data-fitting loss leverages only magnitude measurements at sensor positions . A log-spectral loss is employed: which was found to correlate better with perceptual error than an MSE in magnitude.
4. Phase Retrieval Mechanism
Phase is not measured directly and is instead implicitly estimated via the PDE regularizer. The parameters of the phase network are initialized randomly. During training, minimization of the term steers the phase estimates towards physically valid configurations compatible with the Helmholtz equation.
The phase retrieval is thus embedded in the joint optimization over both magnitude and phase networks, with no explicit projection or iterative phase-unwrapping methods used. Global phase shifts are not penalized, reflecting inherent ambiguity; additional phase-smoothness terms are unnecessary in the reported scenarios.
5. Training Setup and Hyperparameter Regime
Training uses the AdamW optimizer with an initial learning rate of , decaying to every 10,000 iterations. The schedule runs for steps. The PDE loss is computed on batches of collocation points (typically several thousand), while all measurement points are utilized for the data loss at every iteration. Both networks are trained from scratch in a joint, one-stage procedure—no curriculum or pretraining is required (Schrader et al., 27 Jan 2026).
6. Experimental Evaluation
The methodology was evaluated via simulation in a rectangular room of size (ms). The reconstruction target is a cube at the center of the room, discretized on a spatial grid. The number of sensors takes values in {5, 10, 20, 50}, with measurement positions randomly sampled.
A total of 64 point sources are positioned outside the reconstruction domain, half of which are used for hyperparameter tuning, the remainder for evaluation. Single frequencies at 200, 400, and 600 Hz are considered.
Performance is quantified by the test-set log-spectral loss in decibels as defined above. Baselines include nearest-neighbor interpolation and a neural field without PDE regularization (). PRB-PINN consistently outperformed both baselines across sensor counts and frequencies, achieving $1$–$3$ dB lower test losses. Visualizations display high-fidelity reconstruction of standing-wave amplitude patterns otherwise blurred or distorted by baseline methods.
A summary of the experimental setup and results is given below.
| Sensor Count | Frequency (Hz) | Method | Typical log-spectral loss (dB) | Qualitative Outcome |
|---|---|---|---|---|
| 20 | 200 | PRB-PINN | Lower by 1–3 dB | Pattern closely matches ground truth |
| 20 | 200 | Baseline NF | Higher | Smoothed/blurred nodes |
| 20 | 200 | Interpolator | Highest | Significant loss of fine features |
7. Discussion, Limitations, and Extensions
The PRB-PINN approach reliably converges in iterations. Accuracy improves sharply with increased sensor count, though at higher frequencies (e.g., 600 Hz), even relatively dense sampling may not suffice for full spatial resolution. The PDE regularizer offers substantial robustness to data sparsity and moderate magnitude noise (perturbations up to dB).
Potential extensions include multi-frequency joint optimization to resolve phase ambiguity, explicit phase-smoothness or total-variation regularization, application to domains with scattering or nontrivial boundaries, and integration of learned priors to further reduce sensor requirements. Limitations include potential non-uniqueness in extremely sparse settings and sensitivity at high frequencies or for pathological sensor placements (Schrader et al., 27 Jan 2026).
The method represents a principled architecture for incorporating physical constraints into neural field regression tasks under magnitude-only measurement regimes, with demonstrated superiority over purely data-driven approaches.