- The paper proposes a novel physics-informed RNN that integrates machine learning with FAO-based physical models to forecast crop yield loss under water scarcity.
- It uses an LSTM backbone and a custom loss function to sequentially estimate biophysical properties (ET_a and K_y) while enforcing physical consistency.
- Experiments show the PI-RNN achieves an R² of 0.77 using high-resolution Sentinel-2 and weather data, outperforming standard RNNs and simulation models.
This paper, "Exploring Physics-Informed Neural Networks for Crop Yield Loss Forecasting" (2501.00502), addresses the critical need to accurately forecast crop yield loss under extreme weather conditions, particularly water scarcity, in the context of climate change.
Problem: Traditional crop simulation models, while explainable and grounded in physical principles, often struggle with high-resolution data, are computationally expensive, require extensive calibration, and can be inaccurate due to simplified assumptions. Pure data-driven ML models excel at scalability and accuracy with complex data but act as black boxes, lacking interpretability and often failing to adhere to underlying physical principles of crop growth. This gap limits their utility for understanding crop physiology and supporting informed decision-making. Specifically, assessing yield loss due to water scarcity is challenging because accurately determining actual evapotranspiration (ETa​) at a high resolution is difficult, making the classic FAO yield response to water equation hard to apply precisely in practice.
Proposed Solution: The authors propose a novel physics-informed neural network (PI-RNN) approach that combines the data-handling capabilities of ML with the interpretability of physical models. The core idea is to use an RNN to learn key biophysical properties, actual water use (ETa​) and crop sensitivity to water scarcity (Ky​), at the pixel level. These learned properties are then used to estimate yield loss based on the established FAO equation for crop yield response to water scarcity. Physical consistency is enforced through a specialized loss function.
Methodology:
- Physical Foundation: The model is built upon the FAO equation for relative yield loss (Yl​):
Yl​=(1−Yx​Ya​​)=Ky​(1−ETx​ETa​​)
where Ya​ is actual yield, Yx​ is maximum (potential) yield, ETa​ is actual evapotranspiration, ETx​ is maximum evapotranspiration, and Ky​ is the crop yield response factor. The paper assumes ETx​ can be accurately simulated using methods like FAO paper-56, focusing on learning ETa​ and Ky​.
- Neural Network Architecture: A Recurrent Neural Network (RNN), specifically an LSTM backbone, is used to process time series data. The output layer of the RNN is designed to predict the two biophysical properties, ETa​ and Ky​, for each time step. The architecture includes sequential layers with linear transformations, batch normalization, and dropout before the final output layers for Ky​ and ETa​. The RNN is trained to predict these properties sequentially over the growing season.
- Physics Integration through Loss Function: A two-component loss function enforces physical consistency:
Ltotal​=λ1​Ll​+λ2​Lphys​
* Ll​: Standard mean squared error (MSE) between the predicted yield loss (ya​^​) derived from the FAO equation (using the predicted ETa​ and Ky​) and the ground truth actual yield loss (ya​). Ll​=E[(ya​^​−ya​)2]. This term drives the model to accurately predict the final yield loss.
* Lphys​: A penalty term designed to ensure ETa​ predictions are physically plausible and contribute meaningfully to solving the FAO equation. This term specifically encourages ETa​ values to be within the bounds [0, ETx​] and close to ETx​ (under non-limiting conditions).
Lphys​=E[1{ETa​<0}⋅(ETa​)2​+1{ETa​>ETx​}⋅(ETa​−ETx​)2​+1{0≤ETa​≤ETx​}⋅(ETa​−ETx​)2​]
The indicator function 1{⋅} applies penalties if ETa​ is outside the [0, ETx​] range and an MSE-like term within bounds. The λ1​ and λ2​ are hyperparameters weighting the two loss components.
- Data:
- Ground Truth: Pixel-level combine harvester yield data for cereal crops in Switzerland (2017-2021), with 54,098 samples from 54 yield maps (10m resolution). Yield loss is calculated relative to the maximum yield sample within the dataset (Yx​).
- Simulated Data: ETx​ values simulated over time using the FAO paper-56 method.
- Input Data:
- Sentinel-2 (S2) L2A multispectral time series (10 bands, 10m resolution) from seeding to harvest.
- Weather data (total precipitation, temperature) derived from ERA5 (aggregated between S2 time steps).
- Data modalities (S2 and weather) are combined using early fusion of raw time series.
Experiments and Results:
- Comparison with SOTA ML: The PI-RNN was compared against standard RNN (LSTM) and Transformer models for yield loss prediction using S2 + Weather data.
- The PI-RNN achieved an R2 of 0.77, outperforming the standard RNN (R2=0.75) and performing similarly to the Transformer (R2=0.78). MAE and RMSE were also comparable or better than the standard RNN.
- Importantly, unlike the standard ML models, the PI-RNN provides interpretable outputs (ETa​, Ky​).
- Comparison with Simulation Model: The simulation model (predicting using weather data) showed significantly poorer performance (R2=−6.92) compared to the ML models, highlighting the difficulty of applying coarse-resolution simulations at a fine pixel scale and accurately estimating ETa​ without detailed field-level data.
- Modality Ablation: Experiments with the PI-RNN using different input modalities showed the importance of S2 data:
- Weather only: R2=0.46
- Sentinel-2 only: R2=0.75
- Sentinel-2 + Weather: R2=0.77
- This indicates that high-resolution S2 imagery is crucial for capturing spatial variability and achieving high accuracy at the pixel level.
- Interpretable Outputs: The sequential estimations of ETa​ and Ky​ demonstrated physically consistent trends over the growing season. Estimated ETa​ was consistently below simulated ETx​, reflecting water limitation and associated yield loss. Ky​ values were generally below 1, suggesting some resilience to water scarcity in the dataset used. Predictions became more accurate closer to harvest.
Implementation Considerations:
- Data Preprocessing: Requires handling time series data from S2 (potential missing values due to clouds need interpolation/imputation) and aligning it with weather data and pixel-level yield observations. Georeferencing and resampling might be necessary.
- Model Architecture: An LSTM is used, which is suitable for sequence data. Implementing the custom two-component loss function is crucial for enforcing physical constraints. Libraries like PyTorch or TensorFlow allow for custom loss functions.
- Hyperparameter Tuning: The weights λ1​ and λ2​ in the loss function need tuning. Standard training procedures for RNNs apply, including optimization algorithms (e.g., Adam) and learning rate scheduling.
- Computational Resources: Training RNNs on time series data from high-resolution imagery can be computationally intensive, requiring GPUs. The dataset size (54,098 pixel samples) is significant.
- Scalability: The pixel-level approach can be scaled spatially if sufficient high-resolution data (S2, weather, and ground truth yield) is available for larger areas. Simulating ETx​ also needs to be scalable.
- Data Availability: Access to high-quality, georeferenced pixel-level yield data is often a limiting factor for training and validating such models. S2 and ERA5 data are publicly available, but aligning them precisely with yield data requires careful processing.
Practical Applications:
- Yield Loss Assessment: Provides estimates of yield loss at a fine spatial resolution, allowing for targeted interventions or risk assessment.
- Understanding Water Stress: The intermediate outputs (ETa​, Ky​) offer insights into how water availability affects crop performance throughout the season, similar to process-based models but derived from observed data.
- Supporting Decision Making: Provides actionable information for farmers (irrigation planning), policymakers (drought impact assessment, resource allocation), and industry (insurance, supply chain forecasting).
- Early Forecasting: The model can provide increasingly accurate yield loss forecasts as the season progresses, as shown in the results.
Limitations and Future Work (as mentioned in the paper):
- Further validation on larger and more diverse datasets is needed to confirm the model's validity across various crops, regions, and yield-limiting conditions (not just water scarcity).
- Exploring how to incorporate uncertainty quantification into the PI-RNN could provide more robust forecasts and risk assessments.
In summary, the paper presents a promising physics-informed deep learning approach for pixel-level crop yield loss forecasting, demonstrating improved or comparable performance to state-of-the-art ML models while providing valuable, physically consistent intermediate outputs that enhance interpretability for agricultural stakeholders.