Attribution of Under-Predictions to Model Deficiencies vs Ground-Truth Differences
Establish whether the slight under-predictions of 2-meter temperature anomaly magnitude by GraphCast and PanguWeather during the 2021 Pacific Northwest heatwave are due to deficiencies in these ML models or are instead a consequence of discrepancies between the ERA5 and HRES-fc0 ground-truth datasets used for evaluation.
References
One needs to keep in mind that the ERA5 ground truth and the HRES-fc0 ground truth don't coincide exactly, and thus it is not clear whether one can attribute this slight under-predictions of GraphCast and PanguWeather to model deficiencies.
                — Validating Deep Learning Weather Forecast Models on Recent High-Impact Extreme Events
                
                (2404.17652 - Pasche et al., 26 Apr 2024) in Appendix A, Section A.2: Further Analysis on the Spatial Extent of Forecasts