Regression modelling of spatiotemporal extreme U.S. wildfires via partially-interpretable neural networks (2208.07581v4)
Abstract: Risk management in many environmental settings requires an understanding of the mechanisms that drive extreme events. Useful metrics for quantifying such risk are extreme quantiles of response variables conditioned on predictor variables that describe, e.g., climate, biosphere and environmental states. Typically these quantiles lie outside the range of observable data and so, for estimation, require specification of parametric extreme value models within a regression framework. Classical approaches in this context utilise linear or additive relationships between predictor and response variables and suffer in either their predictive capabilities or computational efficiency; moreover, their simplicity is unlikely to capture the truly complex structures that lead to the creation of extreme wildfires. In this paper, we propose a new methodological framework for performing extreme quantile regression using artificial neutral networks, which are able to capture complex non-linear relationships and scale well to high-dimensional data. The "black box" nature of neural networks means that they lack the desirable trait of interpretability often favoured by practitioners; thus, we unify linear, and additive, regression methodology with deep learning to create partially-interpretable neural networks that can be used for statistical inference but retain high prediction accuracy. To complement this methodology, we further propose a novel point process model for extreme values which overcomes the finite lower-endpoint problem associated with the generalised extreme value class of distributions. Efficacy of our unified framework is illustrated on U.S. wildfire data with a high-dimensional predictor set and we illustrate vast improvements in predictive performance over linear and spline-based regression techniques.
- keras: R Interface to ’Keras’. R package version 2.7.0.
- Ev-gan: Simulation of extreme events with relu neural networks. The Journal of Machine Learning Research, 23(1):6723–6761.
- Using risk analysis to reveal opportunities for the management of unplanned ignitions in wilderness. Journal of Forestry, 114(6):610–618.
- Network dissection: Quantifying interpretability of deep visual representations. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 6541–6549.
- Historical trends and extremes in boreal Alaska river basins. Journal of hydrology, 527:590–607.
- Multi-year global land cover mapping at 300 M and characterization for climate modelling: Achievements of the land cover component of the ESA climate change initiative. The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, 40(7):323.
- Modeling and simulating spatial extremes by combining extreme value theory with generative adversarial networks. Environmental Data Science, 1:e5.
- Cannon, A. J. (2010). A flexible nonlinear modelling framework for nonstationary generalized extreme value analysis in hydroclimatology. Hydrological Processes: An International Journal, 24(6):673–685.
- Cannon, A. J. (2011). GEVcdn: an R package for nonstationary extreme value analysis by generalized extreme value conditional density estimation network. Computers & Geosciences, 37(9):1532–1533.
- Cannon, A. J. (2018). Non-crossing nonlinear regression quantiles by monotone composite quantile regression neural network, with application to rainfall extremes. Stochastic Environmental Research and Risk Assessment, 32(11):3207–3225.
- A hybrid Pareto model for conditional density estimation of asymmetric fat-tail data. In Artificial Intelligence and Statistics, pages 51–58. PMLR.
- Stochastic downscaling of precipitation with neural network conditional mixture models. Water Resources Research, 47(10):W10502.
- Spatial regression models for extremes. Extremes, 1(4):449–468.
- A spliced gamma-generalized Pareto model for short-term extreme wind speed probabilistic forecasting. Journal of Agricultural, Biological and Environmental Statistics, 24(3):517–534.
- Practical strategies for generalized extreme value-based regression models for extremes. Environmetrics, 33(6):e2742.
- Evaluation of classical spatial-analysis schemes of extreme rainfall. Natural Hazards and Earth System Sciences, 12(11):3229–3240.
- Chautru, E. (2015). Dimension reduction in multivariate extreme value analysis. Electronic Journal of Statistics, 9(1):383–418.
- Generalized additive modelling of sample extremes. Journal of the Royal Statistical Society: Series C (Applied Statistics), 54(1):207–222.
- A combined statistical and machine learning approach for spatial prediction of extreme wildfire frequencies and sizes. Extremes, 26(2):301–330.
- Extending the generalised Pareto distribution for novelty detection in high-dimensional spaces. Journal of Signal Processing Systems, 74(3):323–339.
- Coles, S. (2001). An Introduction to Statistical Modeling of Extreme Values, volume 208. Springer.
- Bayesian spatial modeling of extreme precipitation return levels. Journal of the American Statistical Association, 102(479):824–840.
- Decompositions of dependence for high-dimensional extremes. Biometrika, 106(3):587–604.
- Copernicus (2021). Wildfires wreaked havoc in 2021, CAMS tracked their impact. Accessed 10/02/2022. https://atmosphere.copernicus.eu/wildfires-wreaked-havoc-2021-cams-tracked-their-impact.
- Statistics of extremes. Annual Review of Statistics and its Application, 2:203–235.
- Statistical modeling of spatial extremes. Statistical Science, 27(2):161–186.
- Principal component analysis for multivariate extremes. Electronic Journal of Statistics, 15(1):908–943.
- A comparison study between MLP and convolutional neural network models for character recognition. In Real-Time Image and Video Processing 2017, volume 10223, page 1022306. International Society for Optics and Photonics.
- Wildfire risk science facilitates adaptation of fire-prone social-ecological systems to the new fire reality. Environmental Research Letters, 15(2):025001.
- A marginal modelling approach for predicting wildfire extremes across the contiguous united states. Extremes, 26(2):381–398.
- Eastoe, E. F. (2019). Nonstationarity in peaks-over-threshold river flows: A regional random effects model. Environmetrics, 30(5):e2560.
- Sparse structures for multivariate extremes. Annual Review of Statistics and Its Application, 8:241–270.
- Cyber claim analysis using generalized Pareto regression trees with applications to insurance. Insurance: Mathematics and Economics, 98:92–105.
- Deep convolutional neural network for flood extent mapping using unmanned aerial vehicles data. Sensors, 19(7):1486.
- Gedalof, Z. (2010). Climate and spatial patterns of wildfire in North America. In The Landscape Ecology of Fire, pages 89–115. Springer.
- Deep sparse rectifier neural networks. In Proceedings of the fourteenth international conference on artificial intelligence and statistics, pages 315–323. JMLR Workshop and Conference Proceedings.
- Extremal random forests. Journal of the American Statistical Association, (just-accepted):1–24.
- Comparing density forecasts using threshold-and quantile-weighted scoring rules. Journal of Business & Economic Statistics, 29(3):411–422.
- Augmented convolutional LSTMs for generation of high-resolution climate change projections. IEEE Access, 9:25208–25218.
- Long short-term memory. Neural computation, 9(8):1735–1780.
- Max-and-Smooth: a two-step approach for approximate Bayesian inference in latent Gaussian models. Bayesian Analysis, 16(2):611–638.
- Space–time modelling of extreme events. Journal of the Royal Statistical Society: Series B (Methodology), 76(2):439–461.
- Reconstruction of incomplete wildfire data using deep generative models. Extremes, pages 1–21.
- k𝑘kitalic_k-means clustering of extremes. Electronic Journal of Statistics, 14(1):1211–1233.
- Return level estimation from non-stationary spatial data exhibiting multidimensional covariate effects. Ocean Engineering, 88:520–532.
- Large california wildfires: 2020 fires in historical context. Fire Ecology, 17(1):1–11.
- Deep learning with Python, volume 1. Springer.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
- Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907.
- Koenker, R. (2005). Quantile Regression. Econometric Society Monographs. Cambridge University Press.
- Koh, J. (2023). Gradient boosting with extreme-value theory for wildfire prediction. Extremes, pages 1–27.
- Spatiotemporal wildfire modeling through point processes with moderate and extreme marks. The Annals of Applied Statistics, 17(1):560–582.
- Climate change projected to reduce prescribed burning opportunities in the south-eastern United States. International Journal of Wildland Fire, 29(9):764–778.
- Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, 3361(10):1995.
- Neural networks for parameter estimation in intractable models. Computational Statistics & Data Analysis, 185:107762.
- Gated graph sequence neural networks. arXiv preprint arXiv:1511.05493.
- TD-LSTM: Temporal dependence-based LSTM networks for marine temperature prediction. Sensors, 18(11):3797.
- LSTM recurrent neural networks for influenza trends prediction. In International Symposium on Bioinformatics Research and Applications, pages 259–264. Springer.
- Downscaling extremes: A comparison of extreme value distributions in point-source and gridded precipitation data. The Annals of Applied Statistics, 4:484–502.
- Comparison of CNN and MLP classifiers for algae detection in underwater pipelines. In 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA), pages 1–6. IEEE.
- Modeling jointly low, moderate, and heavy rainfall intensities without a threshold selection. Water Resources Research, 52(4):2753–2769.
- Opitz, T. (2023). EVA 2021 data challenge on spatiotemporal prediction of wildfire extremes in the USA. Extremes, 26(2):241–250.
- INLA goes extreme: Bayesian tail regression for the estimation of high spatio-temporal quantiles. Extremes, 21(3):441–462.
- Extended generalised Pareto models for tail estimation. Journal of Statistical Planning and Inference, 143(1):131–143.
- Statistical models of vegetation fires: Spatial and temporal patterns. In Handbook of Environmental and Ecological Statistics, pages 401–420. CRC Press.
- The stationary bootstrap. Journal of the American Statistical Association, 89(428):1303–1313.
- Searching for activation functions. arXiv preprint arXiv:1710.05941.
- Richards, J. (2022). pinnEV: Partially-Interpretable Neural Networks for modelling of Extreme Values. R package.
- Supplement to “Regression modelling of spatiotemporal extreme U.S. wildfires via partially-interpretable neural networks”.
- Joint estimation of extreme spatially aggregated precipitation at different scales through mixture modelling. Spatial Statistics, 53:100725.
- Network design for heavy rainfall analysis. Journal of Geophysical Research: Atmospheres, 118(23):13–075.
- Beyond expectation: Deep joint mean and quantile regression for spatiotemporal problems. IEEE Transactions on Neural Networks and Learning Systems, 31(12):5377–5389.
- Conditional density estimation with neural networks: Best practices and benchmarks. arXiv preprint arXiv:1903.00954.
- The extreme value machine. IEEE transactions on Pattern Analysis and Machine Intelligence, 40(3):762–768.
- Semi-structured distributional regression. The American Statistician, 78(1):88–99.
- Likelihood-free parameter estimation with neural Bayes estimators. The American Statistician, pages 1–23.
- Explaining deep neural networks and beyond: A review of methods and applications. Proceedings of the IEEE, 109(3):247–278.
- Continuous spatial process models for spatial extreme values. Journal of Agricultural, Biological, and Environmental Statistics, 15(1):49–65.
- The graph neural network model. IEEE Transactions on Neural Networks, 20(1):61–80.
- Droughts and wildfires in western US rangelands. Rangelands, 38(4):197–203.
- Short, K. C. (2017). Spatial wildfire occurrence data for the United States, 1992-2015 [FPA_FOD_20170508]. 4th Ed. Fort Collins, CO: Forest Service Research Data Archive.
- Projecting future nonstationary extreme streamflow for the Fraser River, Canada. Climatic Change, 145(3):289–303.
- LSTM based hybrid method for basin water level prediction by using precipitation data. Journal of Advanced Simulation in Science and Engineering, 8(1):40–52.
- Silverman, B. W. (1985). Some aspects of the spline smoothing approach to non-parametric regression curve fitting. Journal of the Royal Statistical Society: Series B (Methodology), 47(1):1–21.
- Sciencebrief review: Climate change increases the risk of wildfires. In: Critical Issues in Climate Change Science, edited by: C. Le Quéré, P. Liss, P. Forster. https://doi.org/10.5281/zenodo.4570195.
- Smith, R. L. (1985). Maximum likelihood estimation in a class of nonregular cases. Biometrika, 72(1):67–90.
- Smith, R. L. (1989). Extreme value analysis of environmental time series: An application to trend detection in ground-level ozone. Statistical Science, 4(4):367 – 377.
- Functional boxplots. Journal of Computational and Graphical Statistics, 20(2):316–334.
- Modelling sub-daily precipitation extremes with the blended generalised extreme value distribution. Journal of Agricultural, Biological and Environmental Statistics, 27(4):598–621.
- Nonstationary frequency analysis of annual maximum rainfall using climate covariates. Water Resources Management, 29(2):339–358.
- Gradient boosting for extreme quantile regression. Extremes, 26(4):639–667.
- Extreme value theory for anomaly detection–the GPD classifier. Extremes, 23(4):501–520.
- Modelling the effect of the El Niño-Southern Oscillation on extreme spatial temperature events over Australia.
- Wood, S. (2006). Generalized Additive Models: An Introduction with R. Chapman & Hall/CRC Texts in Statistical Science. Taylor & Francis.
- Wood, S. N. (2003). Thin plate regression splines. Journal of the Royal Statistical Society: Series B (Methodology), 65(1):95–114.
- Spatial hierarchical modeling of threshold exceedances using rate mixtures. Environmetrics, 32(3):e2662.
- Youngman, B. D. (2019). Generalized additive models for exceedances of high thresholds with an application to return level estimation for US wind gusts. Journal of the American Statistical Association, 114(528):1865–1879.
- Probabilistic prediction of regional wind power based on spatiotemporal quantile regression. IEEE Transactions on Industry Applications, 56(6):6117–6127.
- Prediction of North Atlantic Oscillation index with convolutional LSTM based on ensemble empirical mode decomposition. Atmosphere, 10(5):252.
- Flexible covariate representations for extremes. Environmetrics, 31(5):e2624.
- Forest fire susceptibility modeling using a convolutional neural network for Yunnan province of China. International Journal of Disaster Risk Science, 10(3):386–403.
- Visual interpretability for deep learning: a survey.
- Joint modeling and prediction of massive spatio-temporal wildfire count and burnt area data with the INLA-SPDE approach. Extremes, 26(2):339–351.
- Modeling nonstationary temperature maxima based on extremal dependence changing with event magnitude. The Annals of Applied Statistics, 16(1):272–299.
- Neural networks for partially linear quantile regression. Journal of Business & Economic Statistics, pages 1–12.
- Graph neural networks: A review of methods and applications. AI Open, 1:57–81.