A Fortran-Keras Deep Learning Bridge for Scientific Computing (2004.10652v2)

Published 14 Apr 2020 in cs.LG and cs.PL

Abstract: Implementing artificial neural networks is commonly achieved via high-level programming languages like Python and easy-to-use deep learning libraries like Keras. These software libraries come pre-loaded with a variety of network architectures, provide autodifferentiation, and support GPUs for fast and efficient computation. As a result, a deep learning practitioner will favor training a neural network model in Python, where these tools are readily available. However, many large-scale scientific computation projects are written in Fortran, making it difficult to integrate with modern deep learning methods. To alleviate this problem, we introduce a software library, the Fortran-Keras Bridge (FKB). This two-way bridge connects environments where deep learning resources are plentiful, with those where they are scarce. The paper describes several unique features offered by FKB, such as customizable layers, loss functions, and network ensembles. The paper concludes with a case study that applies FKB to address open questions about the robustness of an experimental approach to global climate simulation, in which subgrid physics are outsourced to deep neural network emulators. In this context, FKB enables a hyperparameter search of one hundred plus candidate models of subgrid cloud and radiation physics, initially implemented in Keras, to be transferred and used in Fortran. Such a process allows the model's emergent behavior to be assessed, i.e. when fit imperfections are coupled to explicit planetary-scale fluid dynamics. The results reveal a previously unrecognized strong relationship between offline validation error and online performance, in which the choice of optimizer proves unexpectedly critical. This reveals many neural network architectures that produce considerable improvements in stability including some with reduced error, for an especially challenging training dataset.

Citations (90)

View on Semantic Scholar

Summary

The paper introduces the Fortran-Keras Bridge (FKB) that enables advanced Keras deep learning models to operate within legacy Fortran codebases without extensive rewrites.
It details a two-way conversion process that translates complete neural network architectures, including weights and layers, between Python and Fortran environments.
Robust testing on a climate simulator demonstrates that optimizer choice, notably the Adam optimizer, significantly enhances simulation stability and model accuracy.

A Fortran-Keras Deep Learning Bridge for Scientific Computing

The paper "A Fortran-Keras Deep Learning Bridge for Scientific Computing" addresses the challenge of integrating modern deep learning techniques, predominantly implemented in Python using high-level frameworks like Keras, with the computationally efficient, legacy code bases often written in Fortran for large-scale scientific computations. This integration is critical for fields such as climate modeling, computational fluid dynamics, and earthquake simulation, where existing Fortran-based software dominates.

Summary and Technical Innovations

The Fortran-Keras Bridge (FKB) introduced in the paper allows researcher access to advanced deep learning models within the Fortran environment without rewriting existing large Fortran codebases. This is achieved by creating a two-way bridge between Python and Fortran, facilitating seamless model transfer between these environments.

Key technical innovations in FKB include the ability to translate Keras models, stored as HDF5 files after training, into a format compatible with a neural network structure implemented in Fortran, and vice versa. This translation extends beyond simply converting weights and biases; it also means accommodating the architecture of the network, including layers like dropout and batch normalization, thereby facilitating the implementation of sophisticated models.

The FKB's Fortran side builds upon the Neuro Fortran library, adding support for custom layers, customizable loss functions, and ensemble methods. Importantly, the Fortran implementation supports online training, allowing for the refinement of models directly within computationally-intensive simulations, such as those found in climate modeling.

Strong Numerical Results and Claims

The paper provides robust numerical results through a case paper involving the SPCAM3 climate simulator. Using FKB, the authors conducted an extensive hyperparameter search over 108 candidate neural network architectures to model subgrid-scale physical processes within a global atmospheric context. A clear, quantifiable link is established between offline validation error and online simulation stability, challenging previous assumptions that such correlations might not exist for deep learning emulators in dynamic climate models.

Moreover, a comprehensive hyperparameter analysis identified the choice of optimizer as a crucial factor impacting model stability in this domain. This insight is particularly significant, as optimizer selection had not been previously highlighted as a primary factor influencing the stability of neural networks in climate applications. The authors demonstrate that models trained using certain optimizers, specifically Adam, are more stable when coupled with complex fluid dynamics simulations.

Implications for AI and Scientific Computing

The practical implications of FKB are immediate and substantial for scientific computing fields reliant on Fortran. By enabling these domains to leverage state-of-the-art deep learning models without substantial rewrites of existing code, the paper significantly lowers the barrier to incorporating machine learning into traditional scientific workflows. This advancement could lead to more accurate climate models or better simulations in other scientific arenas.

Theoretically, the demonstrated correlation between the offline training performance and the emergent behavior in complex dynamical systems encourages further exploration of ML models in data-driven approximations of physical processes. It provides a new foothold for investigating stability and fidelity considerations of neural networks in hybrid physical-machine learning models.

Future Directions

Future research could expand on the lessons learned from applying hyperparameter optimization in Fortran-based settings, improving robustness across diverse applications. Additionally, leveraging FKB’s ensemble capabilities could open new avenues for reducing uncertainty in simulations, an area of keen interest across scientific disciplines involving complex, multiscale processes.

Expanding the library to accommodate non-traditional layer types or incorporating more advanced optimization schedules within the Fortran environment could further enhance its utility and adaptivity to rapidly evolving deep learning paradigms.

In conclusion, the Fortran-Keras Bridge represents a significant step towards modernizing legacy scientific software, enabling them to harness the advancements in AI seamlessly. Such integration is poised to bolster computational research with improved predictive capabilities, bridging traditional methodologies with contemporary machine learning advancements.