Efficient, Accurate and Stable Gradients for Neural ODEs (2410.11648v2)

Published 15 Oct 2024 in cs.LG and stat.ML

Abstract: Training Neural ODEs requires backpropagating through an ODE solve. The state-of-the-art backpropagation method is recursive checkpointing that balances recomputation with memory cost. Here, we introduce a class of algebraically reversible ODE solvers that significantly improve upon both the time and memory cost of recursive checkpointing. The reversible solvers presented calculate exact gradients, are high-order and numerically stable -- strictly improving on previous reversible architectures.

Citations (1)

View on Semantic Scholar