Calibrating multi-dimensional complex ODE from noisy data via deep neural networks (2106.03591v2)
Abstract: Ordinary differential equations (ODEs) are widely used to model complex dynamics that arises in biology, chemistry, engineering, finance, physics, etc. Calibration of a complicated ODE system using noisy data is generally very difficult. In this work, we propose a two-stage nonparametric approach to address this problem. We first extract the de-noised data and their higher order derivatives using boundary kernel method, and then feed them into a sparsely connected deep neural network with ReLU activation function. Our method is able to recover the ODE system without being subject to the curse of dimensionality and complicated ODE structure. When the ODE possesses a general modular structure, with each modular component involving only a few input variables, and the network architecture is properly chosen, our method is proven to be consistent. Theoretical properties are corroborated by an extensive simulation study that demonstrates the validity and effectiveness of the proposed method. Finally, we use our method to simultaneously characterize the growth rate of Covid-19 infection cases from 50 states of the USA.
- Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473.
- On deep learning as a remedy for the curse of dimensionality in nonparametric regression. Ann. Statist. 47(4), 2261–2285.
- Bauschke, H. H. and P. L. Combettes (2017). Convex Analysis and Monotone Operator Theory in Hilbert Spaces (2nd ed.). Springer Publishing Company, Incorporated.
- Benson, M. (1979). Parameter fitting in dynamic models. Ecological Modelling 6(2), 97 – 115.
- Bayesian two-step estimation in differential equation models. Electron. J. Statist. 9(2), 3124–3154.
- Nonlinear parameter estimation: A case study comparison. AIChE Journal 32(1), 29–45.
- Numerical analysis. Cengage learning.
- A first-order primal-dual algorithm for convex problems with applications to imaging. Journal of mathematical imaging and vision 40, 120–145.
- Data-driven discovery of coordinates and governing equations. Proceedings of the National Academy of Sciences 116(45), 22445–22451.
- Neural ordinary differential equations. In S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.), Advances in Neural Information Processing Systems, Volume 31, pp. 6571–6583. Curran Associates, Inc.
- Network reconstruction from high-dimensional ordinary differential equations. Journal of the American Statistical Association 112(520), 1697–1707. PMID: 29618851.
- Deep neural network approximation theory. IEEE Transactions on Information Theory 67(5), 2581–2623.
- Deep neural networks for estimation and inference. Econometrica 89(1), 181–213.
- Nonparametric functional data analysis: theory and practice. Springer Science & Business Media.
- Gasser, T. and H.-G. Müller (1984). Estimating regression functions and their derivatives by the kernel method. Scandinavian Journal of Statistics 11(3), 171–185.
- Kernels for nonparametric curve estimation. Journal of the Royal Statistical Society. Series B (Methodological) 47(2), 238–252.
- Quick and easy one-step parameter estimation in differential equations. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 76(4), 735–748.
- Hammer, B. (2000). On the approximation capability of recurrent neural networks. Neurocomputing 31(1), 107 – 123.
- Deep iv: A flexible approach for counterfactual prediction. In International Conference on Machine Learning, pp. 1414–1423. PMLR.
- Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778.
- Network reconstruction using nonparametric additive ode models. PLOS ONE 9(4), 1–15.
- An ecological theory of changing human population dynamics. People and Nature 1(1), 31–43.
- Normalizing flows: An introduction and review of current methods. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1–1.
- On the rate of convergence of fully connected deep neural network regression estimates. The Annals of Statistics 49(4), 2231 – 2249.
- Lassonet: A neural network with feature sparsity. Journal of Machine Learning Research 22(127), 1–29.
- Li, K. (2022). Variable selection for nonlinear cox regression model via deep learning. arXiv preprint arXiv:2211.09287.
- Deep feature screening: Feature selection for ultra high-dimensional data via deep neural networks. Neurocomputing 538, 126186.
- Semiparametric regression for spatial data via deep learning. arXiv preprint arXiv:2301.03747.
- Deep learning via dynamical systems: An approximation perspective. arXiv preprint arXiv:1912.10382.
- Feature screening via distance correlation learning. Journal of the American Statistical Association 107(499), 1129–1139.
- On the curse of memory in recurrent neural networks: Approximation and optimization analysis. arXiv preprint arXiv:2009.07799.
- Parameter estimation for differential equation models using a framework of measurement error in regression models. Journal of the American Statistical Association 103(484), 1570–1583. PMID: 19956350.
- Optimal nonparametric inference via deep neural network. Journal of Mathematical Analysis and Applications 505(2), 125561.
- On deep instrumental variables estimate. arXiv preprint arXiv:2004.14954.
- Deep network approximation for smooth functions. SIAM Journal on Mathematical Analysis 53(5), 5465–5506.
- High-dimensional odes coupled with mixed-effects modeling techniques for dynamic gene regulatory network identification. Journal of the American Statistical Association 106(496), 1242–1258. PMID: 23204614.
- Deep learning for universal linear embeddings of nonlinear dynamics. Nat Commun 9, 4950.
- Transformed l1 regularization for learning sparse deep neural networks. Neural Networks 119, 286–298.
- Deep learning in bioinformatics. Briefings in bioinformatics 18(5), 851–869.
- Nonparametric estimation of dynamics of monotone trajectories. The Annals of Statistics 44(6), 2401–2432.
- Non-parametric function fitting. Journal of the Royal Statistical Society: Series B (Methodological) 34(3), 385–392.
- Nonparametric regression using deep neural networks with relu activation function. Ann. Statist. 48(4), 1875–1897.
- Stone, C. J. (1985). Additive regression and other nonparametric models. The annals of Statistics, 689–705.
- Modelling temporal biomarkers with semiparametric nonlinear dynamical systems. Biometrika 108(1), 199–214.
- Control of chaotic systems: Application to the lorenz equations. In Nonlinear Vibrations, American Society of Mechanical Engineers, Design Engineering Division (Publication) DE, pp. 47–58. Publ by ASME. Winter Annual Meeting of the American Society of Mechanical Engineers ; Conference date: 08-11-1992 Through 13-11-1992.
- Estimation of the mean function of functional data via deep neural networks. Stat 10(1), e393.
- Weinan, E. (2017). A proposal on machine learning via dynamical systems. Communications in Mathematics and Statistics 5(1), 1–11.
- Parameter estimation and variable selection for big systems of linear ordinary differential equations: A matrix-based approach. Journal of the American Statistical Association 114(526), 657–667.
- Deep learning based recommender system: A survey and new perspectives. ACM computing surveys (CSUR) 52(1), 1–38.
- On the selection of ordinary differential equation models with application to predator-prey dynamical models. Biometrics 71(1), 131–138.
- An iterative approach to distance correlation-based sure independence screening. Journal of Statistical Computation and Simulation 85(11), 2331–2345.
- Kexuan Li (16 papers)
- Fangfang Wang (16 papers)
- Ruiqi Liu (51 papers)
- Fan Yang (878 papers)
- Zuofeng Shang (48 papers)