On the accuracy of interpolation based on single-layer artificial neural networks with a focus on defeating the Runge phenomenon
Abstract: In the present paper, we consider one-hidden layer ANNs with a feedforward architecture, also referred to as shallow or two-layer networks, so that the structure is determined by the number and types of neurons. The determination of the parameters that define the function, called training, is done via the resolution of the approximation problem, so by imposing the interpolation through a set of specific nodes. We present the case where the parameters are trained using a procedure that is referred to as Extreme Learning Machine (ELM) that leads to a linear interpolation problem. In such hypotheses, the existence of an ANN interpolating function is guaranteed. The focus is then on the accuracy of the interpolation outside of the given sampling interpolation nodes when they are the equispaced, the Chebychev, and the randomly selected ones. The study is motivated by the well-known bell-shaped Runge example, which makes it clear that the construction of a global interpolating polynomial is accurate only if trained on suitably chosen nodes, ad example the Chebychev ones. In order to evaluate the behavior when growing the number of interpolation nodes, we raise the number of neurons in our network and compare it with the interpolating polynomial. We test using Runge's function and other well-known examples with different regularities. As expected, the accuracy of the approximation with a global polynomial increases only if the Chebychev nodes are considered. Instead, the error for the ANN interpolating function always decays and in most cases we observe that the convergence follows what is observed in the polynomial case on Chebychev nodes, despite the set of nodes used for training.
- Optimal sampling rates for approximating analytic functions from pointwise samples. IMA Journal of Numerical Analysis, 39(3):1360–1390, 05 2018.
- A. R. Barron. Universal approximation bounds for superpositions of a sigmoidal function. IEEE Transactions on Information theory, 39(3):930–945, 1993.
- Z. Battles and L. N. Trefethen. An extension of matlab to continuous functions and operators. SIAM Journal on Scientific Computing, 25(5):1743–1770, 2004.
- R. E. Bellman. Dynamic programming. Princeton University Press, 1957.
- C. M. Bishop. Pattern recognition and machine learning. springer, 2006.
- Exponentially-convergent strategies for defeating the runge phenomenon for the approximation of non-periodic functions, part i: single-interval schemes. Comput. Phys, 5(2-4):484–497, 2009.
- D. Broomhead and D. Lowe. Radial basis functions, multi-variable functional interpolation and adaptive networks. Royal Signals and Radar Establishment Malvern (UK), 4148, 03 1988.
- F. Calabrò and A. C. Esposito. An evaluation of clenshaw–curtis quadrature rule for integration wrt singular measures. Journal of computational and applied mathematics, 229(1):120–128, 2009.
- Extreme learning machine collocation for the numerical solution of elliptic pdes with sharp gradients. Computer Methods in Applied Mechanics and Engineering, 387:114188, 2021.
- The Runge example for interpolation and Wilkinson’s examples for rootfinding. SIAM Review, 62(1):231–243, 2020.
- Robust training and initialization of deep neural networks: An adaptive basis viewpoint. In Mathematical and Scientific Machine Learning, pages 512–536. PMLR, 2020.
- Extreme learning machine: algorithm, theory and applications. Artificial Intelligence Review, 44:103–115, 2015.
- S. Dong and J. Yang. On computing the hyperparameter of extreme learning machines: Algorithm and application to computational pdes, and comparison with classical and high-order finite elements. Journal of Computational Physics, 463:111290, 2022.
- Chebfun guide. Pafnuty Publications, Oxford, 2014.
- Towards a mathematical understanding of neural network-based machine learning: What we know and what we don’t. arXiv:2009.10713, 2020.
- The Barron space and the flow-induced function spaces for neural network models. Constructive Approximation, 55(1):369–406, 2022.
- Finite sample identification of wide shallow neural networks with biases. arXiv preprint arXiv:2211.04589, 2022.
- Stable computations with gaussian radial basis functions. SIAM Journal on Scientific Computing, 33(2):869–892, 2011.
- Solving high-dimensional partial differential equations using deep learning. Proceedings of the National Academy of Sciences, 115(34):8505–8510, 2018.
- Deep learning: An introduction for applied mathematicians. SIAM Review, 61(4):860–891, 2019.
- Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks. Neural networks, 3(5):551–560, 1990.
- A. Hryniowski and A. Wong. Deeplabnet: End-to-end learning of deep radial basis networks with fully learnable basis functions. arXiv preprint arXiv:1911.09257, 2019.
- Trends in extreme learning machines: A review. Neural Networks, 61:32–48, 2015.
- Extreme learning machine: theory and applications. Neurocomputing, 70(1-3):489–501, 2006.
- Deep kronecker neural networks: A general framework for neural networks with adaptive activation functions. arXiv preprint arXiv:2105.09513, 2021.
- Deep convolutional neural network for inverse problems in imaging. IEEE Transactions on Image Processing, 26(9):4509–4522, 2017.
- Physics-informed machine learning. Nature Reviews Physics, pages 1–19, 2021.
- A. Kratsios. The universal approximation property: Characterizations, existence, and a canonical topology for deep-learning. Annals of Mathematics and Artificial Intelligence, 89(5-6):435–469, 2021.
- Multilayer feedforward networks with a nonpolynomial activation function can approximate any function. Neural networks, 6(6):861–867, 1993.
- Learning nonlinear operators via deeponet based on the universal approximation theorem of operators. Nature Machine Intelligence, 3(3):218–229, 2021.
- S. Mishra and R. Molinaro. Estimates on the generalization error of physics-informed neural networks for approximating a class of inverse problems for PDEs. IMA Journal of Numerical Analysis, jun 2021.
- A. Neufeld and P. Schmocker. Universal approximation property of random neural networks. arXiv preprint arXiv:2312.08410, 2023.
- J. Park and I. W. Sandberg. Universal approximation using radial-basis-function networks. Neural computation, 3(2):246–257, 1991.
- A. Pinkus. Approximation theory of the mlp model. Acta Numerica 1999: Volume 8, 8:143–195, 1999.
- A. Pinkus. Ridge functions, volume 205. Cambridge University Press, 2015.
- Impossibility of fast stable approximation of analytic functions from equispaced samples. SIAM review, 53(2):308–318, 2011.
- Two-hidden-layer extreme learning machine for regression and classification. Neurocomputing, 175:826–834, 2016.
- J. W. Siegel and J. Xu. Approximation rates for neural networks with general activation functions. Neural Networks, 128:313–321, 2020.
- J. W. Siegel and J. Xu. High-order approximation rates for shallow neural networks with cosine and reluk activation functions. Applied and Computational Harmonic Analysis, 58:1–26, 2022.
- L. N. Trefethen. Is gauss quadrature better than clenshaw–curtis? SIAM review, 50(1):67–87, 2008.
- L. N. Trefethen. Approximation Theory and Approximation Practice, Extended Edition. SIAM, 2019.
- Mathematics of deep learning. arXiv preprint arXiv:1712.04741, 2017.
- A review on extreme learning machine. Multimedia Tools and Applications, 81(29):41611–41660, 2022.
- A study on effectiveness of extreme learning machine. Neurocomputing, 74(16):2483–2490, 2011.
- Optimization approximation solution for regression problem based on extreme learning machine. Neurocomputing, 74(16):2475–2482, 2011.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.