Stochastic Configuration Networks: Fundamentals and Algorithms (1702.03180v4)

Published 10 Feb 2017 in cs.NE

Abstract: This paper contributes to a development of randomized methods for neural networks. The proposed learner model is generated incrementally by stochastic configuration (SC) algorithms, termed as Stochastic Configuration Networks (SCNs). In contrast to the existing randomised learning algorithms for single layer feed-forward neural networks (SLFNNs), we randomly assign the input weights and biases of the hidden nodes in the light of a supervisory mechanism, and the output weights are analytically evaluated in either constructive or selective manner. As fundamentals of SCN-based data modelling techniques, we establish some theoretical results on the universal approximation property. Three versions of SC algorithms are presented for regression problems (applicable for classification problems as well) in this work. Simulation results concerning both function approximation and real world data regression indicate some remarkable merits of our proposed SCNs in terms of less human intervention on the network size setting, the scope adaptation of random parameters, fast learning and sound generalization.

Citations (465)

View on Semantic Scholar

Summary

The paper presents SCNs, a novel class of randomized neural networks that use a supervisory mechanism for adaptive input weight and bias selection.
It details three algorithms—SC-I, SC-II, and SC-III—that compute output weights through constructive, local least squares, and global least squares approaches.
Simulation results show that SCNs achieve accelerated learning and superior accuracy in both regression and classification compared to traditional methods.

Insights into Stochastic Configuration Networks: An Analytical Overview

The paper "Stochastic Configuration Networks: Fundamentals and Algorithms" by Dianhui Wang and Ming Li presents advancements in the area of randomized neural network learning algorithms. The authors introduce the concept of Stochastic Configuration Networks (SCNs), offering an incremental approach to learning that contrasts with existing randomized techniques for single layer feed-forward neural networks (SLFNNs). Their novel approach involves stochastic configuration (SC) algorithms that utilize a supervisory mechanism to randomly determine input weights and biases for hidden nodes, while output weights are analytically computed either constructively or selectively.

Theoretical Foundations

A notable contribution of this paper is the establishment of theoretical results concerning the universal approximation property of SCNs. The authors discuss three variations of SC algorithms designed for regression tasks, which are also suitable for classification. The crucial theoretical underpinning is the implementation of an inequality constraint to adaptively select the scope of random parameters. This ensures the universal approximation capability of the constructed models, setting SCNs apart from other random vector functional-link (RVFL) networks by refining the randomization process.

Algorithmic Innovation

Three SC algorithms are described—SC-I, SC-II, and SC-III—each employing the same supervisory mechanism yet differing in the method of computing output weights. SC-I uses a constructive approach, maintaining previously calculated weights unchanged. SC-II involves recalculating some current output weights using a local least squares problem with a defined window, while SC-III solves a global least squares problem to update all output weights simultaneously.

Experimental Validation

Simulation results emphasize the merits of SCNs in both function approximation and real-world data regression tasks. Key advantages highlighted include reduced human intervention in determining network size, adaptable random parameter scopes, accelerated learning, and robust generalization. The authors employ several datasets to operationalize their SC algorithms, demonstrating superior performance over existing methods such as Modified Quickprop (MQ) and IRVFL in terms of learning efficiency and accuracy.

Implications and Future Prospects

The implications of this research extend both practically and theoretically. Practically, SCNs offer a scalable solution for data modeling in situations with varying data distribution characteristics. Theoretical contributions lie in resolving limitations inherent in existing randomized learning frameworks, presenting a pathway towards more effective and efficient neural network architectures.

The paper leaves room for further exploration into areas such as robustness analysis under various conditions, potential applications of SCNs in deep learning for complex data representation, and extending the framework towards ensemble and online learning paradigms. These directions suggest a broad horizon for the application of SC frameworks in enhancing the robustness and adaptability of AI models across diverse computational environments.

PDF Markdown