Simplified Predictive Local Rule (SPLR)
- SPLR is a biologically inspired local learning algorithm that uses prediction errors to trigger sparse, binary updates in Extreme Learning Machines.
- It replaces costly global backpropagation with per-sample, O(M) updates, significantly reducing computational and memory complexity.
- Designed for resource-constrained environments, SPLR is ideal for FPGA and edge device implementations where energy efficiency and real-time processing are critical.
The Simplified Predictive Local Rule (SPLR) is a biologically inspired learning algorithm designed for extreme learning machines (ELMs) that eschews global backpropagation and intensive matrix operations in favor of local, prediction-driven, sparse, and hardware-efficient synaptic updates. SPLR operates in an online regimen with per-sample updates that are explicitly triggered only by prediction errors, resulting in significant reductions in computational and memory complexity. Its local and binary update mechanism makes it particularly suitable for logic-only implementation in resource-constrained environments and edge devices (Zang et al., 25 Dec 2025).
1. Mathematical Formulation and Derivation
SPLR is embedded within the standard ELM framework, which consists of a fixed, randomly projected hidden layer and a linear readout. The principal steps of the SPLR formulation are as follows:
A. Hidden-layer Activation and Output Prediction:
Let be the input vector, the fixed random projection, and the bias. The hidden state is computed as:
where denotes the Heaviside (step) function. The output score for each class is obtained via a linear readout:
B. Loss and Local Gradients:
A winner-take-all (WTA) loss penalizes the gap between the predicted class and the true class :
The gradient with respect to the output weights is,
0
C. Discrete, Sparse Update Rule:
The SPLR simplifies the update to trigger only when 1, absorbing the error term’s sign into a binary learning rate 2:
3
Weights are clipped into 4 after update. No updates occur for correct predictions.
2. Online Algorithmic Implementation
SPLR operates in an incremental, single-pass (online) manner. The algorithm proceeds as:
- Compute hidden activations: 5
- Compute output vector: 6
- Determine predicted class: 7
- On error (8), update only weights linked to classes 9 and 0:
- 1
- 2
This differs fundamentally from standard ELM, which amasses the entire hidden response matrix 3 and computes the optimal output weights in a batch, closed-form inversion,
4
with 5 cost. SPLR requires only 6 time per misclassified sample, operating in a streaming, error-driven paradigm (Zang et al., 25 Dec 2025).
3. Computational Complexity
The breakdown of computational costs is as follows:
| Operation | Standard ELM | SPLR–ELM |
|---|---|---|
| Training matrix inversion | 7 | -- |
| Per-sample learning | 8 (in batch accumulation) | 9 |
| Total (per 0 samples) | 1 | 2 |
For large hidden layers (3), SPLR’s per-sample complexity is linear in 4, while standard ELM is cubic. This constitutes a major advantage for memory- and compute-constrained scenarios. The dominant update in SPLR is binary-driven vector addition to two output weight columns during errors (Zang et al., 25 Dec 2025).
4. Accuracy and Convergence Characteristics
On the MNIST dataset (5,000 training / 1,000 test samples, 5 hidden units):
- Standard ELM achieves 6 training and 7 test accuracy.
- SPLR-ELM reaches 8 training (9 absolute) and 0 test accuracy (1 absolute).
SPLR typically stabilizes within a single online pass, whereas standard ELM yields an exact, batch-optimal result but is not suited for streaming input. This suggests SPLR is well-matched to online or continual learning deployments (Zang et al., 25 Dec 2025).
5. FPGA Implementation and Resource Utilization
The SPLR rule is amenable to highly parallel and energy-efficient hardware implementation:
- Hidden-Layer Neuron Modules (HN): Each composed of a pseudo-random number generator (LFSR) for on-the-fly 2 reconstruction, multiply-accumulate logic for projections, comparator for thresholding, and serial shift register output.
- Output-Layer Neuron Modules (ON): For each class, prediction is accumulated, and, upon mistaken predictions, weights are updated in external BRAM.
- Pipeline: Data are streamed in, hidden activations are serialized, outputs are computed, and weight updates are performed via parallel BRAM access.
On a ZCU104 Ultrascale+ running at 224 MHz with 3 hidden units, resource and throughput metrics are:
| Metric | Value |
|---|---|
| LUTs | 205,000 |
| FFs | 158,000 |
| DSPs | 1,700 |
| BRAM | 5 blocks |
| Max Freq | 224 MHz |
| Training FPS | 63,454 |
| Inference FPS | 122,336 |
| Power | 4 W |
| Power Eff. | 5 FPS/W |
Compared to R-STDP and STDP-based FPGA cores, SPLR-ELM realizes 6 FPS in training and 7 FPS in inference at comparable or lower per-neuron footprint (Zang et al., 25 Dec 2025).
6. Biological Motivation and Implications for Edge AI
SPLR is distilled from the predictive learning rule (PLR) of Saponati & Vinck (2023), derived from models of cortical predictive coding and spike-timing-dependent plasticity (STDP). Updates are strictly local, depending only on the presynaptic spike (8) and winner/loser class indicators, mirroring synaptic changes driven by mismatch (prediction error) signals found in neurobiology.
The update rule does not require global synchronization, backpropagation, or eligibility traces, thereby enabling efficient, online, and hardware-friendly learning. This makes SPLR highly applicable to energy-constrained, always-on edge devices such as vision sensors and anomaly detectors, where continuous adaptation with minimal overhead is essential (Zang et al., 25 Dec 2025).