Papers
Topics
Authors
Recent
Search
2000 character limit reached

RocketStack: Deep Recursive Ensemble

Updated 6 December 2025
  • RocketStack is a deep recursive ensemble learning framework that extends stacking architectures with up to ten layers, integrating predictions through adaptive model pruning and feature compression.
  • The framework leverages recursive stacking, noise-perturbed pruning, and periodic feature compression methods (such as SFE, autoencoder, and attention-based selection) to control computational and feature complexity.
  • Empirical results on binary and multi-class datasets show significant accuracy improvements (up to 6.11% gains) and substantial runtime and feature dimensionality reductions compared to traditional stacking methods.

RocketStack is a level-aware deep recursive ensemble learning framework designed to extend the depth of stacking architectures while controlling computational and feature complexity through adaptive model pruning, feature compression, and stochastic regularization. The methodology systematically advances beyond conventional horizontal diversity in ensemble learning by enabling recursive stacking up to ten levels, thus promoting deeper representational integration across base learners with tractable computational costs (Demirel, 20 Jun 2025).

1. Recursive Stacking Architecture

RocketStack generalizes traditional stacking by constructing a hierarchy of ensemble layers, each integrating predictions from the preceding level through meta-feature concatenation and selective pruning. Let X(0)Rn×dX^{(0)} \in \mathbb{R}^{n \times d} represent the original nn-sample, dd-feature training set, with Xte(0)X_{\rm te}^{(0)} as its hold-out counterpart. At stacking level \ell, the ensemble consists of M(1)={m1,...,mM1}\mathcal{M}^{(\ell-1)} = \{ m_1, ..., m_{M_{\ell-1}} \}, where M1M_{\ell-1} is the number of models retained post-pruning from the previous level.

Each model undergoes KK-fold cross-validation; its concatenated out-of-fold (OOF) predictions form pi()Rnp_i^{(\ell)} \in \mathbb{R}^n. Aggregating predictions across all models yields P()Rn×M1P^{(\ell)} \in \mathbb{R}^{n \times M_{\ell-1}}. The iterative meta-feature expansion is defined as:

nn0

where nn1 denotes column-concatenation and nn2 the vector of hold-out predictions.

Model pruning is performed at each level to ensure nn3. Raw OOF performance scores nn4 (accuracy or AUC) are computed for each nn5, and a custom threshold nn6 is defined as the quantile at nn7, where nn8 may be the raw or noise-perturbed score. Only models meeting nn9 are retained.

2. Pruning Strategies and Feature Compression Mechanisms

A key innovation in RocketStack is the introduction of noise-perturbed pruning and adaptive feature compression, implemented as follows:

Noise-perturbed Pruning

Mild Gaussian noise is added to OOF scores prior to pruning to serve as a regularizer:

dd0

where dd1. Strict (dd2) and randomized (dd3) schemes are compared.

Feature Compression

Feature dimensionality is controlled either at every level or periodically (e.g., levels 3, 6, 9), using one of three compressors:

  • Simple, Fast, Efficient (SFE) Filter: Utility dd4; select features with highest utility.
  • Autoencoder (AE) Compression: Nonlinear reduction using dd5 minimize dd6 with bottleneck dd7.
  • Attention-Based Selection: Compute dd8; keep dd9 with Xte(0)X_{\rm te}^{(0)}0.

A simplified pseudocode of the framework orchestrates the OOF generation, optional feature compression, model evaluation, noise injection, and dynamic pruning per level, with user-specified settings for stacking depth Xte(0)X_{\rm te}^{(0)}1, cross-validation folds Xte(0)X_{\rm te}^{(0)}2, pruning noise Xte(0)X_{\rm te}^{(0)}3, compression mode, compressor type, periodicity, and minimum model count.

3. Model Training, Meta-Learner Pooling, and Computational Complexity

At each level, retained base learners are re-trained on the augmented feature matrix Xte(0)X_{\rm te}^{(0)}4, recursively constructing deeper meta-representations. Rather than a single fixed meta-learner, the ensemble at each level comprises all surviving models Xte(0)X_{\rm te}^{(0)}5, with optional selection of the top-Xte(0)X_{\rm te}^{(0)}6 or the singular top performer for inference.

The computational complexity of each level is dominated by Xte(0)X_{\rm te}^{(0)}7 for cross-validated training, Xte(0)X_{\rm te}^{(0)}8 for filter-based compression (or Xte(0)X_{\rm te}^{(0)}9 for autoencoders), and \ell0 for pruning. Sublinear runtime growth with increasing \ell1 is achieved through aggressive pruning and feature reduction, supporting practical exploration to depths of \ell2.

4. Empirical Evaluation across Binary and Multi-Class Datasets

Experiments across 33 OpenML datasets (23 binary, 10 multi-class) demonstrate the efficacy and scalability of RocketStack:

Binary Classification (Periodic SFE at Levels 3/6/9)

  • Strict pruning (\ell3): 88.08% accuracy at level 10
  • Light randomization (\ell4): 88.40% (+0.32%)
  • Runtime reduction: \ell510.5% compared to no compression
  • Feature count at L10: \ell66 (vs. 177 with no compression)

Multi-Class Classification (Periodic Attention)

  • Strict pruning: 93.29%
  • Light randomization: 93.67% (+0.38%)
  • Ultimate accuracy at L10: 98.60% (vs. 92.49% best baseline; +6.11%)
  • Runtime reduction: \ell756.1% relative to no compression
  • Feature reduction at L10: From \ell8145 to \ell938 (M(1)={m1,...,mM1}\mathcal{M}^{(\ell-1)} = \{ m_1, ..., m_{M_{\ell-1}} \}074%)

Linear mixed model analysis indicates significant accuracy increases with stacking depth in most configurations (M(1)={m1,...,mM1}\mathcal{M}^{(\ell-1)} = \{ m_1, ..., m_{M_{\ell-1}} \}1). Periodic compression schemes yield the strongest trends (M(1)={m1,...,mM1}\mathcal{M}^{(\ell-1)} = \{ m_1, ..., m_{M_{\ell-1}} \}2), while per-level compression often lacks a significant trend (M(1)={m1,...,mM1}\mathcal{M}^{(\ell-1)} = \{ m_1, ..., m_{M_{\ell-1}} \}3).

5. Staged Ensemble Dynamics: The Rocket Analogy

RocketStack is conceptualized around the metaphor of multistage rocket engineering, encapsulated as “Prune – Compress – Propel”:

  • Prune: Analogous to jettisoning empty fuel tanks, underperforming learners are removed to prevent superfluous complexity.
  • Compress: Periodic feature compression parallels stage separation, allowing informative meta-features to accumulate before redundancy is discarded.
  • Propel: Mild Gaussian randomization in pruning induces a controlled instability, analogous to guidance feedback in rocket dynamics, promoting diversity and mitigating the risk of premature convergence.

These coordinated mechanisms facilitate deep recursive ensembling with sustainable complexity, enabling superior predictive performance relative to shallower and horizontally-diverse stacking architectures.

6. Significance and Implementation Considerations

RocketStack establishes a scalable paradigm for deep ensemble integration, demonstrating that controlled regularization and staged dimensionality reduction can overcome saturation and complexity barriers that previously limited the depth of stack-based learning. Its modular design accommodates advances in feature compression, meta-learner architectures, and adaptive pruning for continued empirical and theoretical exploration (Demirel, 20 Jun 2025). The detailed pseudocode and equation definitions provided in the original manuscript enable rigorous reimplementation and comparative benchmarking.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to RocketStack.