Randomized Memetic ABC (RMABC) Algorithm
- The paper introduces RMABC, which integrates memetic strategies with randomized perturbations to enhance convergence and solution stability in multi-objective optimization.
- It employs expert-guided search mechanisms with an aging strategy and adaptive control signals to balance global exploration and local refinement.
- Empirical results on curriculum sequencing benchmarks demonstrate RMABC’s superior efficiency and improved optimization metrics compared to traditional methods.
The Expert-Guided Memetic Walrus Optimizer (MWO) is an advanced evolutionary optimization algorithm designed for Adaptive Curriculum Sequencing (ACS), a multi-objective problem situated in personalized online learning environments. MWO integrates an expert-guided strategy with an agent aging mechanism, a nonlinear adaptive control signal framework, and a hierarchical educational priority mechanism to produce high-quality, stable, and contextually meaningful curriculum sequences. Empirical validation on the OULAD dataset and standard optimization benchmarks establishes its superior optimization stability, curriculum relevance metrics, and generalization to complex multi-objective scenarios (Huang et al., 16 Jun 2025).
1. Multi-Objective Optimization Formulation
MWO formulates ACS as a multi-objective binary selection problem, where the selection of educational materials for students and materials is encoded as . For each student, the overall loss function is
where:
- : concept coverage/redundancy, penalizing uncovered required concepts and minimally penalizing redundant coverage,
with , .
- : time constraint violation,
- 0: learning-style compatibility,
1
where 2 is the 3-th FSLSM coordinate of the student, 4 that of material 5.
Typical hyperparameter settings: 6, 7, 8, 9. This combination strongly penalizes missing concepts, with lower cost for redundancy or time/style mismatches.
2. Expert-Guided Strategy and Aging Mechanism
MWO maintains an expert pool comprising all agents, assigning each an exponentially decaying influence weight based on age-in-population. For agent 0: 1
2
For updating, better-fitness experts 3 are sampled with probability 4, and the update is
5
This mechanism improves exploitation and avoids premature convergence by favouring recent successful search directions.
3. Adaptive Control Signal Framework
MWO employs two nonlinear control signals per iteration:
- Danger signal 6: Promotes exploration in early search phases.
7
- Safety signal 8: Gradually increases exploitation.
9
Update strategies: 1. If 0: Migration update.
1
- Else if 2:
- Top-3 ("males"): Halton-based global moves.
- Others: Local search around 4.
- Else: Combine best and second-best positions:
5
6
7
This regime facilitates a dynamic and data-driven balance between global exploration and local refinement.
4. Three-Tier Curriculum Sequencing and Priority Assignment
After selecting optimal material subsets, MWO applies a hierarchical sequencing:
- High priority (8): Full concept coverage, within student ability, prioritized by prerequisite strength,
9
- Medium priority (0): Partial coverage, matched to ability,
1
- Challenge set (2): Slightly above ability, promotes progress,
3
- Final sequence score:
4
Materials are sorted by descending 5, ensuring prerequisite constraints.
5. Algorithmic Structure and Computational Complexity
MWO’s control flow follows the pseudocode:
1
With recommended parameters: population 6, iterations 7, materials 8. Per-iteration complexity is 9, overall 0, dominated by fitness computation and sequence sorting.
6. Empirical Validation and Performance Analysis
Experimental Results on OULAD
MWO demonstrates superior convergence stability: | Algorithm | Avg. fitness | Std. deviation | |-----------|--------------|----------------| | MWO | 598.48 | 18.02 | | WO | 641.62 | 28.29 | | SCSO | 814.85 | 329.43 | | SOA | 972.65 | 422.11 | | PEOA | 866.35 | 315.62 |
Difficulty Progression Rate (DPR), the fraction of consecutive curriculum materials with non-decreasing difficulty:
- MWO: 1
- WO: 2
- SCSO: 3
- SOA: 4
- PEOA: 5
For a representative student (S1): | Algorithm | Coverage (%) | DPR (%) | Alignment (%) | First 6 Materials | |-----------|-------------|---------|--------------|-----------------------| | MWO | 100.0 | 90.7 | 100.0 | 28→90→61→76→35→47… | | WO | 100.0 | 83.3 | 96.7 | 28→90→133→76→47→65… |
MWO converges in 6 iterations (7s runtime), WO in 8 iterations (9s).
Standard Benchmark Function Validation
MWO achieves high performance and statistical superiority (Wilcoxon test, 0) on 8/9 functions versus SCSO and PEOA, and on 7/9 versus SOA. Notably, MWO attains solutions with standard deviation near zero on unimodal functions, and matches or outperforms state-of-the-art on multimodal and hybrid benchmarks.
7. Synthesis and Impact
MWO synergizes three distinct innovations: (1) an expert-guided, aging-weighted search dynamic that improves exploitation while preventing premature convergence; (2) a nonlinear, data-driven adaptive control framework for exploration–exploitation trade-off; and (3) a multi-level, pedagogically motivated priority sequencing mechanism ensuring educational relevance and adherence to constraints.
The empirical findings demonstrate MWO’s efficacy in generating curriculum sequences with high concept coverage, realistic progression, learner-style alignment, and robust optimization stability. Its framework generalizes robustly to domain-agnostic optimization problems, suggesting potential for broader application in constrained multi-objective optimization domains where solution stability and nuanced prioritization are critical (Huang et al., 16 Jun 2025).