Papers
Topics
Authors
Recent
Search
2000 character limit reached

Expert-Guided Memetic Walrus Optimizer

Updated 29 May 2026
  • The paper formulates adaptive curriculum sequencing as a multi-objective combinatorial optimization problem balancing concept coverage, time constraints, and learning-style compatibility.
  • It employs an expert-guided, aging-weighted exploitation strategy combined with dual adaptive control signals to dynamically manage exploration and exploitation.
  • A three-tier priority material sequencing mechanism ensures pedagogical validity by optimizing learning progression and adherence to prerequisite structures.

The Expert-Guided Memetic Walrus Optimizer (MWO) is a population-based metaheuristic designed for constrained, multi-objective optimization in adaptive curriculum sequencing (ACS) settings. MWO integrates memetic exploitation with domain-specific guidance, innovative adaptive exploration control, and a pedagogically informed sequence prioritization strategy to optimize learning material selection for personalized education scenarios.

1. Multi-Objective Problem Formulation

MWO formulates the ACS task as a multi-objective combinatorial optimization problem. The learning material selection for TsT_s students from TmT_m candidates is represented as x{0,1}Tmx \in \{0,1\}^{T_m}. The objective function is a weighted sum over three pedagogically critical criteria: minF(x)=ω1O1(x)+ω2O2(x)+ω3O3(x)\min F(x) = \omega_1\,\mathcal{O}_1(x) + \omega_2\,\mathcal{O}_2(x) + \omega_3\,\mathcal{O}_3(x) where:

  • Concept Coverage/Redundancy:

O1=ε1(RRE)+ε2(ERE)\mathcal{O}_1 = \varepsilon_1 \bigl(|\mathcal{R}|-|\mathcal{R} \cap \mathcal{E}|\bigr) + \varepsilon_2 \bigl(|\mathcal{E}|-|\mathcal{R} \cap \mathcal{E}|\bigr)

with R\mathcal{R} as the union of concepts covered by selected materials, E\mathcal{E} as the required concepts. Penalty weight ε2ε1\varepsilon_2 \gg \varepsilon_1 prioritizes omission over redundancy.

  • Time Constraint Violation:

O2={ε3,j=1TmxjTsj[T,T] 0,otherwise\mathcal{O}_2 = \begin{cases} \varepsilon_3, & \sum_{j=1}^{T_m} x_j Ts_j \notin [\underline{T}, \overline{T}] \ 0, & \text{otherwise} \end{cases}

  • Learning-Style Compatibility:

O3=j=1Tmxjk=14pkpmj,k\mathcal{O}_3 = \sum_{j=1}^{T_m} x_j \sum_{k=1}^4 |p_k - pm_{j,k}|

where TmT_m0, TmT_m1 are FSLSM-style coordinates of the student and material TmT_m2.

Experimental settings typically use TmT_m3, TmT_m4, TmT_m5, TmT_m6 (Huang et al., 16 Jun 2025).

2. Expert-Guided Search with Aging

A core innovation in MWO is the expert-guided, aging-weighted exploitation strategy. Unlike conventional metaheuristics, MWO maintains an expert pool and models each individual’s influence as a decaying exponential function of “age” (iterations since last being top-2). The update mechanism includes:

  • Aging:
    • TmT_m7 each iteration; reset to zero if TmT_m8 is top-2.
    • Influence TmT_m9 if x{0,1}Tmx \in \{0,1\}^{T_m}0, zero otherwise, with x{0,1}Tmx \in \{0,1\}^{T_m}1.
  • Guided Update:
    • For agent x{0,1}Tmx \in \{0,1\}^{T_m}2, select expert x{0,1}Tmx \in \{0,1\}^{T_m}3 (from x{0,1}Tmx \in \{0,1\}^{T_m}4) with probability x{0,1}Tmx \in \{0,1\}^{T_m}5.
    • Position update:

    x{0,1}Tmx \in \{0,1\}^{T_m}6

This mechanism allows for informed, diversity-enhanced exploitation and mitigates stagnation in local optima (Huang et al., 16 Jun 2025).

3. Adaptive Control Signal Framework

MWO employs a dual "danger/safety" signal system for dynamically balancing exploration and exploitation:

  • Danger Signal (x{0,1}Tmx \in \{0,1\}^{T_m}7): Primarily active in early iterations, calculated as x{0,1}Tmx \in \{0,1\}^{T_m}8, where x{0,1}Tmx \in \{0,1\}^{T_m}9 and minF(x)=ω1O1(x)+ω2O2(x)+ω3O3(x)\min F(x) = \omega_1\,\mathcal{O}_1(x) + \omega_2\,\mathcal{O}_2(x) + \omega_3\,\mathcal{O}_3(x)0. High minF(x)=ω1O1(x)+ω2O2(x)+ω3O3(x)\min F(x) = \omega_1\,\mathcal{O}_1(x) + \omega_2\,\mathcal{O}_2(x) + \omega_3\,\mathcal{O}_3(x)1 triggers migration steps to preserve diversity.

  • Safety Signal (minF(x)=ω1O1(x)+ω2O2(x)+ω3O3(x)\min F(x) = \omega_1\,\mathcal{O}_1(x) + \omega_2\,\mathcal{O}_2(x) + \omega_3\,\mathcal{O}_3(x)2): Becomes prominent as the search progresses, defined by

minF(x)=ω1O1(x)+ω2O2(x)+ω3O3(x)\min F(x) = \omega_1\,\mathcal{O}_1(x) + \omega_2\,\mathcal{O}_2(x) + \omega_3\,\mathcal{O}_3(x)3

with minF(x)=ω1O1(x)+ω2O2(x)+ω3O3(x)\min F(x) = \omega_1\,\mathcal{O}_1(x) + \omega_2\,\mathcal{O}_2(x) + \omega_3\,\mathcal{O}_3(x)4.

Update actions depend on (minF(x)=ω1O1(x)+ω2O2(x)+ω3O3(x)\min F(x) = \omega_1\,\mathcal{O}_1(x) + \omega_2\,\mathcal{O}_2(x) + \omega_3\,\mathcal{O}_3(x)5, minF(x)=ω1O1(x)+ω2O2(x)+ω3O3(x)\min F(x) = \omega_1\,\mathcal{O}_1(x) + \omega_2\,\mathcal{O}_2(x) + \omega_3\,\mathcal{O}_3(x)6) values, modulating between migration, global search (with Halton sequences), local refinement, and best-case recombination to adjust search breadth as needed (Huang et al., 16 Jun 2025).

4. Three-Tier Priority Material Sequencing

Post-optimization, MWO introduces a three-tier mechanism for sequencing selected learning materials:

  • High Priority: Materials fully covering required concepts and within estimated ability, score minF(x)=ω1O1(x)+ω2O2(x)+ω3O3(x)\min F(x) = \omega_1\,\mathcal{O}_1(x) + \omega_2\,\mathcal{O}_2(x) + \omega_3\,\mathcal{O}_3(x)7 (prerequisite-weighted).

  • Medium Priority: Within ability but only partially covering concepts, minF(x)=ω1O1(x)+ω2O2(x)+ω3O3(x)\min F(x) = \omega_1\,\mathcal{O}_1(x) + \omega_2\,\mathcal{O}_2(x) + \omega_3\,\mathcal{O}_3(x)8, minF(x)=ω1O1(x)+ω2O2(x)+ω3O3(x)\min F(x) = \omega_1\,\mathcal{O}_1(x) + \omega_2\,\mathcal{O}_2(x) + \omega_3\,\mathcal{O}_3(x)9.

  • Challenge: Materials slightly above current ability, O1=ε1(RRE)+ε2(ERE)\mathcal{O}_1 = \varepsilon_1 \bigl(|\mathcal{R}|-|\mathcal{R} \cap \mathcal{E}|\bigr) + \varepsilon_2 \bigl(|\mathcal{E}|-|\mathcal{R} \cap \mathcal{E}|\bigr)0, O1=ε1(RRE)+ε2(ERE)\mathcal{O}_1 = \varepsilon_1 \bigl(|\mathcal{R}|-|\mathcal{R} \cap \mathcal{E}|\bigr) + \varepsilon_2 \bigl(|\mathcal{E}|-|\mathcal{R} \cap \mathcal{E}|\bigr)1 (progress-indexed).

Final score: O1=ε1(RRE)+ε2(ERE)\mathcal{O}_1 = \varepsilon_1 \bigl(|\mathcal{R}|-|\mathcal{R} \cap \mathcal{E}|\bigr) + \varepsilon_2 \bigl(|\mathcal{E}|-|\mathcal{R} \cap \mathcal{E}|\bigr)2 with O1=ε1(RRE)+ε2(ERE)\mathcal{O}_1 = \varepsilon_1 \bigl(|\mathcal{R}|-|\mathcal{R} \cap \mathcal{E}|\bigr) + \varepsilon_2 \bigl(|\mathcal{E}|-|\mathcal{R} \cap \mathcal{E}|\bigr)3. Materials are sorted by descending O1=ε1(RRE)+ε2(ERE)\mathcal{O}_1 = \varepsilon_1 \bigl(|\mathcal{R}|-|\mathcal{R} \cap \mathcal{E}|\bigr) + \varepsilon_2 \bigl(|\mathcal{E}|-|\mathcal{R} \cap \mathcal{E}|\bigr)4, enforcing prerequisite graph constraints to yield the learning sequence. This structuring explicitly maintains pedagogical validity while optimizing for progression and prerequisite adherence (Huang et al., 16 Jun 2025).

5. Workflow and Computational Complexity

The overall MWO algorithm involves initializing a population of candidate solutions, iteratively updating positions based on the adaptive expert-guided scheme and control signals, continuously tracking best solutions, and generating the final learning sequence by the priority mechanism.

Abridged pseudocode structure:

E\mathcal{E}6

Each iteration incurs O1=ε1(RRE)+ε2(ERE)\mathcal{O}_1 = \varepsilon_1 \bigl(|\mathcal{R}|-|\mathcal{R} \cap \mathcal{E}|\bigr) + \varepsilon_2 \bigl(|\mathcal{E}|-|\mathcal{R} \cap \mathcal{E}|\bigr)5 time, dominated by fitness evaluations (O1=ε1(RRE)+ε2(ERE)\mathcal{O}_1 = \varepsilon_1 \bigl(|\mathcal{R}|-|\mathcal{R} \cap \mathcal{E}|\bigr) + \varepsilon_2 \bigl(|\mathcal{E}|-|\mathcal{R} \cap \mathcal{E}|\bigr)6) and sequence sorting, yielding an overall time complexity of O1=ε1(RRE)+ε2(ERE)\mathcal{O}_1 = \varepsilon_1 \bigl(|\mathcal{R}|-|\mathcal{R} \cap \mathcal{E}|\bigr) + \varepsilon_2 \bigl(|\mathcal{E}|-|\mathcal{R} \cap \mathcal{E}|\bigr)7. Standard configuration: O1=ε1(RRE)+ε2(ERE)\mathcal{O}_1 = \varepsilon_1 \bigl(|\mathcal{R}|-|\mathcal{R} \cap \mathcal{E}|\bigr) + \varepsilon_2 \bigl(|\mathcal{E}|-|\mathcal{R} \cap \mathcal{E}|\bigr)8, O1=ε1(RRE)+ε2(ERE)\mathcal{O}_1 = \varepsilon_1 \bigl(|\mathcal{R}|-|\mathcal{R} \cap \mathcal{E}|\bigr) + \varepsilon_2 \bigl(|\mathcal{E}|-|\mathcal{R} \cap \mathcal{E}|\bigr)9, R\mathcal{R}0, R\mathcal{R}1, R\mathcal{R}2 (Huang et al., 16 Jun 2025).

6. Experimental Evaluation

Comprehensive empirical evaluation is conducted on the OULAD dataset and standard continuous optimization benchmarks:

OULAD Results:

  • Convergence Stability: Average fitness (±std, 30 runs):
Algorithm Avg. Fitness Std. Deviation
MWO 598.48 18.02
WO 641.62 28.29
SCSO 814.85 329.43
SOA 972.65 422.11
PEOA 866.35 315.62
  • Difficulty Progression Rate (DPR): Fraction of consecutive items with non-decreasing difficulty:

    • MWO: 95.3%
    • WO: 87.2%
    • SCSO: 85.1%
    • SOA: 83.4%
    • PEOA: 78.9%
  • Sequence Quality (Student S1 Example):
    • MWO: Coverage 100.0%, DPR 90.7%, Alignment 100.0%; first 6 materials: 28→90→61→76→35→47
    • WO: Coverage 100.0%, DPR 83.3%, Alignment 96.7%; first 6 materials: 28→90→133→76→47→65
  • Runtime: MWO 497 iterations / 150.6s; WO 495 iterations / 158.0s

Benchmark Functions:

  • On R\mathcal{R}3–R\mathcal{R}4, MWO achieves objective R\mathcal{R}5–R\mathcal{R}6 (zero std).
  • On TFR\mathcal{R}7: R\mathcal{R}8 versus optimum R\mathcal{R}9.
  • TFE\mathcal{E}0 (Ackley) and TFE\mathcal{E}1: zero std.
  • TFE\mathcal{E}2–E\mathcal{E}3: matches or outperforms competitive methods with std E\mathcal{E}4.
  • Wilcoxon tests (E\mathcal{E}5): MWO is statistically superior on 8/9 functions against SCSO/PEOA, 7/9 vs. SOA (Huang et al., 16 Jun 2025).

7. Synthesis and Significance

MWO's design incorporates (i) an expert-guided, aging-weighted search for robust and stable exploitation, (ii) a dual adaptive control signal framework for dynamic regulation of global/local search behaviors, and (iii) a pedagogically structured three-tier priority sequencing for curriculum validity. These synergistic elements yield state-of-the-art performance on ACS tasks—including high difficulty progression rates and convergence stability—while also exhibiting competitive robustness on standard global optimization benchmarks. This suggests applicability to a wider class of combinatorial and multi-objective optimization challenges, particularly where domain structure and constraint satisfaction are paramount (Huang et al., 16 Jun 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Expert-Guided Memetic Walrus Optimizer (MWO).