Architecture–algorithm co-design questions for reinforced MoE and hardware-aware objectives
Identify and solve key architecture–algorithm co-design challenges for reinforced mixture-of-experts and hardware-aware objectives, including: (i) designing robust multi-objective reward functions that avoid trivial solutions such as all-expert sparsity; (ii) achieving stable credit assignment when architectural actions change network topology; and (iii) amortizing architecture policy learning across prompts, tasks, and deployment scales.
Sponsor
References
Key open questions include designing robust multi-objective reward functions that avoid trivial solutions (e.g., all-expert sparsity), achieving stable credit assignment when architectural actions modify network topology, and amortizing architecture policy learning across prompts, tasks, and deployment scales.