Boundary between prompt-based and weight-based learning for compound AI systems
Characterize the boundary between reflective prompt evolution (GEPA) and weight-space reinforcement learning methods (such as Group Relative Policy Optimization with LoRA or full-parameter finetuning) for optimizing compound AI systems, by determining the data and rollout regimes under which prompt-based optimization versus weight updates are expected to outperform one another.
Sponsor
References
The boundary between prompt-based and weight-based learning is not well understood—although GEPA excels when rollouts are expensive, it is likely that weight updates will outperform prompting in regimes with abundant data or when large-scale rollouts are feasible.
— GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
(2507.19457 - Agrawal et al., 25 Jul 2025) in Limitations and Future Work