ExpSeek as rollout augmentation for Agentic Reinforcement Learning
Investigate whether incorporating ExpSeek as an enhancement technique for Agentic Reinforcement Learning rollout improves training convergence speed and sampling quality, given its observed ability to significantly increase pass@k performance in web-agent evaluations.
References
Since ExpSeek can also significantly improve pass@k performance, it has not yet been studied whether it can serve as an enhancement technique for Agentic Reinforcement Learning rollout to improve training convergence speed and sampling quality.
— ExpSeek: Self-Triggered Experience Seeking for Web Agents
(2601.08605 - Zhang et al., 13 Jan 2026) in Limitations Section