Conditioning prompts to generate naturalistic yet verifiable terminal tasks
Determine a conditioning strategy for the language-model prompt used to generate task descriptions in the Endless Terminals pipeline that produces more naturalistic, user-style requests for terminal-use tasks while simultaneously maintaining sufficient explicit specification to support automated verification via initial-state and completion tests.
References
Conditioning the generation prompt to produce more naturalistic requests while maintaining sufficient specification for verification remains an open challenge.
— Endless Terminals: Scaling RL Environments for Terminal Agents
(2601.16443 - Gandhi et al., 23 Jan 2026) in Discussion (Limitations)