Data scalability for long-horizon GUI agent learning
Develop scalable data generation and curation methodologies that enable long-horizon training of GUI-centered agents by producing large-scale interactive trajectories with reasoning, actions, environment states, and feedback at feasible cost.
References
While recent advances in native agent models have shown promise by unifying perception, reasoning, action, and memory through end-to-end learning, open problems remain in data scalability, multi-turn reinforcement learning (RL), the limitations of GUI-only operation, and environment stability.
— UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
(2509.02544 - Wang et al., 2 Sep 2025) in Abstract (Page 1)