Sim-to-Real Transfer for File-System Behavioral Personalization
Determine how to achieve robust sim-to-real transfer for memory-centric personalized file-system agents evaluated on FileGramBench by enabling methods that maintain high performance when moving from simulated behavioral trajectories to real-world human screen recordings in the benchmark’s Real-World setting.
References
With 20 profiles and 640 trajectories the benchmark operates at moderate scale; the sharp accuracy drop in the Real-World setting confirms that sim-to-real transfer remains an open challenge.
— FileGram: Grounding Agent Personalization in File-System Behavioral Traces
(2604.04901 - Liu et al., 6 Apr 2026) in Appendix: Discussion and Resources, Ethical Considerations