Dice Question Streamline Icon: https://streamlinehq.com

Effect of demonstrations on domain-specific problem solving for SWE-agent

Determine whether including demonstration trajectories in the SWE-agent prompt (via the Demonstration Template that shows a solved trajectory) actually improves agents’ domain-specific problem solving capabilities in software engineering tasks, as opposed to primarily teaching proper command formatting and interface usage.

Information Square Streamline Icon: https://streamlinehq.com

Background

SWE-agent optionally prepends a demonstration trajectory to the prompt to illustrate correct interface use. Ablations indicate small benefits, and the authors hypothesize demonstrations may mainly help with formatting and command usage rather than impart domain-specific expertise.

However, the authors explicitly state uncertainty about whether demonstrations help agents understand nuanced, domain-specific problem solving, leaving open the question of demonstrations’ true contribution beyond interface familiarization.

References

We are unsure if demonstrations actually help agents understand the nuances of domain specific problem solving.

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering (2405.15793 - Yang et al., 6 May 2024) in Appendix, Section “Prompts,” Demonstration Template