Teaching Code LLMs to Use Autocompletion Tools in Repository-Level Code Generation (2401.06391v3)

Published 12 Jan 2024 in cs.SE

Abstract: Code LLMs face limitations in repository-level code generation due to their lack of awareness of repository-level dependencies (e.g., user-defined attributes), resulting in dependency errors such as undefined-variable and no-member errors. In this work, we introduce ToolGen, an approach that integrates autocompletion tools into the code LLM generation process to address these dependencies. ToolGen comprises two main phases: Trigger Insertion and Model Fine-tuning (Offline), and Tool-integrated Code Generation (Online). During the offline phase, ToolGen augments functions within a given code corpus with a special mark token, indicating positions to trigger autocompletion tools. These augmented functions, along with their corresponding docstrings, are then used to fine-tune a selected code LLM. In the online phase, ToolGen iteratively generates functions by predicting tokens step-by-step using the fine-tuned LLM. Whenever a mark token is encountered, ToolGen invokes the autocompletion tool to suggest code completions and selects the most appropriate one. We conduct comprehensive experiments to evaluate ToolGen's effectiveness in repository-level code generation. To facilitate this evaluation, we create a benchmark comprising 671 real-world code repositories and introduce two new dependency-based metrics: Dependency Coverage and Static Validity Rate. The results demonstrate that ToolGen significantly improves Dependency Coverage by 31.4% to 39.1% and Static Validity Rate by 44.9% to 57.7% across the three LLMs, while maintaining competitive or improved performance in widely recognized similarity metrics such as BLEU-4, CodeBLEU, Edit Similarity, and Exact Match. On the CoderEval dataset, ToolGen achieves improvements of 40.0% and 25.0% in Pass@1 for CodeT5 and CodeLlama, respectively.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (44)

Authors (7)

Chong Wang (308 papers)
Jian Zhang (542 papers)
Yebo Feng (26 papers)
Tianlin Li (43 papers)
Weisong Sun (45 papers)
Yang Liu (2253 papers)
Xin Peng (82 papers)

Citations (16)

View on Semantic Scholar

Tweets

https://twitter.com/ComputerPapers/status/1746764575830090039

Teaching Code LLMs to Use Autocompletion Tools in Repository-Level Code Generation (2401.06391v3)

Related Papers

Tweets