PanGu-Coder: Program Synthesis with Function-Level Language Modeling (2207.11280v1)

Published 22 Jul 2022 in cs.LG, cs.AI, cs.CL, cs.PL, and cs.SE

Abstract: We present PanGu-Coder, a pretrained decoder-only LLM adopting the PanGu-Alpha architecture for text-to-code generation, i.e. the synthesis of programming language solutions given a natural language problem description. We train PanGu-Coder using a two-stage strategy: the first stage employs Causal LLMling (CLM) to pre-train on raw programming language data, while the second stage uses a combination of Causal LLMling and Masked LLMling (MLM) training objectives that focus on the downstream task of text-to-code generation and train on loosely curated pairs of natural language program definitions and code functions. Finally, we discuss PanGu-Coder-FT, which is fine-tuned on a combination of competitive programming problems and code with continuous integration tests. We evaluate PanGu-Coder with a focus on whether it generates functionally correct programs and demonstrate that it achieves equivalent or better performance than similarly sized models, such as CodeX, while attending a smaller context window and training on less data.

Authors (22)

Fenia Christopoulou (10 papers)
Gerasimos Lampouras (22 papers)
Milan Gritta (13 papers)
Guchun Zhang (4 papers)
Yinpeng Guo (6 papers)
Zhongqi Li (5 papers)
Qi Zhang (785 papers)
Meng Xiao (114 papers)
Bo Shen (41 papers)
Lin Li (329 papers)
Hao Yu (195 papers)
Li Yan (90 papers)
Pingyi Zhou (9 papers)
Xin Wang (1307 papers)
Yuchi Ma (22 papers)
Ignacio Iacobacci (24 papers)
Yasheng Wang (91 papers)
Guangtai Liang (10 papers)
Jiansheng Wei (10 papers)
Xin Jiang (242 papers)

Citations (66)

View on Semantic Scholar

Summary

Essay on "PanGu-Coder: Program Synthesis with Function-Level LLMing"

The paper "PanGu-Coder: Program Synthesis with Function-Level LLMing" introduces PanGu-Coder, a LLM specifically designed for the text-to-code generation task. PanGu-Coder adopts the PanGu-$ architecture, which is a decoder-only transformer modified with an additional query layer for effective attention distribution across positional embeddings during LLMing tasks.</p> <h3 class='paper-heading'>Model Architecture and Training Strategy</h3> <p>PanGu-Coder leverages a uni-directional, decoder-only transformer with a supplementary query layer designed to scale up to hundreds of billions of parameters. For the task of program synthesis, the model is fine-tuned to operate with Python as the primary programming language. The architecture allows for the dynamic handling of natural language (NL) prompts and their translation into functional code.</p> <p>The training of PanGu-Coder follows a two-stage process. The first stage uses Causal LLMing (CLM) over programming language data combined with natural language elements like docstrings. This initial phase ensures the model's familiarity with raw code and its structure. The second stage introduces training objectives combining CLM with Masked LLMing (MLM) on curated NL-code pairs. By decoding the code based directly on natural language descriptions, the model emphasizes the text-to-code synthesis task.</p> <h3 class='paper-heading'>Evaluation and Results</h3> <p>Evaluation of PanGu-Coder involves significant benchmarking against prominent models like CodeX. Focusing on whether the generated programs execute correctly, PanGu-Coder shows performance comparable or superior to other models in several instances, such as the HumanEval and MBPP datasets. Despite training on a narrower dataset and attending to a smaller context window, the implementation achieves notable results, highlighting the power of its architectural decisions and training regime.</p> <p>The critical performance metric utilized is pass@$k $, capturing the proportion of correct programs within a sample of generated outputs. PanGu-Coder demonstrates strong performance in pass@$ 1 $, suggesting high precision in generating functionally correct primary solutions. Further analysis revealed that optimizing for precision (i.e., pass@$ 1$) benefits from specific decoding strategies like temperature scaling and nucleus sampling.

Implications and Future Directions

The paper highlights the implications of specialized training data and architecture configurations for the generation of functional code. Furthermore, embedding separation between source prompts and code appears critical in enhancing the model's understanding and generation capabilities. Notably, the model benefits from fine-tuning on competitive programming data and real-world problem-solving tasks, aligning more closely with the target distribution for text-to-code generation.

Future directions may involve expanding PanGu-Coder's capabilities to cover more diverse programming languages beyond Python. Additionally, broader testing in real-world applications or competitive programming contexts could further substantiate its utility and adaptability. The success of PanGu-Coder indicates an effective pathway for refining AI models tailored for specific domains within program synthesis and software engineering.

Overall, PanGu-Coder represents a significant contribution to the field of automatic code generation, navigating the challenges of producing functionally accurate code from natural language prompts with architectural ingenuity and strategic training methodologies.

PDF Markdown

Related Papers

YouTube

Show All Videos