Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
51 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning (2402.09136v1)

Published 14 Feb 2024 in cs.CL and cs.AI

Abstract: Code LLMs (Code LLMs) have demonstrated outstanding performance in code-related tasks. Several instruction tuning approaches have been proposed to boost the code generation performance of pre-trained Code LLMs. In this paper, we introduce a diverse instruction model (DolphCoder) with self-evaluating for code generation. It learns diverse instruction targets and combines a code evaluation objective to enhance its code generation ability. Our model achieves superior performance on the HumanEval and MBPP benchmarks, demonstrating new insights for future code instruction tuning work. Our key findings are: (1) Augmenting more diverse responses with distinct reasoning paths increases the code capability of LLMs. (2) Improving one's ability to evaluate the correctness of code solutions also enhances their ability to create it.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Yejie Wang (15 papers)
  2. Keqing He (47 papers)
  3. Guanting Dong (46 papers)
  4. Pei Wang (240 papers)
  5. Weihao Zeng (24 papers)
  6. Muxi Diao (11 papers)
  7. Yutao Mou (16 papers)
  8. Mengdi Zhang (37 papers)
  9. Jingang Wang (71 papers)
  10. Xunliang Cai (63 papers)
  11. Weiran Xu (58 papers)
Citations (6)