Leveraging Reinforcement Learning and Large Language Models for Code Optimization (2312.05657v1)

Published 9 Dec 2023 in cs.LG, cs.AI, cs.PL, and cs.SE

Abstract: Code optimization is a daunting task that requires a significant level of expertise from experienced programmers. This level of expertise is not sufficient when compared to the rapid development of new hardware architectures. Towards advancing the whole code optimization process, recent approaches rely on machine learning and artificial intelligence techniques. This paper introduces a new framework to decrease the complexity of code optimization. The proposed framework builds on LLMs and reinforcement learning (RL) and enables LLMs to receive feedback from their environment (i.e., unit tests) during the fine-tuning process. We compare our framework with existing state-of-the-art models and show that it is more efficient with respect to speed and computational usage, as a result of the decrement in training steps and its applicability to models with fewer parameters. Additionally, our framework reduces the possibility of logical and syntactical errors. Toward evaluating our approach, we run several experiments on the PIE dataset using a CodeT5 LLM and RRHF, a new reinforcement learning algorithm. We adopt a variety of evaluation metrics with regards to optimization quality, and speedup. The evaluation results demonstrate that the proposed framework has similar results in comparison with existing models using shorter training times and smaller pre-trained models. In particular, we accomplish an increase of 5.6% and 2.2 over the baseline models concerning the %OP T and SP metrics.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (26)

Authors (11)

Shukai Duan (11 papers)
Nikos Kanakaris (9 papers)
Xiongye Xiao (16 papers)
Heng Ping (9 papers)
Chenyu Zhou (15 papers)
Nesreen K. Ahmed (76 papers)
Guixiang Ma (20 papers)
Mihai Capota (9 papers)
Theodore L. Willke (21 papers)
Shahin Nazarian (31 papers)
Paul Bogdan (51 papers)

Citations (2)

View on Semantic Scholar

Tweets

https://twitter.com/kanakaris_nikos/status/1756460937941221860

Leveraging Reinforcement Learning and Large Language Models for Code Optimization (2312.05657v1)

Related Papers

Tweets