Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Quality Estimation & Interpretability for Code Translation (2012.07581v2)

Published 4 Dec 2020 in cs.SE and cs.PL

Abstract: Recently, the automated translation of source code from one programming language to another by using automatic approaches inspired by Neural Machine Translation (NMT) methods for natural languages has come under study. However, such approaches suffer from the same problem as previous NMT approaches on natural languages, viz. the lack of an ability to estimate and evaluate the quality of the translations; and consequently ascribe some measure of interpretability to the model's choices. In this paper, we attempt to estimate the quality of source code translations built on top of the TransCoder model. We consider the code translation task as an analog of machine translation (MT) for natural languages, with some added caveats. We present our main motivation from a user study built around code translation; and present a technique that correlates the confidences generated by that model to lint errors in the translated code. We conclude with some observations on these correlations, and some ideas for future work.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Mayank Agarwal (30 papers)
  2. Kartik Talamadupula (38 papers)
  3. Stephanie Houde (18 papers)
  4. Fernando Martinez (8 papers)
  5. Michael Muller (70 papers)
  6. John Richards (16 papers)
  7. Steven Ross (2 papers)
  8. Justin D. Weisz (26 papers)
Citations (9)