Context-Aware Parse Trees (2003.11118v1)

Published 24 Mar 2020 in cs.PL and cs.AI

Abstract: The simplified parse tree (SPT) presented in Aroma, a state-of-the-art code recommendation system, is a tree-structured representation used to infer code semantics by capturing program \emph{structure} rather than program \emph{syntax}. This is a departure from the classical abstract syntax tree, which is principally driven by programming language syntax. While we believe a semantics-driven representation is desirable, the specifics of an SPT's construction can impact its performance. We analyze these nuances and present a new tree structure, heavily influenced by Aroma's SPT, called a \emph{context-aware parse tree} (CAPT). CAPT enhances SPT by providing a richer level of semantic representation. Specifically, CAPT provides additional binding support for language-specific techniques for adding semantically-salient features, and language-agnostic techniques for removing syntactically-present but semantically-irrelevant features. Our research quantitatively demonstrates the value of our proposed semantically-salient features, enabling a specific CAPT configuration to be 39\% more accurate than SPT across the 48,610 programs we analyzed.

Authors (11)

Fangke Ye (3 papers)
Shengtian Zhou (8 papers)
Anand Venkat (5 papers)
Ryan Marcus (33 papers)
Paul Petersen (2 papers)
Jesmin Jahan Tithi (19 papers)
Tim Mattson (10 papers)
Tim Kraska (78 papers)
Pradeep Dubey (31 papers)
Vivek Sarkar (16 papers)
Justin Gottschlich (21 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Context-Aware Parse Trees (2003.11118v1)

Summary

Related Papers