Predicting Entire Experiment Outcomes from Raw Code Using Regression Language Models
Determine whether code-based Regression Language Models—encoder–decoder language models that read source code or computation graph text and autoregressively output numeric metrics—can predict the numeric outcomes of entire experiments directly from raw code, extending their applicability from program-level metrics (such as memory usage, kernel latency, and architecture accuracy/latency) to complete experimental pipelines.
References
A key open question is whether such code-based RLMs can be more broadly used to predict the numeric outcome of entire experiments from raw code, but we leave this to future work and hope this paper will be a valuable reference for multiple scientific communities in automated machine learning, programming languages, and computer architecture.