Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multi-modal Program Inference: a Marriage of Pre-trainedLanguage Models and Component-based Synthesis (2109.02445v1)

Published 3 Sep 2021 in cs.AI and cs.PL

Abstract: Multi-modal program synthesis refers to the task of synthesizing programs (code) from their specification given in different forms, such as a combination of natural language and examples. Examples provide a precise but incomplete specification, and natural language provides an ambiguous but more "complete" task description. Machine-learned pre-trained models (PTMs) are adept at handling ambiguous natural language, but struggle with generating syntactically and semantically precise code. Program synthesis techniques can generate correct code, often even from incomplete but precise specifications, such as examples, but they are unable to work with the ambiguity of natural languages. We present an approach that combines PTMs with component-based synthesis (CBS): PTMs are used to generate candidates programs from the natural language description of the task, which are then used to guide the CBS procedure to find the program that matches the precise examples-based specification. We use our combination approach to instantiate multi-modal synthesis systems for two programming domains: the domain of regular expressions and the domain of CSS selectors. Our evaluation demonstrates the effectiveness of our domain-agnostic approach in comparison to a state-of-the-art specialized system, and the generality of our approach in providing multi-modal program synthesis from natural language and examples in different programming domains.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Kia Rahmani (6 papers)
  2. Mohammad Raza (9 papers)
  3. Sumit Gulwani (55 papers)
  4. Vu Le (26 papers)
  5. Daniel Morris (15 papers)
  6. Arjun Radhakrishna (23 papers)
  7. Gustavo Soares (21 papers)
  8. Ashish Tiwari (44 papers)
Citations (38)