Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Formal Specifications from Natural Language (2206.01962v2)

Published 4 Jun 2022 in cs.SE, cs.LG, and cs.PL

Abstract: We study the generalization abilities of LLMs when translating natural language into formal specifications with complex semantics. In particular, we fine-tune LLMs on three datasets consisting of English sentences and their corresponding formal representation: 1) regular expressions (regex), frequently used in programming and search; 2) First-order logic (FOL), commonly used in software verification and theorem proving; and 3) linear-time temporal logic (LTL), which forms the basis for industrial hardware specification languages. Our experiments show that, in these diverse domains, the LLMs maintain their generalization capabilities from pre-trained knowledge of natural language to generalize, e.g., to new variable names or operator descriptions. Additionally, they achieve competitive performance, and even outperform the state-of-the-art for translating into regular expressions, with the benefits of being easy to access, efficient to fine-tune, and without a particular need for domain-specific reasoning.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Christopher Hahn (33 papers)
  2. Frederik Schmitt (10 papers)
  3. Julia J. Tillman (1 paper)
  4. Niklas Metzger (12 papers)
  5. Julian Siber (10 papers)
  6. Bernd Finkbeiner (118 papers)
Citations (23)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com