Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

nl2spec: Interactively Translating Unstructured Natural Language to Temporal Logics with Large Language Models (2303.04864v1)

Published 8 Mar 2023 in cs.LO, cs.AI, and cs.LG

Abstract: A rigorous formalization of desired system requirements is indispensable when performing any verification task. This often limits the application of verification techniques, as writing formal specifications is an error-prone and time-consuming manual task. To facilitate this, we present nl2spec, a framework for applying LLMs to derive formal specifications (in temporal logics) from unstructured natural language. In particular, we introduce a new methodology to detect and resolve the inherent ambiguity of system requirements in natural language: we utilize LLMs to map subformulas of the formalization back to the corresponding natural language fragments of the input. Users iteratively add, delete, and edit these sub-translations to amend erroneous formalizations, which is easier than manually redrafting the entire formalization. The framework is agnostic to specific application domains and can be extended to similar specification languages and new neural models. We perform a user study to obtain a challenging dataset, which we use to run experiments on the quality of translations. We provide an open-source implementation, including a web-based frontend.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Matthias Cosler (3 papers)
  2. Christopher Hahn (33 papers)
  3. Daniel Mendoza (4 papers)
  4. Frederik Schmitt (10 papers)
  5. Caroline Trippel (19 papers)
Citations (37)