Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SpeCrawler: Generating OpenAPI Specifications from API Documentation Using Large Language Models (2402.11625v1)

Published 18 Feb 2024 in cs.CL

Abstract: In the digital era, the widespread use of APIs is evident. However, scalable utilization of APIs poses a challenge due to structure divergence observed in online API documentation. This underscores the need for automatic tools to facilitate API consumption. A viable approach involves the conversion of documentation into an API Specification format. While previous attempts have been made using rule-based methods, these approaches encountered difficulties in generalizing across diverse documentation. In this paper we introduce SpeCrawler, a comprehensive system that utilizes LLMs to generate OpenAPI Specifications from diverse API documentation through a carefully crafted pipeline. By creating a standardized format for numerous APIs, SpeCrawler aids in streamlining integration processes within API orchestrating systems and facilitating the incorporation of tools into LLMs. The paper explores SpeCrawler's methodology, supported by empirical evidence and case studies, demonstrating its efficacy through LLM capabilities.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Koren Lazar (5 papers)
  2. Matan Vetzler (6 papers)
  3. Guy Uziel (12 papers)
  4. David Boaz (3 papers)
  5. Esther Goldbraich (5 papers)
  6. David Amid (1 paper)
  7. Ateret Anaby-Tavor (21 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.