Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Beyond Text-to-SQL for IoT Defense: A Comprehensive Framework for Querying and Classifying IoT Threats (2406.17574v1)

Published 25 Jun 2024 in cs.CL

Abstract: Recognizing the promise of natural language interfaces to databases, prior studies have emphasized the development of text-to-SQL systems. While substantial progress has been made in this field, existing research has concentrated on generating SQL statements from text queries. The broader challenge, however, lies in inferring new information about the returned data. Our research makes two major contributions to address this gap. First, we introduce a novel Internet-of-Things (IoT) text-to-SQL dataset comprising 10,985 text-SQL pairs and 239,398 rows of network traffic activity. The dataset contains additional query types limited in prior text-to-SQL datasets, notably temporal-related queries. Our dataset is sourced from a smart building's IoT ecosystem exploring sensor read and network traffic data. Second, our dataset allows two-stage processing, where the returned data (network traffic) from a generated SQL can be categorized as malicious or not. Our results show that joint training to query and infer information about the data can improve overall text-to-SQL performance, nearly matching substantially larger models. We also show that current LLMs (e.g., GPT3.5) struggle to infer new information about returned data, thus our dataset provides a novel test bed for integrating complex domain-specific reasoning into LLMs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (14)
  1. Ryan Pavlich (1 paper)
  2. Nima Ebadi (5 papers)
  3. Richard Tarbell (2 papers)
  4. Billy Linares (2 papers)
  5. Adrian Tan (3 papers)
  6. Rachael Humphreys (1 paper)
  7. Jayanta Kumar Das (10 papers)
  8. Rambod Ghandiparsi (2 papers)
  9. Hannah Haley (1 paper)
  10. Jerris George (1 paper)
  11. Rocky Slavin (3 papers)
  12. Kim-Kwang Raymond Choo (59 papers)
  13. Glenn Dietrich (2 papers)
  14. Anthony Rios (25 papers)