Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

TeleQnA: A Benchmark Dataset to Assess Large Language Models Telecommunications Knowledge (2310.15051v1)

Published 23 Oct 2023 in cs.IT, cs.AI, cs.LG, and math.IT

Abstract: We introduce TeleQnA, the first benchmark dataset designed to evaluate the knowledge of LLMs in telecommunications. Comprising 10,000 questions and answers, this dataset draws from diverse sources, including standards and research articles. This paper outlines the automated question generation framework responsible for creating this dataset, along with how human input was integrated at various stages to ensure the quality of the questions. Afterwards, using the provided dataset, an evaluation is conducted to assess the capabilities of LLMs, including GPT-3.5 and GPT-4. The results highlight that these models struggle with complex standards related questions but exhibit proficiency in addressing general telecom-related inquiries. Additionally, our results showcase how incorporating telecom knowledge context significantly enhances their performance, thus shedding light on the need for a specialized telecom foundation model. Finally, the dataset is shared with active telecom professionals, whose performance is subsequently benchmarked against that of the LLMs. The findings illustrate that LLMs can rival the performance of active professionals in telecom knowledge, thanks to their capacity to process vast amounts of information, underscoring the potential of LLMs within this domain. The dataset has been made publicly accessible on GitHub.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Ali Maatouk (35 papers)
  2. Fadhel Ayed (25 papers)
  3. Nicola Piovesan (23 papers)
  4. Antonio De Domenico (36 papers)
  5. Zhi-Quan Luo (115 papers)
  6. Merouane Debbah (269 papers)
Citations (27)