Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Exploring the Limits of ChatGPT in Software Security Applications (2312.05275v1)

Published 8 Dec 2023 in cs.CR and cs.AI

Abstract: LLMs have undergone rapid evolution and achieved remarkable results in recent times. OpenAI's ChatGPT, backed by GPT-3.5 or GPT-4, has gained instant popularity due to its strong capability across a wide range of tasks, including natural language tasks, coding, mathematics, and engaging conversations. However, the impacts and limits of such LLMs in system security domain are less explored. In this paper, we delve into the limits of LLMs (i.e., ChatGPT) in seven software security applications including vulnerability detection/repair, debugging, debloating, decompilation, patching, root cause analysis, symbolic execution, and fuzzing. Our exploration reveals that ChatGPT not only excels at generating code, which is the conventional application of LLMs, but also demonstrates strong capability in understanding user-provided commands in natural languages, reasoning about control and data flows within programs, generating complex data structures, and even decompiling assembly code. Notably, GPT-4 showcases significant improvements over GPT-3.5 in most security tasks. Also, certain limitations of ChatGPT in security-related tasks are identified, such as its constrained ability to process long code contexts.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Fangzhou Wu (11 papers)
  2. Qingzhao Zhang (21 papers)
  3. Ati Priya Bajaj (1 paper)
  4. Tiffany Bao (6 papers)
  5. Ning Zhang (278 papers)
  6. Ruoyu "Fish" Wang (1 paper)
  7. Chaowei Xiao (110 papers)
Citations (6)