Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Evaluation of ChatGPT Model for Vulnerability Detection (2304.07232v1)

Published 12 Apr 2023 in cs.CR, cs.AI, and cs.SE

Abstract: In this technical report, we evaluated the performance of the ChatGPT and GPT-3 models for the task of vulnerability detection in code. Our evaluation was conducted on our real-world dataset, using binary and multi-label classification tasks on CWE vulnerabilities. We decided to evaluate the model because it has shown good performance on other code-based tasks, such as solving programming challenges and understanding code at a high level. However, we found that the ChatGPT model performed no better than a dummy classifier for both binary and multi-label classification tasks for code vulnerability detection.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Anton Cheshkov (4 papers)
  2. Pavel Zadorozhny (3 papers)
  3. Rodion Levichev (3 papers)
Citations (53)