2000 character limit reached
Large Language Model for Vulnerability Detection: Emerging Results and Future Directions (2401.15468v1)
Published 27 Jan 2024 in cs.SE
Abstract: Previous learning-based vulnerability detection methods relied on either medium-sized pre-trained models or smaller neural networks from scratch. Recent advancements in Large Pre-Trained LLMs have showcased remarkable few-shot learning capabilities in various tasks. However, the effectiveness of LLMs in detecting software vulnerabilities is largely unexplored. This paper aims to bridge this gap by exploring how LLMs perform with various prompts, particularly focusing on two state-of-the-art LLMs: GPT-3.5 and GPT-4. Our experimental results showed that GPT-3.5 achieves competitive performance with the prior state-of-the-art vulnerability detection approach and GPT-4 consistently outperformed the state-of-the-art.
- Xin Zhou (319 papers)
- Ting Zhang (174 papers)
- David Lo (229 papers)