Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning to map source code to software vulnerability using code-as-a-graph (2006.08614v1)

Published 15 Jun 2020 in cs.SE, cs.CR, cs.LG, and cs.PL

Abstract: We explore the applicability of Graph Neural Networks in learning the nuances of source code from a security perspective. Specifically, whether signatures of vulnerabilities in source code can be learned from its graph representation, in terms of relationships between nodes and edges. We create a pipeline we call AI4VA, which first encodes a sample source code into a Code Property Graph. The extracted graph is then vectorized in a manner which preserves its semantic information. A Gated Graph Neural Network is then trained using several such graphs to automatically extract templates differentiating the graph of a vulnerable sample from a healthy one. Our model outperforms static analyzers, classic machine learning, as well as CNN and RNN-based deep learning models on two of the three datasets we experiment with. We thus show that a code-as-graph encoding is more meaningful for vulnerability detection than existing code-as-photo and linear sequence encoding approaches. (Submitted Oct 2019, Paper #28, ICST)

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Sahil Suneja (9 papers)
  2. Yunhui Zheng (11 papers)
  3. Yufan Zhuang (16 papers)
  4. Jim Laredo (8 papers)
  5. Alessandro Morari (10 papers)
Citations (29)

Summary

We haven't generated a summary for this paper yet.