Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Prompted Contextual Vectors for Spear-Phishing Detection (2402.08309v2)

Published 13 Feb 2024 in cs.LG, cs.CL, and cs.CR

Abstract: Spear-phishing attacks present a significant security challenge, with LLMs escalating the threat by generating convincing emails and facilitating target reconnaissance. To address this, we propose a detection approach based on a novel document vectorization method that utilizes an ensemble of LLMs to create representation vectors. By prompting LLMs to reason and respond to human-crafted questions, we quantify the presence of common persuasion principles in the email's content, producing prompted contextual document vectors for a downstream supervised machine learning model. We evaluate our method using a unique dataset generated by a proprietary system that automates target reconnaissance and spear-phishing email creation. Our method achieves a 91% F1 score in identifying LLM-generated spear-phishing emails, with the training set comprising only traditional phishing and benign emails. Key contributions include an innovative document vectorization method utilizing LLM reasoning, a publicly available dataset of high-quality spear-phishing emails, and the demonstrated effectiveness of our method in detecting such emails. This methodology can be utilized for various document classification tasks, particularly in adversarial problem domains.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Daniel Nahmias (1 paper)
  2. Gal Engelberg (2 papers)
  3. Dan Klein (99 papers)
  4. Asaf Shabtai (119 papers)
Citations (8)

Summary

We haven't generated a summary for this paper yet.