An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach (2402.13871v1)
Abstract: Phishing email is a serious cyber threat that tries to deceive users by sending false emails with the intention of stealing confidential information or causing financial harm. Attackers, often posing as trustworthy entities, exploit technological advancements and sophistication to make detection and prevention of phishing more challenging. Despite extensive academic research, phishing detection remains an ongoing and formidable challenge in the cybersecurity landscape. LLMs and Masked LLMs (MLMs) possess immense potential to offer innovative solutions to address long-standing challenges. In this research paper, we present an optimized, fine-tuned transformer-based DistilBERT model designed for the detection of phishing emails. In the detection process, we work with a phishing email dataset and utilize the preprocessing techniques to clean and solve the imbalance class issues. Through our experiments, we found that our model effectively achieves high accuracy, demonstrating its capability to perform well. Finally, we demonstrate our fine-tuned model using Explainable-AI (XAI) techniques such as Local Interpretable Model-Agnostic Explanations (LIME) and Transformer Interpret to explain how our model makes predictions in the context of text classification for phishing emails.
- Phishing email detection using natural language processing techniques: a literature survey, Procedia Computer Science 189 (2021) 19–28.
- A comprehensive survey of ai-enabled phishing attacks detection techniques, Telecommunication Systems 76 (2021) 139–154.
- I. H. Sarker, Machine learning for intelligent data analysis and automation in cybersecurity: current and future prospects, Annals of Data Science 10 (2023) 1473–1498.
- Transformer in transformer, Advances in Neural Information Processing Systems 34 (2021) 15908–15919.
- An improved transformer-based model for detecting phishing, spam, and ham: A large language model approach, arXiv preprint arXiv:2311.04913 (2023).
- A survey of large language models, arXiv preprint arXiv:2303.18223 (2023).
- A survey on large language model (llm) security and privacy: The good, the bad, and the ugly, arXiv preprint arXiv:2312.02003 (2023).
- M. Koroteev, Bert: a review of applications in natural language processing and understanding, arXiv preprint arXiv:2103.11943 (2021).
- Sentiment analysis on the impact of coronavirus in social life using the bert model, Social Network Analysis and Mining 11 (2021) 33.
- Bert: Pre-training of deep bidirectional transformers for language understanding, arXiv preprint arXiv:1810.04805 (2018).
- Survey of bert-base models for scientific text classification: Covid-19 case study, Applied Sciences 12 (2022) 2891.
- Albert: A lite bert for self-supervised learning of language representations, arXiv preprint arXiv:1909.11942 (2019).
- Roberta: A robustly optimized bert pretraining approach, arXiv preprint arXiv:1907.11692 (2019).
- Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter, arXiv preprint arXiv:1910.01108 (2019).
- Explainable ai: A brief survey on history, research areas, approaches and challenges, in: Natural Language Processing and Chinese Computing: 8th CCF International Conference, NLPCC 2019, Dunhuang, China, October 9–14, 2019, Proceedings, Part II 8, Springer, 2019, pp. 563–574.
- Metrics for explainable ai: Challenges and prospects, arXiv preprint arXiv:1812.04608 (2018).
- Interpretable bangla sarcasm detection using bert and explainable ai, in: 2023 IEEE 13th Annual Computing and Communication Workshop and Conference (CCWC), IEEE, 2023, pp. 1272–1278.
- The role of machine learning in cybersecurity, Digital Threats: Research and Practice 4 (2023) 1–38.
- A. Yasin, A. Abuhasan, An intelligent classification model for phishing email detection, arXiv preprint arXiv:1608.02196 (2016).
- A machine learning approach towards phishing email detection, in: Proceedings of the Anti-Phishing Pilot at ACM International Workshop on Security and Privacy Analytics (IWSPA AP), volume 2013, 2018, pp. 455–468.
- Using feature selection and classification scheme for automating phishing email detection (2013).
- Phishing web site detection using diverse machine learning algorithms, The Electronic Library 38 (2020) 65–80.
- A. Alhogail, A. Alsabih, Applying machine learning and natural language processing to detect phishing email, Computers & Security 110 (2021) 102414.
- Intelligent deep learning based cybersecurity phishing email detection and classification., Computers, Materials & Continua 74 (2023).
- M. Dewis, T. Viana, Phish responder: A hybrid machine learning approach to detect phishing and spam emails, Applied System Innovation 5 (2022) 73.
- Phishing email detection using improved rcnn model with multilevel vectors and attention mechanism, IEEE Access 7 (2019) 56329–56340.
- J. Zhang, X. Li, Phishing detection method based on borderline-smote deep belief network, in: Security, Privacy, and Anonymity in Computation, Communication, and Storage: SpaCCS 2017 International Workshops, Guangzhou, China, December 12-15, 2017, Proceedings 10, Springer, 2017, pp. 45–53.
- Classifying phishing urls using recurrent neural networks, in: 2017 APWG symposium on electronic crime research (eCrime), IEEE, 2017, pp. 1–8.
- Detection of online phishing email using dynamic evolving neural network based on reinforcement learning, Decision Support Systems 107 (2018) 88–102.
- Evaluation of federated learning in phishing email detection, Sensors 23 (2023) 4346.
- S. Atawneh, H. Aljehani, Phishing email detection model using deep learning, Electronics 12 (2023) 4261.
- Tinybert: Distilling bert for natural language understanding, arXiv preprint arXiv:1909.10351 (2019).
- Catbert: Context-aware tiny bert for detecting social engineering emails, arXiv preprint arXiv:2010.03484 (2020).
- Bert-based models for phishing detection (2023).
- A large-scale pretrained deep model for phishing url detection, in: ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, pp. 1–5.
- Urltran: Improving phishing url detection using transformers, in: MILCOM 2021-2021 IEEE Military Communications Conference (MILCOM), IEEE, 2021, pp. 197–204.
- Comparative analyses of bert, roberta, distilbert, and xlnet for text-based emotion recognition, in: 2020 17th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP), IEEE, 2020, pp. 117–121.
- R. Lin, Analysis on the selection of the appropriate batch size in cnn neural network, in: 2022 International Conference on Machine Learning and Knowledge Engineering (MLKE), IEEE, 2022, pp. 106–109.
- I. Loshchilov, F. Hutter, Decoupled weight decay regularization, arXiv preprint arXiv:1711.05101 (2017).
- Understanding adamw through proximal methods and scale-freeness, arXiv preprint arXiv:2202.00089 (2022).
- Explainable ai methods-a brief overview, in: International workshop on extending explainable AI beyond deep models and classifiers, Springer, 2022, pp. 13–38.
- " why should i trust you?" explaining the predictions of any classifier, in: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, 2016, pp. 1135–1144.
- Cyber-attack detection through ensemble-based machine learning classifier, in: International Conference on Machine Intelligence and Emerging Technologies, Springer, 2022, pp. 386–396.
- Mohammad Amaz Uddin (2 papers)
- Iqbal H. Sarker (36 papers)