信息安全研究2025,Vol.11Issue(12):1117-1124,8.DOI:10.12379/j.issn.2096-1057.2025.12.07
基于语言模型与低秩适配的钓鱼邮件高效检测方法
An Efficient Detection Method of Phishing Email Based on Language Model and LoRA
摘要
Abstract
Phishing email detection is critical in cybersecurity,as it faces significant challenges due to the diverse and complex nature of phishing emails.This paper proposes a phishing email detection method integrating the pre-trained language model DistilBERT with Low-Rank Adaptation(LoRA).DistilBERT is used to extract deep semantic features from email text,while LoRA fine-tunes a small number of parameters,thereby reducing the dependence on large-scale labeled data and enhancing the model generalization.Experimental results show that compared to traditional machine learning methods and deep learning models(such as RNN,LSTM,and Bidirectional LSTM),DistilBERT+LoRA outperforms them in key metrics including accuracy,precision,recall,and F1-score,achieving 96%accuracy and 97%F1-score,which significantly surpassing comparative models.Additionally,it demonstrates better balance between precision and recall than other deep learning models,particularly demonstrating robustness and adaptability in detecting complex phishing emails.Experiments further reveal that the model's performance improves with the increase in LoRA's rank parameters.By leveraging the powerful feature extraction capabilities of pre-trained language models and the efficient fine-tuning advantages of LoRA,this method provides an innovative and effective solution for accurate and efficient phishing email detection.关键词
网络安全/钓鱼邮件检测/预训练语言模型/低秩适配/语义特征提取Key words
cybersecurity/phishing email detection/pre-trained language model/LoRA(low-rank adaptation)/semantic feature extraction分类
信息技术与安全科学引用本文复制引用
Li Zichuan,Ji Duo,Zhou Song..基于语言模型与低秩适配的钓鱼邮件高效检测方法[J].信息安全研究,2025,11(12):1117-1124,8.基金项目
国家自然科学基金项目(62406342) (62406342)
辽宁省自然科学基金项目(2022-MS-168) (2022-MS-168)
中国刑事警察学院重点科研课题(2022XKGJ0108) (2022XKGJ0108)