信息安全研究2026,Vol.12Issue(2):151-163,13.DOI:10.12379/j.issn.2096-1057.2026.02.07
基于大语言模型的钓鱼邮件检测技术研究
Research on Phishing Email Detection Based on Large Language Model
摘要
Abstract
With the rapid increase in phishing email volumes and the continuous evolution of adversarial techniques,traditional phishing detection methods have encountered significant challenges regarding efficiency and accuracy.To address issues such as low detection rates,high false-negative rates,and poor human-computer interaction in existing systems,the authors proposed a phishing email detection system based on large language model.Through comprehensive analysis of key phishing email characteristics—including header fields,body content,URLs,QR codes,attachments,and HTML pages—they constructed a high-quality training dataset using feature insertion algorithms.Building upon the pre-trained LLaMA model,the researchers implemented LoRA fine-tuning technology,achieving domain knowledge transfer by updating only 0.72%of model parameters(approximately 50MB).Experimental results demonstrate that compared to traditional methods,the LLM-based detection approach achieves 94.5%overall accuracy with enhanced robustness,effectively reduces false-positive rates,improves classification and interpretation capabilities for phishing email features,and provides a more practical and reliable solution for phishing detection.关键词
钓鱼邮件/大语言模型/预训练语言模型/低秩自适应/微调Key words
phishing email/large language model/pre-trained language model/low-rank adaptation/fine-tuning分类
信息技术与安全科学引用本文复制引用
袁斌,杨克涵,邹德清,刘勇,张乾坤..基于大语言模型的钓鱼邮件检测技术研究[J].信息安全研究,2026,12(2):151-163,13.基金项目
国家自然科学基金项目(62372191) (62372191)
湖北省自然科学基金项目(2023AFB258) (2023AFB258)
嵩山实验室项目(241110210200) (241110210200)