首页|期刊导航|信息安全研究|基于大语言模型的钓鱼邮件检测技术研究

基于大语言模型的钓鱼邮件检测技术研究

袁斌杨克涵邹德清刘勇张乾坤

信息安全研究2026，Vol.12Issue(2)：151-163,13.

信息安全研究2026，Vol.12Issue(2)：151-163,13.DOI:10.12379/j.issn.2096-1057.2026.02.07

基于大语言模型的钓鱼邮件检测技术研究

Research on Phishing Email Detection Based on Large Language Model

袁斌 ¹杨克涵 ²邹德清 ²刘勇 ³张乾坤²

作者信息

1. 华中科技大学网络空间安全学院武汉 430074||嵩山实验室郑州 452470
2. 华中科技大学网络空间安全学院武汉 430074
3. 中关村实验室北京 100190||奇安信集团股份有限公司北京 100044
折叠

摘要

Abstract

With the rapid increase in phishing email volumes and the continuous evolution of adversarial techniques,traditional phishing detection methods have encountered significant challenges regarding efficiency and accuracy.To address issues such as low detection rates,high false-negative rates,and poor human-computer interaction in existing systems,the authors proposed a phishing email detection system based on large language model.Through comprehensive analysis of key phishing email characteristics—including header fields,body content,URLs,QR codes,attachments,and HTML pages—they constructed a high-quality training dataset using feature insertion algorithms.Building upon the pre-trained LLaMA model,the researchers implemented LoRA fine-tuning technology,achieving domain knowledge transfer by updating only 0.72％of model parameters(approximately 50MB).Experimental results demonstrate that compared to traditional methods,the LLM-based detection approach achieves 94.5％overall accuracy with enhanced robustness,effectively reduces false-positive rates,improves classification and interpretation capabilities for phishing email features,and provides a more practical and reliable solution for phishing detection.

关键词

钓鱼邮件/大语言模型/预训练语言模型/低秩自适应/微调

Key words

phishing email/large language model/pre-trained language model/low-rank adaptation/fine-tuning

分类

信息技术与安全科学

引用本文复制引用

袁斌,杨克涵,邹德清,刘勇,张乾坤..基于大语言模型的钓鱼邮件检测技术研究[J].信息安全研究,2026,12(2):151-163,13.

基金项目

国家自然科学基金项目(62372191) （62372191）

湖北省自然科学基金项目(2023AFB258) （2023AFB258）

嵩山实验室项目(241110210200) （241110210200）

信息安全研究

ISSN：2096-1057

访问量0

下载量0

段落导航