首页|期刊导航|软件导刊|面向检索增强式私有问答大模型的幻觉检测方法

面向检索增强式私有问答大模型的幻觉检测方法

李铂鑫鲁骁张霄王斌

软件导刊2026，Vol.25Issue(1)：39-46,8.

软件导刊2026，Vol.25Issue(1)：39-46,8.DOI:10.11907/rjdk.241759

面向检索增强式私有问答大模型的幻觉检测方法

A Hallucination Detection Method for Retrieval-Augmented Private Question-Answering Large Language Models

李铂鑫 ¹鲁骁 ²张霄 ²王斌²

作者信息

1. 北京小米移动软件有限公司小米人工智能实验室,北京 100085||中国科学院软件研究所中文信息处理实验室,北京 100190
2. 北京小米移动软件有限公司小米人工智能实验室,北京 100085
折叠

摘要

Abstract

The existence of Large Language Models'(LLMs)hallucination phenomenon seriously restricts its implementation in practical sce-narios,existing hallucination detection methods were singular,and there was a lack of hallucination detection work under the paradigm of Re-trieval-Augmented Generation(RAG).To address these issues,a hallucination detection method was proposed for private Question-Answer-ing(Q&A)LLMs under the paradigm of RAG.This method integrated two types of hallucination detection methods,one based on uncertainty measure indexes and the other on LLMs'automatic evaluation,which effectively harnessed the hallucination features in the LLMs'generation process and the LLMs'automatic evaluation capability.To enhance the automatic assessment of open-source LLMs,a method of instruction fine-tuning was applied to distill the automatic evaluation ability from closed-source LLMs to open-source ones.Furthermore,a hallucination assessment dataset under the RAG's paradigm was constructed to verify the effectiveness of the proposed method.Experimental results on the above dataset indicate that the proposed integrated method achieved the highest AUC-ROC value,significantly increasing by 11.1％and 4.3％compared to the two baseline methods of uncertainty measure index and LLMs'automatic evaluation,respectively,proving the complementary nature of the two baseline methods.Moreover,the instruction fine-tuned open-source LLM exhibits an 18.6％increase in the AUC-ROC val-ue,with a significant improvement in automatic evaluation capability,validating the effectiveness of the instruction fine-tuning method in en-hancing the automatic evaluation ability of open-source LLMs.

关键词

大模型/幻觉检测/检索增强生成/指令微调/大模型自动评估

Key words

large language model/hallucination detection/retrieval-augmented generation/instruction fine-tuning/automatic evaluation of large language model

分类

信息技术与安全科学

引用本文复制引用

李铂鑫,鲁骁,张霄,王斌..面向检索增强式私有问答大模型的幻觉检测方法[J].软件导刊,2026,25(1):39-46,8.

软件导刊

ISSN：1672-7800

访问量1

下载量0

段落导航