首页|期刊导航|计算机科学与探索|基于大语言模型的公安专业小样本知识抽取方法研究

基于大语言模型的公安专业小样本知识抽取方法研究

裴炳森李欣蒋章涛刘明帅

计算机科学与探索2024，Vol.18Issue(10)：2630-2642,13.

计算机科学与探索2024，Vol.18Issue(10)：2630-2642,13.DOI:10.3778/j.issn.1673-9418.2403039

基于大语言模型的公安专业小样本知识抽取方法研究

Research on Public Security Professional Small Sample Knowledge Extraction Method Based on Large Language Model

裴炳森 ¹李欣 ¹蒋章涛 ¹刘明帅¹

作者信息

1. 中国人民公安大学信息网络安全学院,北京 100038
折叠

摘要

Abstract

The rapid development of informatization and digitalization in public security business has generated a large amount of law enforcement case data in public security work.However,due to various types of text and large amount of information,front-line police officers often face problems such as low reading efficiency and difficulty in aggregating information in the process of reading case files.In order to further utilize the law enforcement case text,it is necessary to conduct intelligent analysis and knowledge extraction.However,due to the professionalism,data sensitivity,confidentiality of public security professional law enforcement case text,as well as the requirements of public security data going out of the network,only a small number of learning training samples can be obtained,and the traditional deep learning model has unsatisfactory extraction effect.Therefore,this paper proposes to build a large language model in vertical fields with fewer resources and data,and realize the adaptation of the model to the public security profession.The model uses knowledge editing technology MEMIT(mess-editing memory in a trans-former),low-resource fine-tuning technology LoRA(low-rank adaptation),and prompt templates to improve the model's understanding of public security knowledge such as police terminology and common sense.Moreover,in order to further improve the knowledge extraction effect of the model,a small sample law enforcement case text data extraction process is designed to better integrate the professional knowledge related to the case in the model.Experimental results show that the accuracy of the public security professional vertical field large language model integrated with the extraction process in various knowledge extraction tasks is significantly improved compared with the traditional methods,which helps front-line police officers quickly,objectively and accurately analyze law enforcement case text,dig out potential case information,and support the intelligent development of public security work.

关键词

大语言模型/知识抽取/小样本数据/公安执法办案

Key words

large language model/knowledge extraction/small sample data/public security law enforcement

分类

信息技术与安全科学

引用本文复制引用

裴炳森,李欣,蒋章涛,刘明帅..基于大语言模型的公安专业小样本知识抽取方法研究[J].计算机科学与探索,2024,18(10):2630-2642,13.

基金项目

国家重点研发计划(2022300070005).This work was supported by the National Key Research and Development Program of China(2022300070005). （2022300070005）

计算机科学与探索

OA北大核心CSTPCD

ISSN：1673-9418

访问量3

下载量0

段落导航