首页|期刊导航|计算机技术与发展|基于实体信息增强思维链的政策长文本摘要方法

基于实体信息增强思维链的政策长文本摘要方法

赵景欣王志强武永亮董佳唐松

计算机技术与发展2026，Vol.36Issue(4)：121-129,9.

计算机技术与发展2026，Vol.36Issue(4)：121-129,9.DOI:10.20165/j.cnki.ISSN1673-629X.2025.0299

基于实体信息增强思维链的政策长文本摘要方法

A Long Policy Text Summarization Method Based on Entity Information Enhancement Chain of Thought

赵景欣 ¹王志强 ¹武永亮 ²董佳 ¹唐松¹

作者信息

1. 河北省科学院应用数学研究所,河北石家庄 050081||河北省信息安全认证工程技术研究中心,河北石家庄 050081
2. 石家庄铁道大学信息科学与技术学院,河北石家庄 050043
折叠

摘要

Abstract

With the deepening of digital governance,the efficient interpretation of policy official documents has become the key requirement of government affairs and social governance,and text summarization technology has become an important method to quickly extract the core content.However,the length of policy official documents is long and the elements are complex.In view of the inefficiency of traditional manual summarization and the problems of entity omission and logical fault when dealing with long-text policies in general large model,a framework for generating long-text summaries of policy official documents based on entity information enhancement chain of thought is proposed,which consists of three core modules.Firstly,the key elements are extracted through entity ex-traction module to guide the large model to pay attention to the core content.Then,through the relationship modeling module,the semantic association is constructed,and the internal logic of official documents is sorted out.Finally,the final document summary is generated by the prompt generation module,and the key information of the policy text is accurately extracted and the summary is generated.The experimental results show that the F1 values of the framework on Rouge-1,Rouge-2,Rouge-L and BERTScore reach 62.48%,33.02%,34.54%and 75.34%,respectively,which are significantly better than that of other comparison models,and are improved by 4.38 percentage points,7.1 percentage points,10.9 percentage points,and 5.37 percentage points respectively compared with the basic Qwen2.5-7B model.The example of generating summaries shows that the summaries generated by the framework are well characterized in terms of accuracy,completeness of details and language fluency.

关键词

政策公文/文本摘要/大语言模型/思维链/实体抽取/关系建模

Key words

policy official documents/text summarization/large language model/chain of thought/entity extraction/relational modeling

分类

信息技术与安全科学

引用本文复制引用

赵景欣,王志强,武永亮,董佳,唐松..基于实体信息增强思维链的政策长文本摘要方法[J].计算机技术与发展,2026,36(4):121-129,9.

基金项目

河北省科学院科技计划项目(25606) （25606）

河北省自然科学基金资助项目(F2024210005) （F2024210005）

计算机技术与发展

ISSN：1673-629X

访问量0

下载量0

段落导航