| 注册
首页|期刊导航|电子科技大学学报|基于异构图和关键词的抽取式文本摘要模型

基于异构图和关键词的抽取式文本摘要模型

朱颀林 王羽 徐建

电子科技大学学报2024,Vol.53Issue(2):259-270,12.
电子科技大学学报2024,Vol.53Issue(2):259-270,12.DOI:10.12178/1001-0548.2023019

基于异构图和关键词的抽取式文本摘要模型

Extractive Document Summarization Model Based on Heterogeneous Graph and Keywords

朱颀林 1王羽 2徐建1

作者信息

  • 1. 南京理工大学计算机科学与工程学院,南京 210094
  • 2. 国防科技大学信息系统工程重点实验室,长沙 410003||中国电子科技集团公司第二十八研究所,南京 210007
  • 折叠

摘要

Abstract

Extractive document summarization uses certain strategies to select some sentences from lengthy texts to form a summary,whose key is to use as much semantic and structural information of the text as possible.In order to better mine such information and then use it to guide the summarization,an extractive document summarization model based on heterogeneous graph and keywords(HGKSum)is proposed,which models the text as a heterogeneous graph composed of sentence nodes and word nodes.The model uses the graph attention networks to learn the features of the nodes in the graph.The multi-task learning is applied to the model,which considers the keywords extraction task as an auxiliary task of the document summarization task.The candidate summary which derived from the prediction of the neural networks in the model is often highly redundant,so the model refines it to create the final summary of low redundancy.The comparative experiment on the document summarization benchmark shows that the proposed model outperforms the baselines.Besides,ablation studies also demonstrate the necessity of introducing heterogeneous nodes and keywords.

关键词

抽取式文本摘要/异构图/关键词/图注意力网络/多任务学习/关键词抽取任务

Key words

extractive document summarization/heterogeneous graph/keywords/graph attention network/multi-task learning

分类

信息技术与安全科学

引用本文复制引用

朱颀林,王羽,徐建..基于异构图和关键词的抽取式文本摘要模型[J].电子科技大学学报,2024,53(2):259-270,12.

基金项目

国家自然科学基金(61872186) (61872186)

国防基础科研计划国防科技重点实验室稳定支持项目(WDZC20225250405) (WDZC20225250405)

电子科技大学学报

OA北大核心CSTPCD

1001-0548

访问量0
|
下载量0
段落导航相关论文