| 注册
首页|期刊导航|四川大学学报(自然科学版)|生成式摘要的事实一致性与文本质量的平衡性研究

生成式摘要的事实一致性与文本质量的平衡性研究

杨昱睿 何禹瞳 琚生根

四川大学学报(自然科学版)2025,Vol.62Issue(2):347-358,12.
四川大学学报(自然科学版)2025,Vol.62Issue(2):347-358,12.DOI:10.19907/j.0490-6756.240241

生成式摘要的事实一致性与文本质量的平衡性研究

The study on balancing factual consistency and text quality in abstractive summarization

杨昱睿 1何禹瞳 2琚生根1

作者信息

  • 1. 四川大学计算机学院,成都 610065
  • 2. 兰州大学信息科学与工程学院,兰州 730000
  • 折叠

摘要

Abstract

Enhancing factual consistency has become a research hotspot in the field of abstractive summariza-tion.Current mainstream methods can be categorized into two approaches:post-editing of generated summa-ries and optimization of model mechanisms.While these methods effectively improve factual consistency,they often sacrifice text quality and readability.To address this issue,the authors propose an abstractive sum-marization model named SumRCL(Summarization with Reinforcement and Contrastive Learning)that com-bines reinforcement learning with ranking-based contrastive learning.On the one hand,the authors leverage ranking-based contrastive learning on candidate summaries to enhance the correlation between the probability assigned to a summary by the model and its factual consistency.On the other hand,the authors employ rein-forcement learning based on text quality metrics to preserve high-quality text.Specifically,the authors utilize Monte Carlo search to address the issue of intermediate summary evaluation.Experiments on the CNN/DM and XSUM datasets demonstrate that our proposed SumRCL model indeed contributes to generating summa-ries with both high factual consistency and text quality.The authors analyze the effects of the number of candi-date summaries and the choice of ranking metrics in contrastive learning on the final performance.Finally,through manual evaluation,the authors demonstrate that SumRCL exhibits superior factual consistency com-pared to popular large language models.

关键词

生成式摘要/事实一致性/强化学习/对比学习/大语言模型

Key words

Abstractive summarization/Factual consistency/Reinforcement learning/Contrastive learning/Large language model

分类

计算机与自动化

引用本文复制引用

杨昱睿,何禹瞳,琚生根..生成式摘要的事实一致性与文本质量的平衡性研究[J].四川大学学报(自然科学版),2025,62(2):347-358,12.

基金项目

国家自然科学基金重点项目(62137001) (62137001)

四川省重点研发项目(2023YFG0265) (2023YFG0265)

四川大学学报(自然科学版)

OA北大核心

0490-6756

访问量0
|
下载量0
段落导航相关论文