首页|期刊导航|四川大学学报（自然科学版）|生成式摘要的事实一致性与文本质量的平衡性研究

生成式摘要的事实一致性与文本质量的平衡性研究

杨昱睿何禹瞳琚生根

四川大学学报（自然科学版）2025，Vol.62Issue(2)：347-358,12.

四川大学学报（自然科学版）2025，Vol.62Issue(2)：347-358,12.DOI:10.19907/j.0490-6756.240241

生成式摘要的事实一致性与文本质量的平衡性研究

The study on balancing factual consistency and text quality in abstractive summarization

杨昱睿 ¹何禹瞳 ²琚生根¹

作者信息

1. 四川大学计算机学院,成都 610065
2. 兰州大学信息科学与工程学院,兰州 730000
折叠

摘要

Abstract

Enhancing factual consistency has become a research hotspot in the field of abstractive summariza-tion.Current mainstream methods can be categorized into two approaches:post-editing of generated summa-ries and optimization of model mechanisms.While these methods effectively improve factual consistency,they often sacrifice text quality and readability.To address this issue,the authors propose an abstractive sum-marization model named SumRCL(Summarization with Reinforcement and Contrastive Learning)that com-bines reinforcement learning with ranking-based contrastive learning.On the one hand,the authors leverage ranking-based contrastive learning on candidate summaries to enhance the correlation between the probability assigned to a summary by the model and its factual consistency.On the other hand,the authors employ rein-forcement learning based on text quality metrics to preserve high-quality text.Specifically,the authors utilize Monte Carlo search to address the issue of intermediate summary evaluation.Experiments on the CNN/DM and XSUM datasets demonstrate that our proposed SumRCL model indeed contributes to generating summa-ries with both high factual consistency and text quality.The authors analyze the effects of the number of candi-date summaries and the choice of ranking metrics in contrastive learning on the final performance.Finally,through manual evaluation,the authors demonstrate that SumRCL exhibits superior factual consistency com-pared to popular large language models.

关键词

生成式摘要/事实一致性/强化学习/对比学习/大语言模型

Key words

Abstractive summarization/Factual consistency/Reinforcement learning/Contrastive learning/Large language model

分类

信息技术与安全科学

引用本文复制引用

杨昱睿,何禹瞳,琚生根..生成式摘要的事实一致性与文本质量的平衡性研究[J].四川大学学报（自然科学版）,2025,62(2):347-358,12.

基金项目

国家自然科学基金重点项目(62137001) （62137001）

四川省重点研发项目(2023YFG0265) （2023YFG0265）

四川大学学报（自然科学版）

OA北大核心

ISSN：0490-6756

访问量0

下载量0

段落导航