融合预训练模型与注意力的事件抽取方法OA北大核心
Incorporating Pre-Trained Model and Attention Mechanism for Event Extraction
事件抽取旨在从大量无结构的文本中抽取出结构化的事件信息,然而现有的研究工作存在难以抽取重叠角色,子任务间缺乏交互以及语义特征表达能力不足的问题.针对上述问题提出了一种中文事件抽取模型PACJEE(pre-trained language model and attention mechanism based Chinese joint event extraction).该模型采用预训练语言模型RoBERTa来提取文本特征,对文本进行事件类型分类,在触发词识别阶段,将提取到的事件类型先验特征与文本特征进行融合,并且使用自注意力机制获取内部特征相关性,在论元角色分类阶段引入卷积神经网络与注意力机制来加强触发词特征的表达能力,通过多层指针标注进行重叠角色的识别.该方法在中文数据集ACE2005和DuEE上进行了实验分析,结果显示,相较于基准方法,在触发词分类上的F1值分别提升1.6和0.5个百分点,在论元角色分类上的F1值分别提升3.3和2.5个百分点,说明该模型能显著提升事件抽取效果,并且在一定程度上提升了对角色重叠事件的识别准确率.
Event extraction aims to extract structured event information from a large amount of unstructured texts,but existing research work has problems such as difficulty in extracting overlapping roles,lack of interaction between sub-tasks,and insufficient semantic feature expression ability.This paper proposes a Chinese event extraction model PACJEE(pre-trained language model and attention mechanism based Chinese joint event extraction)to address these problems.The model uses the pre-trained language model RoBERTa to extract text features,then classifies the event types of the text,in the trigger word recognition stage,fuses the extracted event type prior features with the text features,and uses self-attention mechanism to obtain the internal feature relevance,in the argument role classification stage,introduces convolu-tional neural network and attention mechanism to enhance the trigger word feature expression ability,and finally uses multi-layer pointer tagging to identify overlapping roles.The method is experimentally analyzed on the Chinese datasets ACE2005 and DuEE,and the results show that compared with the baseline methods,the F1 values of trigger word classifi-cation are increased by 1.6 and 0.5 percentage points respectively,and the F1 values of argument role classification are increased by 3.3 and 2.5 percentage points respectively,indicating that the model can significantly improve the event extraction effect,and to a certain extent,improve the recognition accuracy of overlapping role events.
肖立中;殷晨旭
上海应用技术大学 计算机科学与信息工程学院,上海 201418上海应用技术大学 计算机科学与信息工程学院,上海 201418
计算机与自动化
事件抽取角色重叠特征融合注意力机制
event extractionoverlapping rolesfeature integrationattention mechanism
《计算机工程与应用》 2025 (4)
130-140,11
上海市自然科学基金(20ZR1455600).
评论