| 注册
首页|期刊导航|计算机工程与应用|基于Fin-BERT的中文金融领域事件抽取方法

基于Fin-BERT的中文金融领域事件抽取方法

李熠 耿朝阳 杨丹

计算机工程与应用2024,Vol.60Issue(14):123-132,10.
计算机工程与应用2024,Vol.60Issue(14):123-132,10.DOI:10.3778/j.issn.1002-8331.2304-0224

基于Fin-BERT的中文金融领域事件抽取方法

Fin-BERT-Based Event Extraction Method for Chinese Financial Domain

李熠 1耿朝阳 1杨丹1

作者信息

  • 1. 西安工业大学计算机科学与工程学院,西安 710021
  • 折叠

摘要

Abstract

Event extraction aims to extract human-interest information from massive amounts of unstructured text.Cur-rently,most existing event extraction methods are based on general corpora and rarely consider domain-specific prior knowledge.Moreover,most methods cannot handle well the case where multiple events exist in the same document,and they perform poorly when faced with a large number of negative examples.To address these issues,this paper proposes a model called Fin-PTPCG based on Fin-BERT(financial bidirectional encoder representation from Transformers)and PTPCG(pseudo-trigger-aware pruned complete graph).This method fully utilizes the expression ability of the Fin-BERT pre-training model and incorporates domain-specific prior knowledge during the encoding stage.In the event detection module,multiple binary classifiers are stacked to ensure that the model can effectively identify the situation of multiple events in a document and screen out negative examples.Combined with the decoding module of the PTPCG model,entities are extracted and connected into a complete graph and pruned by calculating a similarity matrix.The problem of unlabeled triggers is solved by selecting pseudo-triggers.Finally,the event extraction is achieved by the event classifier.This method achieves a 0.7 and 3.7 percentage points improvement in F1 score compared to the baselines on the ChFinAnn and Duee-fin datasets for the event extraction task.

关键词

事件抽取/事件检测/信息抽取/自然语言处理

Key words

event extraction/event detection/information extraction/natural language processing

分类

信息技术与安全科学

引用本文复制引用

李熠,耿朝阳,杨丹..基于Fin-BERT的中文金融领域事件抽取方法[J].计算机工程与应用,2024,60(14):123-132,10.

计算机工程与应用

OA北大核心CSTPCD

1002-8331

访问量4
|
下载量0
段落导航相关论文