数字图书馆论坛2024,Vol.20Issue(3):25-33,9.DOI:10.3772/j.issn.1673-2286.2024.03.003
基于多阶段分类的科研项目申请书结构功能识别
Structure Function Recognition of Scientific Research Project Application Based on Multi-Stage Classification
摘要
Abstract
The research project applications contain rich scientific knowledge and are widely used as the basic data for scientific and technological information analyses.Some information analyses such as duplicate detection and analysis mining need to be carried out on the premise of clarifying the structure function of the applications.Therefore,this paper proposes a research project application structure function recognition model based on multi-stage classification.Firstly,the research project applications should be preprocessed,including identifying the main content and multimodal elements of the applications,and standardizing the text paragraphs.Afterwards,based on the BiLSTM-Attention model,the chapter titles and their text are distinguished,and the primary structure function is recognized based on the titles.Furtherly,the fine-grained structure function of the application is identified.The experiment shows that the precision and recall rate of the model reach 93.7%and 93.1%.The model can support the structured analysis of scientific research project applications and provide references for the structure function recognition of other types of academic texts.关键词
科研项目申请书/结构功能识别/多阶段分类/BiLSTM-AttentionKey words
Scientific Research Project Application/Structure Function Recognition/Multi-Stage Classification/BiLSTM-Attention分类
社会科学引用本文复制引用
林鑫,杜莹,罗宇..基于多阶段分类的科研项目申请书结构功能识别[J].数字图书馆论坛,2024,20(3):25-33,9.基金项目
本研究得到国家社会科学基金项目"面向多模态发布的学术论文语义标注与对象链接研究"(编号:23BTQ083)资助. (编号:23BTQ083)