数字图书馆论坛Issue(10):39-47,9.DOI:10.3772/j.issn.1673-2286.2017.10.007
科技论文中学术信息的提取方法综述
A Method Review on Academic Information Extracting from Scientific Papers
摘要
Abstract
In order to make better use of rich information in academic papers, it is a very urgent and realistic requirement to identify and extract academic information within. The academic information extracting has a broad application prospect in text mining, information retrieval, theme monitoring, information metrology and many other fields. There are five kinds of academic information, such as title information, section information, citation information, reference information and other information. This paper reviews the methods of academic information extracting from the ful text of academic papers. Different methods could be used to extract different kinds of academic information from different types of ful texts, PDF or HTML/XML. Final y, the paper also lists the current tools for extracting academic information.关键词
学术信息/论文全文本/信息提取/机器学习Key words
Academic Information/Ful Text/Information Extraction/Machine Learning分类
社会科学引用本文复制引用
胡志刚,田文灿,孙太安,侯海燕..科技论文中学术信息的提取方法综述[J].数字图书馆论坛,2017,(10):39-47,9.基金项目
本研究得到国家自然科学基金项目"开放获取背景下的全文引文分析方法与应用研究"(编号:71503031)资助. (编号:71503031)