| 注册
首页|期刊导航|数字图书馆论坛|科技论文中学术信息的提取方法综述

科技论文中学术信息的提取方法综述

胡志刚 田文灿 孙太安 侯海燕

数字图书馆论坛Issue(10):39-47,9.
数字图书馆论坛Issue(10):39-47,9.DOI:10.3772/j.issn.1673-2286.2017.10.007

科技论文中学术信息的提取方法综述

A Method Review on Academic Information Extracting from Scientific Papers

胡志刚 1田文灿 2孙太安 1侯海燕2

作者信息

  • 1. 大连理工大学科学学与科技管理研究所,大连116024
  • 2. 大连理工大学WISE实验室,大连116024
  • 折叠

摘要

Abstract

In order to make better use of rich information in academic papers, it is a very urgent and realistic requirement to identify and extract academic information within. The academic information extracting has a broad application prospect in text mining, information retrieval, theme monitoring, information metrology and many other fields. There are five kinds of academic information, such as title information, section information, citation information, reference information and other information. This paper reviews the methods of academic information extracting from the ful text of academic papers. Different methods could be used to extract different kinds of academic information from different types of ful texts, PDF or HTML/XML. Final y, the paper also lists the current tools for extracting academic information.

关键词

学术信息/论文全文本/信息提取/机器学习

Key words

Academic Information/Ful Text/Information Extraction/Machine Learning

分类

社会科学

引用本文复制引用

胡志刚,田文灿,孙太安,侯海燕..科技论文中学术信息的提取方法综述[J].数字图书馆论坛,2017,(10):39-47,9.

基金项目

本研究得到国家自然科学基金项目"开放获取背景下的全文引文分析方法与应用研究"(编号:71503031)资助. (编号:71503031)

数字图书馆论坛

OACSSCICSTPCD

1673-2286

访问量0
|
下载量0
段落导航相关论文