计算机工程Issue(12):126-131,6.DOI:10.3969/j.issn.1000-3428.2014.12.023
基于自然语言处理的算法知识名称发现
Discovering Terms of Algorithmic Knowledge Based on Natural Language Processing
摘要
Abstract
There are many programming resources on the Internet. Although these programming resources have internal relations,there are often no hyperlinks connecting them. Getting the terms of algorithmic knowledge,organizing the terms to an expert file,which is used for recognizing the knowledge in the programming resources,the programming resources can be connected by the knowledge. To get the terms of algorithmic knowledge,this paper proposes a method to discover terms of algorithmic knowledge based on natural language processing. This method consists of discovering the patterns of strings which contain terms of algorithmic knowledge,extracting from programming resources that probably contain terms of algorithmic knowledge according to the discovered patterns,finding the word segmentation most likely appearing in the terms of algorithmic knowledge,and fetching the terms of algorithmic knowledge according to the word segmentation. This method increases 11 . 2% algorithmic knowledge and 13 . 6% terms of algorithmic knowledge in comparison with the manual collection of terms of algorithmic knowledge which is obtained by previous work.关键词
知识发现/模式发现/自然语言处理/算法知识名称/中文分词/词性标注Key words
knowledge discovery/pattern discovery/natural language processing/terms of algorithmic knowledge/Chinese word segmentation/part-of-speech tagging分类
信息技术与安全科学引用本文复制引用
朱国进,郑宁..基于自然语言处理的算法知识名称发现[J].计算机工程,2014,(12):126-131,6.基金项目
国家自然科学基金资助项目(60973121)。 (60973121)