计算机与数字工程2019,Vol.47Issue(3):535-538,4.DOI:10.3969/j.issn.1672-9722.2019.03.010
一种基于信息熵的关键词提取算法
An Extraction Method For Test Keyword Based on Information Entropy
吴华 1罗顺 1孙伟晋1
作者信息
- 1. 上海通用识别技术研究所 上海 201112
- 折叠
摘要
Abstract
Keywords extraction is the basis for techniques of information retrieval,natural language processing,ontology and so on. The paper introduces a new unsupervised method of keywords extraction based on information entropy. The algorithm can ana?lyze texts without prior knowledge such as domain dictionary and word segmentation,can well recognize word out of vocabulary and can deal multi-language condition text. An experimental indicates that the algorithm can achieve good precision rate and recall rate and it has achieved satisfactory results.关键词
文本分析/自然语言处理/关键词提取/无监督Key words
text analysis/natural language processing/keyword extraction/unsupervised method分类
信息技术与安全科学引用本文复制引用
吴华,罗顺,孙伟晋..一种基于信息熵的关键词提取算法[J].计算机与数字工程,2019,47(3):535-538,4.