| 注册
首页|期刊导航|计算机与数字工程|一种基于信息熵的关键词提取算法

一种基于信息熵的关键词提取算法

吴华 罗顺 孙伟晋

计算机与数字工程2019,Vol.47Issue(3):535-538,4.
计算机与数字工程2019,Vol.47Issue(3):535-538,4.DOI:10.3969/j.issn.1672-9722.2019.03.010

一种基于信息熵的关键词提取算法

An Extraction Method For Test Keyword Based on Information Entropy

吴华 1罗顺 1孙伟晋1

作者信息

  • 1. 上海通用识别技术研究所 上海 201112
  • 折叠

摘要

Abstract

Keywords extraction is the basis for techniques of information retrieval,natural language processing,ontology and so on. The paper introduces a new unsupervised method of keywords extraction based on information entropy. The algorithm can ana?lyze texts without prior knowledge such as domain dictionary and word segmentation,can well recognize word out of vocabulary and can deal multi-language condition text. An experimental indicates that the algorithm can achieve good precision rate and recall rate and it has achieved satisfactory results.

关键词

文本分析/自然语言处理/关键词提取/无监督

Key words

text analysis/natural language processing/keyword extraction/unsupervised method

分类

信息技术与安全科学

引用本文复制引用

吴华,罗顺,孙伟晋..一种基于信息熵的关键词提取算法[J].计算机与数字工程,2019,47(3):535-538,4.

计算机与数字工程

OACSTPCD

1672-9722

访问量0
|
下载量0
段落导航相关论文