| 注册
首页|期刊导航|计算机与现代化|文本特征选择算法MI的改进

文本特征选择算法MI的改进

方志龙

计算机与现代化Issue(7):172-175,4.
计算机与现代化Issue(7):172-175,4.DOI:10.3969/j.issn.1006-2475.2011.07.047

文本特征选择算法MI的改进

Improvement of Mutual Information of Feature Extraction

方志龙1

作者信息

  • 1. 华南师范大学计算机学院,广东,广州,510631
  • 折叠

摘要

Abstract

Feature extraction is a crucial part in text mining. After word splitting, the docs of the train set form the original feature space, but the dimension of the space is usually very large, it reaches hundreds of thousands of demensions. After feature extraction, not only the dimension of the space decreases sharply, but also, the impact of the noise is reduced. Finally, speed and precision of the classifier are both increased. This paper improves the original mutual information method, and proves it' s vilid in the experiment.

关键词

特征选择/MI/IG/标准差

Key words

feature extraction/ MI/ information gain(IG) / mean square error

分类

信息技术与安全科学

引用本文复制引用

方志龙..文本特征选择算法MI的改进[J].计算机与现代化,2011,(7):172-175,4.

计算机与现代化

OACSTPCD

1006-2475

访问量0
|
下载量0
段落导航相关论文