计算机应用与软件2011,Vol.28Issue(4):239-241,3.
一种改进的文本特征选择方法的研究与设计
STUDY AND DESIGN OF AN IMPROVED TEXT FEATURE SELECTION METHOD
符会涛 1卡米力·木衣丁1
作者信息
- 1. 新疆大学信息科学与工程学院,新疆,乌鲁木齐,830046
- 折叠
摘要
Abstract
The article explains why text classification performance is low when mutual information method is adopted in feature selection,asserts that it is largely due to the flaw of selection of rare feature when making feature selections.Next a mutual information feature selection method based on distributed degree and average frequency is proposed.Experimental results show that the improved mutual information method can significantly improve the text classification performance.关键词
特征选择/互信息/文本分类Key words
Feature selection/ Mutual information/ Text classification引用本文复制引用
符会涛,卡米力·木衣丁..一种改进的文本特征选择方法的研究与设计[J].计算机应用与软件,2011,28(4):239-241,3.