计算机应用与软件2013,Vol.30Issue(3):193-195,3.DOI:10.3969/j.issn.1000-386x.2013.03.051
用于中文文本分类的基于类别区分词的特征选择方法
FEATURE SELECTION METHOD FOR CHINESE TEXT CATEGORISATION BASED ON CLASS DISCRIMINATING WORDS
摘要
Abstract
Feature selection is a key process in Chinese text categorisation, which will directly affect the accuracy of text categorisation. After analysing some of the feature selection methods, a new feature selection method based on class-discriminating words is proposed. Experimental results show that the new feature selection method is superior to traditional approaches in categorisation efficiency, therefore verifies the validity of the new method.关键词
文本分类/特征选择/类别区分词/信息增益/互信息/期望交叉熵Key words
Text categorisation /Feature selection /Class discriminating word/ Information gain/ Mutual information/ Excepted cross entropy
分类
信息技术与安全科学引用本文复制引用
周奇年,张振浩,徐登彩..用于中文文本分类的基于类别区分词的特征选择方法[J].计算机应用与软件,2013,30(3):193-195,3.