计算机工程与应用2009,Vol.45Issue(22):53-55,3.DOI:10.3778/j.issn.1002-8331.2009.22.018
新型快速中文文本分类器的设计与实现
Design and implementation of new Chinese text classier
陈艳秋 1熊耀华1
作者信息
- 1. 东北大学东软信息技术学院计算机科学与技术系,辽宁大连100623
- 折叠
摘要
Abstract
For improving the efficiency and accuracy of Chinese text eategnrization,this paper presents a new Chinese text classier,in which a novel feature selection is proposed according to word frequency,mutual information and classificatory information,and after analyzing the hypostasis of the traditional TF-IDF,a weight adjustment method is put forward in which the IDF function is replaced by function used in feature selection.Finally a fast Bayes theory classier is designed.Experiments prove this classier is simple and effective.关键词
中文文本分类/特征选择/特征权重/分类算法Key words
Chinese text categorization/feature selection/feature weighting/classification algorithm分类
计算机与自动化引用本文复制引用
陈艳秋,熊耀华..新型快速中文文本分类器的设计与实现[J].计算机工程与应用,2009,45(22):53-55,3.