华侨大学学报(自然科学版)2011,Vol.32Issue(4):401-404,4.
一种改进的朴素贝叶斯文本分类方法
An Improved Text Classification Method Based on Bayes
摘要
Abstract
There are huge amount of unstructured text resources in internet, a refined Naive Bayes based text categorization method is proposed in this paper for classifying these resources. Firstly, this method refines text by calculating the features of the text in order to improve the text's recognizability, and then Naive Bayes is used to classify these resources based on these features instead of the original words. The experiments show that the new method is easy setting up and renew in theory, and the accurate rate of the classification is also improved.关键词
文本分类/朴素贝叶斯方法/文档特征/卡方检验Key words
text categorization/Naive Bayes/text feature/Chi-Square test分类
信息技术与安全科学引用本文复制引用
陈叶旺,余金山..一种改进的朴素贝叶斯文本分类方法[J].华侨大学学报(自然科学版),2011,32(4):401-404,4.基金项目
福建省自然科学基金资助项目(A0810013) (A0810013)
华侨大学高层次人才科研启动项目(09BS619) (09BS619)