计算机技术与发展2012,Vol.22Issue(5):83-86,4.
一种新的中文文本分类算法——One Class SVM-KNN算法
A New Text Classification Algorithm-One Class SVM-KNN
刘文 1吴陈1
作者信息
- 1. 江苏科技大学智能信息处理实验室,江苏镇江212003
- 折叠
摘要
Abstract
Text classification is widely used in database and search engine. KNN is widely used in Chinese text categorization,however, KNN has many defects in the application of text classification. The deficiency of KNN classification algorithm is that all the training samples are kept until the samples are classified. When the size of samples is very large, the storage and computation will be costly, which will result in classification deviation. One class SVM is a simple and effective classification algorithm in one class. To solve KNN problems, a new algorithm based on harmonic one-class-SVM and KNN was proposed, which will achieve better classification effect. The experiment result is shown that the recall computed using the proposed method is obviously more highly than the KNN method.关键词
中文文本分类/支持向量机/K-近邻/One Class SVM-KNNKey words
Chinese text classification/support vector machine/K-nearest neighbour/One Class SVM-KNN分类
信息技术与安全科学引用本文复制引用
刘文,吴陈..一种新的中文文本分类算法——One Class SVM-KNN算法[J].计算机技术与发展,2012,22(5):83-86,4.