| 注册
首页|期刊导航|中南大学学报(自然科学版)|一种基于特征库投影的文本分类算法

一种基于特征库投影的文本分类算法

尹绍锋 郑蕙 徐少华 荣辉桂 张娜

中南大学学报(自然科学版)2017,Vol.48Issue(7):1782-1789,8.
中南大学学报(自然科学版)2017,Vol.48Issue(7):1782-1789,8.DOI:10.11817/j.issn.1672-7207.2017.07.014

一种基于特征库投影的文本分类算法

A text classification algorithm based on feature library projection

尹绍锋 1郑蕙 2徐少华 1荣辉桂 3张娜3

作者信息

  • 1. 湖南大学校园信息化建设与管理办公室,湖南长沙,410082
  • 2. 湖南商学院旅游管理学院,湖南长沙,410205
  • 3. 湖南大学信息工程与科学学院,湖南长沙,410082
  • 折叠

摘要

Abstract

Considering that KNN algorithm has some disadvantages such as high time complexity,feature reduction,sample clipping and information loss,a feature library projection (FLP) classification algorithm was proposed.Firstly,the algorithm reserved all the features and characteristics of the training sample weight in the feature library.The data in this library were changed into new projection samples through the projection functions.By calculating the similarity of the new sample with the projection samples,data classification could be achieved.Based on the text classification,the effectiveness of the algorithm and texts,the data were validated under two conditions,i.e.small training texts and large training texts,and it was compared with KNN algorithm.The results show that the FLP algorithm does not lose the classification feature,and the classification accuracy is higher than that of other ones.The classification efficiency is not directly related to the sample size growth,and the time complexity is low.

关键词

文本分类/KNN算法/特征库投影

Key words

text classification/KNN algorithm/feature library projection

分类

信息技术与安全科学

引用本文复制引用

尹绍锋,郑蕙,徐少华,荣辉桂,张娜..一种基于特征库投影的文本分类算法[J].中南大学学报(自然科学版),2017,48(7):1782-1789,8.

基金项目

国家自然科学基金资助项目(61672221,61304184,61672156) (Projects(61672221,61304184,61672156) supported by the National Natural Science Foundation of China) (61672221,61304184,61672156)

中南大学学报(自然科学版)

OA北大核心CSCDCSTPCD

1672-7207

访问量0
|
下载量0
段落导航相关论文