计算机应用研究2023,Vol.40Issue(12):3566-3571,3577,7.DOI:10.19734/j.issn.1001-3695.2023.04.0163
考虑不平衡指数的不平衡数据集分类设计方法
Classification design method of unbalanced data sets considering unbalanced index
摘要
Abstract
The imbalance of data sets category is one of the important problems in the classification field.The unbalanced in-dex of each data set is closely related to itself,it is a key indicator of data sets.To deal with the classification design of unba-lanced data sets,this paper proposed an enhanced AdaBoost(E-AdaBoost)algorithm.In the process of iteration,the algorithm took into account unbalanced index,and the classification accuracy of the minority classed that was more important in unba-lanced data sets improving the weight updating strategy of the base classifier,and thus promoting the classification performance of unbalanced data sets.The classification design method of unbalanced data sets based on E-AdaBoost could determine the weight parameters of the base classifier according to the sample unbalanced index,so as to improve the performance of the clas-sifier.With this method that was combined with multiple classical classifiers,this paper carried out experimental analysis in terms of artificial data sets and standard data sets,and compared with relevant methods.The results show that the classification design method of unbalanced data sets based on E-AdaBoost can effectively improve the classification performance of unba-lanced data sets.关键词
不平衡分类/改进AdaBoost/不平衡指数/权重Key words
unbalanced classification/enhanced AdaBoost/unbalanced index/weight分类
信息技术与安全科学引用本文复制引用
周玉,岳学震,孙红玉..考虑不平衡指数的不平衡数据集分类设计方法[J].计算机应用研究,2023,40(12):3566-3571,3577,7.基金项目
国家自然科学基金资助项目(U1504622,31671580) (U1504622,31671580)
河南省高等学校青年骨干教师培养计划项目(2018GGJS079) (2018GGJS079)