计算机工程与应用2018,Vol.54Issue(3):166-171,211,7.DOI:10.3778/j.issn.1002-8331.1608-0122
动态误分类代价下代价敏感属性选择分治算法
Divide and conquer algorithm for cost-sensitive feature selection based on dynamic misclassification costs
摘要
Abstract
Cost-sensitive feature selection problem aims at getting an attribute subset with the minimal total cost, through considering the trade-off between test costs and misclassification costs. There are two main challenges in cost-sensitive feature selection problem. On the one hand, most of the cost-sensitive attribute selection methods only take fixed misclas-sification costs into account, thus these methods can't solve imbalance class problems. On the other hand, the efficiency is not ideal when dealing with cost-sensitive feature selection on large scale datasets. In this paper, the contributions for the two challenges are summarized as follows. Firstly, it designs a new dynamic mechanism of misclassification costs to minimize total cost. Secondly, each of datasets is adaptively divided according to the scale of the dataset based on divide and conquer method. Finally, cost-sensitive feature selection problem is redefined based on dynamic misclassification costs, and a divide and conquer algorithm is proposed for cost-sensitive feature selection problem. The proposed algo-rithm is compared with two other algorithms on seven UCI datasets. Some experiments demonstrate that the proposed algo-rithm can improve the efficiency and obtain the optimal misclassification costs as well, so as to ensure to minimize total cost.关键词
粗糙集/代价敏感/属性选择/动态误分类代价/自适应分治Key words
rough sets/cost-sensitive/feature selection/dynamic misclassification cost/adaptive divide and conquer分类
信息技术与安全科学引用本文复制引用
黄伟婷,赵红..动态误分类代价下代价敏感属性选择分治算法[J].计算机工程与应用,2018,54(3):166-171,211,7.基金项目
福建省教育厅项目(No.JAT160287,No.JAT160307) (No.JAT160287,No.JAT160307)
漳州市自然科学基金(No.ZZ2016J35) (No.ZZ2016J35)
国家自然科学基金(No.61379049,No.61379089). (No.61379049,No.61379089)