计算机工程与应用2016,Vol.52Issue(18):74-78,5.DOI:10.3778/j.issn.1002-8331.1412-0214
基于最近邻互信息的特征选择算法
Feature selection algorithm based on nearest-neighbor mutual in-formation
摘要
Abstract
Feature selection of neighborhood information system is constrained by the neighborhood size. First, this paper calculates the distance between a given sample and its nearest samples with the same and different labels to define the con-cept of nearest-neighbor, and determines the size of nearest neighbor simultaneously. Second, the notion of nearest-neighbor is extended to Shannon information theory, and the concept of nearest neighbor mutual information is presented. Then, a forward greedy strategy is used to construct feature selection algorithm based on nearest-neighbor mutual information. Finally, experiments are conducted on eight UCI data sets and two different base classifiers. Experimental results show that the proposed algorithm selects a few features and effectively improves classification performance compared with other popular algorithms.关键词
特征选择/最近邻/互信息/邻域互信息Key words
feature selection/nearest-neighbor/mutual information/neighborhood mutual information分类
信息技术与安全科学引用本文复制引用
王晨曦,林耀进,刘景华,林梦雷..基于最近邻互信息的特征选择算法[J].计算机工程与应用,2016,52(18):74-78,5.基金项目
国家自然科学基金(No.61303131);福建省自然科学基金(No.2013J01028);福建省教育厅科技项目(No.JA14192,No.JAT60866)。 ()