计算机工程与应用2018,Vol.54Issue(8):137-142,6.DOI:10.3778/j.issn.1002-8331.1611-0483
自动确定聚类中心的密度峰值算法
Automatically determine density of cluster center of peak algorithm
摘要
Abstract
Density Peaks Clustering(DPC)is a density-based clustering algorithm,which has the advantage of not need-ing to specify clustering parameters and discovering non-spherical clusters.In this paper,an adaptive truncation method based on Gini index is proposed to solve the problem that the density peak algorithm can not effectively deal with each scene by calculating the cutoff distance dc,and the density peak algorithm manually selects the clustering center to get the actual clustering center.Distance dcand automatic clustering center method can effectively solve the defects of tradi-tional DPC algorithm which can not handle the complex data set.The algorithm firstly cuts off the distance through Gini index,then calculates the cluster center weights of each point,and then uses the change of slope to find the critical point. This strategy effectively avoids the errors caused by manual selection of clustering centers by decision graph. Experi-ments show that the new algorithm not only can automatically determine the clustering center,but also has higher accuracy than the original algorithm.关键词
密度峰值/聚类/簇中心点/基尼指数Key words
density peak/clustering/cluster center point/Gini index分类
信息技术与安全科学引用本文复制引用
王洋,张桂珠..自动确定聚类中心的密度峰值算法[J].计算机工程与应用,2018,54(8):137-142,6.基金项目
江苏省自然科学基金(No.BK20140165). (No.BK20140165)