南京大学学报(自然科学版)2019,Vol.55Issue(4):546-552,7.DOI:10.13232/j.cnki.jnju.2019.04.004
基于稳定性的三支聚类
Three?way clustering based on sample's stability
摘要
Abstract
Two?way clustering algorithms produce clusters with clear and sharp boundaries,which does not truly reflect the fact that a cluster may not necessarily have a well?defined boundary in many real world situations. To tackle this deficiency, three?way clustering uses three regions through a pair of sets to represent a cluster instead of using two regions to represent a cluster by a single set, which reflects the three types of relationship between an object and a cluster, namely, belong?to definitely, uncertain and not belong?to definitely. In this paper, we propose a three?way clustering algorithm by using the stability of each sample. We use clustering ensemble results to compute the sample’s stability and divide the universe into cluster core and cluster halo based on sample’s stability. The elements in the cluster core are assigned into the core region of each cluster by using traditional clustering algorithm. The elements in the cluster halo are assigned into the fringe region of corresponding cluster according to distances between the elements and the centers of the cluster core region. Therefore,a three?way clustering is naturally formed. Experimental results on UCI datasets show that this method can improve the structure of the clustering results.关键词
聚类集成/稳定性/二支聚类/三支聚类Key words
clustering ensemble/stability/two⁃way clustering/three⁃way clustering分类
信息技术与安全科学引用本文复制引用
杨鑫,施虹,王平心,徐刚..基于稳定性的三支聚类[J].南京大学学报(自然科学版),2019,55(4):546-552,7.基金项目
国家自然科学基金(61503160,61572242),江苏省高校自然科学研究重大项目(18KJA1300),江苏省高校自然科学研究项目(15KJB110004) (61503160,61572242)