计算机工程与应用2018,Vol.54Issue(13):19-26,8.DOI:10.3778/j.issn.1002-8331.1802-0186
基于子空间聚类的高维数据可视分析方法综述
A survey of high dimensional data visual analysis methods based on subspace clustering
摘要
Abstract
With the rapid development of information technology and the advent of big data era, the data show the com-plex features of high dimensionality and nonlinearity. For high-dimensional data, it is often difficult to find feature regions that reflect distribution patterns in full-dimensional space, but most of the traditional clustering algorithms only have good scalability for low-dimensional data. Therefore, when the traditional clustering algorithm processes high-dimensional data, the clustering results may not meet the needs of the current stage. The subspace clustering algorithm searches for clusters existing in the high-dimensional data subspace, and divides the original feature space of data into different subsets of fea-tures to reduce the influence of uncorrelated features and preserve the main features in the original data. The subspace clustering method can find the information that is not easy to show in high-dimensional data and display the internal struc-ture of data attributes and dimensions through visualization techniques, which provides an effective method for visual analysis of high-dimensional data. This paper summarizes the research progress of high-dimensional data visual analysis methods based on subspace clustering in recent years, and elaborates three different methods based on feature selection, subspace exploration and subspace clustering. Then, the methods and applications of its interaction analysis are analyzed, and the future development trends of visual analysis methods of high-dimensional data are prospected.关键词
高维数据/可视分析/子空间探索/子空间聚类Key words
high dimensional data/visual analysis/subspace exploration/subspace clustering分类
信息技术与安全科学引用本文复制引用
田帅,陈谊..基于子空间聚类的高维数据可视分析方法综述[J].计算机工程与应用,2018,54(13):19-26,8.基金项目
"十二五"国家科技支撑计划(No.2012BAD29B01-2) (No.2012BAD29B01-2)
国家科技基础性工作专项(No.2015FY111200) (No.2015FY111200)
北京市科技计划课题(No.Z151100001615041) (No.Z151100001615041)
虚拟现实技术与系统国家重点实验室开放基金(No.BUAA-VR-17KF-07) (No.BUAA-VR-17KF-07)
2018年研究生科研能力提升计划项目. ()