计算机与现代化Issue(11):52-57,6.DOI:10.3969/j.issn.1006-2475.2013.11.012
基于 MapReduce 的可扩展协同聚类算法
A Scalable Co-clustering Algorithm Based on MapReduce
摘要
Abstract
Collaborative clustering algorithm is a kind of clustering algorithm to cluster the documents and the features at the same time, this algorithm can find the potential relationship between internal document features so as to improve the clustering effect . With the arrival of the era of big data , parallel algorithm showed its superiority , this paper carries out a comprehensive research on collaborative clustering algorithm , and extends the parallel algorithm of it .We studied the collaborative clustering algorithm based on minimum sum-squared residue , and then designed and realized the parallel collaborative clustering algorithm with Ma -pReduce model .Experimental results show that the proposed parallel collaborative clustering algorithm can improve the efficiency of clustering , and be of well scalability .关键词
协同聚类/MapReduce/可扩展/残差平方和Key words
collaborative clustering/MapReduce/scalability/sum-squared residue分类
信息技术与安全科学引用本文复制引用
马俏,万剑怡,王明文..基于 MapReduce 的可扩展协同聚类算法[J].计算机与现代化,2013,(11):52-57,6.基金项目
国家自然科学基金资助项目 ()