计算机工程与应用Issue(20):112-117,6.DOI:10.3778/j.issn.1002-8331.1201-0298
基于MapReduce的海量数据挖掘技术研究
Research on massive data mining based on MapReduce
摘要
Abstract
MapReduce is a programming model which can run in a heterogeneous environment for mining massive volume of data. It is simple to be implemented without paying attention to the underlying details and can be used for large-scale parallel computing. In this paper, three data mining algorithms, Naive Bayes, K-modes, ECLAT are implemented by employing the MapReduce programming model. The results indicate that MapReduce can perform the data mining tasks on massive volume of data efficiently.关键词
云计算/数据挖掘/Hadoop/MapReduceKey words
cloud computing/data mining/Hadoop/MapReduce分类
信息技术与安全科学引用本文复制引用
李伟卫,赵航,张阳,王勇..基于MapReduce的海量数据挖掘技术研究[J].计算机工程与应用,2013,(20):112-117,6.基金项目
国家自然科学基金(No.60873196);中央高校基本科研业务费专项资金(No.QN2009092)。 ()