首页|期刊导航|软件导刊|一种基于 MapR educ e的频繁项集挖掘算法

一种基于 MapR educ e的频繁项集挖掘算法

孙兵率

软件导刊Issue(4)：75-77,3.

软件导刊Issue(4)：75-77,3.DOI:10.11907/rjdk.151007

一种基于 MapR educ e的频繁项集挖掘算法

A Algorithm for Mining Frequent Itemsets Based on MapReduce

孙兵率¹

作者信息

1. 西安工程大学计算机科学学院，陕西西安710048
折叠

摘要

Abstract

With the arrival of the era of big data ,in view of the Apriori algorithm and FP‐Grow th algorithm in mining large scale frequent itemsets ,there exists some performance bottlenecks such as insufficient memory ,low calculation efficiency and so on .An improved Aggregating_FP algorithm is proposed ,the algorithm combines MapReduce with FP‐Growth algo‐rithm to realize the idea of parallel mining frequent itemsets .And in the output stage ,each item is processed by merging and the algorithm output only the first K frequent itemsets including the item to improve the effectiveness of mass data de‐cision value .M ultiple groups of different scale data sets are tested in Hadoop platform ,experimental results show that the Aggregating_FP algorithm is applied to analyze and deal with big data ,and has good expansibility .

关键词

频繁项集/MapReduce/Hadoop/可扩展性

Key words

Frequent Itemsets/MapReduce/Hadoop/Scalability

分类

信息技术与安全科学

引用本文复制引用

孙兵率..一种基于 MapR educ e的频繁项集挖掘算法[J].软件导刊,2015,(4):75-77,3.

软件导刊

ISSN：1672-7800

访问量4

下载量0

段落导航