桂林电子科技大学学报2016,Vol.36Issue(5):387-390,4.
基于Hadoop的DG-Apriori算法
DG-Apriori algorithm based on Hadoop
摘要
Abstract
Aiming at the problem that the Apriori algorithm needs to scan the database repeatedly and generates large candi-date item sets and has long computation time,a DG-Apriori algorithm based on Hadoop is proposed,The algorithm im-proves connection of frequent item sets,the generation of k-frequent item sets is only needed to join 1-frequent item sets with (k-1)-frequent item sets,the connection number is greatly reduced and the huge candidate item sets are avoided.And the improved Apriori algorithm is used for Hadoop platform to compute parallel frequent item sets and reduce the computa-tion time.Experimental results show that DG-Apriori algorithm can effectively improve the performance of Apriori algo-rithm.关键词
Apriori算法/数据库/Hadoop/频繁项集Key words
Apriori algorithm/database/Hadoop/frequent item sets分类
信息技术与安全科学引用本文复制引用
杜佼玲,张向利..基于Hadoop的DG-Apriori算法[J].桂林电子科技大学学报,2016,36(5):387-390,4.基金项目
国家自然科学基金(61363031,61461010) (61363031,61461010)
广西高校云计算与复杂系统重点实验室研究课题(14101) (14101)
桂林电子科技大学研究生教育创新计划(GDYCSZ201450) (GDYCSZ201450)