计算机技术与发展2019,Vol.29Issue(3):178-182,5.DOI:10.3969/j.issn.1673-629X.2019.03.037
基于云计算的数据挖掘系统设计与实现
Design and Implementation of Data Mining System Based on Cloud Computing
摘要
Abstract
In order to solve the problem of the ever-increasing contradiction between the massive data and the limited computing capacity of traditional data mining system caused by the exponential growth of data, we propose a solution combined cloud computing technology and data mining organic. By using Map/Reduce, a parallel programming model method that can handle a large number of semi-structured data collections, cloud computing technology is integrated into massive data mining process, and a cloud-based data mining system is designed and implemented. This system is tested by excavating and analyzing log datasets of university educators and students in library e-documents. The results prove that the system can provide even services for users according to their needs. The experiment shows that the running efficiency and speed of the system are higher than that of the single machine system, and with the increase of data volume, the advantage of mining efficiency is more obvious. Therefore, the system can meet users' needs and effectively solve the technical bottleneck of traditional data mining systems.关键词
云计算/数据挖掘/海量数据/Map/ReduceKey words
cloud computing/data mining/massive data/Map/Reduce分类
信息技术与安全科学引用本文复制引用
王晓妮,段群,韩建刚..基于云计算的数据挖掘系统设计与实现[J].计算机技术与发展,2019,29(3):178-182,5.基金项目
陕西省教育科学"十三五"规划2017年课题(SGH17H196) (SGH17H196)
咸阳师范学院专项科研基金资助项目(13XSYK087) (13XSYK087)