计算机科学与探索2012,Vol.6Issue(1):46-57,12.DOI:10.3778/j.issn.1673-9418.2012.01.003
面向服务的云数据挖掘引擎的研究
Research on Service-Oriented Data Mining Engine Based on Cloud Computing
摘要
Abstract
The scalability of data mining algorithms is restricted when dealing with large-scale data. There are significant differences in a wide range of application areas and requirements for knowledge discovery process. It is fundamental to provide effective formalisms to design distributed data mining application and support their efficient execution. This paper proposes a novel service-oriented data minging engine based on cloud computing framework, which is named as CloudDM. Differentiating from grid-based distributed data mining framework, CloudDM exploits the capacity of open source cloud computing platform-Hadoop for large-scale data analysis, supports the design and execution of distributed data mining applications according to SOA (service-oriented architecture). Moreover, it discusses and reports the key component functions and implementation technologies. According to the design principles of SOA and data mining engine based on cloud computing, the paper can solve the problems in massive data mining systems, such as big data storage, data processing and interactive operation of algorithms, etc.关键词
云计算/Hadoop/数据挖掘/面向服务的体系结构(SOA)Key words
cloud computing/ Hadoop/ data mining/ service-oriented architecture (SOA)分类
自科综合引用本文复制引用
余永红,向晓军,高阳,商琳,杨育彬..面向服务的云数据挖掘引擎的研究[J].计算机科学与探索,2012,6(1):46-57,12.基金项目
The National Natural Science Foundation of China under Grant No.61035003(国家自然科学基金) (国家自然科学基金)
the International S&T Cooperation Program of China under Grant No.2010DFA11030(科技部国际科技合作项目) (科技部国际科技合作项目)
the Natural Science Foundation of Jiangsu Province of China under Grant No.SBK201150103(江苏省自然科学基金). (江苏省自然科学基金)