电力建设2016,Vol.37Issue(11):48-54,7.DOI:10.3969/j.issn.1000-7229.2016.11.008
基于Spark的大电网广域时空序列分析平台构建
Platform Building for Wide-Area Spatiotemporal Sequences Analysis of Large-Scale Power Grid Based on Spark
摘要
Abstract
To address the energy internet trends and increasingly complex operating environment, we need to enhance the mining depth and utilization capability of energy internet multi-source data relying on big data technology. First, in the view of the wide-area spatiotemporal sequences data of large power grid, this paper expounds the Spark's advantages in distributed computing and the goal of big data platform, designs the big data platform architecture of power grid based on Spark, and describes each level of the platform in detail. Secondly, this paper describes the Spark's advantage in processing the spatiotemporal sequences data. Finally, on the basis of Spark and Hadoop experiment environment, this paper carries out typical clustering algorithm to compare the performance between Spark and Hadoop. The results verifies that Spark has a great advantage in data processing comparing with Hadoop MapReduce, which lays the foundation for the next step research.关键词
能源互联网/Spark/时空序列/流计算/聚类Key words
energy internet/Spark/spatiotemporal sequences/streaming computing/cluster分类
信息技术与安全科学引用本文复制引用
袁宝超,刘道伟,刘丽平,王泽忠..基于Spark的大电网广域时空序列分析平台构建[J].电力建设,2016,37(11):48-54,7.基金项目
国家自然科学基金项目( 51207143 ) ( 51207143 )
国家电网公司科技项目(XT71-15-056) Project supported by National Natural Science Foundation of China (51207143) (XT71-15-056)