电力系统保护与控制2016,Vol.44Issue(24):52-57,6.DOI:10.7667/PSPC152053
一种基于Hadoop的电力大数据属性实体识别算法
A kind of attribute entity recognition algorithm based on Hadoop for power big data
摘要
Abstract
With the coming of the era of big data, traditional entity recognition technologies have been unable to effectively finish data pre-processing because of the large scale of power grid data and volume complex type features. The rising of the Hadoop technologies in these years can deal with the big data processing better. Therefore this paper proposes a power big data entity recognition algorithm based on Hadoop. This algorithm uses the discretization algorithm to select higher information accuracy discrete points and puts forward a discretization evaluation indicator. In the end, the entity recognition of the monitoring data of wind turbines is finished on Hadoop platform. Experimental results show that the proposed algorithm performs well in terms of correctness and breakpoint number experiments and it has a good speed-up ratio. The proposed algorithm can be applied to power large data entity recognition processing.关键词
电力大数据/实体识别/离散化算法/信息准确率Key words
power big data/entity recognition algorithm/discretization/information accuracy引用本文复制引用
齐俊,曲朝阳,娄建楼,王冲..一种基于Hadoop的电力大数据属性实体识别算法[J].电力系统保护与控制,2016,44(24):52-57,6.基金项目
国家自然科学基金资助项目(51277023);吉林省科技厅社发处重点科技攻关项目(20150204084GX)This work is supported by National Natural Science Foundation of China (No.51277023) (51277023)