东南大学学报(英文版)2005,Vol.21Issue(3):293-298,6.
一种基于采样的并行电力负荷数据流划分方法
Data partitioning based on sampling for power load streams
摘要
Abstract
A novel data streams partitioning method is proposed to resolve problems of range-aggregation continuous queries over parallel streams for power industry.The first step of this method is to parallel sample the data,which is implemented as an extended reservoir-sampling algorithm.A skip factor based on the change ratio of data-values is introduced to describe the distribution characteristics of data-values adaptively.The second step of this method is to partition the fluxes of data streams averagely,which is implemented with two alternative equal-depth histogram generating algorithms that fit the different cases:one for incremental maintenance based on heuristics and the other for periodical updates to generate an approximate partition vector.The experimental results on actual data prove that the method is efficient,practical and suitable for time-varying data streams processing.关键词
数据流/连续查询/并行处理/采样/数据划分Key words
data streams/continuous queries/parallel processing/sampling/data partitioning分类
信息技术与安全科学引用本文复制引用
王永利,徐宏炳,董逸生,钱江波,刘学军..一种基于采样的并行电力负荷数据流划分方法[J].东南大学学报(英文版),2005,21(3):293-298,6.基金项目
The High Technology Research Plan of Jiangsu Province (No.BG2004034),the Foundation of Graduate Creative Program of Jiangsu Province (No.xm04-36). (No.BG2004034)