计算机与现代化Issue(2):62-66,72,6.DOI:10.3969/j.issn.1006-2475.2015.02.014
基于预测的云计算热点数据副本因子决策算法
Dynamic Replicas Strategy Based on Predicted Popularity
摘要
Abstract
To improve data availability and performance of cluster, current HDFS adapt uniform data replication.However, dif-ferent files have different popularity and sometimes the disparity is enormous, access to high popular data may hurt job perform-ance.To address this problem, a dynamic replicas strategy based on predicted popularity is put forward.By making full use of the recent data popularity, based on grey prediction model, we use Markov prediction model to correct the predicted deviation be-cause of the burst access and shifting access, and get the accurate predicted popularity of file.After then, finite channel service model based on the predicted popularity is established to calculate the minimum replicas meeting user demand.Experimental re-sult shows that compared with default data replication, our strategy can more effectively avoid contentions, reduce the time consu-ming of job, and alleviated the network traffic.关键词
热点数据/副本管理/云计算/Hadoop/灰色预测/生灭过程Key words
high popular data/replica management/cloud computing/Hadoop/grey prediction/birth and death process分类
信息技术与安全科学引用本文复制引用
张松,杜庆伟,孙静,孙振..基于预测的云计算热点数据副本因子决策算法[J].计算机与现代化,2015,(2):62-66,72,6.基金项目
国家自然科学基金资助项目 ()