河南理工大学学报(自然科学版)2013,Vol.32Issue(3):332-336,5.
一种改进的Hadoop数据负载均衡算法
An improved data load balancing algorithm for hadoop
摘要
Abstract
This paper first introduces the principle of Hadoop and HDFS.Then the algorithm of Hadoop data load balancing is analyzed.This Hadoop algorithm balances the data according to the space usage of each node and does not handle the factors as processing power,bandwidth,files'access frequency.Thusly there is a big difference in response time for the similar files.This paper devises a novel load balancing model based on the factors of files' size,files' concurrent access time,files' access frequency,nodes' processing power,bandwidth and nodes'available storage space.Experimental results show that the devised model cannot only guarantee the storage space load balancing,but also make the similar files' response time more consistent.关键词
Hadoop/负载均衡/云计算/云存储Key words
hadoop/load balancing/cloud computing/cloud storage分类
信息技术与安全科学引用本文复制引用
刘琨,钮文良..一种改进的Hadoop数据负载均衡算法[J].河南理工大学学报(自然科学版),2013,32(3):332-336,5.基金项目
北京市教育委员会科技计划面上项目(SQKM201211417008) (SQKM201211417008)