计算机工程与科学2011,Vol.33Issue(3):129-135,7.DOI:10.3969/j.issn.1007-130X.2011.03.023
MapReduce:新型的分布式并行计算编程模型
MapReduce:a New Programming Model for Distributed Parallel Computing
李成华 1张新访 1金海 1向文1
作者信息
- 1. 华中科技大学计算机科学与技术学院,湖北,武汉,430074
- 折叠
摘要
Abstract
MapReduce is a programming model introduced by Google for writing applications that rapidly process vast amounts of data in parallel on large clusters of computing nodes.The model is inspired by map and reduce functions commonly used in functional programming.A Map/Reduce job usually splits the input data-set into independent chunks which are processed by the map tasks in a completely parallel manner.The reduce tasks merge all intermediate values generated by the map tasks.Users only devote themselves to how to specify the map functions and reduce functions.The details of partitioning the input data, scheduling the program's execution across a set of machines, handling machine failures, and managing the required inter-machine communication are taken care of by the run-time system of MapReduce.MapReduce will be widely adopted on the cloud computing platform.Several aspects of the Hadoop MapReduce contributed by Apache remain to be perfected.关键词
MapReduce/并行计算编程模型/云计算Key words
MapReduce/ distributed parallel computing/ cloud computing分类
信息技术与安全科学引用本文复制引用
李成华,张新访,金海,向文..MapReduce:新型的分布式并行计算编程模型[J].计算机工程与科学,2011,33(3):129-135,7.