首页|期刊导航|桂林电子科技大学学报|基于Hadoop MapReduce的大规模数据索引构建与集群性能分析

基于Hadoop MapReduce的大规模数据索引构建与集群性能分析

谌超强保华石龙

桂林电子科技大学学报2012，Vol.32Issue(4)：307-312,6.

基于Hadoop MapReduce的大规模数据索引构建与集群性能分析

Large scale data index construction and cluster efficiency analysis based on Hadoop MapReduce

谌超 ¹强保华 ¹石龙¹

作者信息

1. 桂林电子科技大学计算机科学与工程学院,广西桂林541004
折叠

摘要

Abstract

In order to satisfy the search engine's requirements of time and space and build effectively distributed index, Hadoop is used to build a distributed cluster environment) and large data inverted index can be achieved based on the MapReduce programming. The performance of the Hadoop cluster is evaluated by different network bandwidth) data volume and number of cluster nodes. Experimental results show that the greater network bandwidth is, the higher efficiency of cluster processing is< the more cluster nodes are, the stronger the ability to handle large data is. The performance of Hadoop cluster is influenced by the network communication bandwidth) high-speed cluster link can improve the performance of the cluster.

关键词

MapReduce/倒排索引/Hadoop集群

Key words

MapReducer inverted index/ Hadoop cluster

分类

信息技术与安全科学

引用本文复制引用

谌超,强保华,石龙..基于Hadoop MapReduce的大规模数据索引构建与集群性能分析[J].桂林电子科技大学学报,2012,32(4):307-312,6.

基金项目

国家自然科学基金(61163057) （61163057）

桂林电子科技大学学报

ISSN：1673-808X

访问量0

下载量0

段落导航