计算机与现代化Issue(2):26-30,5.DOI:10.3969/j.issn.1006-2475.2012.02.008
基于MapReduce的ID3决策树分类算法研究
Research on ID3 Decision Tree Classification Algorithm Based on MapReduce
钱网伟1
作者信息
- 1. 同济大学电子与信息工程学院,上海 201804
- 折叠
摘要
Abstract
Decision tree is widely used in data mining which is one of the typical classification algorithms. Traditional ID3 tree learning algorithms require training data to reside in memory on a single machine, so they cannot deal with massive datasets. To solve this problem, this paper analyzes the parallel algorithm of ID3 decision tree based on MapReduce model, then proposes a parallel and distributed algorithm for ID3 decision tree learning. The experimental results demonstrate the algorithm can scale well and efficiently process large-scale datasets on commodity computers.关键词
云计算/数据挖掘/决策树/ID3/MapReduceKey words
cloud computing/data mining/decision tree/ID3/MapReduce分类
信息技术与安全科学引用本文复制引用
钱网伟..基于MapReduce的ID3决策树分类算法研究[J].计算机与现代化,2012,(2):26-30,5.