| 注册
首页|期刊导航|计算机应用研究|基于Spark的分布式交通流数据预测系统

基于Spark的分布式交通流数据预测系统

黄廷辉 王玉良 汪振 崔更申

计算机应用研究2018,Vol.35Issue(2):405-409,416,6.
计算机应用研究2018,Vol.35Issue(2):405-409,416,6.DOI:10.3969/j.issn.1001-3695.2018.02.019

基于Spark的分布式交通流数据预测系统

Distributed traffic flow data prediction system based on Spark

黄廷辉 1王玉良 2汪振 1崔更申1

作者信息

  • 1. 桂林电子科技大学计算机与信息安全学院,广西桂林541004
  • 2. 桂林电子科技大学广西可信软件重点实验室,广西桂林541004
  • 折叠

摘要

Abstract

In the era of big data and complex urban traffic environment,real-time and accurate traffic flow forecast is a prerequisite to implementing intelligent transportation system.This paper presented a distributed urban traffic flow forecasting model which based on gradient optimization decision tree on Spark platform.It also proposed the optimization method of gradient optimization decision tree model in distributed case,which included sampling points,feature packing and layer-by-layer training.All of them could improve the training efficiency of gradient optimization decision tree in distributed case.The characteristics of time,road condition and weather were established based on the advantages of efficient,reliable and flexible expansibility of Spark distributed computing platform and the advantages of high accuracy and time complexity of gradient optimization decision tree model.The DUTP-GBDT model implemented real-time and accurate traffic flow prediction.Compared with GA-BP,GA-KNN and MSTAR models,the results prove that,the accuracy and training speed of DUTP-GBDT model using Spark platform in the distributed environment are both improved.In line with the requirements of urban traffic flow forecasting system.

关键词

交通流预测/分布式计算/Spark平台/梯度优化决策树模型

Key words

traffic flow forecast/distributed computing/Spark platform/gradient optimization decision tree model

分类

信息技术与安全科学

引用本文复制引用

黄廷辉,王玉良,汪振,崔更申..基于Spark的分布式交通流数据预测系统[J].计算机应用研究,2018,35(2):405-409,416,6.

基金项目

赛尔网络下一代互联网技术创新项目(NGII20160306) (NGII20160306)

广西科技攻关项目(PD160189) (PD160189)

计算机应用研究

OA北大核心CSCDCSTPCD

1001-3695

访问量4
|
下载量0
段落导航相关论文