| 注册
首页|期刊导航|计算机工程与科学|CNN卷积计算在移动GPU上的加速研究

CNN卷积计算在移动GPU上的加速研究

王湘新 时洋 文梅

计算机工程与科学2018,Vol.40Issue(1):34-39,6.
计算机工程与科学2018,Vol.40Issue(1):34-39,6.DOI:10.3969/j.issn.1007-130X.2018.01.005

CNN卷积计算在移动GPU上的加速研究

Accelerating CNN on mobile GPU

王湘新 1时洋 2文梅2

作者信息

  • 1. 武警湖南省消防总队信息中心,湖南长沙410205
  • 2. 国防科技大学计算机学院,湖南长沙410073
  • 折叠

摘要

Abstract

Convolutional Neural Networks (CNNs) are playing an increasingly important role in areas such as image classification and speech recognition because of their excellent performance.Some researchers have already wanted to apply this deep learning process on mobile phones,but the performance of the porting program is unsatisfactory due to the huge amount of computation of CNN.In order to explore how to solve this problem,this paper uses a deep learning framework named MXNet to realize the forward process of CNN on mobile phones and focuses on the use of GPU that is another powerful computing device on the mobile phone.Based on the OpenCL common programming framework,we use matrix multiplication to compute the most time-consuming convolution in the forward process and move it to the GPU.Besides,serval improvements are made to achieve better performance.Finally,the experimental results show that we succeed in reducing the time of the forward process to half of the original time.

关键词

CNN/手机/移动GPU/快速算法/OpenCL

Key words

CNN/mobile phone/mobile GPU/fast algorithm/OpenCL

分类

信息技术与安全科学

引用本文复制引用

王湘新,时洋,文梅..CNN卷积计算在移动GPU上的加速研究[J].计算机工程与科学,2018,40(1):34-39,6.

基金项目

国家自然科学基金(61272145) (61272145)

计算机工程与科学

OA北大核心CSCDCSTPCD

1007-130X

访问量0
|
下载量0
段落导航相关论文