国防科技大学学报2024,Vol.46Issue(1):131-140,10.DOI:10.11887/j.cn.202401014
高性能异构加速器MiniGo算子优化方法
Optimizing operator computation of MiniGo on high-performance heterogeneous accelerator
摘要
Abstract
An efficient parallel computing method based on the characteristics of the high-performance heterogeneous accelerator and the training mode of MiniGo was proposed.The on-chip computing resources were reasonably planned to achieve pipelining parallel optimization between heterogeneous devices.The shared memory programming was designed according to the existence of shared storage segments between heterogeneous devices to reduce data transmission costs.According to the characteristics of multiple computing resources in a digital signal processing cluster,combined with the computing-memory access feature of the operators,different optimization strategies were designed.At the same time,this method provides an easy-use high-performance operator library for TensorFlow.The experimental results show that this method realizes the multi-core parallel computing of operators.The speedup of convolution was 24.69 compared with that was achieved on a single core.Compared with the cropped version of the 8-core FT2000 + CPU,the speedup of training and self-play execution on this method were 3.83 and 1.5,respectively.关键词
异构计算/算子优化/卷积神经网络/强化学习Key words
heterogeneous computing/operator optimization/convolutional neural networks/reinforcement learning分类
信息技术与安全科学引用本文复制引用
乔鹏,贺周雨,李荣春,姜晶菲..高性能异构加速器MiniGo算子优化方法[J].国防科技大学学报,2024,46(1):131-140,10.基金项目
国家重点实验室稳定支持资助项目(WDZC20205500104) (WDZC20205500104)