| 注册
首页|期刊导航|现代电子技术|面向图卷积神经网络的FPGA部署及加速研究

面向图卷积神经网络的FPGA部署及加速研究

高强 邵春霖 李京润 沈宗凯

现代电子技术2024,Vol.47Issue(10):39-46,8.
现代电子技术2024,Vol.47Issue(10):39-46,8.DOI:10.16652/j.issn.1004-373x.2024.10.008

面向图卷积神经网络的FPGA部署及加速研究

Research on FPGA deployment and acceleration of graph convolutional neural network

高强 1邵春霖 1李京润 1沈宗凯2

作者信息

  • 1. 红云红河烟草(集团)有限责任公司昆明卷烟厂,云南 昆明 650000
  • 2. 昆明理工大学 信息工程与自动化学院,云南 昆明 650000
  • 折叠

摘要

Abstract

The graph convolutional neural network(GCN)algorithm has achieved breakthrough success in processing graph structured data tasks.However,training GCN requires a large amount of memory space and multiple random memory accesses,which limits the further deployment and application of the algorithm.Existing deployment and acceleration solutions for GCN mostly rely on the Vitis HLS tool,which is developed by means of C/C++.These solutions almost entirely neglect hardware description language,leading to incomplete software-hardware acceleration.To address these issues,a FPGA deployment and acceleration architecture tailored for GCN is proposed.The architecture is composed of computing modules and storage modules,which can be implemented by means of hardware description languages.In the computing module,the hardware description language is used to implement the key algorithm of GCN,mapping it to the field-programmable gate array(FPGA)for hardware acceleration.In the caching module,the read-only memory(ROM)IP core is primarily called and a two-dimensional register file is defined to store input node features,normalized adjacency matrices,quantized parameters of various layers,and intermediate variables,enhancing the parallelism of the GCN algorithm.The model training is conducted on the Pycharm platform and parameters are extracted for quantization,then the design and simulation test for GCN are conducted on the Vivado platform,and the computational performance of CPU and GPU are compared.The experimental results show that the designed GCN acceleration architecture can improve the inference speed of the model.

关键词

图卷积神经网络/FPGA加速器/硬件描述语言/计算模块/存储模块/参数量化

Key words

GCN/FPGA accelerator/hardware description language/calculation module/storage module/parameter quantification

分类

电子信息工程

引用本文复制引用

高强,邵春霖,李京润,沈宗凯..面向图卷积神经网络的FPGA部署及加速研究[J].现代电子技术,2024,47(10):39-46,8.

现代电子技术

OA北大核心CSTPCD

1004-373X

访问量0
|
下载量0
段落导航相关论文