| 注册
首页|期刊导航|计算机工程与科学|轻量化卷积神经网络硬件加速设计及FPGA实现

轻量化卷积神经网络硬件加速设计及FPGA实现

李珍琪 王强 齐星云 赖明澈 赵言亢 陆亿行 黎渊

计算机工程与科学2025,Vol.47Issue(4):582-591,10.
计算机工程与科学2025,Vol.47Issue(4):582-591,10.DOI:10.3969/j.issn.1007-130X.2025.04.002

轻量化卷积神经网络硬件加速设计及FPGA实现

Design and FPGA implementation of lightweight convolutional neural network hardware acceleration

李珍琪 1王强 1齐星云 1赖明澈 1赵言亢 1陆亿行 1黎渊1

作者信息

  • 1. 国防科技大学计算机学院,湖南长沙 410073
  • 折叠

摘要

Abstract

In recent years,convolutional neural networks(CNNs)have achieved remarkable results in fields such as computer vision.However,CNNs typically have complex network structures and sub-stantial computational requirements,making it difficult to implement them on portable devices with lim-ited computational resources and power consumption.FPGAs,with their high parallelism,energy effi-ciency,and reconfigurability,have emerged as one of the most effective computing platforms for accele-rating CNN inference on portable devices.This paper proposes a CNN accelerator that can be configured for different network structures,and optimizes its latency and power consumption through three as-pects:data reuse,pipeline optimization based on row buffers,and low-latency convolution techniques based on adder trees.Taking the YOLOv2-tiny lightweight network model as an example,a real-time target detection system was built on the Navigator ZYNQ-7020 development board.The experimental results show that the design meets low hardware and power requirements for portable devices,with 88%resource consumption and 2.959 W power consumption.It achieves a detection speed of 3.91 fps at an image resolution of 416×256.

关键词

卷积神经网络/FPGA加速/加速器/便携设备

Key words

convolutional neural network(CNN)/FPGA acceleration/accelerator/portable device

分类

计算机与自动化

引用本文复制引用

李珍琪,王强,齐星云,赖明澈,赵言亢,陆亿行,黎渊..轻量化卷积神经网络硬件加速设计及FPGA实现[J].计算机工程与科学,2025,47(4):582-591,10.

计算机工程与科学

OA北大核心

1007-130X

访问量2
|
下载量0
段落导航相关论文