首页|期刊导航|计算机应用研究|基于可重构阵列的CNN数据量化方法

基于可重构阵列的CNN数据量化方法

朱家扬蒋林李远成宋佳刘帅

计算机应用研究2024，Vol.41Issue(4)：1070-1076,7.

计算机应用研究2024，Vol.41Issue(4)：1070-1076,7.DOI:10.19734/j.issn.1001-3695.2023.07.0378

基于可重构阵列的CNN数据量化方法

CNN data quantization method based on reconfigurable array

朱家扬 ¹蒋林 ²李远成 ²宋佳 ³刘帅¹

作者信息

1. 西安科技大学通信与信息工程学院,西安 710600
2. 西安科技大学计算机科学与技术学院,西安 710600
3. 西安科技大学电气与控制工程学院,西安 710600
折叠

摘要

Abstract

Convolution operations lead to a significant increase in the network size,which makes CNN models difficult to de-ploy to the embedded hardware platform,and different granularity data is not coordinated with the underlying hardware struc-ture,which leads to low computing efficiency.Based on the reconfigurable array with the computing units supporting multiple bit widths,through software hardware cooperation and reconfigurable computing methods,this paper defined the quantization threshold using KL divergence and random integer method,proposed a strategy for finding the best basis point,designed an in-struction set and a parallel mapping scheme supporting multiple bit widths to realize three distinct bit widths in data quantiza-tion.The results show the quantization scheme with 8 bit weight and feature map can compress model parameter quantity to about 50％with 2％accuracy loss.The acceleration ratios of quantifying the test images to three different bit widths reach 1.012,1.273,and 1.556,respectively,which can shorten the execution time by up to 35.7％and reduce memory access times by 56.2％,while only bringing less than 1％relative error.This indicates that this method can achieve efficient neural network computation under three quantization bit widths,thereby implementing hardware acceleration and model compression.

关键词

卷积神经网络/数据量化/可重构结构/并行映射/加速比

Key words

convolutional neural network(CNN)/data quantization/reconfigurable structure/parallel mapping/acceleration ratio

分类

信息技术与安全科学

引用本文复制引用

朱家扬,蒋林,李远成,宋佳,刘帅..基于可重构阵列的CNN数据量化方法[J].计算机应用研究,2024,41(4):1070-1076,7.

基金项目

科技创新2030-"新一代人工智能"重大项目(2022ZD0119005) （2022ZD0119005）

国家自然科学基金重点资助项目(61834005) （61834005）

计算机应用研究

OA北大核心CSTPCD

ISSN：1001-3695

访问量0

下载量0

段落导航