| 注册
首页|期刊导航|南京师范大学学报(工程技术版)|新型轻量化神经网络结构范式的剪枝研究

新型轻量化神经网络结构范式的剪枝研究

宋晨 魏子重 姜凯 李锐 段强

南京师范大学学报(工程技术版)2023,Vol.23Issue(4):29-36,8.
南京师范大学学报(工程技术版)2023,Vol.23Issue(4):29-36,8.DOI:10.3969/j.issn.1672-1292.2023.04.004

新型轻量化神经网络结构范式的剪枝研究

Pruning Research on New Lightweight Neural Network Structures Paradigm

宋晨 1魏子重 1姜凯 1李锐 1段强1

作者信息

  • 1. 山东浪潮科学研究院有限公司,山东 济南 250014
  • 折叠

摘要

Abstract

With the widespread adoption of deep learning technology,the object detection task in image processing has made vigorous progress.Along with the popularity and development of large models,the accuracy of deep learning models continuously improves.However,these large models are difficult to deploy on edge devices that are increasingly developing.To address the current object detection tasks at the edge-side,a network structure combining MobileOne-S0 and SSD is proposed.This network structure is reparameterized to form a VGG-like network structure for the inference process.Then,three different pruning criteria are used,including unstructured weight pruning,structured BN pruning,and Taylor pruning.The results show that weight pruning has the worst effect,while the two structured pruning methods have almost the same decrease rate for FLOPs and parameter quantity with the increase of sparsity.However,the accuracy drop of BN pruning is slower than that of Taylor pruning while Taylor pruning has the best pruning effect on peak memory size.When the model precision decreases by about 10%,BN pruning can compress the parameter quantity by 22.3 times,FLOPs by 9.4 times,and peak memory usage by 2.5 times.The final model size is only 123.88 kB,making it easier to deploy on TinyML-suitable,MCU-level,low-power end-side devices.

关键词

MobileOne/SSD/深度可分离卷积/剪枝/TinyML

Key words

MobileOne/SSD/deep separable convolution/pruning/TinyML

分类

信息技术与安全科学

引用本文复制引用

宋晨,魏子重,姜凯,李锐,段强..新型轻量化神经网络结构范式的剪枝研究[J].南京师范大学学报(工程技术版),2023,23(4):29-36,8.

南京师范大学学报(工程技术版)

1672-1292

访问量4
|
下载量0
段落导航相关论文