考虑层敏感性的卷积神经网络混合精度量化方法

Convolutional neural network mixed-precision quantization method considering layer sensitivity

下载PDF

导出

摘要针对如何将神经网络保真映射到资源受限的嵌入式设备这一问题,提出基于层敏感性分析的卷积神经网络混合精度量化方法。通过计算Hessian矩阵平均迹衡量卷积层参数的敏感性,为位宽分配提供依据;使用逐层升降方法进行位宽分配,最终完成网络模型的混合精度量化。实验结果表明,与DoReFa和LSQ+两种固定精度量化方法相比,所提出的混合精度量化方法在平均位宽为3 bit的情况下将识别准确率提高了10.2%和1.7%;与其他混合精度量化方法相比,所提方法识别准确率提高了1%以上。此外,加噪训练能够有效提高混合精度量化方法的鲁棒性,在噪声标准差为0.5的情况下,将识别准确率提高了16%。 To address the problem of how to faithfully map neural networks to resource-constrained embedded devices,a mixed-precision quantization method for convolutional neural networks based on layer sensitivity analysis was proposed.The sensitivity of convolutional layer parameters was measured by calculating the average trace of the Hessian matrix,providing a basis for bit-width allocation.A layer-wise ascending-descending approach was employed for bit-width allocation,ultimately achieving mixed-precision quantization of the network model.Experimental results demonstrate that compared to the fixed-precision quantization methods DoReFa and LSQ+,the proposed mixed-precision quantization method improves recognition accuracy by 10.2%and 1.7%,respectively,at an average bit-width of 3 bit.When compared to other mixed-precision quantization methods,the proposed approach achieves over 1%higher recognition accuracy.Additionally,noise-injected training effectively enhances the robustness of the mixed-precision quantization method,improving recognition accuracy by 16%under a noise standard deviation of 0.5.

作者刘海军张晨曦王析羽陈长林陈军李智炜 LIU Haijun;ZHANG Chenxi;WANG Xiyu;CHEN Changlin;CHEN Jun;LI Zhiwei(College of Electronic Science and Technology,National University of Defense Technology,Changsha 410073,China)

机构地区国防科技大学电子科学学院

出处《国防科技大学学报》北大核心 2025年第4期143-150,共8页 Journal of National University of Defense Technology

基金国家自然科学基金资助项目(62074166,62304254,62104256,62404253,U23A20322)。

关键词卷积神经网络模型量化人工智能混合精度 convolutional neural network model quantization artificial intelligence mixed precision

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1赵文龙.基于GNSS融合技术的国土空间规划动态监测应用探索[J].北斗与空间信息应用技术,2025(3):47-50. 被引量：2
2翟瑞森,肖佳辰,陈宇轩,马志远,林莉.基于CEEMD的相控阵超声信号压缩感知方法[J].测控技术,2025,44(7):19-25. 被引量：1
3梁威鹏,徐永兵.CNN设计空间搜索和权衡优化[J].电子制作,2025,33(13):113-116.
4刘小林.基于改进ORB特征点和光流法的结构位移监测方法[J].大坝与安全,2025(3):14-18.
5张云伟,邹南征,赵雪轩,李春娜.基于Hessian矩阵逼近的二阶时变可靠性分析方法[J].西北工业大学学报,2025,43(3):467-477.
6王淑芳,任敬敏,任金金.基于SIFT特征点的抗缩放攻击视频数字水印算法[J].长江信息通信,2025,38(6):82-84. 被引量：1

国防科技大学学报

2025年第4期

浏览历史

内容加载中请稍等...

考虑层敏感性的卷积神经网络混合精度量化方法

相关作者

相关机构

相关主题

浏览历史