端到端双通道特征重标定DenseNet图像分类被引量：13

Image classification method based on end-to-end dual feature reweight DenseNet

导出

摘要目的针对密集连接卷积神经网络(Dense Net)没有充分考虑通道特征相关性以及层间特征相关性的缺点,本文结合软注意力机制提出了端到端双通道特征重标定密集连接卷积神经网络。方法提出的网络同时实现了Dense Net网络的通道特征重标定与层间特征重标定。给出了Dense Net网络通道特征重标定与层间特征重标定方法;构建了端到端双通道特征重标定密集连接卷积神经网络,该网络每个卷积层的输出特征图经过两个通道分别完成通道特征重标定以及层间特征重标定,再进行两种重标定后特征图的融合。结果为了验证本文方法在不同图像分类数据集上的有效性和适应性,在图像分类数据集CIFAR-10/100以及人脸年龄数据集MORPH、Adience上进行了实验,提高了图像分类准确率,并分析了模型的参数量、训练及测试时长,验证了本文方法的实用性。与Dense Net网络相比,40层及64层双通道特征重标定密集连接卷积神经网络DFR-DenseNet (dual feature reweight Dense Net),在CIFAR-10数据集上,参数量仅分别增加1.87%、1.23%,错误率分别降低了12%、9.11%,在CIFAR-100数据集上,错误率分别降低了5.56%、5.41%;与121层DFR-DenseNet网络相比,在MORPH数据集上,平均绝对误差(MAE)值降低了7.33%,在Adience数据集上,年龄组估计准确率提高了2%;与多级特征重标定密集连接卷积神经网络MFR-DenseNet(multiple feature reweight Dense Net)相比,DFR-DenseNet网络参数量减少了一半,测试耗时约缩短为MFR-DenseNet的61%。结论实验结果表明本文端到端双通道特征重标定密集连接卷积神经网络能够增强网络的学习能力,提高图像分类的准确率,并对不同图像分类数据集具有一定的适应性、实用性。 Objective Image classification is one of the important research technologies in computer vision. The development of deep learning and convolutional neural networks(CNNs) has laid the technical foundation for image classification. In recent years,image classification methods based on deep CNN have become an important research topic. DenseN et is one of the widely applied deep CNNs in image classification,encouraging feature reusage and alleviating the vanishing gradient problem. However,this approach has obvious limitations. First,each layer simply combines the feature maps obtained from preceding layers by concatenating operation without considering the interdependencies between different channels. The network representation can be further improved by modeling feature channel correlation and realizing channel feature recalibration. Second,the correlation of the interlayer feature map is not explicitly modeled. Thus,adaptively learning the correlation coefficients by modeling the correlation of feature maps between the layers is important. Method The conventional DenseN et networks do not adequately consider the channel feature correlation and interlayer feature correlation. To address these limitations,multiple feature reweight DenseN et(MFR-DenseN et) combines channel feature reweight DenseN et(CFR-DenseN et) and inter-layer feature reweight DenseN et(ILFR-DenseN et) by ensemble learning method,thereby improving the representation power of the DenseN et by adaptively recalibrating the channel-wise feature responses and explicitly modeling the interdependencies between the features of different convolutional layers. However,MFR-DenseN et uses two independent parallel networks for image classification,which is not end-to-end training. The CFR-DenseN et and the ILFR-DenseN et models should be trained and saved in training. First,the models and weights are loaded,and the MFR-DenseN et needs multiple save and load. The training process is cumbersome. Second,the parameters and calculations are large,so the training takes a long time. In the test,the final prediction results of the MFR-DenseN et are obtained by taking an average of predictions from the two models. The parameters and test time are almost doubled compared with a single-channel feature reweight or interlayer feature reweight network. Therefore,the MFR-DenseN et has high requirements on the storage space and computing performance of the device in practical applications,thereby limiting its application. To address these limitations of MFR-DenseN et,this paper proposes an end-to-end dual feature reweight DenseN et(DFRDenseN et) based on the soft attention mechanism. The network implements the channel feature reweight and interlayer feature reweight of DenseN et. First,the channel feature reweight and interlayer feature reweight method are integrated in DenseN et. By introducing a squeeze-and-excitation module(SEM) after each 3 × 3 convolutional layer,our method solves the problem of exploiting the channel dependencies. Each feature map of each layer in the SEM obtains a weight through a squeeze and excitation operation. The representation of the network can be improved by explicitly modeling the interdependencies between the channels. The output feature map of the convolutional layer is subjected to two squeeze excitation operations. Thus,the weight value of each layer can be obtained to achieve the reweight of the interlayer features. Then,DFRDenseN et was constructed. The output feature map of each convolution layer completes the channel feature reweight and interlayer feature reweight through two channels. The concat and convolution operations were used to achieve the combination of two types of reweighted feature maps. Result First,the DFR-DenseN et is compared with the serial fusion method and parallel-addition fusion method on the image classification dataset CIFAR-10,which proves that DFR-DenseN et is the most effective. Second,to demonstrate the advantage of the DFR-DenseN et,we performed different experiments on the image classification dataset CIFAR-10/100. To show the effectiveness of the method on the high-resolution dataset,we conducted the age classification experiment on the face dataset MORPH,and the age group classification comparison experiment was performed on the unconstrained Adience dataset. The image classification accuracy was significantly improved.The 40-layer DFR-DenseN et had a 4. 69% error and outperformed the 40-layer DenseN et by 12% on CIFAR-10 with only1. 87% more parameters. The 64-layer DFR-DenseN et resulted in a 4. 29% error on CIFAR-10 and outperformed the 64-layer DenseN et by 9. 11%. On CIFAR-100,the 40-layer DFR-DenseN et and 64-layer DFR-DenseN et resulted in a24. 29% and 21. 86% test error on the test set,and they outperformed the 40-layer DenseN et and 64-layer DenseN et by5. 56% and 5. 41%,respectively. Age estimation from a single face image is an essential task in the field of human-computer interaction and computer vision,which has a wide range of practical applications. Age estimation consists of two categories: age classification and age regression. Adience is used for age group classification and obtained 58. 79% accuracy.MORPH Album 2 is used for age regression. The 121-layer DFR-DenseN et had a 3. 16 mean absolute error and outperformed the 121-layer DenseN et by 7. 33% on the MORPH Album 2. Compared with the MFR-DenseN et,the DFRDenseN et reduced the number of parameters by half. The test time of the DFR-DenseN et network was shortened to approximately 61% in the MFR-DenseN et test. Conclusion The experimental results show that the end-to-end dual feature reweight DenseN et can enhance the learning ability of the network and improve the accuracy of image classification.

作者郭玉荣张珂王新胜苑津莎赵振兵马占宇 Guo Yurong;Zhang Ke;Wang Xinsheng;Yuan Jinsha;Zhao Zhenbing;Ma Zhanyu(The Department of Electronic and Communication Engineering,North China Electric Power University,Baoding 071000,China;School of Information and Communication Engineering,Institute of Artificial Intelligence,Beijing University of Posts and Telecommunication,Beijing 100086,China)

机构地区华北电力大学电子与通信工程系北京邮电大学信息与通信工程学院人工智能研究院

出处《中国图象图形学报》 CSCD 北大核心 2020年第3期486-497,共12页 Journal of Image and Graphics

基金国家自然科学基金项目(61871182,61922015,61773071,61302163) 河北省自然科学基金项目(F2015502062,F2016502101,F2017502016) 北京市自然科学基金项目(4192055) 中央高校基本科研经费项目(2018MS094,2018MS095)。

关键词双通道特征重标定密集连接卷积神经网络通道特征重标定层间特征重标定图像分类端到端 dual feature reweight Dense Net(DFR-DenseNet) channel feature reweight inter-layer feature reweight image classification end-to-end

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献2

1张珂,王新胜,郭玉荣,苏昱坤,何颖宣.人脸年龄估计的深度学习方法综述[J].中国图象图形学报,2019,0(8):1215-1230. 被引量：17
2李彦冬,郝宗波,雷航.卷积神经网络研究综述[J].计算机应用,2016,36(9):2508-2515. 被引量：584

二级参考文献71

1LECUN Y, BOTTOU L, BENGIO Y, et al. Gradient-based learning applied to document recognition [J]. Proceedings of the IEEE, 1998, 86(11): 2278-2324.
2HINTON G E, OSINDERO S, TEH Y W. A fast learning algorithm for deep belief nets [J]. Neural Computation, 2006, 18(7): 1527-1554.
3LEE H, GROSSE R, RANGANATH R, et al. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations [C]// ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning. New York: ACM, 2009: 609-616.
4HUANG G B, LEE H, ERIK G. Learning hierarchical representations for face verification with convolutional deep belief networks [C]// CVPR '12: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2012: 2518-2525.
5KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks [C]// Proceedings of Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 2012: 1106-1114.
6GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation [C]// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2014: 580-587.
7LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation [C]// Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2015: 3431-3440.
8SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition [EB/OL]. [2015-11-04]. http://www.robots.ox.ac.uk:5000/~vgg/publications/2015/Simonyan15/simonyan15.pdf.
9SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions [C]// Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2015: 1-8.
10HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition [EB/OL]. [2016-01-04]. https://www.researchgate.net/publication/286512696_Deep_Residual_Learning_for_Image_Recognition.

共引文献594

1侯帅鹏,石英,华逸伦,苏涛.基于改进SSD的行人检测模型[J].武汉理工大学学报,2019,41(7):95-102. 被引量：1
2苟玉晓,江永全,杨燕,周冠禄,林凯.基于全卷积神经网络的公交专用道识别[J].计算机应用研究,2020,37(S01):406-407.
3杨颖.基于MobileNet-SSD的蝶类昆虫识别算法[J].智能计算机与应用,2021,11(4):156-158. 被引量：2
4雷慧静.卷积神经网络综述[J].中国科技纵横,2018,0(16):44-47. 被引量：1
5张玮,张华熊.基于卷积神经网络的纺织面料主成分分类[J].浙江理工大学学报（自然科学版）,2019,41(1):1-8. 被引量：7
6徐思,孙仁诚.结合聚类的半监督分类方法[J].青岛大学学报（自然科学版）,2018,31(4):49-53. 被引量：2
7李辉,钟平,戴玉静,吕东辉.基于深度学习的输电线路锈蚀检测方法的研究[J].电子测量技术,2018,41(22):54-59. 被引量：22
8曾平平,李林升.基于卷积神经网络的水果图像分类识别研究[J].机械设计与研究,2019,35(1):23-26. 被引量：42
9蒋承知,于起,叶文强,甘凇元.卷积神经网络算法的比较探究[J].电子技术与软件工程,2017(7):78-80. 被引量：7
10梁锐,朱清新,廖淑娇,牛新征.基于多特征融合的深度视频自然语言描述方法[J].计算机应用,2017,37(4):1179-1184. 被引量：6

同被引文献93

1石吉勇,邹小波,赵杰文,毛罕平,王开亮,陈正伟,黄晓玮.基于近红外光谱的设施栽培水果黄瓜磷元素亏缺初期快速诊断[J].光谱学与光谱分析,2011,31(12):3264-3268. 被引量：10
2关海鸥,衣淑娟,焦峰,许少华,左豫虎,金宝石.农作物缺素症状诊断的正则化模糊神经网络模型[J].农业机械学报,2012,43(5):162-167. 被引量：15
3朱静华,李玉华,李明悦,高伟.氮、磷、钾对设施蔬菜产量及养分循环的影响[J].中国农学通报,2013,29(7):148-154. 被引量：11
4孙俊,金夏明,毛罕平,武小红,朱文静,张晓东,高洪燕.基于高光谱图像光谱与纹理信息的生菜氮素含量检测[J].农业工程学报,2014,30(10):167-173. 被引量：54
5穆俊祥,曹兴明,刘拴成.氮、磷、钾缺素培养对番茄幼苗生长的影响[J].北方园艺,2015(6):40-42. 被引量：8
6岳有军,杨雪,赵辉,王红君.基于支持向量机的油菜缺素诊断研究[J].广东农业科学,2015,42(20):145-148. 被引量：6
7刘德建.基于LeNet的花卉识别方法[J].电子技术与软件工程,2015(23):13-14. 被引量：10
8李美清,李晋阳,毛罕平.基于光谱特征和生理特征的番茄磷营养诊断方法[J].农业机械学报,2016,47(3):286-291. 被引量：12
9张凯兵,章爱群,李春生.基于HSV空间颜色直方图的油菜叶片缺素诊断[J].农业工程学报,2016,32(19):179-187. 被引量：30
10关海鸥,李佳朋,马晓丹,杜松怀,焦峰.基于冠层颜色特征的大豆缺素症状识别研究[J].西北农林科技大学学报（自然科学版）,2016,44(12):136-142. 被引量：3

引证文献13

1王建云,吴正平,雷帮军,颜洵.基于darknet框架高空视角下车辆的细分类[J].现代电子技术,2021,44(3):124-129. 被引量：4
2秦嘉奇.基于迁移学习的小样本细粒度图像分类方法[J].信息与电脑,2021,33(12):58-60.
3巫昊燕,刘高翔,李忠蔚.基于立体视觉的生态景观布局中三维特征标定[J].计算机仿真,2021,38(8):198-202. 被引量：2
4秦嘉奇.基于Mobilenet的农作物叶片病害识别方法[J].信息与电脑,2021,33(18):181-184. 被引量：4
5韩旭,赵春江,吴华瑞,朱华吉,张燕.基于注意力机制及多尺度特征融合的番茄叶片缺素图像分类方法[J].农业工程学报,2021,37(17):177-188. 被引量：24
6张珂,冯晓晗,郭玉荣,苏昱坤,赵凯,赵振兵,马占宇,丁巧林.图像分类的深度卷积神经网络模型综述[J].中国图象图形学报,2021,26(10):2305-2325. 被引量：142
7姜文涛,赵琳琳,涂潮.双分支多注意力机制的锐度感知分类网络[J].模式识别与人工智能,2023,36(3):252-267. 被引量：7
8邱云飞,张家欣,兰海,宗佳旭.融合张量合成注意力的改进ResNet图像分类模型[J].激光与光电子学进展,2023,60(6):87-96. 被引量：5
9吴清平.基于分类激活图增强的立体视觉图像分类方法[J].重庆科技学院学报（自然科学版）,2023,25(4):53-59. 被引量：1
10吴甜,刘海华,童顺延.基于深度反馈的卷积神经网络的图像分类[J].计算机与现代化,2023(9):82-86. 被引量：4

二级引证文献184

1李莉,陈心宇,高文斌.一种基于FPGA的卷积神经网络加速器实现方案[J].北京电子科技学院学报,2022,30(4):96-104. 被引量：2
2张银胜,杨宇龙,吉茹,蓝天鹤,单慧琳.改进YOLOv5s的风力涡轮机表面缺陷检测[J].电子测量与仪器学报,2023,37(1):40-49. 被引量：21
3刘斌,贾浩强,杨一,申佳,盖美辰,宋天霖.基于改进OpenPose算法的矿工危险行为识别研究[J].电视技术,2023,47(2):20-23. 被引量：7
4朱洪波,张在岩,秦育罗,宋伟东,张晋赫.农村路面多类型病害检测方法研究[J].测绘科学,2022,47(9):170-180. 被引量：5
5杨子勋,陈广新,李长荣,曹文超.基于计算机辅助诊断的皮肤癌良恶性诊断研究[J].新一代信息技术,2022,5(8):134-138.
6龚赛君,曹红.基于通道自适应动态网络剪枝算法的FPGA加速器设计与实现[J].信息与电脑,2021,33(22):66-68. 被引量：1
7张文,杨雅姿,黄驰,陈琳.一种基于YOLOV4 Tiny的目标检测算法[J].电脑与信息技术,2022,30(2):33-37. 被引量：4
8何俊,蒋昌辉,李倡洪,刘鹏,聂勇.基于EF-YOLO的输电线路鸟害检测技术研究[J].现代电子技术,2022,45(10):94-98. 被引量：5
9杨树旺.地铁车辆车号识别系统的研究与应用[J].现代城市轨道交通,2022(5):20-23. 被引量：2
10熊文军,赵山虎,李世博,杨建华,范孝波,孙洪良,陈军.清洁车智能监测与控制系统研究[J].计算机测量与控制,2022,30(5):109-114. 被引量：1

1行金玲,牛乐.高校教师职业能力熵权模糊综合评价研究[J].柳州职业技术学院学报,2020,0(1):42-47. 被引量：4
2李磊云.电气设备安装调试中存在的问题与对策探讨[J].新晋商,2020(2):104-104.
3李姝,李靖(摄).我们开学啦[J].宁夏画报,2020,0(4):46-47.
4Zhuo Zhang,Guangyuan Fu,Rongrong Ni,Jia Liu,Xiaoyuan Yang.A Generative Method for Steganography by Cover Synthesis with Auxiliary Semantics[J].Tsinghua Science and Technology,2020,25(4):516-527. 被引量：6
5刘恒,吴德鑫,徐剑.基于生成式对抗网络的通用性对抗扰动生成方法[J].信息网络安全,2020(5):57-64. 被引量：3
6黄蒙,丁黎,常海,周静,何少蓉,张林军,祝艳龙,安静.真空安定性判据对几种新型高能量密度化合物的适用性研究[J].火炸药学报,2020,43(1):39-44. 被引量：2
7杜娟,安世华,魏明磊,高冰,赵志辰,王梦媛,李保罡.基于直流微网的电力分组传输调度[J].电力系统保护与控制,2020,48(10):106-112. 被引量：1
8马倩敏,郭荣鑫,史天尧,张敏,刘倩.氯离子非稳态电迁测试方法在碱矿渣混凝土中的适用性及其改进措施[J].新型建筑材料,2020,47(3):9-11. 被引量：1
9吴斌方,陈涵,肖书浩.基于SVM与Inception-v3的手势识别[J].计算机系统应用,2020,29(5):189-195. 被引量：4
10简献忠,张雨墨,王如志.基于生成对抗网络的压缩感知图像重构方法[J].包装工程,2020,41(11):239-245. 被引量：5

中国图象图形学报

2020年第3期

浏览历史

内容加载中请稍等...

端到端双通道特征重标定DenseNet图像分类被引量：13

参考文献2

二级参考文献71

共引文献594

同被引文献93

引证文献13

二级引证文献184

相关作者

相关机构

相关主题

浏览历史

端到端双通道特征重标定DenseNet图像分类 被引量：13

参考文献2

二级参考文献71

共引文献594

同被引文献93

引证文献13

二级引证文献184

相关作者

相关机构

相关主题

浏览历史

端到端双通道特征重标定DenseNet图像分类被引量：13