A Fine-Grained RecognitionModel based on Discriminative Region Localization and Efficient Second-Order Feature Encoding

下载PDF

导出

摘要 Discriminative region localization and efficient feature encoding are crucial for fine-grained object recognition.However,existing data augmentation methods struggle to accurately locate discriminative regions in complex backgrounds,small target objects,and limited training data,leading to poor recognition.Fine-grained images exhibit“small inter-class differences,”and while second-order feature encoding enhances discrimination,it often requires dual Convolutional Neural Networks(CNN),increasing training time and complexity.This study proposes a model integrating discriminative region localization and efficient second-order feature encoding.By ranking feature map channels via a fully connected layer,it selects high-importance channels to generate an enhanced map,accurately locating discriminative regions.Cropping and erasing augmentations further refine recognition.To improve efficiency,a novel second-order feature encoding module generates an attention map from the fourth convolutional group of Residual Network 50 layers(ResNet-50)and multiplies it with features from the fifth group,producing second-order features while reducing dimensionality and training time.Experiments on Caltech-University of California,San Diego Birds-200-2011(CUB-200-2011),Stanford Car,and Fine-Grained Visual Classification of Aircraft(FGVC Aircraft)datasets show state-of-the-art accuracy of 88.9%,94.7%,and 93.3%,respectively.

作者 Xiaorui Zhang Yingying Wang Wei Sun Shiyu Zhou Haoming Zhang Pengpai Wang

机构地区 College of Computer and Information Engineering School of Software School of Automation School of Computer Science

出处《Computers, Materials & Continua》 2026年第4期946-965,共20页 计算机、材料和连续体(英文)

基金 supported,in part,by the National Nature Science Foundation of China under Grant 62272236,62376128 and 62306139 the Natural Science Foundation of Jiangsu Province under Grant BK20201136,BK20191401.

关键词 Fine-grained recognition feature encoding data augmentation second-order feature discriminative regions

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1Peipei Zhao,Siyan Yang,Wei Ding,Ruyi Liu,Wentian Xin,Xiangzeng Liu,Qiguang Miao.Learning multi-scale attention network for fine-grained visual classification[J].Journal of Information and Intelligence,2025,3(6):492-503.
2YAN Jie,WEI Yingmei,XIE Yuxiang,GONG Quanzhi,ZOU Shiwei,LUAN Xidao.The brief self-attention module for lightweight convolution neural networks[J].Journal of Systems Engineering and Electronics,2025,36(6):1389-1397.
3王新艳,肖子亚,李勇,李腾,郭延吉.基于LASSO-Cox回归构建主动脉夹层保守治疗预后预测模型[J].中国循证心血管医学杂志,2026,18(2):178-183.
4朱学文,杨春玲,崔银,宋佳.A型主动脉夹层麻醉诱导后低血压预测模型构建[J].医学研究杂志,2026,55(1):122-126.
5张振兴,杨任农,李永林,左家亮,胡利平,陈双艳.基于双重注意力生成对抗网络的文本到图像生成[J].系统工程与电子技术,2026,48(1):34-43.
6Wei Yao.Winter Games Warm Hearts[J].Beijing Review,2026,69(8):46-47.
7Fawad Salam Khan,Noman Hasany,Muzammil Ahmad Khan,Shayan Abbas,Sajjad Ahmed,Muhammad Zorain,Wai Yie Leong,Susama Bagchi,Sanjoy Kumar Debnath.Boruta-LSTMAE:Feature-Enhanced Depth Image Denoising for 3D Recognition[J].Computers, Materials & Continua,2026,87(4):2181-2206.
8Qinzhen Fang,Dongliang Peng,Lu Zeng,Zixuan Jiang.Improved YOLO11 for Maglev Train Foreign Object Detection[J].Journal on Artificial Intelligence,2025,7(1):469-484.
9李宁,程旭,梁河雷,张超,刘帅,张军国.多模态融合的输电线路涉鸟故障鸟类智能识别[J].实验室研究与探索,2026,45(2):30-36.
10顾广华,孙文星,伊柏宇.基于多码深度特征融合生成对抗网络的文本生成图像方法[J].电子与信息学报,2026,48(1):287-296.

Computers, Materials & Continua

2026年第4期

浏览历史

内容加载中请稍等...

A Fine-Grained RecognitionModel based on Discriminative Region Localization and Efficient Second-Order Feature Encoding

相关作者

相关机构

相关主题

浏览历史