期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Head-Body Guided Deep Learning Framework for Dog Breed Recognition
1
作者 Noman Khan afnan +1 位作者 Mi Young Lee Jakyoung Min 《Computers, Materials & Continua》 2025年第11期2935-2958,共24页
Fine-grained dog breed classification presents significant challenges due to subtle inter-class differences,pose variations,and intra-class diversity.To address these complexities and limitations of traditional handcr... Fine-grained dog breed classification presents significant challenges due to subtle inter-class differences,pose variations,and intra-class diversity.To address these complexities and limitations of traditional handcrafted approaches,a novel and efficient two-stageDeep Learning(DL)framework tailored for robust fine-grained classification is proposed.In the first stage,a lightweight object detector,YOLO v8N(You Only Look Once Version 8 Nano),is finetuned to localize both the head and full body of the dog from each image.In the second stage,a dual-stream Vision Transformer(ViT)architecture independently processes the detected head and body regions,enabling the extraction of region-specific,complementary features.This dual-path approach improves feature discriminability by capturing localized cues that are vital for distinguishing visually similar breeds.The proposed framework introduces several key innovations:(1)a modular and lightweight head–body detection pipeline that balances accuracy with computational efficiency,(2)a region-awareViT model that leverages spatial attention for enhanced fine-grained recognition,and(3)a training scheme incorporating advanced augmentations and structured supervision to maximize generalization.These contributions collectively enhancemodel performancewhilemaintaining deployment efficiency.Extensive experiments conducted on the Tsinghua Dogs dataset validate the effectiveness of the approach.The model achieves an accuracy of 90.04%,outperforming existing State-of-the-Art(SOTA)methods across all key evaluation metrics.Furthermore,statistical significance testing confirms the robustness of the observed improvements over multiple baselines.The proposed method presents an effective solution for breed recognition tasks and shows strong potential for broader applications,including pet surveillance,veterinary diagnostics,and cross-species classification.Notably,it achieved an accuracy of 96.85% on the Oxford-IIIT Pet dataset,demonstrating its robustness across different species and breeds. 展开更多
关键词 Animal science computer vision dog breed deep learning RECOGNITION
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部