期刊文献+

YOLOv12-enhanced:multi-scale attention and edge information fusion for industrial valve nozzle detection

在线阅读 下载PDF
导出
摘要 Accurate valve nozzle detection is an important component of industrial visual inspection systems;however,structural complexity,scale variation,illumination fluctuation,and partial occlusion remain challenging factors that affect detection stability.This study presents YOLOv12-Enhanced,a refined singlestage detection framework developed for industrial valve nozzle scenarios.The proposed approach incorporates three architectural modifications:a RepViT backbone to enhance hierarchical feature representation through structural re-parameterization and global–local modeling,a Spatial Pyramid Pooling Fast(SPPF)module combined with C2PSA attention to strengthen multi-scale contextual feature extraction,and a Global Edge Information Fusion(GEIF)module to integrate shallow edge information with deep semantic features for improved boundary alignment.Experimental evaluation on the Pascal Visual Object Classes(VOC)dataset shows that the proposed model achieves 71.0%mAP50 and 54.4%mAP50–95 under identical training conditions,exceeding the baseline YOLOv12n.Ablation experiments further demonstrate that each module contributes incremental performance gains.Evaluation on a self-constructed valve nozzle dataset consisting of 500 real industrial images indicates stable detection behavior under varying illumination and partial occlusion conditions.The experimental findings suggest that the proposed structural refinements provide a balanced enhancement in feature representation and localization precision while maintaining comparable computational complexity.
出处 《Advances in Engineering Innovation》 2026年第3期80-91,共12页 工程创新进展(英文)

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部