面向公共场所敏感目标与人体异常行为协同识别网络

Collaborative detection network for sensitive targets and abnormal human behaviour in public places

下载PDF

导出

摘要为解决多任务识别任务模型参数量大、计算成本高、定位能力差、识别精度低等突出问题,设计轻量化协同识别网络LightYOLOv11s。在主干网络与颈部,提出基于坐标注意力机制的多尺度卷积模块CAConv,捕获多尺寸目标特征,通过注意力机制强化语义信息理解,提高定位精度;向网络头部Detect和Pose模块传递颈部特征融合数据,确保模型在共享特征提取信息基础上,目标与人体行为解耦输出,实现高效协同识别;设计联合损失函数,根据图像中目标与人体行为数量动态调整权重参数,平衡两类任务识别精度。模型训练后,引入自适应通道剪枝算法(layer-adaptive magnitude-based pruning,LAMP),删除冗余信息,精简网络结构;同时,结合通道级知识蒸馏(CWD),对教师网络通道激活图归一化处理,使学生网络精准学习教师网络关键特征,优化模型预测。实验结果表明:LightYOLOv11s在F1-score、mAP@0.5、模型参数量、计算开销四项指标上均有优化。在目标检测任务中,对比基线YOLOv11s,F1-score、mAP@0.5依次增长2.62%、3.48%,参数量下降53.92%、计算开销降低55.78%。在人体行为识别中,参考基线YOLOv11sPose,F1-score、mAP@0.5依次提升9.66%、9.97%,参数量降低55.25%、计算开销下降57.74%。在精简网络结构同时,LightYOLOv11s实现更为精准的目标检测与人体行为协同识别,满足轻量化部署需要。边缘设备部署选择NPU、GPU、CPU集群架构开展实验研究,并与autodl服务器平台测试结果对比,证实移动端设备在识别精度、推理速度、便携性部署、移动电源能量存储多环节具有显著优势。 To solve the prominent problems of large parameter count,high computational cost,poor positioning ability,and low recognition accuracy in multi-task recognition models,we design a lightweight collaborative recognition network,LightYOLOv11s.In the backbone and neck of the network,a multi-scale convolution module,CAConv,based on the coordinate attention mechanism,is proposed to capture multi-scale target features,and the attention mechanism is used to strengthen the understanding of semantic information and improve the positioning precision.By simultaneously passing the output data from the neck to the detection and pose modules in the head of the network,the model ensures efficient collaborative recognition of object detection and human behaviour identification tasks based on shared feature extraction information.Additionally,a joint loss function is designed to dynamically adjust the weight parameters based on the number of objects and human behaviours in the image,thereby balancing the recognition accuracy of both tasks.After the model training,the model employs the layer-adaptive magnitude-based pruning algorithm(LAMP)to eliminate redundant information and simplify the network structure.Additionally,by utilising channel-wise knowledge distillation(CWD),the channel activation maps of the teacher network are normalised,allowing the student network to accurately learn the key channel features of the teacher network,thus optimising the model's predictive performance.Experimental results show that the LightYOLOv11s network is optimised in four key indicators:F1-score,mAP@0.5,model parameters,and computational overhead.Compared with the baseline YOLOv11s,the target detection F1-score and mAP@0.5 increased by 2.62%and 3.48%,respectively,the parameters decreased by 53.92%,and the computational overhead decreased by 55.78%.Compared with the YOLOv11sPose,the human behaviour recognition F1-score and mAP@0.5 increased by 9.66%and 9.97%respectively,the parameters decreased by 55.25%,and the computational overhead decreased by 57.74%.While streamlining the network structure,LightYOLOv11s achieves more accurate target detection and human behaviour collaborative recognition to satisfy the needs of lightweight deployment.The edge device deployment selected NPU,GPU,and CPU cluster architectures for experimental research,and compared with the test results of the autodl server platform,it is confirmed that mobile devices have significant advantages in recognition precision,reasoning speed,portable deployment,and mobile power energy storage.

作者孟琪翔高志霖王劲滔寇旗旗卜凡亮 Meng Qixiang;Gao Zhilin;Wang Jintao;Kou Qiqi;Bu Fanliang(School of Information Network Security,People's Public Security University of China,Beijing 100038,China;School of Computer Science and Technology,China University of Mining and Technology,Xuzhou,Jiangsu 221116,China)

机构地区中国人民公安大学信息网络安全学院中国矿业大学计算机科学与技术学院

出处《光电工程》北大核心 2025年第8期126-147,共22页 Opto-Electronic Engineering

基金国家自然科学基金(52204177)。

关键词目标检测行为识别模型剪枝知识蒸馏便携式部署 YOLOv11 object detection behavior recognition model pruning knowledge distillation portable deployment YOLOv11

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献12

1胡依伦,杨俊,许聪源,夏亚金,邓文斌.PIC2f-YOLO:金属表面缺陷检测轻量化方法[J].光电工程,2025,52(1):87-100. 被引量：8
2郝明,白鹤,徐婷婷.融合ResNeSt和多尺度特征融合的遥感影像道路提取[J].光电工程,2025,52(1):39-51. 被引量：4
3周川,秦蕾,毛小薇,文佳洛,王张车儿,李聪.面向体育测试的人体运动姿势实时识别研究[J].软件导刊,2023,22(12):92-98. 被引量：2
4吉训生,滕彬.基于生成对抗网络的行人异常行为图像去模糊算法研究[J].光电工程,2021,48(6):29-39. 被引量：4
5裴瑞景,王硕,王华英.基于改进YOLOv4算法的水果识别检测研究[J].激光技术,2023,47(3):400-406. 被引量：8
6许立,凌铭,王若铭.基于改进complex-YOLO激光雷达的行人检测算法研究[J].激光技术,2025,49(1):106-112. 被引量：2
7谢竞,邓月明,王润民.改进YOLOv8s的交通标志检测算法[J].计算机工程,2024,50(11):338-349. 被引量：11
8张润梅,潘晨飞,陈梓华,陈中,袁彬.改进YOLOv8的焊缝表面缺陷检测算法[J].光电工程,2025,52(3):111-123. 被引量：6
9曲立国,张鑫,卢自宝,刘玉玲,陈国豪.基于改进YOLOv5的交通标志识别方法[J].光电工程,2024,51(6):21-33. 被引量：2
10周瑜,贺伟.激光成像引信的目标识别方法研究[J].激光技术,2023,47(2):267-272. 被引量：8

二级参考文献80

1刘春妹,高洪民,王学田,韩聪蓉,刘栖,沈婕.基于深度学习的水果图像识别系统[J].微波学报,2020,36(S01):427-430. 被引量：4
2孙志慧,邓甲昊,闫小伟.线阵推扫式激光成像引信探测技术[J].光电工程,2009,36(3):16-21. 被引量：14
3徐长航,陈国明,谢静.红外图像处理技术在金属表面缺陷检测中的应用[J].制造业自动化,2009,31(10):51-54. 被引量：10
4王刚,贺伟,董卫斌,陈遵田,崔东森.一种基于形态点特征的目标识别方法[J].探测与控制学报,2010,32(6):18-22. 被引量：3
5王超,蒋晓瑜,柳效辉.基于电致发光成像理论的硅太阳电池缺陷检测[J].光电子．激光,2011,22(9):1332-1336. 被引量：19
6董康生,曹林平,韩统,黄汉桥,蔡佳.激光成像引信在空空导弹中的应用[J].空军工程大学学报（自然科学版）,2013,14(2):42-46. 被引量：4
7唐述,龚卫国,仲建华.稀疏平滑特性的多正则化约束图像盲复原方法[J].软件学报,2013,24(5):1143-1154. 被引量：13
8卫媛媛,贺伟.快速区分目标与云雾的方法研究[J].弹箭与制导学报,2014,34(2):177-181. 被引量：2
9傅隆生,冯亚利,Elkamil Tola,刘智豪,李瑞,崔永杰.基于卷积神经网络的田间多簇猕猴桃图像识别方法[J].农业工程学报,2018,34(2):205-211. 被引量：118
10熊俊涛,刘振,汤林越,林睿,卜榕彬,彭红星.自然环境下绿色柑橘视觉检测技术研究[J].农业机械学报,2018,49(4):45-52. 被引量：79

共引文献51

1刘婷,张辰,章宜玉.基于多尺度卷积和超分辨率分频模型图像去模糊算法研究[J].电脑知识与技术,2022,18(19):79-81. 被引量：2
2窦敏娜,王晓霞.无线环境下电子显微镜数据实时采集方法[J].自动化与仪器仪表,2023(2):51-55. 被引量：1
3袁金豪,张南峰,阮洁珊,高向东.基于改进YOLOX算法的X射线图像违禁品检测方法[J].激光技术,2023,47(4):547-552. 被引量：5
4赵放之,段升顺,吴俊.基于FPGA和机器视觉的多元特征模式水果分拣系统[J].电子器件,2024,47(1):248-254. 被引量：5
5李立,易诗,刘茜,程兴豪,王铖.基于密集残差生成对抗网络的红外图像去模糊[J].红外技术,2024,46(6):663-671. 被引量：1
6李仁惠,齐永兰,王珊,李丹霞,朱雪珂,张根茂.基于YOLOv5的自然条件下水果检测识别[J].果农之友,2024(8):88-91. 被引量：2
7程碧瑶,张雅茹,李雨蔚,王小卓,付曦瑶,唐辉.基于自适应阈值的脉冲激光引信时刻鉴别方法[J].探测与控制学报,2024,46(4):32-38. 被引量：3
8魏琛,庄子波,曹博书.基于激光图像特征提取的危险目标越界识别研究[J].激光杂志,2024,45(8):144-149.
9李莉杰,高方,李元涛,田壮梅,吕莉源,张梦洁.储能设备电池极片缺陷检测网络研究[J].电力大数据,2024,27(6):22-31. 被引量：2
10武果,高守正,张德福,肖国龙,龚正平,覃健.基于特征重组和注意力机制的YOLOv5光伏面板缺陷检测方法[J].自动化与信息工程,2024,45(5):47-53.

1高霞,王海波,石钖,刘峥,张友豪.新通道激活广西开放新引擎[J].当代党员,2025(17):22-24.
2化春键,李秀琴,蒋毅,俞建峰,陈莹.基于YOLO-CFD的棉布微小微弱缺陷检测研究[J].电子测量与仪器学报,2025,39(4):152-162. 被引量：2
3“上海企业登记在线”再升级电子签名,掏出手机就能办[J].现代工商,2025(6):40-40.
4杨国萍,侯庆,蓝善根.低分辨率视频图像关键帧异常行为识别[J].计算机仿真,2025,42(7):533-537.
5刘文明,刘羽乔.液化天然气接收站冷排水余氯对近岸海域生态环境影响的三维数值模拟[J].热带海洋学报,2025,44(3):217-223.
6郭强.融媒体直播中的虚拟化排播系统设计与实现[J].现代电视技术,2025(5):88-92.
7范忆梅,蒋存皓,范贤杰,伍彩霞,周力强.基于改进YOLOv10-MCS多模态协同感知的轻量化负障碍物检测算法[J].计算机测量与控制,2025,33(6):288-297. 被引量：1
8马仑,杨跃,王迨贺,廖桂生,李幸.联合自注意力机制与权值共享的人体行为识别模型[J].系统仿真学报,2025,37(9):2409-2419. 被引量：1
9王积国.化工园区周边土地规划安全控制线的划定及优化措施[J].安徽化工,2025,51(4):130-132.
10庄旭,张红民,郑敬添.基于预测网络的视频异常行为检测方法[J].计算机仿真,2025,42(5):475-481.

光电工程

2025年第8期

浏览历史

内容加载中请稍等...

面向公共场所敏感目标与人体异常行为协同识别网络

参考文献12

二级参考文献80

共引文献51

相关作者

相关机构

相关主题

浏览历史