针对基于You Only Look Once v2算法的目标检测存在精度低及稳健性差的问题,提出一种车辆目标实时检测的You Only Look Once v2优化算法;该算法以You Only Look Once v2算法为基础,通过增加网络深度,增强特征提取能力,同时,通过添加残...针对基于You Only Look Once v2算法的目标检测存在精度低及稳健性差的问题,提出一种车辆目标实时检测的You Only Look Once v2优化算法;该算法以You Only Look Once v2算法为基础,通过增加网络深度,增强特征提取能力,同时,通过添加残差模块,解决网络深度增加带来的梯度消失或弥散问题;该方法将网络结构中低层特征与高层特征进行融合,提升对小目标车辆的检测精度。结果表明,通过在KITTI数据集上进行测试,优化后的算法在检测速度不变的情况下,提高了车辆目标检测精度,平均精度达到0.94,同时提升了小目标检测的准确性。展开更多
This paper proposes SW-YOLO(StarNet Weighted-Conv YOLO),a lightweight human pose estimation network for edge devices.Current mainstream pose estimation algorithms are computationally inefficient and have poor feature ...This paper proposes SW-YOLO(StarNet Weighted-Conv YOLO),a lightweight human pose estimation network for edge devices.Current mainstream pose estimation algorithms are computationally inefficient and have poor feature capture capabilities for complex poses and occlusion scenarios.This work introduces a lightweight backbone architecture that integrates WConv(Weighted Convolution)and StarNet modules to address these issues.Leveraging StarNet’s superior capabilities in multi-level feature fusion and long-range dependency modeling,this architecture enhances the model’s spatial perception of human joint structures and contextual information integration.These improvements significantly enhance robustness in complex scenarios involving occlusion and deformation.Additionally,the introduction of WConv convolution operations,based on weight recalibration and receptive field optimization,dynamically adjusts feature importance during convolution.This reduces redundant computations while maintaining or enhancing feature representation capabilities at an extremely low computational cost.Consequently,SW-YOLO substantially reduces model complexity and inference latency while preserving high accuracy,significantly outperforming existing lightweight networks.展开更多
Impact craters are important for understanding the evolution of lunar geologic and surface erosion rates,among other functions.However,the morphological characteristics of these micro impact craters are not obvious an...Impact craters are important for understanding the evolution of lunar geologic and surface erosion rates,among other functions.However,the morphological characteristics of these micro impact craters are not obvious and they are numerous,resulting in low detection accuracy by deep learning models.Therefore,we proposed a new multi-scale fusion crater detection algorithm(MSF-CDA)based on the YOLO11 to improve the accuracy of lunar impact crater detection,especially for small craters with a diameter of<1 km.Using the images taken by the LROC(Lunar Reconnaissance Orbiter Camera)at the Chang’e-4(CE-4)landing area,we constructed three separate datasets for craters with diameters of 0-70 m,70-140 m,and>140 m.We then trained three submodels separately with these three datasets.Additionally,we designed a slicing-amplifying-slicing strategy to enhance the ability to extract features from small craters.To handle redundant predictions,we proposed a new Non-Maximum Suppression with Area Filtering method to fuse the results in overlapping targets within the multi-scale submodels.Finally,our new MSF-CDA method achieved high detection performance,with the Precision,Recall,and F1 score having values of 0.991,0.987,and 0.989,respectively,perfectly addressing the problems induced by the lesser features and sample imbalance of small craters.Our MSF-CDA can provide strong data support for more in-depth study of the geological evolution of the lunar surface and finer geological age estimations.This strategy can also be used to detect other small objects with lesser features and sample imbalance problems.We detected approximately 500,000 impact craters in an area of approximately 214 km2 around the CE-4 landing area.By statistically analyzing the new data,we updated the distribution function of the number and diameter of impact craters.Finally,we identified the most suitable lighting conditions for detecting impact crater targets by analyzing the effect of different lighting conditions on the detection accuracy.展开更多
针对少精、弱精患者的家用精子检测仪嵌入式部署需求,提出一种基于改进you only look once(YOLO)v8n的轻量化精子检测算法.通过对YOLOv8n模型进行结构优化,在Neck层引入高效多分支尺度特征金字塔网络(EMBSFPN),在提高精度的同时保证了...针对少精、弱精患者的家用精子检测仪嵌入式部署需求,提出一种基于改进you only look once(YOLO)v8n的轻量化精子检测算法.通过对YOLOv8n模型进行结构优化,在Neck层引入高效多分支尺度特征金字塔网络(EMBSFPN),在提高精度的同时保证了模型的轻量化.在检测头部分采用轻量级共享可变形卷积检测(LSDECD)头替换原来的检测头,大大减少了模型的参数量和运算量.实验结果表明,改进后的算法在精子检测任务上实现了良好的性能,平均精度提高了2.3%,模型运算量减少了26.8%,为嵌入式系统上的精子检测应用提供了一种有效的解决方案.展开更多
针对目前遥感图像小目标检测任务中易出现漏检和误检的问题,提出一种SCS-YOLO[SMCA+CSC+SIoU(shape-aware intersection over union loss)-you only look once]的遥感图像小目标检测算法。首先,针对遥感图像中目标小而聚集的问题,构建...针对目前遥感图像小目标检测任务中易出现漏检和误检的问题,提出一种SCS-YOLO[SMCA+CSC+SIoU(shape-aware intersection over union loss)-you only look once]的遥感图像小目标检测算法。首先,针对遥感图像中目标小而聚集的问题,构建空间多尺度卷积注意力(spatial multi-scale convolutional attention,SMCA),提升模型对空间和通道信息的特征提取能力;其次,针对深层网络传递时小目标语义信息容易丢失的问题,设计聚合亚像素卷积(concentrated sub-pixel convolution,CSC),采用多尺度聚合特征提取方法,增强了网络对语义信息的提取能力;最后,将SIoU损失函数替代原模型中的CIoU(complete intersection over union loss)损失函数,加快了网络的收敛速度。SCS-YOLO模型在RSOD和NWPU VHR-10数据集上,平均精确率的平均值(mAP)分别达到97%和90.9%,相较于原模型分别提升了2.2%和2.7%,可见该方法在遥感图像小目标检测任务中的有效性。展开更多
Loess landslides are one of the geological hazards prevalent in mountainous areas of Loess Plateau,seriously threatening people's lives and property safety.Accurate identification of landslides is a prerequisite f...Loess landslides are one of the geological hazards prevalent in mountainous areas of Loess Plateau,seriously threatening people's lives and property safety.Accurate identification of landslides is a prerequisite for reducing the risk of landslide hazards.Traditional landslide interpretation methods often have the disadvantage of being laborious and difficult to use on a large scale compared with the recently developed deep learning-based landslide detection methods.In this study,we propose an improved deep learning model,landslide detectionyou only look once(LD-YOLO),based on the existing you only look once(YOLO)model for the intelligent identification of old and new landslides in loess areas.Specifically,remote sensing images of landslides in Baoji City,Shaanxi Province,China are acquired from the Google Earth Engine platform.The landslide images of Baoji City(excluding Qianyang County)are used to establish a loess landslide dataset for training the model.The landslide data of Qianyang County is used to verify the detection performance of the model.The focal and efficient IoU(Focal-EIoU)loss function and efficient channel attention(ECA)mechanism are incorporated into the 7th version of YOLO(YOLOv7)model to construct the LD-YOLO model,which makes it more suitable for the landslide detection task.The experiments yielded an improved LD-YOLO model with average precision of 92.05%,precision of 92.31%,recall of 90.28%,and F1-score of 91.28%for loess landslide detection.The landslides in Qianyang County were divided into two test sets,new landslides and old landslides,which were used to test the detection performance of LD-YOLO for both types of landslides.The results show that LD-YOLO detects old landslides with a detection precision of 82.75%and a recall of 80%.When detecting new landslides,the detection precision is 94.29%and the recall is 91.67%.It indicates that our proposed LD-YOLO model has strong detection performance for both new and old landslides in loess areas.Through a proposed solution that can realize the accurate detection of landslides in loess areas,this paper provides a valuable reference for the application of deep learning methods in landslide identification.展开更多
During a sea firing training,the intelligent detection of projectile-induced water column targets in a firing video is the prerequisite for and critical to the automatic calculation of miss distance,while the correct ...During a sea firing training,the intelligent detection of projectile-induced water column targets in a firing video is the prerequisite for and critical to the automatic calculation of miss distance,while the correct and precise calculation of miss distance is directly affected by the accuracy,false alarm rate and time delay of detection.After analyzing the characteristics of projectile-induced water columns,an accurate detection algorithm for time backtracked projectile-induced water columns based on the improved you only look once(YOLO)network is put forward.The capability and accuracy of detecting projectileinduced water column targets with the conventional YOLO network are improved by optimizing the anchor box through K-means clustering and embedding the squeeze and excitation(SE)attention module.The detection area is limited by adopting a sea-sky line detection algorithm based on gray level co-occurrence matrix(GLCM),so as to effectively eliminate such disturbances as ocean waves and ship wakes,and lower the false alarm rate of projectile-induced water column detection.The improved algorithm increases the mAP50 of water column detection by 30.3%.On the basis of correct detection,a time backtracking algorithm is designed with mean shift to track images containing projectile-induced water column in reverse time sequence.It accurately detects a projectile-induced water column at the time of its initial appearance as well as its pixel position in images,and considerably reduces detection delay,so as to provide the support for the automatic,accurate,and real-time calculation of miss distance.展开更多
To avoid suffering gouge and transient overshooting in high speed cutting machining, a novel parametefized curve interpolator model with velocity look-ahead algorithm is proposed. Based on a prearrangement step interp...To avoid suffering gouge and transient overshooting in high speed cutting machining, a novel parametefized curve interpolator model with velocity look-ahead algorithm is proposed. Based on a prearrangement step interpolation algorithm for parameterized curves and considering high curvature points, parameterized curve tool path is divided into acceleration segments and deceleration segments by look-ahead algorithm. Under condition of characteristics of acceleration and deceleration stored in control system, deceleration before high curvature points and acceleration after high curvature points are realized in real-time in high speed cutting machining. Based on new parameterized curve interpolator model with velocity look-ahead algorithm, a real cubic spline is machined simulativly. The simulation results show that velocity look-ahead algorithm improves velocity changing more smoothly.展开更多
为了综合分析YOLO(You Only Look Once)算法在提升交通安全性和效率方面的重要作用,从“人-车-路”3个核心要素的角度,对YOLO算法在交通目标检测中的发展和研究现状进行系统性地总结.概述了YOLO算法常用的评价指标,详细阐述了这些指标...为了综合分析YOLO(You Only Look Once)算法在提升交通安全性和效率方面的重要作用,从“人-车-路”3个核心要素的角度,对YOLO算法在交通目标检测中的发展和研究现状进行系统性地总结.概述了YOLO算法常用的评价指标,详细阐述了这些指标在交通场景中的实际意义.对YOLO算法的核心架构进行概述,追溯了该算法的发展历程,分析各个版本迭代中的优化和改进措施.从“人-车-路”3种交通目标的视角出发,梳理并论述了采用YOLO算法进行交通目标检测的研究现状及应用情况.分析目前YOLO算法在交通目标检测中存在的局限性和挑战,提出相应的改进方法,展望未来的研究重点,为道路交通的智能化发展提供了研究参考.展开更多
To avoid colliding with trees during its operation,a lawn mower robot must detect the trees.Existing tree detection methods suffer from low detection accuracy(missed detection)and the lack of a lightweight model.In th...To avoid colliding with trees during its operation,a lawn mower robot must detect the trees.Existing tree detection methods suffer from low detection accuracy(missed detection)and the lack of a lightweight model.In this study,a dataset of trees was constructed on the basis of a real lawn environment.According to the theory of channel incremental depthwise convolution and residual suppression,the Embedded-A module is proposed,which expands the depth of the feature map twice to form a residual structure to improve the lightweight degree of the model.According to residual fusion theory,the Embedded-B module is proposed,which improves the accuracy of feature-map downsampling by depthwise convolution and pooling fusion.The Embedded YOLO object detection network is formed by stacking the embedded modules and the fusion of feature maps of different resolutions.Experimental results on the testing set show that the Embedded YOLO tree detection algorithm has 84.17%and 69.91%average precision values respectively for trunk and spherical tree,and 77.04% mean average precision value.The number of convolution parameters is 1.78×10^(6),and the calculation amount is 3.85 billion float operations per second.The size of weight file is 7.11MB,and the detection speed can reach 179 frame/s.This study provides a theoretical basis for the lightweight application of the object detection algorithm based on deep learning for lawn mower robots.展开更多
文摘针对基于You Only Look Once v2算法的目标检测存在精度低及稳健性差的问题,提出一种车辆目标实时检测的You Only Look Once v2优化算法;该算法以You Only Look Once v2算法为基础,通过增加网络深度,增强特征提取能力,同时,通过添加残差模块,解决网络深度增加带来的梯度消失或弥散问题;该方法将网络结构中低层特征与高层特征进行融合,提升对小目标车辆的检测精度。结果表明,通过在KITTI数据集上进行测试,优化后的算法在检测速度不变的情况下,提高了车辆目标检测精度,平均精度达到0.94,同时提升了小目标检测的准确性。
文摘This paper proposes SW-YOLO(StarNet Weighted-Conv YOLO),a lightweight human pose estimation network for edge devices.Current mainstream pose estimation algorithms are computationally inefficient and have poor feature capture capabilities for complex poses and occlusion scenarios.This work introduces a lightweight backbone architecture that integrates WConv(Weighted Convolution)and StarNet modules to address these issues.Leveraging StarNet’s superior capabilities in multi-level feature fusion and long-range dependency modeling,this architecture enhances the model’s spatial perception of human joint structures and contextual information integration.These improvements significantly enhance robustness in complex scenarios involving occlusion and deformation.Additionally,the introduction of WConv convolution operations,based on weight recalibration and receptive field optimization,dynamically adjusts feature importance during convolution.This reduces redundant computations while maintaining or enhancing feature representation capabilities at an extremely low computational cost.Consequently,SW-YOLO substantially reduces model complexity and inference latency while preserving high accuracy,significantly outperforming existing lightweight networks.
基金the National Key Research and Development Program of China(Grant No.2022YFF0711400)which provided valuable financial support and resources for my research and made it possible for me to deeply explore the unknown mysteries in the field of lunar geologythe National Space Science Data Center Youth Open Project(Grant No.NSSDC2302001),which has not only facilitated the smooth progress of my research,but has also built a platform for me to communicate and cooperate with experts in the field.
文摘Impact craters are important for understanding the evolution of lunar geologic and surface erosion rates,among other functions.However,the morphological characteristics of these micro impact craters are not obvious and they are numerous,resulting in low detection accuracy by deep learning models.Therefore,we proposed a new multi-scale fusion crater detection algorithm(MSF-CDA)based on the YOLO11 to improve the accuracy of lunar impact crater detection,especially for small craters with a diameter of<1 km.Using the images taken by the LROC(Lunar Reconnaissance Orbiter Camera)at the Chang’e-4(CE-4)landing area,we constructed three separate datasets for craters with diameters of 0-70 m,70-140 m,and>140 m.We then trained three submodels separately with these three datasets.Additionally,we designed a slicing-amplifying-slicing strategy to enhance the ability to extract features from small craters.To handle redundant predictions,we proposed a new Non-Maximum Suppression with Area Filtering method to fuse the results in overlapping targets within the multi-scale submodels.Finally,our new MSF-CDA method achieved high detection performance,with the Precision,Recall,and F1 score having values of 0.991,0.987,and 0.989,respectively,perfectly addressing the problems induced by the lesser features and sample imbalance of small craters.Our MSF-CDA can provide strong data support for more in-depth study of the geological evolution of the lunar surface and finer geological age estimations.This strategy can also be used to detect other small objects with lesser features and sample imbalance problems.We detected approximately 500,000 impact craters in an area of approximately 214 km2 around the CE-4 landing area.By statistically analyzing the new data,we updated the distribution function of the number and diameter of impact craters.Finally,we identified the most suitable lighting conditions for detecting impact crater targets by analyzing the effect of different lighting conditions on the detection accuracy.
文摘针对少精、弱精患者的家用精子检测仪嵌入式部署需求,提出一种基于改进you only look once(YOLO)v8n的轻量化精子检测算法.通过对YOLOv8n模型进行结构优化,在Neck层引入高效多分支尺度特征金字塔网络(EMBSFPN),在提高精度的同时保证了模型的轻量化.在检测头部分采用轻量级共享可变形卷积检测(LSDECD)头替换原来的检测头,大大减少了模型的参数量和运算量.实验结果表明,改进后的算法在精子检测任务上实现了良好的性能,平均精度提高了2.3%,模型运算量减少了26.8%,为嵌入式系统上的精子检测应用提供了一种有效的解决方案.
文摘针对目前遥感图像小目标检测任务中易出现漏检和误检的问题,提出一种SCS-YOLO[SMCA+CSC+SIoU(shape-aware intersection over union loss)-you only look once]的遥感图像小目标检测算法。首先,针对遥感图像中目标小而聚集的问题,构建空间多尺度卷积注意力(spatial multi-scale convolutional attention,SMCA),提升模型对空间和通道信息的特征提取能力;其次,针对深层网络传递时小目标语义信息容易丢失的问题,设计聚合亚像素卷积(concentrated sub-pixel convolution,CSC),采用多尺度聚合特征提取方法,增强了网络对语义信息的提取能力;最后,将SIoU损失函数替代原模型中的CIoU(complete intersection over union loss)损失函数,加快了网络的收敛速度。SCS-YOLO模型在RSOD和NWPU VHR-10数据集上,平均精确率的平均值(mAP)分别达到97%和90.9%,相较于原模型分别提升了2.2%和2.7%,可见该方法在遥感图像小目标检测任务中的有效性。
基金the Huainan Normal University Natural Science Research(Grants No.2022XJYB034)the Fundamental Research Funds for the Central Universities,CHD(Grants No.300102352506)the Natural Science Foundation of Anhui Colleges(Grants No.KJ2020A0313)。
文摘Loess landslides are one of the geological hazards prevalent in mountainous areas of Loess Plateau,seriously threatening people's lives and property safety.Accurate identification of landslides is a prerequisite for reducing the risk of landslide hazards.Traditional landslide interpretation methods often have the disadvantage of being laborious and difficult to use on a large scale compared with the recently developed deep learning-based landslide detection methods.In this study,we propose an improved deep learning model,landslide detectionyou only look once(LD-YOLO),based on the existing you only look once(YOLO)model for the intelligent identification of old and new landslides in loess areas.Specifically,remote sensing images of landslides in Baoji City,Shaanxi Province,China are acquired from the Google Earth Engine platform.The landslide images of Baoji City(excluding Qianyang County)are used to establish a loess landslide dataset for training the model.The landslide data of Qianyang County is used to verify the detection performance of the model.The focal and efficient IoU(Focal-EIoU)loss function and efficient channel attention(ECA)mechanism are incorporated into the 7th version of YOLO(YOLOv7)model to construct the LD-YOLO model,which makes it more suitable for the landslide detection task.The experiments yielded an improved LD-YOLO model with average precision of 92.05%,precision of 92.31%,recall of 90.28%,and F1-score of 91.28%for loess landslide detection.The landslides in Qianyang County were divided into two test sets,new landslides and old landslides,which were used to test the detection performance of LD-YOLO for both types of landslides.The results show that LD-YOLO detects old landslides with a detection precision of 82.75%and a recall of 80%.When detecting new landslides,the detection precision is 94.29%and the recall is 91.67%.It indicates that our proposed LD-YOLO model has strong detection performance for both new and old landslides in loess areas.Through a proposed solution that can realize the accurate detection of landslides in loess areas,this paper provides a valuable reference for the application of deep learning methods in landslide identification.
基金supported by the National Natural Science Foundation of China(51679247)。
文摘During a sea firing training,the intelligent detection of projectile-induced water column targets in a firing video is the prerequisite for and critical to the automatic calculation of miss distance,while the correct and precise calculation of miss distance is directly affected by the accuracy,false alarm rate and time delay of detection.After analyzing the characteristics of projectile-induced water columns,an accurate detection algorithm for time backtracked projectile-induced water columns based on the improved you only look once(YOLO)network is put forward.The capability and accuracy of detecting projectileinduced water column targets with the conventional YOLO network are improved by optimizing the anchor box through K-means clustering and embedding the squeeze and excitation(SE)attention module.The detection area is limited by adopting a sea-sky line detection algorithm based on gray level co-occurrence matrix(GLCM),so as to effectively eliminate such disturbances as ocean waves and ship wakes,and lower the false alarm rate of projectile-induced water column detection.The improved algorithm increases the mAP50 of water column detection by 30.3%.On the basis of correct detection,a time backtracking algorithm is designed with mean shift to track images containing projectile-induced water column in reverse time sequence.It accurately detects a projectile-induced water column at the time of its initial appearance as well as its pixel position in images,and considerably reduces detection delay,so as to provide the support for the automatic,accurate,and real-time calculation of miss distance.
基金Special Project for Key Mechatronic Equipment of Zhejiang Province,China (No.2006Cl1067)Science & Technology Project of Zhejiang Province,China (No. 2005E10049)
文摘To avoid suffering gouge and transient overshooting in high speed cutting machining, a novel parametefized curve interpolator model with velocity look-ahead algorithm is proposed. Based on a prearrangement step interpolation algorithm for parameterized curves and considering high curvature points, parameterized curve tool path is divided into acceleration segments and deceleration segments by look-ahead algorithm. Under condition of characteristics of acceleration and deceleration stored in control system, deceleration before high curvature points and acceleration after high curvature points are realized in real-time in high speed cutting machining. Based on new parameterized curve interpolator model with velocity look-ahead algorithm, a real cubic spline is machined simulativly. The simulation results show that velocity look-ahead algorithm improves velocity changing more smoothly.
文摘为了综合分析YOLO(You Only Look Once)算法在提升交通安全性和效率方面的重要作用,从“人-车-路”3个核心要素的角度,对YOLO算法在交通目标检测中的发展和研究现状进行系统性地总结.概述了YOLO算法常用的评价指标,详细阐述了这些指标在交通场景中的实际意义.对YOLO算法的核心架构进行概述,追溯了该算法的发展历程,分析各个版本迭代中的优化和改进措施.从“人-车-路”3种交通目标的视角出发,梳理并论述了采用YOLO算法进行交通目标检测的研究现状及应用情况.分析目前YOLO算法在交通目标检测中存在的局限性和挑战,提出相应的改进方法,展望未来的研究重点,为道路交通的智能化发展提供了研究参考.
基金the National Natural Science Foundation of China (No.51275223)。
文摘To avoid colliding with trees during its operation,a lawn mower robot must detect the trees.Existing tree detection methods suffer from low detection accuracy(missed detection)and the lack of a lightweight model.In this study,a dataset of trees was constructed on the basis of a real lawn environment.According to the theory of channel incremental depthwise convolution and residual suppression,the Embedded-A module is proposed,which expands the depth of the feature map twice to form a residual structure to improve the lightweight degree of the model.According to residual fusion theory,the Embedded-B module is proposed,which improves the accuracy of feature-map downsampling by depthwise convolution and pooling fusion.The Embedded YOLO object detection network is formed by stacking the embedded modules and the fusion of feature maps of different resolutions.Experimental results on the testing set show that the Embedded YOLO tree detection algorithm has 84.17%and 69.91%average precision values respectively for trunk and spherical tree,and 77.04% mean average precision value.The number of convolution parameters is 1.78×10^(6),and the calculation amount is 3.85 billion float operations per second.The size of weight file is 7.11MB,and the detection speed can reach 179 frame/s.This study provides a theoretical basis for the lightweight application of the object detection algorithm based on deep learning for lawn mower robots.