期刊文献+
共找到8篇文章
< 1 >
每页显示 20 50 100
YOLO-MFD:Remote Sensing Image Object Detection with Multi-Scale Fusion Dynamic Head
1
作者 Zhongyuan Zhang Wenqiu Zhu 《Computers, Materials & Continua》 SCIE EI 2024年第5期2547-2563,共17页
Remote sensing imagery,due to its high altitude,presents inherent challenges characterized by multiple scales,limited target areas,and intricate backgrounds.These inherent traits often lead to increased miss and false... Remote sensing imagery,due to its high altitude,presents inherent challenges characterized by multiple scales,limited target areas,and intricate backgrounds.These inherent traits often lead to increased miss and false detection rates when applying object recognition algorithms tailored for remote sensing imagery.Additionally,these complexities contribute to inaccuracies in target localization and hinder precise target categorization.This paper addresses these challenges by proposing a solution:The YOLO-MFD model(YOLO-MFD:Remote Sensing Image Object Detection withMulti-scale Fusion Dynamic Head).Before presenting our method,we delve into the prevalent issues faced in remote sensing imagery analysis.Specifically,we emphasize the struggles of existing object recognition algorithms in comprehensively capturing critical image features amidst varying scales and complex backgrounds.To resolve these issues,we introduce a novel approach.First,we propose the implementation of a lightweight multi-scale module called CEF.This module significantly improves the model’s ability to comprehensively capture important image features by merging multi-scale feature information.It effectively addresses the issues of missed detection and mistaken alarms that are common in remote sensing imagery.Second,an additional layer of small target detection heads is added,and a residual link is established with the higher-level feature extraction module in the backbone section.This allows the model to incorporate shallower information,significantly improving the accuracy of target localization in remotely sensed images.Finally,a dynamic head attentionmechanism is introduced.This allows themodel to exhibit greater flexibility and accuracy in recognizing shapes and targets of different sizes.Consequently,the precision of object detection is significantly improved.The trial results show that the YOLO-MFD model shows improvements of 6.3%,3.5%,and 2.5%over the original YOLOv8 model in Precision,map@0.5 and map@0.5:0.95,separately.These results illustrate the clear advantages of the method. 展开更多
关键词 Object detection YOLOv8 MULTI-SCALE attention mechanism dynamic detection head
在线阅读 下载PDF
DAFPN-YOLO: An Improved UAV-Based Object Detection Algorithm Based on YOLOv8s
2
作者 Honglin Wang Yaolong Zhang Cheng Zhu 《Computers, Materials & Continua》 2025年第5期1929-1949,共21页
UAV-based object detection is rapidly expanding in both civilian and military applications,including security surveillance,disaster assessment,and border patrol.However,challenges such as small objects,occlusions,comp... UAV-based object detection is rapidly expanding in both civilian and military applications,including security surveillance,disaster assessment,and border patrol.However,challenges such as small objects,occlusions,complex backgrounds,and variable lighting persist due to the unique perspective of UAV imagery.To address these issues,this paper introduces DAFPN-YOLO,an innovative model based on YOLOv8s(You Only Look Once version 8s).Themodel strikes a balance between detection accuracy and speed while reducing parameters,making itwell-suited for multi-object detection tasks from drone perspectives.A key feature of DAFPN-YOLO is the enhanced Drone-AFPN(Adaptive Feature Pyramid Network),which adaptively fuses multi-scale features to optimize feature extraction and enhance spatial and small-object information.To leverage Drone-AFPN’smulti-scale capabilities fully,a dedicated 160×160 small-object detection head was added,significantly boosting detection accuracy for small targets.In the backbone,the C2f_Dual(Cross Stage Partial with Cross-Stage Feature Fusion Dual)module and SPPELAN(Spatial Pyramid Pooling with Enhanced LocalAttentionNetwork)modulewere integrated.These components improve feature extraction and information aggregationwhile reducing parameters and computational complexity,enhancing inference efficiency.Additionally,Shape-IoU(Shape Intersection over Union)is used as the loss function for bounding box regression,enabling more precise shape-based object matching.Experimental results on the VisDrone 2019 dataset demonstrate the effectiveness ofDAFPN-YOLO.Compared to YOLOv8s,the proposedmodel achieves a 5.4 percentage point increase inmAP@0.5,a 3.8 percentage point improvement in mAP@0.5:0.95,and a 17.2%reduction in parameter count.These results highlight DAFPN-YOLO’s advantages in UAV-based object detection,offering valuable insights for applying deep learning to UAV-specific multi-object detection tasks. 展开更多
关键词 YOLOv8 UAV-based object detection AFPN small-object detection head SPPELAN DualConv loss function
在线阅读 下载PDF
SDH-FCOS:An Efficient Neural Network for Defect Detection in Urban Underground Pipelines 被引量:3
3
作者 Bin Zhou Bo Li +2 位作者 Wenfei Lan Congwen Tian Wei Yao 《Computers, Materials & Continua》 SCIE EI 2024年第1期633-652,共20页
Urban underground pipelines are an important infrastructure in cities,and timely investigation of problems in underground pipelines can help ensure the normal operation of cities.Owing to the growing demand for defect... Urban underground pipelines are an important infrastructure in cities,and timely investigation of problems in underground pipelines can help ensure the normal operation of cities.Owing to the growing demand for defect detection in urban underground pipelines,this study developed an improved defect detection method for urban underground pipelines based on fully convolutional one-stage object detector(FCOS),called spatial pyramid pooling-fast(SPPF)feature fusion and dual detection heads based on FCOS(SDH-FCOS)model.This study improved the feature fusion component of the model network based on FCOS,introduced an SPPF network structure behind the last output feature layer of the backbone network,fused the local and global features,added a top-down path to accelerate the circulation of shallowinformation,and enriched the semantic information acquired by shallow features.The ability of the model to detect objects with multiple morphologies was strengthened by introducing dual detection heads.The experimental results using an open dataset of underground pipes show that the proposed SDH-FCOS model can recognize underground pipe defects more accurately;the average accuracy was improved by 2.7% compared with the original FCOS model,reducing the leakage rate to a large extent and achieving real-time detection.Also,our model achieved a good trade-off between accuracy and speed compared with other mainstream methods.This proved the effectiveness of the proposed model. 展开更多
关键词 Urban underground pipelines defect detection SDH-FCOS feature fusion SPPF dual detection heads
在线阅读 下载PDF
Ghost-YOLO v8:An Attention-Guided Enhanced Small Target Detection Algorithm for Floating Litter on Water Surfaces 被引量:1
4
作者 Zhongmin Huangfu Shuqing Li Luoheng Yan 《Computers, Materials & Continua》 SCIE EI 2024年第9期3713-3731,共19页
Addressing the challenges in detecting surface floating litter in artificial lakes,including complex environments,uneven illumination,and susceptibility to noise andweather,this paper proposes an efficient and lightwe... Addressing the challenges in detecting surface floating litter in artificial lakes,including complex environments,uneven illumination,and susceptibility to noise andweather,this paper proposes an efficient and lightweight Ghost-YOLO(You Only Look Once)v8 algorithm.The algorithmintegrates advanced attention mechanisms and a smalltarget detection head to significantly enhance detection performance and efficiency.Firstly,an SE(Squeeze-and-Excitation)mechanism is incorporated into the backbone network to fortify the extraction of resilient features and precise target localization.This mechanism models feature channel dependencies,enabling adaptive adjustment of channel importance,thereby improving recognition of floating litter targets.Secondly,a 160×160 small-target detection layer is designed in the feature fusion neck to mitigate semantic information loss due to varying target scales.This design enhances the fusion of deep and shallow semantic information,improving small target feature representation and enabling better capture and identification of tiny floating litter.Thirdly,to balance performance and efficiency,the GhostConv module replaces part of the conventional convolutions in the feature fusion neck.Additionally,a novel C2fGhost(CSPDarknet53 to 2-Stage Feature Pyramid Networks Ghost)module is introduced to further reduce network parameters.Lastly,to address the challenge of occlusion,a newloss function,WIoU(Wise Intersection over Union)v3 incorporating a flexible and non-monotonic concentration approach,is adopted to improve detection rates for surface floating litter.The outcomes of the experiments demonstrate that the Ghost-YOLO v8 model proposed in this paper performs well in the dataset Marine,significantly enhances precision and recall by 3.3 and 7.6 percentage points,respectively,in contrast with the base model,mAP@0.5 and mAP 0.5:0.95 improve by 5.3 and 4.4 percentage points and reduces the computational volume by 1.88MB,the FPS value hardly decreases,and the efficient real-time identification of floating debris on the water’s surface can be achieved costeffectively. 展开更多
关键词 YOLO v8 surface floating litter target detection attention mechanism small target detection head ghostnet loss function
在线阅读 下载PDF
Improved YOLOv5-Based Inland River Floating Garbage Detection Model
5
作者 HU Wen-hao SI Zhan-jun +1 位作者 SHI Jin-yu YANG Ke 《印刷与数字媒体技术研究》 CAS 北大核心 2024年第5期195-204,共10页
Detection of floating garbage in inland rivers is crucial for water environmental protection,as it effectively reduces ecological damage and ensures the safety of water resources.To address the inefficiency of traditi... Detection of floating garbage in inland rivers is crucial for water environmental protection,as it effectively reduces ecological damage and ensures the safety of water resources.To address the inefficiency of traditional cleanup methods and the challenges in detecting small targets,an improved YOLOv5 object detection model was proposed in this study.In order to enhance the model’s sensitivity to small targets and mitigate the impact of redundant information on detection performance,a bi-level routing attention mechanism was introduced and embedded into the backbone network.Additionally,a multi-scale detection head was incorporated into the model,allowing for more comprehensive coverage of floating garbage of various sizes through multi-scale feature extraction and detection.The Focal-EIoU loss function was also employed to optimize the model parameters,improving localization accuracy.Experimental results on the publicly available FloW_Img dataset demonstrated that the improved YOLOv5 model outperforms the original YOLOv5 model in terms of precision and recall,achieving a mAP(mean average precision)of 86.12%,with significant improvements and faster convergence. 展开更多
关键词 Floatinggarbage YOLOv5 Attentionmechanism Multi-scale detection head Focal-EIoU
在线阅读 下载PDF
The Practice of a Method of Self-Study Students Counting in Classrooms Based on Head Detection in Colleges and Universities
6
作者 Man Liu Lei Yu 《Journal of Computer and Communications》 2022年第4期51-62,共12页
In order to solve the problem that it is difficult for students to find self-study classrooms because of the limited classroom resources, combined with the current situation of informatization in colleges and universi... In order to solve the problem that it is difficult for students to find self-study classrooms because of the limited classroom resources, combined with the current situation of informatization in colleges and universities, a feasible method of students counting in classrooms based on head detection is proposed. This method first collects the scene images in the classroom at regular intervals based on the existing examination monitoring system, and then uses the offline trained AdaBoost cascade detector to detect the head candidate region in the images. Then, the trained CNN-SVM model is used to further identify the head, and finally the identification results are processed and the number of students in the classrooms is counted. The test and practice show that the query system for the idle situation of self-study classrooms constructed by coordinating the classroom seat capacity, classroom scheduling data and the students counting in the classroom based on the above method can easily query the current crowded degree of the students in the classrooms, which plays a good guiding role for students to find self-study classrooms. The method has strong reference and promotion significance for solving similar problems in other universities. 展开更多
关键词 head detection ADABOOST Students Counting CLASSROOM Colleges and Universities
在线阅读 下载PDF
Efficient forest fire detection based on an improved YOLO model 被引量:2
7
作者 Lei Cao Zirui Shen Sheng Xu 《Visual Intelligence》 2024年第1期240-246,共7页
A forest fire is a natural disaster characterized by rapid spread,difficulty in extinguishing,and widespread destruction,which requires an efficient response.Existing detection methods fail to balance global and local... A forest fire is a natural disaster characterized by rapid spread,difficulty in extinguishing,and widespread destruction,which requires an efficient response.Existing detection methods fail to balance global and local fire features,resulting in the false detection of small or hidden fires.In this paper,we propose a novel detection technique based on an improved YOLO v5 model to enhance the visual representation of forest fires and retain more information about global interactions.We add a plug-and-play global attention mechanism to improve the efficiency of neck and backbone feature extraction of the YOLO v5 model.Then,a re-parameterized convolutional module is designed,and a decoupled detection head is used to accelerate the convergence speed.Finally,a weighted bi-directional feature pyramid network(BiFPN)is introduced to merge feature information for local information processing.In the evaluation,we use the complete intersection over union(CIoU)loss function to optimize the multi-task loss for different kinds of forest fires.Experiments show that the precision,recall,and mean average precision are increased by 4.2%,3.8%,and 4.6%,respectively,compared with the classic YOLO v5 model.In particular,the mAP@0.5:0.95 is 2.2% higher than the other detection methods,while meeting the requirements of real-time detection. 展开更多
关键词 Deep learning Forest fire detection Attention mechanism Decoupled detection head Re-parameterized convolution
在线阅读 下载PDF
CSNet:A Count-Supervised Network via Multiscale MLP-Mixer for Wheat Ear Counting
8
作者 Yaoxi Li Xingcai Wu +4 位作者 Qi Wang Zhixun Pei Kejun Zhao Panfeng Chen Gefei Hao 《Plant Phenomics》 CSCD 2024年第4期995-1009,共15页
Wheat is the most widely grown crop in the world,and its yield is closely related to global food security.The number of ears is important for wheat breeding and yield estimation.Therefore,automated wheat ear counting ... Wheat is the most widely grown crop in the world,and its yield is closely related to global food security.The number of ears is important for wheat breeding and yield estimation.Therefore,automated wheat ear counting techniques are essential for breeding high-yield varieties and increasing grain yield.However,all existing methods require position-level annotation for training,implying that a large amount of labor is required for annotation,limiting the application and development of deep learning technology in the agricultural field.To address this problem,we propose a count-supervised multiscale perceptive wheat counting network(CSNet,count-supervised network),which aims to achieve accurate counting of wheat ears using quantity information.In particular,in the absence of location information,CSNet adopts MLP-Mixer to construct a multiscale perception module with a global receptive field that implements the learning of small target attention maps between wheat ear features.We conduct comparative experiments on a publicly available global wheat head detection dataset,showing that the proposed count-supervised strategy outperforms existing position-supervised methods in terms of mean absolute error(MAE)and root mean square error(RMSE).This superior performance indicates that the proposed approach has a positive impact on improving ear counts and reducing labeling costs,demonstrating its great potential for agricultural counting tasks.The code is available at . 展开更多
关键词 global wheat head detection dataset mean absolute error deep learning technology wheat ear counting multiscale perception root mean square error count supervised network mlp mixer
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部