期刊文献+
共找到588篇文章
< 1 2 30 >
每页显示 20 50 100
EHDC-YOLO: Enhancing Object Detection for UAV Imagery via Multi-Scale Edge and Detail Capture
1
作者 Zhiyong Deng Yanchen Ye Jiangling Guo 《Computers, Materials & Continua》 2026年第1期1665-1682,共18页
With the rapid expansion of drone applications,accurate detection of objects in aerial imagery has become crucial for intelligent transportation,urban management,and emergency rescue missions.However,existing methods ... With the rapid expansion of drone applications,accurate detection of objects in aerial imagery has become crucial for intelligent transportation,urban management,and emergency rescue missions.However,existing methods face numerous challenges in practical deployment,including scale variation handling,feature degradation,and complex backgrounds.To address these issues,we propose Edge-enhanced and Detail-Capturing You Only Look Once(EHDC-YOLO),a novel framework for object detection in Unmanned Aerial Vehicle(UAV)imagery.Based on the You Only Look Once version 11 nano(YOLOv11n)baseline,EHDC-YOLO systematically introduces several architectural enhancements:(1)a Multi-Scale Edge Enhancement(MSEE)module that leverages multi-scale pooling and edge information to enhance boundary feature extraction;(2)an Enhanced Feature Pyramid Network(EFPN)that integrates P2-level features with Cross Stage Partial(CSP)structures and OmniKernel convolutions for better fine-grained representation;and(3)Dynamic Head(DyHead)with multi-dimensional attention mechanisms for enhanced cross-scale modeling and perspective adaptability.Comprehensive experiments on the Vision meets Drones for Detection(VisDrone-DET)2019 dataset demonstrate that EHDC-YOLO achieves significant improvements,increasing mean Average Precision(mAP)@0.5 from 33.2%to 46.1%(an absolute improvement of 12.9 percentage points)and mAP@0.5:0.95 from 19.5%to 28.0%(an absolute improvement of 8.5 percentage points)compared with the YOLOv11n baseline,while maintaining a reasonable parameter count(2.81 M vs the baseline’s 2.58 M).Further ablation studies confirm the effectiveness of each proposed component,while visualization results highlight EHDC-YOLO’s superior performance in detecting objects and handling occlusions in complex drone scenarios. 展开更多
关键词 UAV imagery object detection multi-scale feature fusion edge enhancement detail preservation YOLO feature pyramid network attention mechanism
在线阅读 下载PDF
Optimized Convolutional Neural Networks with Multi-Scale Pyramid Feature Integration for Efficient Traffic Light Detection in Intelligent Transportation Systems 被引量:1
2
作者 Yahia Said Yahya Alassaf +2 位作者 Refka Ghodhbani Taoufik Saidani Olfa Ben Rhaiem 《Computers, Materials & Continua》 2025年第2期3005-3018,共14页
Transportation systems are experiencing a significant transformation due to the integration of advanced technologies, including artificial intelligence and machine learning. In the context of intelligent transportatio... Transportation systems are experiencing a significant transformation due to the integration of advanced technologies, including artificial intelligence and machine learning. In the context of intelligent transportation systems (ITS) and Advanced Driver Assistance Systems (ADAS), the development of efficient and reliable traffic light detection mechanisms is crucial for enhancing road safety and traffic management. This paper presents an optimized convolutional neural network (CNN) framework designed to detect traffic lights in real-time within complex urban environments. Leveraging multi-scale pyramid feature maps, the proposed model addresses key challenges such as the detection of small, occluded, and low-resolution traffic lights amidst complex backgrounds. The integration of dilated convolutions, Region of Interest (ROI) alignment, and Soft Non-Maximum Suppression (Soft-NMS) further improves detection accuracy and reduces false positives. By optimizing computational efficiency and parameter complexity, the framework is designed to operate seamlessly on embedded systems, ensuring robust performance in real-world applications. Extensive experiments using real-world datasets demonstrate that our model significantly outperforms existing methods, providing a scalable solution for ITS and ADAS applications. This research contributes to the advancement of Artificial Intelligence-driven (AI-driven) pattern recognition in transportation systems and offers a mathematical approach to improving efficiency and safety in logistics and transportation networks. 展开更多
关键词 Intelligent transportation systems(ITS) traffic light detection multi-scale pyramid feature maps advanced driver assistance systems(ADAS) real-time detection AI in transportation
在线阅读 下载PDF
Pyramid Pooling-Based Vision Transformer for Tool Condition Recognition
3
作者 ZHENG Kun LI Yonglin +2 位作者 GU Xinyan DING Zhiying ZHU Haihua 《Transactions of Nanjing University of Aeronautics and Astronautics》 2025年第3期322-336,共15页
This study focuses on tool condition recognition through data-driven approaches to enhance the intelligence level of computerized numerical control(CNC)machining processes and improve tool utilization efficiency.Tradi... This study focuses on tool condition recognition through data-driven approaches to enhance the intelligence level of computerized numerical control(CNC)machining processes and improve tool utilization efficiency.Traditional tool monitoring methods that rely on empirical knowledge or limited mathematical models struggle to adapt to complex and dynamic machining environments.To address this,we implement real-time tool condition recognition by introducing deep learning technology.Aiming to the insufficient recognition accuracy,we propose a pyramid pooling-based vision Transformer network(P2ViT-Net)method for tool condition recognition.Using images as input effectively mitigates the issue of low-dimensional signal features.We enhance the vision Transformer(ViT)framework for image classification by developing the P2ViT model and adapt it to tool condition recognition.Experimental results demonstrate that our improved P2ViT model achieves 94.4%recognition accuracy,showing a 10%improvement over conventional ViT and outperforming all comparative convolutional neural network models. 展开更多
关键词 tool condition recognition TRANSFORMER pyramid pooling deep convolutional neural network
在线阅读 下载PDF
A multivariate grey incidence model for different scale data based on spatial pyramid pooling 被引量:7
4
作者 ZHANG Ke CUI Le YIN Yao 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2020年第4期770-779,共10页
In order to solve the problem that existing multivariate grey incidence models cannot be applied to time series on different scales, a new model is proposed based on spatial pyramid pooling.Firstly, local features of ... In order to solve the problem that existing multivariate grey incidence models cannot be applied to time series on different scales, a new model is proposed based on spatial pyramid pooling.Firstly, local features of multivariate time series on different scales are pooled and aggregated by spatial pyramid pooling to construct n levels feature pooling matrices on the same scale. Secondly,Deng's multivariate grey incidence model is introduced to measure the degree of incidence between feature pooling matrices at each level. Thirdly, grey incidence degrees at each level are integrated into a global incidence degree. Finally, the performance of the proposed model is verified on two data sets compared with a variety of algorithms. The results illustrate that the proposed model is more effective and efficient than other similarity measure algorithms. 展开更多
关键词 grey system spatial pyramid pooling grey incidence multivariate time series
在线阅读 下载PDF
DrownACB-YOLO:an Improved YOLO for Drowning Detection in Swimming Pools
5
作者 ZENG Xiaoya XU Wujun ZHANG Xiunian 《Journal of Donghua University(English Edition)》 2025年第4期417-424,共8页
With the rise in drowning accidents in swimming pools,the demand for the precision and speed in artificial intelligence(AI)drowning detection methods has become increasingly crucial.Here,an improved YOLO-based method,... With the rise in drowning accidents in swimming pools,the demand for the precision and speed in artificial intelligence(AI)drowning detection methods has become increasingly crucial.Here,an improved YOLO-based method,named DrownACB-YOLO,for drowning detection in swimming pools is proposed.Since existing methods focus on the drowned state,a transition label is added to the original dataset to provide timely alerts.Following this expanded dataset,two improvements are implemented in the original YOLOv5.Firstly,the spatial pyramid pooling(SPP)module and the default upsampling operator are replaced by the atrous spatial pyramid pooling(ASPP)module and the content-aware reassembly of feature(CARAFE)module,respectively.Secondly,the cross stage partial bottleneck with three convolutions(C3)module at the end of the backbone is replaced with the bottleneck transformer(BotNet)module.The results of comparison experiments demonstrate that DrownACB-YOLO performs better than other models. 展开更多
关键词 drowning detection YOLO atrous spatial pyramid pooling(ASPP) content-aware reassembly of feature(CARAFE)
在线阅读 下载PDF
基于空间通道自适应特征的肝脏病理图像分割网络
6
作者 王建宇 王朝立 +1 位作者 孙占全 刘晓虹 《电子科技》 2026年第1期9-17,共9页
针对肝脏病理图像中病变区域与周围组织相似度高、对比度低以及边界模糊等问题,文中提出了一个基于空间通道自适应特征的肝脏病理分割网络。通过混合校准注意力使网络能够自适应地选择经空间和通道校准过的特征信息,有利于编码器捕获与... 针对肝脏病理图像中病变区域与周围组织相似度高、对比度低以及边界模糊等问题,文中提出了一个基于空间通道自适应特征的肝脏病理分割网络。通过混合校准注意力使网络能够自适应地选择经空间和通道校准过的特征信息,有利于编码器捕获与肝脏病灶相关的重要特征,并在编码器最深层引入空洞空间金字塔池化模块来弥补高级特征所缺失的多尺度信息,提高模型的分割精度。在私有肝脏数据集、公开肝脏数据集以及其他两种公开病理数据集对所提网络进行对比实验和消融实验。实验结果表明,相较于其他方法,所提网络的分割结果较佳,且有效解决了肝细胞癌分割问题。 展开更多
关键词 肝细胞癌 病理图像 编解码架构 混合校准注意力模块 空间注意力 通道注意力 空洞空间金字塔池化模块 多尺度信息
在线阅读 下载PDF
基于多层次特征融合和注意力机制的无人机图像小目标检测算法 被引量:1
7
作者 张信佳 王芳 《计算机工程》 北大核心 2026年第2期148-157,共10页
无人机(UAV)航拍图像中的目标通常具有尺度密集、易被遮挡且多为小目标等特点,这导致检测过程中容易出现漏检和误检。为应对上述挑战,基于YOLOv5s提出了针对小目标检测的SNA-YOLOv5s算法。首先,引入空间深度转换卷积(SPD-Conv)模块替换... 无人机(UAV)航拍图像中的目标通常具有尺度密集、易被遮挡且多为小目标等特点,这导致检测过程中容易出现漏检和误检。为应对上述挑战,基于YOLOv5s提出了针对小目标检测的SNA-YOLOv5s算法。首先,引入空间深度转换卷积(SPD-Conv)模块替换原模型的跨步卷积层,避免细节信息丢失,增强小目标特征提取能力;其次,设计新型平均快速空间金字塔池化(AGSPPF)模块,引入平均池化操作缓解池化层在提取特征信息的同时会导致部分信息丢失的问题,提升模型的特征提取能力;再次,新增针对小目标的大尺度检测分支,捕捉浅层特征中丰富的细节信息,提升模型对小目标的检测能力;最后,将归一化注意力机制(NAM)嵌入骨干网络,对特征信息进行加权处理,抑制无效的特征信息。在VisDrone2019数据集和NWPU VHR-10数据集上的训练测试结果表明,该算法的均值平均精度(mAP)分别达到了42.3%和96.5%,与基线模型YOLOv5s相比分别提高了8.4和2.6百分点。通过与其他基于深度学习的主流模型对比实验,进一步验证了该模型的鲁棒性和精确性。 展开更多
关键词 YOLOv5s模型 小目标检测 空间深度转换卷积 空间金字塔池化 归一化注意力机制
在线阅读 下载PDF
改进YOLOv5的道路小目标检测算法
8
作者 王海涛 裴树军 +1 位作者 裴洪扬 许靖 《哈尔滨理工大学学报》 北大核心 2026年第1期35-46,共12页
针对目前自动驾驶领域目标检测算法在对道路小目标检测时经常出现漏检、误检的问题,提出基于YOLOv5s的目标检测算法SCE-YOLOv5。首先,借鉴SPPCSPC的思想,对主干网络中的SPPF层进行了重新设计,确保在提高模型精度和感受野的同时获得速度... 针对目前自动驾驶领域目标检测算法在对道路小目标检测时经常出现漏检、误检的问题,提出基于YOLOv5s的目标检测算法SCE-YOLOv5。首先,借鉴SPPCSPC的思想,对主干网络中的SPPF层进行了重新设计,确保在提高模型精度和感受野的同时获得速度的提升。其次,Neck部分的上采样算子被替换为CARAFE,能够在较大的感受野内聚合上下文信息,避免上采样过程中部分特征信息缺失。最后,在每一层检测头前都引入EMA注意力机制,对全局信息进行编码,提取更多的特征信息。实验结果表明:在KITTI数据集和Kaggle数据集上,改进后的算法与原算法相比,m AP值有所提高,并且检测速度也分别达到了90帧/秒和61帧/秒,具备较高的实时性,可以满足自动驾驶道路目标检测的需求。 展开更多
关键词 道路目标检测 注意力机制 YOLOv5 空间金字塔池化 特征上采样
在线阅读 下载PDF
基于跨尺度特征融合的内窥镜图像增强算法
9
作者 刘旭阳 蔡芸 蒋林 《现代电子技术》 北大核心 2026年第1期34-40,共7页
临床医学的内窥镜图像由于在成像过程中存在补充光源不均匀和人体组织粘液反光的问题,出现大量曝光过度等图像质量较低的现象。现有基于深度学习的图像增强算法由于仅采用固定尺寸的特征融合方式,导致特征提取能力较低、增强效果较差。... 临床医学的内窥镜图像由于在成像过程中存在补充光源不均匀和人体组织粘液反光的问题,出现大量曝光过度等图像质量较低的现象。现有基于深度学习的图像增强算法由于仅采用固定尺寸的特征融合方式,导致特征提取能力较低、增强效果较差。为改善这一问题,文中构建了基于跨尺度特征融合的内窥镜图像增强算法,通过构建CM卷积模块实现高性能特征提取,同时采用SPPF金字塔池化模块实现对特征图不同尺度的池化操作,并且在网络不同尺度的网络层之间引入跨尺度特征融合(CFF)模块,实现多尺度特征融合和上下文信息传播,从而大幅提高图像细节捕捉能力和图像质量。实验结果表明,文中算法在PSNR、SSIM指标均高于现有算法,其中PSNR指标提高了9.9%,SSIM指标提高了15.4%,可以实现高质量内窥镜图像增强任务。 展开更多
关键词 内窥镜图像 深度特征融合 CFF 曝光异常 图像增强算法 金字塔池化模块
在线阅读 下载PDF
基于多尺度金字塔池化的自适应无参考图像质量评价
10
作者 吴雪松 陈媛媛 周涛 《计算机工程》 北大核心 2026年第3期107-118,共12页
在图像质量评价(IQA)领域,无参考质量评价方法在处理真实场景下的失真图像时展现了巨大的应用价值和未来发展潜力,然而真实环境中的失真图像具有高度的多样性和复杂性,增加了相关评价算法设计的难度。近年来,深度学习技术在图像分类、... 在图像质量评价(IQA)领域,无参考质量评价方法在处理真实场景下的失真图像时展现了巨大的应用价值和未来发展潜力,然而真实环境中的失真图像具有高度的多样性和复杂性,增加了相关评价算法设计的难度。近年来,深度学习技术在图像分类、目标检测以及图像分割等细分领域均取得了令人瞩目的成果。这些进展推动科研人员将深度神经网络(DNN)技术引入IQA中。DNN凭借其出色的特征提取和学习能力,为真实环境中的失真IQA带来了创新性的解决方案和显著的进步。但是,现有方法在处理真实场景图像质量描述时仍存在一定的局限性,特别是在应对图像内容多样性方面。此外,许多基于DNN的IQA方法需要对输入图像进行缩放或裁剪以固定分辨率,这往往会破坏图像的原始结构和内容,从而影响质量评估的准确性和泛化能力。为了解决这些问题,提出一种基于多尺度金字塔池化的自适应无参考图像质量评价方法(MSPP-IQA)。MSPP-IQA允许直接使用原始尺寸的图像进行质量评估,无需任何图像预处理,通过引入图像内容理解模块和注意力模块,模仿人类视觉系统(HVS)的工作原理,同时感知全局高级特征和局部低级特征。实验结果表明,相较于当前主流方法,MSPP-IQA在真实失真和合成失真数据集上均表现出良好的性能。这一实验结果充分证明了MSPP-IQA在应对真实失真IQA挑战方面的有效性和优越性。 展开更多
关键词 无参考图像质量评价 真实失真 多尺度特征融合 空间金字塔池化 注意力机制
在线阅读 下载PDF
轻量化几何流形深度网络的自闭症诊断方法
11
作者 吴金颖 马慧彬 《现代信息科技》 2026年第3期70-75,共6页
自闭症谱系障碍(Autism Spectrum Disorder,ASD)是一类常见的中枢神经发育障碍,其临床诊断存在主观性强、准确性不足的问题。为提升功能性磁共振成像(functional Magnetic Resonance Imaging,fMRI)在计算机辅助诊断中的性能,文章构建了... 自闭症谱系障碍(Autism Spectrum Disorder,ASD)是一类常见的中枢神经发育障碍,其临床诊断存在主观性强、准确性不足的问题。为提升功能性磁共振成像(functional Magnetic Resonance Imaging,fMRI)在计算机辅助诊断中的性能,文章构建了一种轻量化几何流形深度网络(Lightweight Geometric Manifold Deep Network,LGMD-Net)。该网络采用多通道二维残差结构结合医学几何先验获取三维信息,通过轻量化残差块实现高效特征提取;在特征降维阶段,引入空间金字塔池化捕获多尺度空间特征,并结合流形混合(Manifold Mixup)技术实现特征增强与正则化。实验在ABIDE数据集上开展,结果表明,该方法在分类准确率、F1值及AUC等指标上均优于卷积神经网络、Transformer及图神经网络模型,同时在参数规模、推理时间与显存占用方面具备显著优势。研究结果验证了LGMD-Net在提升诊断准确性及工程化应用中的可行性。 展开更多
关键词 自闭症谱系障碍 功能磁共振成像 轻量化网络 空间金字塔池化 流形混合
在线阅读 下载PDF
SCVi-Net:一种基于混合模型的视网膜血管分割方法
12
作者 茅纪慧 姜尚格日乐 江旻珊 《光学仪器》 2026年第1期30-42,共13页
现有视网膜血管分割方法通常受限于局部感受野,难以有效捕获全局信息。此外,血管结构在不同尺度下的形态差异较大,使得多尺度特征融合变得困难。为了解决上述问题,提出了一种高效的视网膜血管分割模型SCVi-Net。该模型在U-Net的基础上... 现有视网膜血管分割方法通常受限于局部感受野,难以有效捕获全局信息。此外,血管结构在不同尺度下的形态差异较大,使得多尺度特征融合变得困难。为了解决上述问题,提出了一种高效的视网膜血管分割模型SCVi-Net。该模型在U-Net的基础上改进了跳跃连接,引入一个新的空间通道联合注意力模块,通过自适应调整空间和通道权重,增强了特征提取能力。通过在编码器最深层加入视觉Transformer模块,SCVi-Net的全局信息捕获能力得到了提升。空洞空间金字塔池化模块能有效提取多尺度特征,可增强网络的鲁棒性。侧边多尺度融合模块通过融合多个侧边输出,优化了训练过程,从而提升了血管区域的分割精度。为评估模型的优越性,在DRIVE、CHASEDB1和STARE数据集上进行了对比实验,结果表明,SCVi-Net在复杂视网膜血管图像中具有较好的分割精度和鲁棒性。 展开更多
关键词 视网膜血管分割 U-Net 联合注意力机制 Transformer 空洞卷积 SCVi-Net 医学图像处理
在线阅读 下载PDF
基于ASPP-UNet的地震波阻抗反演方法
13
作者 岳碧波 颜鹏 +1 位作者 杜彦志 周强 《石油地球物理勘探》 北大核心 2026年第1期1-16,共16页
深度学习以其强大的非线性映射问题处理能力而在地震波阻抗反演中得到了广泛的关注。常规深度学习地震波阻抗反演方法存在对标记数据过于依赖,尤其当训练测井数据不足时存在反演模型局部特征的提取能力下降、精度不足的问题。为此,提出... 深度学习以其强大的非线性映射问题处理能力而在地震波阻抗反演中得到了广泛的关注。常规深度学习地震波阻抗反演方法存在对标记数据过于依赖,尤其当训练测井数据不足时存在反演模型局部特征的提取能力下降、精度不足的问题。为此,提出了一种基于空洞空间金字塔池化的U-Net网络(ASPP-UNet)地震波阻抗反演方法,利用金字塔池化方法增强U-Net网络的多尺度特征提取能力,据此利用地震记录数据和少数测井数据构建训练集。为了验证所提方法的有效性,将其应用在Marmousi2和SEAM两种公开测试数据的地震波阻抗反演中,每组测试试验均与CNN、U-Net、Attention-UNet三种深度学习地震波阻抗反演结果进行对比。实验结果均表明,在同等实验条件下,该方法得到的单道波阻抗反演结果高频细节成分更丰富,反演波阻抗剖面在层间及断层处纵向连接平滑;反演结果对标记数据的依赖性低,在远离训练测井位置处信息丢失最少,表现为反演结果剖面的道间横向连续性好,各项统计指标均优于其他三种对比方法。为进一步验证所提方法的可行性,将其应用于四川省东部实际勘探数据地震波阻抗反演,其准确度均优于上述三种对比方法,所得波阻抗剖面与实际地质特征更吻合,波阻抗误差最小。 展开更多
关键词 波阻抗反演 ASPP-UNet 空洞卷积 空洞空间金字塔池化
在线阅读 下载PDF
基于语义增强和尺度感知的光伏组件红外图像缺陷检测
14
作者 潘战国 洪文龙 《广东电力》 北大核心 2026年第1期34-45,共12页
针对光伏组件红外缺陷图像中背景干扰强、目标尺度差异大、小缺陷难分辨导致检测精度低的问题,提出一种基于语义增强与空间金字塔网络(semantic enhancement and spatial pyramid network,SESPNet)的红外图像光伏组件缺陷检测算法。首先... 针对光伏组件红外缺陷图像中背景干扰强、目标尺度差异大、小缺陷难分辨导致检测精度低的问题,提出一种基于语义增强与空间金字塔网络(semantic enhancement and spatial pyramid network,SESPNet)的红外图像光伏组件缺陷检测算法。首先,构建一种语义信息增强模块并嵌入骨干网络,融合全局与局部语义信息,增强特征表达能力,抑制复杂背景噪音的干扰;其次,采用空间注意金字塔池化模块替代YOLOv10中原本的空间金字塔池化模块,通过局部和全局特征信息的加权融合,增强对多尺度缺陷的感知能力;最后,在颈部网络构建多尺度通道注意力机制,通过建立不同通道之间的信息交互,进一步提升对小尺度特征信息的提取能力。使用自制光伏组件红外缺陷数据集开展实验,结果表明:SESPNet的平均精度均值P_(mA)达到92.1%,检测速度达到62.4帧/s,显著优于其它主流检测算法。嵌入式环境下的对照实验结果证明,SESPNet在受限计算资源上仍具备出色的实时性与检测性能。 展开更多
关键词 光伏组件 目标检测 嵌入式系统 YOLOv10 语义信息增强模块 空间注意金字塔池化 多尺度通道注意力机制
在线阅读 下载PDF
联合注意力机制和金字塔池化的U型视网膜血管分割网络
15
作者 陆鹏程 程科 +1 位作者 陈思 姚壮 《计算机与数字工程》 2026年第1期231-236,共6页
血管是眼底视网膜主要解剖结构,眼底视网膜血管分割图像已被广泛用于心脑血管和视网膜疾病的判断,因此,合适的眼底视网膜血管分割方法对于视网膜疾病的检测有着重要的意义。联合注意力机制和金字塔池化的U型视网膜血管分割网络——APU-N... 血管是眼底视网膜主要解剖结构,眼底视网膜血管分割图像已被广泛用于心脑血管和视网膜疾病的判断,因此,合适的眼底视网膜血管分割方法对于视网膜疾病的检测有着重要的意义。联合注意力机制和金字塔池化的U型视网膜血管分割网络——APU-Net,以U-Net作为主干网络,下采样替换成金字塔池化网络,级联时加入注意力机制模块,提高了对细微血管的分割能力。算法在CHASEDB1数据集上进行测试,较U-Net及其他主流视网膜血管分割算法在准确性、敏感性和特异率指标上均有一定的提高。 展开更多
关键词 U-Net 金字塔池化 注意力机制 图像分割
在线阅读 下载PDF
基于模块化AI模组的火灾图像智能检测
16
作者 杨涛 汪友杰 王伟 《齐鲁工业大学学报》 2026年第1期57-64,共8页
火灾监测对减少生命财产损失至关重要,但传统方法在复杂环境中存在实时性与准确性不足的问题。本文提出一种基于改进YOLOv5s的轻量化火灾图像检测算法,结合边缘计算技术优化监测系统。通过引入卷积块注意力模块(CBAM)增强特征学习能力,... 火灾监测对减少生命财产损失至关重要,但传统方法在复杂环境中存在实时性与准确性不足的问题。本文提出一种基于改进YOLOv5s的轻量化火灾图像检测算法,结合边缘计算技术优化监测系统。通过引入卷积块注意力模块(CBAM)增强特征学习能力,采用多孔空间金字塔池化(ASPP)扩大模型感受野,并利用EIoU Loss损失函数加速收敛、提升回归精度。实验表明,改进后模型的火灾识别率提高至94%,精确率与召回率分别达到94.2%和92.4%。通过将系统搭载在模块化AI模组上,直接处理视频数据,避免了云端传输延迟,显著提升了检测实时性。该方法为复杂场景下的火灾监测提供了高效解决方案,对提升应急响应能力具有重要意义。 展开更多
关键词 边缘计算 火灾监测 神经网络 卷积块注意力模块 多孔空间金字塔池化
在线阅读 下载PDF
Bidirectional parallel multi-branch convolution feature pyramid network for target detection in aerial images of swarm UAVs 被引量:4
17
作者 Lei Fu Wen-bin Gu +3 位作者 Wei Li Liang Chen Yong-bao Ai Hua-lei Wang 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2021年第4期1531-1541,共11页
In this paper,based on a bidirectional parallel multi-branch feature pyramid network(BPMFPN),a novel one-stage object detector called BPMFPN Det is proposed for real-time detection of ground multi-scale targets by swa... In this paper,based on a bidirectional parallel multi-branch feature pyramid network(BPMFPN),a novel one-stage object detector called BPMFPN Det is proposed for real-time detection of ground multi-scale targets by swarm unmanned aerial vehicles(UAVs).First,the bidirectional parallel multi-branch convolution modules are used to construct the feature pyramid to enhance the feature expression abilities of different scale feature layers.Next,the feature pyramid is integrated into the single-stage object detection framework to ensure real-time performance.In order to validate the effectiveness of the proposed algorithm,experiments are conducted on four datasets.For the PASCAL VOC dataset,the proposed algorithm achieves the mean average precision(mAP)of 85.4 on the VOC 2007 test set.With regard to the detection in optical remote sensing(DIOR)dataset,the proposed algorithm achieves 73.9 mAP.For vehicle detection in aerial imagery(VEDAI)dataset,the detection accuracy of small land vehicle(slv)targets reaches 97.4 mAP.For unmanned aerial vehicle detection and tracking(UAVDT)dataset,the proposed BPMFPN Det achieves the mAP of 48.75.Compared with the previous state-of-the-art methods,the results obtained by the proposed algorithm are more competitive.The experimental results demonstrate that the proposed algorithm can effectively solve the problem of real-time detection of ground multi-scale targets in aerial images of swarm UAVs. 展开更多
关键词 Aerial images Object detection Feature pyramid networks multi-scale feature fusion Swarm UAVs
在线阅读 下载PDF
Neighborhood fusion-based hierarchical parallel feature pyramid network for object detection 被引量:3
18
作者 Mo Lingfei Hu Shuming 《Journal of Southeast University(English Edition)》 EI CAS 2020年第3期252-263,共12页
In order to improve the detection accuracy of small objects,a neighborhood fusion-based hierarchical parallel feature pyramid network(NFPN)is proposed.Unlike the layer-by-layer structure adopted in the feature pyramid... In order to improve the detection accuracy of small objects,a neighborhood fusion-based hierarchical parallel feature pyramid network(NFPN)is proposed.Unlike the layer-by-layer structure adopted in the feature pyramid network(FPN)and deconvolutional single shot detector(DSSD),where the bottom layer of the feature pyramid network relies on the top layer,NFPN builds the feature pyramid network with no connections between the upper and lower layers.That is,it only fuses shallow features on similar scales.NFPN is highly portable and can be embedded in many models to further boost performance.Extensive experiments on PASCAL VOC 2007,2012,and COCO datasets demonstrate that the NFPN-based SSD without intricate tricks can exceed the DSSD model in terms of detection accuracy and inference speed,especially for small objects,e.g.,4%to 5%higher mAP(mean average precision)than SSD,and 2%to 3%higher mAP than DSSD.On VOC 2007 test set,the NFPN-based SSD with 300×300 input reaches 79.4%mAP at 34.6 frame/s,and the mAP can raise to 82.9%after using the multi-scale testing strategy. 展开更多
关键词 computer vision deep convolutional neural network object detection hierarchical parallel feature pyramid network multi-scale feature fusion
在线阅读 下载PDF
Intelligent identification of oceanic eddies in remote sensing data via Dual-Pyramid UNet 被引量:2
19
作者 Nan Zhao Baoxiang Huang +2 位作者 Xinmin Zhang Linyao Ge Ge Chen 《Atmospheric and Oceanic Science Letters》 CSCD 2023年第4期29-36,共8页
海洋涡旋是大洋中重要的组成部分,对海洋能量和物质的输送至关重要.海洋涡旋的检测和表征无论是对于海洋气象学,海洋声学还是海洋生物学等领域都具有重要的研究价值.本文基于UNet架构,并结合金字塔分割注意力(PSA)模块和空洞空间卷积池... 海洋涡旋是大洋中重要的组成部分,对海洋能量和物质的输送至关重要.海洋涡旋的检测和表征无论是对于海洋气象学,海洋声学还是海洋生物学等领域都具有重要的研究价值.本文基于UNet架构,并结合金字塔分割注意力(PSA)模块和空洞空间卷积池化金字塔(ASPP)构造了Dual-Pyramid UNet模型,以平面异常和海表面温度数据中进行海洋涡旋的识别.实验在北大西洋和南大西洋两个涡旋活跃区域进行并选用多个评价指标对识别结果进行评价以证明模型的优异性能. 展开更多
关键词 海洋涡旋识别 深度学习 金字塔分割注意 空洞空间卷积池化金字塔 U型网络架构
在线阅读 下载PDF
IMTNet:Improved Multi-Task Copy-Move Forgery Detection Network with Feature Decoupling and Multi-Feature Pyramid
20
作者 Huan Wang Hong Wang +2 位作者 Zhongyuan Jiang Qing Qian Yong Long 《Computers, Materials & Continua》 SCIE EI 2024年第9期4603-4620,共18页
Copy-Move Forgery Detection(CMFD)is a technique that is designed to identify image tampering and locate suspicious areas.However,the practicality of the CMFD is impeded by the scarcity of datasets,inadequate quality a... Copy-Move Forgery Detection(CMFD)is a technique that is designed to identify image tampering and locate suspicious areas.However,the practicality of the CMFD is impeded by the scarcity of datasets,inadequate quality and quantity,and a narrow range of applicable tasks.These limitations significantly restrict the capacity and applicability of CMFD.To overcome the limitations of existing methods,a novel solution called IMTNet is proposed for CMFD by employing a feature decoupling approach.Firstly,this study formulates the objective task and network relationship as an optimization problem using transfer learning.Furthermore,it thoroughly discusses and analyzes the relationship between CMFD and deep network architecture by employing ResNet-50 during the optimization solving phase.Secondly,a quantitative comparison between fine-tuning and feature decoupling is conducted to evaluate the degree of similarity between the image classification and CMFD domains by the enhanced ResNet-50.Finally,suspicious regions are localized using a feature pyramid network with bottom-up path augmentation.Experimental results demonstrate that IMTNet achieves faster convergence,shorter training times,and favorable generalization performance compared to existingmethods.Moreover,it is shown that IMTNet significantly outperforms fine-tuning based approaches in terms of accuracy and F_(1). 展开更多
关键词 Image copy-move detection feature decoupling multi-scale feature pyramids passive forensics
在线阅读 下载PDF
上一页 1 2 30 下一页 到第
使用帮助 返回顶部