期刊文献+
共找到301篇文章
< 1 2 16 >
每页显示 20 50 100
Enhanced Cutaneous Melanoma Segmentation in Dermoscopic Images Using a Dual U-Net Framework with Multi-Path Convolution Block Attention Module and SE-Res-Conv
1
作者 Kun Lan Feiyang Gao +2 位作者 Xiaoliang Jiang Jianzhen Cheng Simon Fong 《Computers, Materials & Continua》 2025年第9期4805-4824,共20页
With the continuous development of artificial intelligence and machine learning techniques,there have been effective methods supporting the work of dermatologist in the field of skin cancer detection.However,object si... With the continuous development of artificial intelligence and machine learning techniques,there have been effective methods supporting the work of dermatologist in the field of skin cancer detection.However,object significant challenges have been presented in accurately segmenting melanomas in dermoscopic images due to the objects that could interfere human observations,such as bubbles and scales.To address these challenges,we propose a dual U-Net network framework for skin melanoma segmentation.In our proposed architecture,we introduce several innovative components that aim to enhance the performance and capabilities of the traditional U-Net.First,we establish a novel framework that links two simplified U-Nets,enabling more comprehensive information exchange and feature integration throughout the network.Second,after cascading the second U-Net,we introduce a skip connection between the decoder and encoder networks,and incorporate a modified receptive field block(MRFB),which is designed to capture multi-scale spatial information.Third,to further enhance the feature representation capabilities,we add a multi-path convolution block attention module(MCBAM)to the first two layers of the first U-Net encoding,and integrate a new squeeze-and-excitation(SE)mechanism with residual connections in the second U-Net.To illustrate the performance of our proposed model,we conducted comprehensive experiments on widely recognized skin datasets.On the ISIC-2017 dataset,the IoU value of our proposed model increased from 0.6406 to 0.6819 and the Dice coefficient increased from 0.7625 to 0.8023.On the ISIC-2018 dataset,the IoU value of proposed model also improved from 0.7138 to 0.7709,while the Dice coefficient increased from 0.8285 to 0.8665.Furthermore,the generalization experiments conducted on the jaw cyst dataset from Quzhou People’s Hospital further verified the outstanding segmentation performance of the proposed model.These findings collectively affirm the potential of our approach as a valuable tool in supporting clinical decision-making in the field of skin cancer detection,as well as advancing research in medical image analysis. 展开更多
关键词 Dual U-Net skin lesion segmentation squeeze-and-excitation modified receptive field block multi-path convolution block attention module
在线阅读 下载PDF
ANC: Attention Network for COVID-19 Explainable Diagnosis Based on Convolutional Block Attention Module 被引量:10
2
作者 Yudong Zhang Xin Zhang Weiguo Zhu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2021年第6期1037-1058,共22页
Aim: To diagnose COVID-19 more efficiently and more correctly, this study proposed a novel attention network forCOVID-19 (ANC). Methods: Two datasets were used in this study. An 18-way data augmentation was proposed t... Aim: To diagnose COVID-19 more efficiently and more correctly, this study proposed a novel attention network forCOVID-19 (ANC). Methods: Two datasets were used in this study. An 18-way data augmentation was proposed toavoid overfitting. Then, convolutional block attention module (CBAM) was integrated to our model, the structureof which is fine-tuned. Finally, Grad-CAM was used to provide an explainable diagnosis. Results: The accuracyof our ANC methods on two datasets are 96.32% ± 1.06%, and 96.00% ± 1.03%, respectively. Conclusions: Thisproposed ANC method is superior to 9 state-of-the-art approaches. 展开更多
关键词 Deep learning convolutional block attention module attention mechanism COVID-19 explainable diagnosis
在线阅读 下载PDF
MobileNet network optimization based on convolutional block attention module 被引量:3
3
作者 ZHAO Shuxu MEN Shiyao YUAN Lin 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2022年第2期225-234,共10页
Deep learning technology is widely used in computer vision.Generally,a large amount of data is used to train the model weights in deep learning,so as to obtain a model with higher accuracy.However,massive data and com... Deep learning technology is widely used in computer vision.Generally,a large amount of data is used to train the model weights in deep learning,so as to obtain a model with higher accuracy.However,massive data and complex model structures require more calculating resources.Since people generally can only carry and use mobile and portable devices in application scenarios,neural networks have limitations in terms of calculating resources,size and power consumption.Therefore,the efficient lightweight model MobileNet is used as the basic network in this study for optimization.First,the accuracy of the MobileNet model is improved by adding methods such as the convolutional block attention module(CBAM)and expansion convolution.Then,the MobileNet model is compressed by using pruning and weight quantization algorithms based on weight size.Afterwards,methods such as Python crawlers and data augmentation are employed to create a garbage classification data set.Based on the above model optimization strategy,the garbage classification mobile terminal application is deployed on mobile phones and raspberry pies,realizing completing the garbage classification task more conveniently. 展开更多
关键词 MobileNet convolutional block attention module(CBAM) model pruning and quantization edge machine learning
在线阅读 下载PDF
Traffic Sign Recognition for Autonomous Vehicle Using Optimized YOLOv7 and Convolutional Block Attention Module 被引量:2
4
作者 P.Kuppusamy M.Sanjay +1 位作者 P.V.Deepashree C.Iwendi 《Computers, Materials & Continua》 SCIE EI 2023年第10期445-466,共22页
The infrastructure and construction of roads are crucial for the economic and social development of a region,but traffic-related challenges like accidents and congestion persist.Artificial Intelligence(AI)and Machine ... The infrastructure and construction of roads are crucial for the economic and social development of a region,but traffic-related challenges like accidents and congestion persist.Artificial Intelligence(AI)and Machine Learning(ML)have been used in road infrastructure and construction,particularly with the Internet of Things(IoT)devices.Object detection in Computer Vision also plays a key role in improving road infrastructure and addressing trafficrelated problems.This study aims to use You Only Look Once version 7(YOLOv7),Convolutional Block Attention Module(CBAM),the most optimized object-detection algorithm,to detect and identify traffic signs,and analyze effective combinations of adaptive optimizers like Adaptive Moment estimation(Adam),Root Mean Squared Propagation(RMSprop)and Stochastic Gradient Descent(SGD)with the YOLOv7.Using a portion of German traffic signs for training,the study investigates the feasibility of adopting smaller datasets while maintaining high accuracy.The model proposed in this study not only improves traffic safety by detecting traffic signs but also has the potential to contribute to the rapid development of autonomous vehicle systems.The study results showed an impressive accuracy of 99.7%when using a batch size of 8 and the Adam optimizer.This high level of accuracy demonstrates the effectiveness of the proposed model for the image classification task of traffic sign recognition. 展开更多
关键词 Object detection traffic sign detection YOLOv7 convolutional block attention module road sign detection ADAM
在线阅读 下载PDF
Attention-Augmented YOLOv8 with Ghost Convolution for Real-Time Vehicle Detection in Intelligent Transportation Systems
5
作者 Syed Sajid Ullah Muhammad Zunair Zamir +1 位作者 Ahsan Ishfaq Salman Khan 《Journal on Artificial Intelligence》 2025年第1期255-274,共20页
Accurate vehicle detection is essential for autonomous driving,traffic monitoring,and intelligent transportation systems.This paper presents an enhanced YOLOv8n model that incorporates the Ghost Module,Convolutional B... Accurate vehicle detection is essential for autonomous driving,traffic monitoring,and intelligent transportation systems.This paper presents an enhanced YOLOv8n model that incorporates the Ghost Module,Convolutional Block Attention Module(CBAM),and Deformable Convolutional Networks v2(DCNv2).The Ghost Module streamlines feature generation to reduce redundancy,CBAM applies channel and spatial attention to improve feature focus,and DCNv2 enables adaptability to geometric variations in vehicle shapes.These components work together to improve both accuracy and computational efficiency.Evaluated on the KITTI dataset,the proposed model achieves 95.4%mAP@0.5—an 8.97% gain over standard YOLOv8n—along with 96.2% precision,93.7% recall,and a 94.93%F1-score.Comparative analysis with seven state-of-the-art detectors demonstrates consistent superiority in key performance metrics.An ablation study is also conducted to quantify the individual and combined contributions of GhostModule,CBAM,and DCNv2,highlighting their effectiveness in improving detection performance.By addressing feature redundancy,attention refinement,and spatial adaptability,the proposed model offers a robust and scalable solution for vehicle detection across diverse traffic scenarios. 展开更多
关键词 YOLOv8n vehicle detection deformable convolutional networks(DCNv2) ghost module convolutional block attention module(CBAM) attention mechanisms
在线阅读 下载PDF
Transmission Facility Detection with Feature-Attention Multi-Scale Robustness Network and Generative Adversarial Network
6
作者 Yunho Na Munsu Jeon +4 位作者 Seungmin Joo Junsoo Kim Ki-Yong Oh Min Ku Kim Joon-Young Park 《Computer Modeling in Engineering & Sciences》 2025年第7期1013-1044,共32页
This paper proposes an automated detection framework for transmission facilities using a featureattention multi-scale robustness network(FAMSR-Net)with high-fidelity virtual images.The proposed framework exhibits thre... This paper proposes an automated detection framework for transmission facilities using a featureattention multi-scale robustness network(FAMSR-Net)with high-fidelity virtual images.The proposed framework exhibits three key characteristics.First,virtual images of the transmission facilities generated using StyleGAN2-ADA are co-trained with real images.This enables the neural network to learn various features of transmission facilities to improve the detection performance.Second,the convolutional block attention module is deployed in FAMSR-Net to effectively extract features from images and construct multi-dimensional feature maps,enabling the neural network to perform precise object detection in various environments.Third,an effective bounding box optimization method called Scylla-IoU is deployed on FAMSR-Net,considering the intersection over union,center point distance,angle,and shape of the bounding box.This enables the detection of power facilities of various sizes accurately.Extensive experiments demonstrated that FAMSRNet outperforms other neural networks in detecting power facilities.FAMSR-Net also achieved the highest detection accuracy when virtual images of the transmission facilities were co-trained in the training phase.The proposed framework is effective for the scheduled operation and maintenance of transmission facilities because an optical camera is currently the most promising tool for unmanned aerial vehicles.This ultimately contributes to improved inspection efficiency,reduced maintenance risks,and more reliable power delivery across extensive transmission facilities. 展开更多
关键词 Object detection virtual image transmission facility convolutional block attention module Scylla-IoU
在线阅读 下载PDF
F-Net:breast cancerous lesion region segmentation based on improved U-Net
7
作者 DENG Xiangyu PAN Lihao DANG Zhiyan 《Optoelectronics Letters》 2025年第12期761-768,共8页
In order to solve the challenge of breast cancer region segmentation,we improved the U-Net.The convolutional block attention module with prioritized attention(CBAM-PA)and dilated transformer(Dformer)modules were desig... In order to solve the challenge of breast cancer region segmentation,we improved the U-Net.The convolutional block attention module with prioritized attention(CBAM-PA)and dilated transformer(Dformer)modules were designed to replace the convolutional layers at the encoding side in the base U-Net,the input logic of the U-Net was improved by dynamically adjusting the input size of each layer,and the short connections in the U-Net were replaced with crosslayer connections to enhance the image restoration capability at the decoding side.On the breast ultrasound images(BUSI)dataset,we obtain a Dice coefficient of 0.8031 and an intersection-over-union(IoU)value of 0.7362.The experimental results show that the proposed enhancement method effectively improves the accuracy and quality of breast cancer lesion region segmentation. 展开更多
关键词 input logic crosslayer connections short connections breast cancer image restoration capabili convolutional block attention module breast cancer region segmentationwe convolutional layers
原文传递
Marine organism classification method based on hierarchical multi-scale attention mechanism
8
作者 XU Haotian CHENG Yuanzhi +1 位作者 ZHAO Dong XIE Peidong 《Optoelectronics Letters》 2025年第6期354-361,共8页
We propose a hierarchical multi-scale attention mechanism-based model in response to the low accuracy and inefficient manual classification of existing oceanic biological image classification methods. Firstly, the hie... We propose a hierarchical multi-scale attention mechanism-based model in response to the low accuracy and inefficient manual classification of existing oceanic biological image classification methods. Firstly, the hierarchical efficient multi-scale attention(H-EMA) module is designed for lightweight feature extraction, achieving outstanding performance at a relatively low cost. Secondly, an improved EfficientNetV2 block is used to integrate information from different scales better and enhance inter-layer message passing. Furthermore, introducing the convolutional block attention module(CBAM) enhances the model's perception of critical features, optimizing its generalization ability. Lastly, Focal Loss is introduced to adjust the weights of complex samples to address the issue of imbalanced categories in the dataset, further improving the model's performance. The model achieved 96.11% accuracy on the intertidal marine organism dataset of Nanji Islands and 84.78% accuracy on the CIFAR-100 dataset, demonstrating its strong generalization ability to meet the demands of oceanic biological image classification. 展开更多
关键词 integrate information different scales hierarchical multi scale attention lightweight feature extraction focal loss efficientnetv marine organism classification oceanic biological image classification methods convolutional block attention module
原文传递
基于轻量级卷积神经网络的岩石图像岩性识别方法
9
作者 刘善伟 马志伟 +1 位作者 魏世清 魏忠勇 《地质科技通报》 北大核心 2026年第1期360-370,共11页
岩性识别是油气勘探和开发过程中的重要环节,对于油气勘探定位、储层评价以及储层模型建立具有重要的指导意义。但传统的人工岩性识别方法耗时耗力,经典的深度学习模型虽然识别精度高,但模型的参数量较大,为了提高模型识别精度,同时降... 岩性识别是油气勘探和开发过程中的重要环节,对于油气勘探定位、储层评价以及储层模型建立具有重要的指导意义。但传统的人工岩性识别方法耗时耗力,经典的深度学习模型虽然识别精度高,但模型的参数量较大,为了提高模型识别精度,同时降低模型的参数量,使模型适用于岩性实时识别工作,首先收集了白云岩、砂岩等8种岩石共3016张岩石图像构建岩性识别数据集,然后以轻量型卷积神经网络ShuffleNetV2模型为基础网络,提出了一种Rock-ShuffleNetV2岩性识别模型(RSHFNet模型)。模型中将混合注意力机制模块(convolutional block attention module,简称CBAM)以及多尺度特征融合模块(multi-scale feature fusion module,简称MSF)融入基础网络中以加强模型的特征提取能力,提升模型识别性能,并优化模型中ShuffleNetV2单元的堆叠次数以减少模型参数量。结果表明:与基础模型相比,RSHFNet模型的准确率达到了87.21%,提高了4.98%;同时,模型参数量与浮点运算量分别降低到了869702个,0.93×108,分别是基础模型的0.67,0.63倍,模型参数量明显降低;并且RSHFNet模型的综合性能明显优于现有的卷积神经网络。RSHFNet岩性识别模型具有较高的识别精度和较好的泛化能力,同时更加的轻量化,为实现野外实时的岩性识别工作提供了新思路。 展开更多
关键词 岩性识别 ShuffleNetV2网络 混合注意力机制模块 多尺度特征融合模块 卷积神经网络
在线阅读 下载PDF
基于GRU和卷积注意力的改进ACGAN故障诊断方法
10
作者 彭朝琴 李奇聪 +2 位作者 张海尼 吴红 马云鹏 《航空学报》 北大核心 2026年第2期318-332,共15页
由于机电伺服系统(EMA)在实际应用中故障数据样本少,会影响故障诊断方法的分类效果。针对故障数据缺失下机电伺服系统的故障诊断问题,设计了一种基于门控循环单元(GRU)和卷积注意力的改进辅助分类生成对抗网络(ACGAN)故障诊断方法,能够... 由于机电伺服系统(EMA)在实际应用中故障数据样本少,会影响故障诊断方法的分类效果。针对故障数据缺失下机电伺服系统的故障诊断问题,设计了一种基于门控循环单元(GRU)和卷积注意力的改进辅助分类生成对抗网络(ACGAN)故障诊断方法,能够稳定地生成各故障类别高质量数据。首先,在ACGAN中引入Wasserstein距离与梯度惩罚,优化损失函数,提升对抗训练稳定性。其次,在生成器和判别器中加入GRU和卷积注意力模块(CBAM),增强网络对关键特征和时序特征的提取能力,克服了卷积网络在处理时序数据时的局限性,提高了生成样本的质量。最后,通过共享分类器与判别器网络参数,利用平衡数据集微调分类器,进一步提高模型的诊断性能。基于搭建的EMA实验台,得到由大量正常数据与少量故障数据组成的不平衡实验数据集,通过对比和消融实验,验证了所提方法的有效性和优越性。 展开更多
关键词 机电伺服系统 门控循环单元 卷积注意力模块 故障诊断 辅助分类生成对抗网络
原文传递
基于改进YOLOv8n的高铁轨道缺陷检测模型研究
11
作者 张瑜 任亚雪 +1 位作者 李龙凯 王孟昭 《无线互联科技》 2026年第4期34-38,共5页
针对高铁轨道缺陷智能检测中精度与速度难以兼顾、小目标漏检率高以及复杂背景干扰等挑战,文章提出一种基于改进YOLOv8n的检测模型。该模型在主干网络中引入卷积块注意力模块(Convolutional Block Attention Module,CBAM),通过通道与空... 针对高铁轨道缺陷智能检测中精度与速度难以兼顾、小目标漏检率高以及复杂背景干扰等挑战,文章提出一种基于改进YOLOv8n的检测模型。该模型在主干网络中引入卷积块注意力模块(Convolutional Block Attention Module,CBAM),通过通道与空间双重注意力机制增强对缺陷关键特征的聚焦能力,有效抑制背景干扰;同时,采用快速空间金字塔池化跨阶段局部连接模块(Spatial Pyramid Pooling Fast-Cross Stage Partial Channel,SPPFCSPC)替代原快速空间金字塔池化结构(Spatial Pyramid Pooling-Fast,SPPF),借助并行多尺度池化与特征分流融合机制,在降低计算复杂度的同时改善梯度流动,提升了对小尺度缺陷的特征融合能力。实验结果表明,改进模型在测试集上的mAP@0.5达到0.524,较原YOLOv8n模型提升24.2%,召回率为0.487,推理速度保持在92 fps,在精度、召回率与实时性之间取得了较好平衡,可为高铁智能运维提供有效的技术解决方案。 展开更多
关键词 高铁轨道缺陷检测 YOLOv8n 注意力机制 卷积块注意力模块 深度学习
在线阅读 下载PDF
基于一维深度可分离卷积的轻量化辐射源识别
12
作者 孙文鑫 孟华 +1 位作者 杨佳煌 周礼亮 《太赫兹科学与电子信息学报》 2026年第1期89-97,共9页
针对深度神经网络的辐射源个体识别技术,为达到良好的识别性能,网络深度不断增加,致使模型参数量与计算复杂度爆炸式增长,难以在边缘端算力受限的设备上部署。为此,本文提出基于一维深度可分离卷积和一维卷积块注意力模块的网络(ODCNet... 针对深度神经网络的辐射源个体识别技术,为达到良好的识别性能,网络深度不断增加,致使模型参数量与计算复杂度爆炸式增长,难以在边缘端算力受限的设备上部署。为此,本文提出基于一维深度可分离卷积和一维卷积块注意力模块的网络(ODCNet)架构,通过结合逐深度卷积和逐点卷积,一维深度可分离卷积有效减少了模型的参数量和计算复杂度;轻量级的一维卷积块注意力模块可有效提升模型性能,保障模型的识别能力。实验表明,ODCNet的识别性能与MobileNet V3相当,而参数量仅为MobileNet V3的11.27%,计算复杂度为MobileNet V3的17.49%,推理时间缩短至MobileNet V3的50%。 展开更多
关键词 辐射源识别 模型轻量化 深度可分离卷积 卷积块注意力模块
在线阅读 下载PDF
CAM-ResNet:基于ResNet的土地利用类型遥感图像分类
13
作者 王梓鉴 方建军 +1 位作者 袁涌博 刘泽宇 《北京联合大学学报》 2026年第1期49-54,共6页
在土地利用类型图像分类领域,传统分类方法在特征提取准确性与分类精度方面存在局限性,难以满足实际应用需求。本文以ResNet50为核心架构,引入卷积块注意力模块(convolutional block attention module,CBAM),强化特征提取,并采用自动混... 在土地利用类型图像分类领域,传统分类方法在特征提取准确性与分类精度方面存在局限性,难以满足实际应用需求。本文以ResNet50为核心架构,引入卷积块注意力模块(convolutional block attention module,CBAM),强化特征提取,并采用自动混合精度(automatic mixed precision,AMP)技术提高计算效率,构建了CAM-ResNet网络。实验结果显示,CAM-ResNet网络的总体精度达98.19%,较原网络高出10.16个百分点。消融实验进一步证明,CBAM注意力机制显著增强了模型的特征提取能力,AMP训练技术提高了模型的收敛速度,CAM-ResNet网络在土地利用类型遥感图像分类中具有一定的有效性和优越性。 展开更多
关键词 土地利用 ResNet50 卷积块注意力模块(CBAM) 自动混合精度(AMP) 图像分类 卷积神经网络(CNN)
在线阅读 下载PDF
基于CBAM增强与多尺度特征融合的AD MRI图像分类方法
14
作者 韦建武 王宋 +3 位作者 任建禹 肖叶子 邵梅 韩刚 《西安邮电大学学报》 2026年第2期109-117,共9页
为解决现有基于深度学习的阿尔茨海默病(Alzheimer's Disease,AD)高分辨率磁共振成像(Magnetic Resonance Imaging,MRI)图像分类方法中全局特征建模不足、多尺度病理信息利用不充分及类别不平衡问题,提出融合卷积块注意力模块与多... 为解决现有基于深度学习的阿尔茨海默病(Alzheimer's Disease,AD)高分辨率磁共振成像(Magnetic Resonance Imaging,MRI)图像分类方法中全局特征建模不足、多尺度病理信息利用不充分及类别不平衡问题,提出融合卷积块注意力模块与多尺度过渡层的改进密集连接网络(DenseNet)模型。具体方法为:在DenseNet121架构下,于各密集块末端集成卷积块注意力模块(Convolutional Block Attention Module,CBAM)以聚焦关键脑区病理变化,设计多尺度Transition层优化下采样时多尺度病理信息融合与保全,采用焦点损失函数缓解类别不平衡。OASIS-1数据集实验显示,该模型分类准确率90.91%、F1值92.12%、召回率94.00%,显著优于MobileNetV2、VGG16及传统DenseNet模型。其能提升AD分期诊断精度,尤其在降低痴呆漏诊率、识别轻度痴呆(黄金干预窗口)上表现突出,可为临床早期干预提供支撑,具备临床转化潜力。 展开更多
关键词 阿尔茨海默病 磁共振成像图像分类 多尺度特征融合 密集连接网络 深度学习 卷积块注意力模块
在线阅读 下载PDF
基于改进U-Net3+模型的无人机正射影像语义分割
15
作者 姜磊 梁聪 +4 位作者 赵旭 王鹏 闫文凯 杨宏鼎 吴继忠 《测绘通报》 北大核心 2026年第2期137-143,共7页
为解决U-Net3+模型在无人机正射影像语义分割时特征抽象层次不足与跨尺度特征冗余的问题,本文提出了一种改进的U-Net3+模型。改进模型引入基于残差网络架构的深度卷积神经网络ResNet50作为特征提取主干网络,同时引入卷积注意力模块作为... 为解决U-Net3+模型在无人机正射影像语义分割时特征抽象层次不足与跨尺度特征冗余的问题,本文提出了一种改进的U-Net3+模型。改进模型引入基于残差网络架构的深度卷积神经网络ResNet50作为特征提取主干网络,同时引入卷积注意力模块作为轻量级注意力机制。试验结果表明:改进U-Net3+模型的总体准确率、平均交并比、F1分数比原始U-Net3+分别高出8.3%、2.6%和1.9%,且优于FCN、U-Net、U-Net++和DeepLab系列主流语义分割模型,改进U-Net3+模型在典型场景下表现出更强的特征区分能力和更高的准确性;仅引入ResNet50或CBAM无法达到最佳效果,ResNet50与CBAM的协同作用可显著增强模型在复杂场景下的识别能力。改进U-Net3+模型的分割精度有明显改善,为无人机正射影像语义分割提供了有效的技术解决方案。 展开更多
关键词 无人机正射影像 语义分割 U-Net3+ ResNet50 卷积注意力模块
原文传递
基于多尺度增强生成对抗网络的行人异常行为检测
16
作者 冯爽爽 范莎 邓超 《计算机工程与应用》 北大核心 2026年第4期190-200,共11页
针对行人异常行为检测中存在样本不均衡、目标尺度变化较大和背景干扰的问题,提出了一种基于多尺度增强生成对抗网络的行人异常行为检测方法。利用改进的U-Net作为生成器的主干网络,将卷积块注意力模块嵌入生成器的解码过程,降低背景的... 针对行人异常行为检测中存在样本不均衡、目标尺度变化较大和背景干扰的问题,提出了一种基于多尺度增强生成对抗网络的行人异常行为检测方法。利用改进的U-Net作为生成器的主干网络,将卷积块注意力模块嵌入生成器的解码过程,降低背景的干扰,实现对检测目标的特征提取和重构。在生成器的跳跃跨接中引入多尺度特征融合增强模块细化对不同目标的表达能力,并结合记忆模块设计多尺度记忆特征增强模块提高对正常行为的重构表现。进一步对损失函数进行改进,结合异常得分,完成对行人异常行为的检测。在UCSD Ped2、CUHK Avenue、Shanghai Tech三个数据集上测得该算法的AUC分别为98.6%、88.7%和77.3%,实验结果表明了该算法在不同场景中针对行人异常行为检测具有良好的检测效果。 展开更多
关键词 行人异常行为检测 生成对抗网络 卷积块注意力模块 多尺度增强模块 异常得分
在线阅读 下载PDF
伪三维卷积注意力网络的多步信道预测
17
作者 陶静 侯萌 +5 位作者 彭薇 张国彦 戴佳明 刘卫明 王海东 王臻 《电子与信息学报》 北大核心 2026年第1期394-403,共10页
现有大规模MIMO信道预测多以广义平稳假设为前提,且多采用单步预测。面对非平稳场景,单步结果极易失效,频繁迭代亦显著抬高导频开销。为此,该文构建一套融合伪三维卷积(P3D)与注意力模块的时频联合多步预测框架。该方案以伪三维卷积替... 现有大规模MIMO信道预测多以广义平稳假设为前提,且多采用单步预测。面对非平稳场景,单步结果极易失效,频繁迭代亦显著抬高导频开销。为此,该文构建一套融合伪三维卷积(P3D)与注意力模块的时频联合多步预测框架。该方案以伪三维卷积替代3D卷积实现信道状态信息(CSI)在时域与频域的高效特征提取,并叠加通道与空间的混合注意力(CBAM),增强网络对全局依赖的表征能力,从而提升预测精度。基于实测信道的实验验证显示,该方法在多步预测任务上具有明显优势。与此同时,结合迁移学习思路,完成了由单天线到多天线场景的平滑扩展。 展开更多
关键词 大规模MIMO 多步信道状态信息预测 伪三维卷积 混合注意力 时频联合特征
在线阅读 下载PDF
基于注意力神经网络的高维不平衡数据分类方法
18
作者 董嘉轩 俞卫琴 《兰州文理学院学报(自然科学版)》 2026年第1期38-43,55,共7页
为解决大数据中普遍存在的高维不平衡数据分类问题,提出一种结合注意力神经网络和支持向量机的分类方法.首先,在卷积神经网络中引入卷积块注意力模块,进行数据降维,将高维数据嵌入低维空间,并结合焦点损失优化特征提取;然后,在降维后的... 为解决大数据中普遍存在的高维不平衡数据分类问题,提出一种结合注意力神经网络和支持向量机的分类方法.首先,在卷积神经网络中引入卷积块注意力模块,进行数据降维,将高维数据嵌入低维空间,并结合焦点损失优化特征提取;然后,在降维后的数据集上,应用合成少数类过采样与随机欠采样进行混合采样,得到平衡数据集,应用支持向量机进行分类.实验结果表明,该方法在4个高维图像数据集上的召回率、查准率和F1值均有提高. 展开更多
关键词 卷积神经网络 卷积块注意力模块 焦点损失 支持向量机
在线阅读 下载PDF
基于模块化AI模组的火灾图像智能检测
19
作者 杨涛 汪友杰 王伟 《齐鲁工业大学学报》 2026年第1期57-64,共8页
火灾监测对减少生命财产损失至关重要,但传统方法在复杂环境中存在实时性与准确性不足的问题。本文提出一种基于改进YOLOv5s的轻量化火灾图像检测算法,结合边缘计算技术优化监测系统。通过引入卷积块注意力模块(CBAM)增强特征学习能力,... 火灾监测对减少生命财产损失至关重要,但传统方法在复杂环境中存在实时性与准确性不足的问题。本文提出一种基于改进YOLOv5s的轻量化火灾图像检测算法,结合边缘计算技术优化监测系统。通过引入卷积块注意力模块(CBAM)增强特征学习能力,采用多孔空间金字塔池化(ASPP)扩大模型感受野,并利用EIoU Loss损失函数加速收敛、提升回归精度。实验表明,改进后模型的火灾识别率提高至94%,精确率与召回率分别达到94.2%和92.4%。通过将系统搭载在模块化AI模组上,直接处理视频数据,避免了云端传输延迟,显著提升了检测实时性。该方法为复杂场景下的火灾监测提供了高效解决方案,对提升应急响应能力具有重要意义。 展开更多
关键词 边缘计算 火灾监测 神经网络 卷积块注意力模块 多孔空间金字塔池化
在线阅读 下载PDF
基于BMF-GADF与改进Swin Transformer的配电网故障选线方法
20
作者 吴小欢 沈景贵 +3 位作者 张欣 胡裕民 徐烨玲 石明玉 《综合智慧能源》 2026年第2期86-95,共10页
由于配电网小电流系统发生单相接地故障时故障特征比较微弱,现有故障选线方法存在准确率低、鲁棒性弱等问题。为此,提出了一种基于巴特沃斯均值滤波-格拉姆角差场(BMF-GADF)与改进Swin Transformer的配电网故障选线方法。该方法将BMF与G... 由于配电网小电流系统发生单相接地故障时故障特征比较微弱,现有故障选线方法存在准确率低、鲁棒性弱等问题。为此,提出了一种基于巴特沃斯均值滤波-格拉姆角差场(BMF-GADF)与改进Swin Transformer的配电网故障选线方法。该方法将BMF与GADF相结合,把零序电流转换为特征增强的GADF图像;将图像样本输入改进的Swin Transformer模型中进行特征提取;改进的Swin Transformer在原架构基础上引入模块并行的卷积注意力机制可实现更准确的特征自适应选择,有效提升模型精度;利用Softmax分类器实现故障线路的选取,试验结果表明,该方法选线准确率达98.96%,相较于其他故障选线方法,具有更高的选线精度与噪声鲁棒性,为配电网故障选线提供了新方案。 展开更多
关键词 故障选线 格拉姆角差场 卷积注意力机制 滑动窗口变换器 特征提取
在线阅读 下载PDF
上一页 1 2 16 下一页 到第
使用帮助 返回顶部