期刊文献+
共找到1,268篇文章
< 1 2 64 >
每页显示 20 50 100
Human Activity Recognition Using a CNN with an Enhanced Convolutional Block Attention Module
1
作者 HU Biling TONG Yu 《Wuhan University Journal of Natural Sciences》 2026年第1期10-24,共15页
WiFi-based human activity recognition(HAR)provides a non-intrusive approach for ubiquitous monitoring;however,achieving both high accuracy and robustness simultaneously remains a significant challenge.This paper propo... WiFi-based human activity recognition(HAR)provides a non-intrusive approach for ubiquitous monitoring;however,achieving both high accuracy and robustness simultaneously remains a significant challenge.This paper proposes a Convolutional Neural Network with Enhanced Convolutional Block Attention Module(CNN-ECBAM)framework.The approach systematically converts raw Channel State Information(CSI)into pseudo-color images,effectively preserving essential signal characteristics for deep neural network processing.The core innovation is an Enhanced Convolutional Block Attention Module(ECBAM),tailored to CSI data characteristics,which integrates Efficient Channel Attention(ECA)and Multi-Scale Spatial Attention(MSSA).By employing learnable adaptive fusion weights,it achieves dynamic synergy between channel and spatial features,enabling the network to capture highly discriminative spatiotemporal patterns.The ECBAM module is integrated into a unified Convolutional Neural Network(CNN)to form the overall CNN-ECBAM model.Experimental results on the UT-HAR and NTU-Fi_HAR datasets demonstrate that CNN-ECBAM achieves competitive performance in recognition accuracy and outperforms mainstream baseline models.Specifically,it attains 99.20%accuracy on UT-HAR(surpassing ResNet-18 at 98.60%)and achieves 100%accuracy on NTU-Fi_HAR(exceeding GAF-CNN at 99.62%).These results validate the effectiveness of the proposed method for high-precision and reliable WiFi-based HAR. 展开更多
关键词 human activity recognition deep learning channel state information Enhanced convolutional Block Attention module(ECBAM) pseudo-color images
原文传递
Enhanced Cutaneous Melanoma Segmentation in Dermoscopic Images Using a Dual U-Net Framework with Multi-Path Convolution Block Attention Module and SE-Res-Conv
2
作者 Kun Lan Feiyang Gao +2 位作者 Xiaoliang Jiang Jianzhen Cheng Simon Fong 《Computers, Materials & Continua》 2025年第9期4805-4824,共20页
With the continuous development of artificial intelligence and machine learning techniques,there have been effective methods supporting the work of dermatologist in the field of skin cancer detection.However,object si... With the continuous development of artificial intelligence and machine learning techniques,there have been effective methods supporting the work of dermatologist in the field of skin cancer detection.However,object significant challenges have been presented in accurately segmenting melanomas in dermoscopic images due to the objects that could interfere human observations,such as bubbles and scales.To address these challenges,we propose a dual U-Net network framework for skin melanoma segmentation.In our proposed architecture,we introduce several innovative components that aim to enhance the performance and capabilities of the traditional U-Net.First,we establish a novel framework that links two simplified U-Nets,enabling more comprehensive information exchange and feature integration throughout the network.Second,after cascading the second U-Net,we introduce a skip connection between the decoder and encoder networks,and incorporate a modified receptive field block(MRFB),which is designed to capture multi-scale spatial information.Third,to further enhance the feature representation capabilities,we add a multi-path convolution block attention module(MCBAM)to the first two layers of the first U-Net encoding,and integrate a new squeeze-and-excitation(SE)mechanism with residual connections in the second U-Net.To illustrate the performance of our proposed model,we conducted comprehensive experiments on widely recognized skin datasets.On the ISIC-2017 dataset,the IoU value of our proposed model increased from 0.6406 to 0.6819 and the Dice coefficient increased from 0.7625 to 0.8023.On the ISIC-2018 dataset,the IoU value of proposed model also improved from 0.7138 to 0.7709,while the Dice coefficient increased from 0.8285 to 0.8665.Furthermore,the generalization experiments conducted on the jaw cyst dataset from Quzhou People’s Hospital further verified the outstanding segmentation performance of the proposed model.These findings collectively affirm the potential of our approach as a valuable tool in supporting clinical decision-making in the field of skin cancer detection,as well as advancing research in medical image analysis. 展开更多
关键词 Dual U-Net skin lesion segmentation squeeze-and-excitation modified receptive field block multi-path convolution block attention module
在线阅读 下载PDF
An attention module integrated hybrid model for recognizing microseismic signals induced by high-pressure grouting in deep rock layers
3
作者 Yongshu Zhang Lianchong Li +2 位作者 Wenqiang Mu Jian Chen Peng Chen 《International Journal of Mining Science and Technology》 2026年第3期595-613,共19页
Microseismic(MS)monitoring is an effective technique to detect mining-induced rock fractures.However,recognizing grouting-induced signals is challenging due to complex geological conditions in deep rock plates.Therefo... Microseismic(MS)monitoring is an effective technique to detect mining-induced rock fractures.However,recognizing grouting-induced signals is challenging due to complex geological conditions in deep rock plates.Therefore,a hybrid model(WM-ResNet50)integrating data enhancement,a deep convolutional neural network(CNN),and convolutional block attention modules(CBAM)was proposed.Firstly,an MS system was established at the Xieqiao coal mine in Anhui Province,China.MS waveforms and injection parameters were acquired during grouting.Secondly,signals were categorized based on time-frequency characteristics to build a dataset,which was divided into training,validation,and test sets at a ratio of 4:1:1.Subsequently,the performance of WM-ResNet50 was evaluated based on indices such as individual precision,total accuracy,recall,and loss function.The results indicated that WMResNet50 achieved an average recognition accuracy of 94.38%,surpassing that of a simple CNN(90.04%),ResNet18(91.72%),and ResNet50(92.48%).Finally,WM-ResNet50 was applied to monitor the whole process at laboratory tests and field cases.Both results affirmed the feasibility and effectiveness of MS inversion in predicting actual slurry diffusion ranges within deep rock layers.By comparison,it was revealed that the MS sources classified by WM-ResNet50 matched grouting records well.A solution to address insufficient diffusion under long-borehole grouting has been proposed.WM-ResNet50′s accuracy was validated through in-situ coring and XRD analysis for cement-based hydration products.This study provides a beneficial reference for similar rock signal processing and in-field grouting practices. 展开更多
关键词 Attention module convolutional neural network Microseismic ROCK Grouting-induced signals Slurry diffusion
在线阅读 下载PDF
An RMD-YOLOv11 Approach for Typical Defect Detection of PV Modules
4
作者 Tao Geng Shuaibing Li +3 位作者 Yunyun Yun Yongqiang Kang Hongwei Li unmin Zhu 《Computers, Materials & Continua》 2026年第3期1804-1822,共19页
In order to address the challenges posed by complex background interference,high miss-detection rates of micro-scale defects,and limited model deployment efficiency in photovoltaic(PV)module defect detection,this pape... In order to address the challenges posed by complex background interference,high miss-detection rates of micro-scale defects,and limited model deployment efficiency in photovoltaic(PV)module defect detection,this paper proposes an efficient detection framework based on an improved YOLOv11 architecture.First,a Re-parameterized Convolution(RepConv)module is integrated into the backbone to enhance the model’s sensitivity to fine-grained defects—such as micro-cracks and hot spots—while maintaining high inference efficiency.Second,a Multi-Scale Feature Fusion Convolutional Block Attention Mechanism(MSFF-CBAM)is designed to guide the network toward critical defect regions by jointly modeling channel-wise and spatial attention.This mechanism effectively strengthens the specificity and robustness of feature representations.Third,a lightweight Dynamic Sampling Module(DySample)is employed to replace conventional upsampling operations,thereby improving the localization accuracy of small-scale defect targets.Experimental evaluations conducted on the PVEL-AD dataset demonstrate that the proposed RMDYOLOv11 model surpasses the baseline YOLOv11 in terms of mean Average Precision(mAP)@0.5,Precision,and Recall,achieving respective improvements of 4.70%,1.51%,and 5.50%.The model also exhibits notable advantages in inference speed and model compactness.Further validation on the ELPV dataset confirms the model’s generalization capability,showing respective performance gains of 1.99%,2.28%,and 1.45%across the same metrics.Overall,the enhanced model significantly improves the accuracy of micro-defect identification on PV module surfaces,effectively reducing both false negatives and false positives.This advancement provides a robust and reliable technical foundation for automated PV module defect detection. 展开更多
关键词 Photovoltaic(PV)modules YOLOv11 re-parameterization convolution attention mechanism dynamic upsampling
在线阅读 下载PDF
ANC: Attention Network for COVID-19 Explainable Diagnosis Based on Convolutional Block Attention Module 被引量:10
5
作者 Yudong Zhang Xin Zhang Weiguo Zhu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2021年第6期1037-1058,共22页
Aim: To diagnose COVID-19 more efficiently and more correctly, this study proposed a novel attention network forCOVID-19 (ANC). Methods: Two datasets were used in this study. An 18-way data augmentation was proposed t... Aim: To diagnose COVID-19 more efficiently and more correctly, this study proposed a novel attention network forCOVID-19 (ANC). Methods: Two datasets were used in this study. An 18-way data augmentation was proposed toavoid overfitting. Then, convolutional block attention module (CBAM) was integrated to our model, the structureof which is fine-tuned. Finally, Grad-CAM was used to provide an explainable diagnosis. Results: The accuracyof our ANC methods on two datasets are 96.32% ± 1.06%, and 96.00% ± 1.03%, respectively. Conclusions: Thisproposed ANC method is superior to 9 state-of-the-art approaches. 展开更多
关键词 Deep learning convolutional block attention module attention mechanism COVID-19 explainable diagnosis
在线阅读 下载PDF
MobileNet network optimization based on convolutional block attention module 被引量:3
6
作者 ZHAO Shuxu MEN Shiyao YUAN Lin 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2022年第2期225-234,共10页
Deep learning technology is widely used in computer vision.Generally,a large amount of data is used to train the model weights in deep learning,so as to obtain a model with higher accuracy.However,massive data and com... Deep learning technology is widely used in computer vision.Generally,a large amount of data is used to train the model weights in deep learning,so as to obtain a model with higher accuracy.However,massive data and complex model structures require more calculating resources.Since people generally can only carry and use mobile and portable devices in application scenarios,neural networks have limitations in terms of calculating resources,size and power consumption.Therefore,the efficient lightweight model MobileNet is used as the basic network in this study for optimization.First,the accuracy of the MobileNet model is improved by adding methods such as the convolutional block attention module(CBAM)and expansion convolution.Then,the MobileNet model is compressed by using pruning and weight quantization algorithms based on weight size.Afterwards,methods such as Python crawlers and data augmentation are employed to create a garbage classification data set.Based on the above model optimization strategy,the garbage classification mobile terminal application is deployed on mobile phones and raspberry pies,realizing completing the garbage classification task more conveniently. 展开更多
关键词 MobileNet convolutional block attention module(CBAM) model pruning and quantization edge machine learning
在线阅读 下载PDF
Traffic Sign Recognition for Autonomous Vehicle Using Optimized YOLOv7 and Convolutional Block Attention Module 被引量:2
7
作者 P.Kuppusamy M.Sanjay +1 位作者 P.V.Deepashree C.Iwendi 《Computers, Materials & Continua》 SCIE EI 2023年第10期445-466,共22页
The infrastructure and construction of roads are crucial for the economic and social development of a region,but traffic-related challenges like accidents and congestion persist.Artificial Intelligence(AI)and Machine ... The infrastructure and construction of roads are crucial for the economic and social development of a region,but traffic-related challenges like accidents and congestion persist.Artificial Intelligence(AI)and Machine Learning(ML)have been used in road infrastructure and construction,particularly with the Internet of Things(IoT)devices.Object detection in Computer Vision also plays a key role in improving road infrastructure and addressing trafficrelated problems.This study aims to use You Only Look Once version 7(YOLOv7),Convolutional Block Attention Module(CBAM),the most optimized object-detection algorithm,to detect and identify traffic signs,and analyze effective combinations of adaptive optimizers like Adaptive Moment estimation(Adam),Root Mean Squared Propagation(RMSprop)and Stochastic Gradient Descent(SGD)with the YOLOv7.Using a portion of German traffic signs for training,the study investigates the feasibility of adopting smaller datasets while maintaining high accuracy.The model proposed in this study not only improves traffic safety by detecting traffic signs but also has the potential to contribute to the rapid development of autonomous vehicle systems.The study results showed an impressive accuracy of 99.7%when using a batch size of 8 and the Adam optimizer.This high level of accuracy demonstrates the effectiveness of the proposed model for the image classification task of traffic sign recognition. 展开更多
关键词 Object detection traffic sign detection YOLOv7 convolutional block attention module road sign detection ADAM
在线阅读 下载PDF
Yetter-Drinfel'd Module and Convolution Module
8
作者 张良云 王栓宏 《Northeastern Mathematical Journal》 CSCD 2002年第1期13-18,共6页
In this paper, we first give a sufficient and necessary condition for a Hopf algebra to be a Yetter-Drinfel'd module, and prove that the finite dual of a Yetter-Drinfel'd module is still a Yetter-Drinfel'd... In this paper, we first give a sufficient and necessary condition for a Hopf algebra to be a Yetter-Drinfel'd module, and prove that the finite dual of a Yetter-Drinfel'd module is still a Yetter-Drinfel'd module. Finally, we introduce a concept of convolution module. 展开更多
关键词 braided Hopf algebra convolution algebra convolution module Yetter-Drinfel'd module
在线阅读 下载PDF
A Convolutional and Transformer Based Deep Neural Network for Automatic Modulation Classification 被引量:6
9
作者 Shanchuan Ying Sai Huang +3 位作者 Shuo Chang Zheng Yang Zhiyong Feng Ningyan Guo 《China Communications》 SCIE CSCD 2023年第5期135-147,共13页
Automatic modulation classification(AMC)aims at identifying the modulation of the received signals,which is a significant approach to identifying the target in military and civil applications.In this paper,a novel dat... Automatic modulation classification(AMC)aims at identifying the modulation of the received signals,which is a significant approach to identifying the target in military and civil applications.In this paper,a novel data-driven framework named convolutional and transformer-based deep neural network(CTDNN)is proposed to improve the classification performance.CTDNN can be divided into four modules,i.e.,convolutional neural network(CNN)backbone,transition module,transformer module,and final classifier.In the CNN backbone,a wide and deep convolution structure is designed,which consists of 1×15 convolution kernels and intensive cross-layer connections instead of traditional 1×3 kernels and sequential connections.In the transition module,a 1×1 convolution layer is utilized to compress the channels of the previous multi-scale CNN features.In the transformer module,three self-attention layers are designed for extracting global features and generating the classification vector.In the classifier,the final decision is made based on the maximum a posterior probability.Extensive simulations are conducted,and the result shows that our proposed CTDNN can achieve superior classification performance than traditional deep models. 展开更多
关键词 automatic modulation classification deep neural network convolutional neural network TRANSFORMER
在线阅读 下载PDF
Attention-Augmented YOLOv8 with Ghost Convolution for Real-Time Vehicle Detection in Intelligent Transportation Systems
10
作者 Syed Sajid Ullah Muhammad Zunair Zamir +1 位作者 Ahsan Ishfaq Salman Khan 《Journal on Artificial Intelligence》 2025年第1期255-274,共20页
Accurate vehicle detection is essential for autonomous driving,traffic monitoring,and intelligent transportation systems.This paper presents an enhanced YOLOv8n model that incorporates the Ghost Module,Convolutional B... Accurate vehicle detection is essential for autonomous driving,traffic monitoring,and intelligent transportation systems.This paper presents an enhanced YOLOv8n model that incorporates the Ghost Module,Convolutional Block Attention Module(CBAM),and Deformable Convolutional Networks v2(DCNv2).The Ghost Module streamlines feature generation to reduce redundancy,CBAM applies channel and spatial attention to improve feature focus,and DCNv2 enables adaptability to geometric variations in vehicle shapes.These components work together to improve both accuracy and computational efficiency.Evaluated on the KITTI dataset,the proposed model achieves 95.4%mAP@0.5—an 8.97% gain over standard YOLOv8n—along with 96.2% precision,93.7% recall,and a 94.93%F1-score.Comparative analysis with seven state-of-the-art detectors demonstrates consistent superiority in key performance metrics.An ablation study is also conducted to quantify the individual and combined contributions of GhostModule,CBAM,and DCNv2,highlighting their effectiveness in improving detection performance.By addressing feature redundancy,attention refinement,and spatial adaptability,the proposed model offers a robust and scalable solution for vehicle detection across diverse traffic scenarios. 展开更多
关键词 YOLOv8n vehicle detection deformable convolutional networks(DCNv2) ghost module convolutional block attention module(CBAM) attention mechanisms
在线阅读 下载PDF
AG-GCN: Vehicle Re-Identification Based on Attention-Guided Graph Convolutional Network
11
作者 Ya-Jie Sun Li-Wei Qiao Sai Ji 《Computers, Materials & Continua》 2025年第7期1769-1785,共17页
Vehicle re-identification involves matching images of vehicles across varying camera views.The diversity of camera locations along different roadways leads to significant intra-class variation and only minimal inter-c... Vehicle re-identification involves matching images of vehicles across varying camera views.The diversity of camera locations along different roadways leads to significant intra-class variation and only minimal inter-class similarity in the collected vehicle images,which increases the complexity of re-identification tasks.To tackle these challenges,this study proposes AG-GCN(Attention-Guided Graph Convolutional Network),a novel framework integrating several pivotal components.Initially,AG-GCN embeds a lightweight attention module within the ResNet-50 structure to learn feature weights automatically,thereby improving the representation of vehicle features globally by highlighting salient features and suppressing extraneous ones.Moreover,AG-GCN adopts a graph-based structure to encapsulate deep local features.A graph convolutional network then amalgamates these features to understand the relationships among vehicle-related characteristics.Subsequently,we amalgamate feature maps from both the attention and graph-based branches for a more comprehensive representation of vehicle features.The framework then gauges feature similarities and ranks them,thus enhancing the accuracy of vehicle re-identification.Comprehensive qualitative and quantitative analyses on two publicly available datasets verify the efficacy of AG-GCN in addressing intra-class and inter-class variability issues. 展开更多
关键词 Vehicle re-identification a lightweight attention module global features local features graph convolution network
在线阅读 下载PDF
A Multi-Task Learning Framework for Joint Sub-Nyquist Wideband Spectrum Sensing and Modulation Recognition
12
作者 Dong Xin Stefanos Bakirtzis +1 位作者 Zhang Jiliang Zhang Jie 《China Communications》 2025年第1期128-138,共11页
The utilization of millimeter-wave frequencies and cognitive radio(CR)are promising ways to increase the spectral efficiency of wireless communication systems.However,conventional CR spectrum sensing techniques entail... The utilization of millimeter-wave frequencies and cognitive radio(CR)are promising ways to increase the spectral efficiency of wireless communication systems.However,conventional CR spectrum sensing techniques entail sampling the received signal at a Nyquist rate,and they are not viable for wideband signals due to their high cost.This paper expounds on how sub-Nyquist sampling in conjunction with deep learning can be leveraged to remove this limitation.To this end,we propose a multi-task learning(MTL)framework using convolutional neural networks for the joint inference of the underlying narrowband signal number,their modulation scheme,and their location in a wideband spectrum.We demonstrate the effectiveness of the proposed framework for real-world millimeter-wave wideband signals collected by physical devices,exhibiting a 91.7% accuracy in the joint inference task when considering up to two narrowband signals over a wideband spectrum.Ultimately,the proposed data-driven approach enables on-the-fly wideband spectrum sensing,combining accuracy,and computational efficiency,which are indispensable for CR and opportunistic networking. 展开更多
关键词 automated modulation classification cognitive radio convolutional neural networks deep learning spectrum sensing sub-Nyquist sampling
在线阅读 下载PDF
Recognition of intrapulse modulation mode in radar signal with BRN-EST
13
作者 Yan Cheng Ke Mei Hao Zeng 《Journal of Electronic Science and Technology》 2025年第4期113-122,共10页
Neural network-based methods for intrapulse modulation recognition in radar signals have demonstrated significant improvements in classification accuracy.However,these approaches often rely on complex network structur... Neural network-based methods for intrapulse modulation recognition in radar signals have demonstrated significant improvements in classification accuracy.However,these approaches often rely on complex network structures,resulting in high computational resource requirements that limit their practical deployment in real-world settings.To address this issue,this paper proposes a bottleneck residual network with efficient soft-thresholding(BRN-EST)network,which integrates multiple lightweight design strategies and noise-reduction modules to maintain high recognition accuracy while significantly reducing computational complexity.Experimental results on the classical low-probability-of-intercept(LPI)radar signal dataset demonstrate that BRN-EST achieves comparable accuracy to state-of-the-art methods while reducing computational complexity by approximately 50%. 展开更多
关键词 Attention mechanism convolutional neural network Low probability of intercept radar Recognition of intrapulse modulation
在线阅读 下载PDF
基于轻量级卷积神经网络的岩石图像岩性识别方法
14
作者 刘善伟 马志伟 +1 位作者 魏世清 魏忠勇 《地质科技通报》 北大核心 2026年第1期360-370,共11页
岩性识别是油气勘探和开发过程中的重要环节,对于油气勘探定位、储层评价以及储层模型建立具有重要的指导意义。但传统的人工岩性识别方法耗时耗力,经典的深度学习模型虽然识别精度高,但模型的参数量较大,为了提高模型识别精度,同时降... 岩性识别是油气勘探和开发过程中的重要环节,对于油气勘探定位、储层评价以及储层模型建立具有重要的指导意义。但传统的人工岩性识别方法耗时耗力,经典的深度学习模型虽然识别精度高,但模型的参数量较大,为了提高模型识别精度,同时降低模型的参数量,使模型适用于岩性实时识别工作,首先收集了白云岩、砂岩等8种岩石共3016张岩石图像构建岩性识别数据集,然后以轻量型卷积神经网络ShuffleNetV2模型为基础网络,提出了一种Rock-ShuffleNetV2岩性识别模型(RSHFNet模型)。模型中将混合注意力机制模块(convolutional block attention module,简称CBAM)以及多尺度特征融合模块(multi-scale feature fusion module,简称MSF)融入基础网络中以加强模型的特征提取能力,提升模型识别性能,并优化模型中ShuffleNetV2单元的堆叠次数以减少模型参数量。结果表明:与基础模型相比,RSHFNet模型的准确率达到了87.21%,提高了4.98%;同时,模型参数量与浮点运算量分别降低到了869702个,0.93×108,分别是基础模型的0.67,0.63倍,模型参数量明显降低;并且RSHFNet模型的综合性能明显优于现有的卷积神经网络。RSHFNet岩性识别模型具有较高的识别精度和较好的泛化能力,同时更加的轻量化,为实现野外实时的岩性识别工作提供了新思路。 展开更多
关键词 岩性识别 ShuffleNetV2网络 混合注意力机制模块 多尺度特征融合模块 卷积神经网络
在线阅读 下载PDF
基于改进卷积神经网络的水体分割方法
15
作者 张永宏 孙岩 +2 位作者 田伟 马光义 朱灵龙 《计算机应用与软件》 北大核心 2026年第2期164-174,188,共12页
由于遥感图像中水体具有复杂的多尺度特征,传统方法在提取水体过程中容易产生误判和漏判现象。针对这一问题,提出一种融合局部和全局信息的新网络结构。该网络首先在编码端设计一个带有注意机制的残差模块,用于获取每个位置特征的全局... 由于遥感图像中水体具有复杂的多尺度特征,传统方法在提取水体过程中容易产生误判和漏判现象。针对这一问题,提出一种融合局部和全局信息的新网络结构。该网络首先在编码端设计一个带有注意机制的残差模块,用于获取每个位置特征的全局和局部信息,采用多路径扩张卷积实现多尺度水体特征提取。为了提高水体边界处的分割精度,在网络解码端设计细化注意力融合模块。实验结果显示该网络的召回率、精准率、F1-scores分别为95.78%、94.24%、93.75%,与传统卷积神经网络相比,评价指标分别提高1.56百分点、1.72百分点、1.62百分点。 展开更多
关键词 水体分割 全局注意力机制 多路径扩张卷积 局部和全局信息
在线阅读 下载PDF
改进YOLOv8n算法的船舶工业钢材表面缺陷检测
16
作者 刘鹏 侯博文 +2 位作者 王彩霞 姜晓娇 丛海芳 《兵工学报》 北大核心 2026年第3期35-49,共15页
为提高船舶工业中钢材表面缺陷检测的准确性,针对现有YOLOv8n算法在特征提取能力不足、检测精度低以及特征融合不充分等问题,提出一种基于改进YOLOv8n的钢材表面缺陷检测方法。构建高效视觉空间金字塔池化增强层聚合网络(Efficient Visi... 为提高船舶工业中钢材表面缺陷检测的准确性,针对现有YOLOv8n算法在特征提取能力不足、检测精度低以及特征融合不充分等问题,提出一种基于改进YOLOv8n的钢材表面缺陷检测方法。构建高效视觉空间金字塔池化增强层聚合网络(Efficient Vision Transformer-Spatial Pyramid Pooling with Enhanced Layer Aggregation Network,EfficientViT-SPPELAN),以增强多维度特征提取能力;设计多尺度时空卷积(Multi-Scale Spatial-Temporal Convolution,MSSTConv)实现多尺度特征融合;在此基础上构建多尺度时空(Multi-Scale Spatial-Temporal,MSST)模块以获取丰富的上下文信息,提高缺陷定位精度并降低计算复杂度,从而提升算法的推理效率。基于东北大学表面缺陷数据集(Northeastern University Surface Defect Dataset,NEU-DET)和镀锌钢10类缺陷检测数据集(Galvanized Steel 10-category Defect Detection Dataset,GC10-DET)两个数据集的实验结果表明,所提方法的检测精准度相较于原始YOLOv8n算法分别提升6.8%和5.7%,均值平均精确率mAP@0.5分别提高3.7%和7.9%;每秒帧数(Frames Per Second,FPS)分别达到189帧/s和142帧/s。研究结果表明,该方法在提升检测精度的同时保持较高计算效率,能够有效完成船舶钢材表面缺陷的定位和类别识别,满足工业场景对检测精度与实时性的需求。 展开更多
关键词 缺陷检测 YOLOv8n算法 多尺度时空模块 多尺度时空卷积 分组注意力
在线阅读 下载PDF
基于GRU和卷积注意力的改进ACGAN故障诊断方法
17
作者 彭朝琴 李奇聪 +2 位作者 张海尼 吴红 马云鹏 《航空学报》 北大核心 2026年第2期318-332,共15页
由于机电伺服系统(EMA)在实际应用中故障数据样本少,会影响故障诊断方法的分类效果。针对故障数据缺失下机电伺服系统的故障诊断问题,设计了一种基于门控循环单元(GRU)和卷积注意力的改进辅助分类生成对抗网络(ACGAN)故障诊断方法,能够... 由于机电伺服系统(EMA)在实际应用中故障数据样本少,会影响故障诊断方法的分类效果。针对故障数据缺失下机电伺服系统的故障诊断问题,设计了一种基于门控循环单元(GRU)和卷积注意力的改进辅助分类生成对抗网络(ACGAN)故障诊断方法,能够稳定地生成各故障类别高质量数据。首先,在ACGAN中引入Wasserstein距离与梯度惩罚,优化损失函数,提升对抗训练稳定性。其次,在生成器和判别器中加入GRU和卷积注意力模块(CBAM),增强网络对关键特征和时序特征的提取能力,克服了卷积网络在处理时序数据时的局限性,提高了生成样本的质量。最后,通过共享分类器与判别器网络参数,利用平衡数据集微调分类器,进一步提高模型的诊断性能。基于搭建的EMA实验台,得到由大量正常数据与少量故障数据组成的不平衡实验数据集,通过对比和消融实验,验证了所提方法的有效性和优越性。 展开更多
关键词 机电伺服系统 门控循环单元 卷积注意力模块 故障诊断 辅助分类生成对抗网络
原文传递
基于MTAM-LSTM的采煤工作面支架载荷预测方法
18
作者 张杰 杨科 范超尘 《中国安全科学学报》 北大核心 2026年第3期144-152,共9页
为有效预测液压支架载荷、评估支架运行状态,提出一种基于多尺度卷积时间注意力模块(MTAM)-长短时记忆(LSTM)神经网络的液压支架载荷预测模型。首先,采用自适应噪声完备集合经验模态分解(CEEMDAN)算法分解支架载荷数据获取本征模态分量... 为有效预测液压支架载荷、评估支架运行状态,提出一种基于多尺度卷积时间注意力模块(MTAM)-长短时记忆(LSTM)神经网络的液压支架载荷预测模型。首先,采用自适应噪声完备集合经验模态分解(CEEMDAN)算法分解支架载荷数据获取本征模态分量,基于K-L散度准则剔除本征模态分量中的冗余分量形成支架载荷预测输入序列;其次,建立MTAM捕捉支架载荷变化特征,静态注意力生成数据特征信息的注意力权重,动态注意力优化不同序列特征的关注度,并引入残差学习保持特征信号的完整性;然后,利用LSTM构建特征信息与支架载荷之间的深层依赖关系,实现支架载荷数据的超前预测;最后,选取陕西某冲击地压矿井402102工作面液压支架载荷数据进行实证分析,对比不同模型均方根误差、决定系数和平均绝对误差3种评价指标,结果表明:MTAMLSTM模型的均方根误差(RMSE)和平均绝对误差(MAE)均明显小于对比模型,RMSE整体降低0.16~0.45,MAE降低0.16~0.45,不同场景下决定系数R^(2)达到0.91,验证了MTAM-LSTM的预测准确率和模型泛化能力。 展开更多
关键词 多尺度卷积时间注意力模块(MTAM) 长短时记忆神经网络(LSTM) 采煤工作面 载荷预测 液压支架 泛化能力
原文传递
基于双重信息注意力机制的电力设备热成像超分辨率重建
19
作者 赵洪山 李忠航 +2 位作者 林诗雨 王晓盼 杨伟新 《中国电机工程学报》 北大核心 2026年第4期1384-1395,I0007,共13页
针对当前电力设备红外图像分辨率低和温度分布模糊问题,提出一种基于局部和全局信息注意力生成对抗网络(local and global information attention generative adversarial network,LGIA-GAN)的超分辨率重建方法。首先,使用门控权重单元... 针对当前电力设备红外图像分辨率低和温度分布模糊问题,提出一种基于局部和全局信息注意力生成对抗网络(local and global information attention generative adversarial network,LGIA-GAN)的超分辨率重建方法。首先,使用门控权重单元融合多种卷积输出构建细节增强融合卷积,增加重要信息在输出特征图的占比;其次,搭建双注意力模块,对图像长距离像素依赖关系建模并捕获空间和通道维度信息;然后,构造生成对抗网络,使网络关注电力设备红外图像局部纹理细节和全局轮廓信息;最后,通过实验证明,LGIA-GAN在数据集上的峰值信噪比和结构相似度分别为30.266dB和0.9197,重建时间为0.120s,明显优于其他几种GAN算法,并在主观视觉上重建效果更好。所提方法能够有效提升电力设备热成像分辨率,对电力设备故障诊断具有支撑作用。 展开更多
关键词 电力设备红外图像 超分辨率重建 细节增强融合卷积 双注意力模块 局部和全局信息
原文传递
基于卷积块注意力机制改进的Transformer的负荷分解
20
作者 王涛 王雨娟 +2 位作者 纳春宁 尹泽昊 周雨笛 《电工电气》 2026年第3期7-13,54,共8页
电力系统源网荷优化调度和需求响应的实现,要求负荷侧更精细化的用电信息。非侵入式负载监测(NILM)是获取用电设备工作状态和各个设备消耗功率的有效技术。目前非侵入式负荷分解采用的Transformer模型会导致提取输入信号局部特征的能力... 电力系统源网荷优化调度和需求响应的实现,要求负荷侧更精细化的用电信息。非侵入式负载监测(NILM)是获取用电设备工作状态和各个设备消耗功率的有效技术。目前非侵入式负荷分解采用的Transformer模型会导致提取输入信号局部特征的能力不足,且存在对电器开关状态准确识别能力不足、分解误差大等问题。提出了一种融合简单线性注意力机制(SLA)、卷积块注意力机制(CBAM)改进的Transformer模型,用于非侵入式负荷分解的方法。其中SLA机制具备强大的局部特征提取能力,CBAM被嵌入Transformer模型的前馈神经网络(FFN),以提高关键特征的表达能力,降低特征冗余度。通过采用公开数据集UK-DALE验证该模型性能,并与两种先进模型做性能对比,证明了该模型在非侵入式负荷分解方面的优越性。 展开更多
关键词 非侵入式负荷监测 Transformer模型 简单线性注意力机制 卷积块注意力机制 负荷分解
在线阅读 下载PDF
上一页 1 2 64 下一页 到第
使用帮助 返回顶部