期刊文献+
共找到186篇文章
< 1 2 10 >
每页显示 20 50 100
Magnetic Resonance Image Super-Resolution Based on GAN and Multi-Scale Residual Dense Attention Network
1
作者 GUAN Chunling YU Suping +1 位作者 XU Wujun FAN Hong 《Journal of Donghua University(English Edition)》 2025年第4期435-441,共7页
The application of image super-resolution(SR)has brought significant assistance in the medical field,aiding doctors to make more precise diagnoses.However,solely relying on a convolutional neural network(CNN)for image... The application of image super-resolution(SR)has brought significant assistance in the medical field,aiding doctors to make more precise diagnoses.However,solely relying on a convolutional neural network(CNN)for image SR may lead to issues such as blurry details and excessive smoothness.To address the limitations,we proposed an algorithm based on the generative adversarial network(GAN)framework.In the generator network,three different sizes of convolutions connected by a residual dense structure were used to extract detailed features,and an attention mechanism combined with dual channel and spatial information was applied to concentrate the computing power on crucial areas.In the discriminator network,using InstanceNorm to normalize tensors sped up the training process while retaining feature information.The experimental results demonstrate that our algorithm achieves higher peak signal-to-noise ratio(PSNR)and structural similarity index measure(SSIM)compared to other methods,resulting in an improved visual quality. 展开更多
关键词 magnetic resonance(MR) image super-resolution(SR) attention mechanism generative adversarial network(GAN) multi-scale convolution
在线阅读 下载PDF
SIM-Net:A Multi-Scale Attention-Guided Deep Learning Framework for High-Precision PCB Defect Detection
2
作者 Ping Fang Mengjun Tong 《Computers, Materials & Continua》 2026年第4期1754-1770,共17页
Defect detection in printed circuit boards(PCB)remains challenging due to the difficulty of identifying small-scale defects,the inefficiency of conventional approaches,and the interference from complex backgrounds.To ... Defect detection in printed circuit boards(PCB)remains challenging due to the difficulty of identifying small-scale defects,the inefficiency of conventional approaches,and the interference from complex backgrounds.To address these issues,this paper proposes SIM-Net,an enhanced detection framework derived from YOLOv11.The model integrates SPDConv to preserve fine-grained features for small object detection,introduces a novel convolutional partial attention module(C2PAM)to suppress redundant background information and highlight salient regions,and employs a multi-scale fusion network(MFN)with a multi-grain contextual module(MGCT)to strengthen contextual representation and accelerate inference.Experimental evaluations demonstrate that SIM-Net achieves 92.4%mAP,92%accuracy,and 89.4%recall with an inference speed of 75.1 FPS,outperforming existing state-of-the-art methods.These results confirm the robustness and real-time applicability of SIM-Net for PCB defect inspection. 展开更多
关键词 Deep learning small object detection PCB defect detection attention mechanism multi-scale fusion network
在线阅读 下载PDF
MSADCN:Multi-Scale Attentional Densely Connected Network for Automated Bone Age Assessment 被引量:1
3
作者 Yanjun Yu Lei Yu +2 位作者 Huiqi Wang Haodong Zheng Yi Deng 《Computers, Materials & Continua》 SCIE EI 2024年第2期2225-2243,共19页
Bone age assessment(BAA)helps doctors determine how a child’s bones grow and develop in clinical medicine.Traditional BAA methods rely on clinician expertise,leading to time-consuming predictions and inaccurate resul... Bone age assessment(BAA)helps doctors determine how a child’s bones grow and develop in clinical medicine.Traditional BAA methods rely on clinician expertise,leading to time-consuming predictions and inaccurate results.Most deep learning-based BAA methods feed the extracted critical points of images into the network by providing additional annotations.This operation is costly and subjective.To address these problems,we propose a multi-scale attentional densely connected network(MSADCN)in this paper.MSADCN constructs a multi-scale dense connectivity mechanism,which can avoid overfitting,obtain the local features effectively and prevent gradient vanishing even in limited training data.First,MSADCN designs multi-scale structures in the densely connected network to extract fine-grained features at different scales.Then,coordinate attention is embedded to focus on critical features and automatically locate the regions of interest(ROI)without additional annotation.In addition,to improve the model’s generalization,transfer learning is applied to train the proposed MSADCN on the public dataset IMDB-WIKI,and the obtained pre-trained weights are loaded onto the Radiological Society of North America(RSNA)dataset.Finally,label distribution learning(LDL)and expectation regression techniques are introduced into our model to exploit the correlation between hand bone images of different ages,which can obtain stable age estimates.Extensive experiments confirm that our model can converge more efficiently and obtain a mean absolute error(MAE)of 4.64 months,outperforming some state-of-the-art BAA methods. 展开更多
关键词 Bone age assessment deep learning attentional densely connected network muti-scale
在线阅读 下载PDF
M2ANet:Multi-branch and multi-scale attention network for medical image segmentation 被引量:1
4
作者 Wei Xue Chuanghui Chen +3 位作者 Xuan Qi Jian Qin Zhen Tang Yongsheng He 《Chinese Physics B》 2025年第8期547-559,共13页
Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to ... Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to the inability to effectively capture global information from images,CNNs can easily lead to loss of contours and textures in segmentation results.Notice that the transformer model can effectively capture the properties of long-range dependencies in the image,and furthermore,combining the CNN and the transformer can effectively extract local details and global contextual features of the image.Motivated by this,we propose a multi-branch and multi-scale attention network(M2ANet)for medical image segmentation,whose architecture consists of three components.Specifically,in the first component,we construct an adaptive multi-branch patch module for parallel extraction of image features to reduce information loss caused by downsampling.In the second component,we apply residual block to the well-known convolutional block attention module to enhance the network’s ability to recognize important features of images and alleviate the phenomenon of gradient vanishing.In the third component,we design a multi-scale feature fusion module,in which we adopt adaptive average pooling and position encoding to enhance contextual features,and then multi-head attention is introduced to further enrich feature representation.Finally,we validate the effectiveness and feasibility of the proposed M2ANet method through comparative experiments on four benchmark medical image segmentation datasets,particularly in the context of preserving contours and textures. 展开更多
关键词 medical image segmentation convolutional neural network multi-branch attention multi-scale feature fusion
原文传递
Multi-Scale Attention-Based Deep Neural Network for Brain Disease Diagnosis 被引量:1
5
作者 Yin Liang Gaoxu Xu Sadaqat ur Rehman 《Computers, Materials & Continua》 SCIE EI 2022年第9期4645-4661,共17页
Whole brain functional connectivity(FC)patterns obtained from resting-state functional magnetic resonance imaging(rs-fMRI)have been widely used in the diagnosis of brain disorders such as autism spectrum disorder(ASD)... Whole brain functional connectivity(FC)patterns obtained from resting-state functional magnetic resonance imaging(rs-fMRI)have been widely used in the diagnosis of brain disorders such as autism spectrum disorder(ASD).Recently,an increasing number of studies have focused on employing deep learning techniques to analyze FC patterns for brain disease classification.However,the high dimensionality of the FC features and the interpretation of deep learning results are issues that need to be addressed in the FC-based brain disease classification.In this paper,we proposed a multi-scale attention-based deep neural network(MSA-DNN)model to classify FC patterns for the ASD diagnosis.The model was implemented by adding a flexible multi-scale attention(MSA)module to the auto-encoder based backbone DNN,which can extract multi-scale features of the FC patterns and change the level of attention for different FCs by continuous learning.Our model will reinforce the weights of important FC features while suppress the unimportant FCs to ensure the sparsity of the model weights and enhance the model interpretability.We performed systematic experiments on the large multi-sites ASD dataset with both ten-fold and leaveone-site-out cross-validations.Results showed that our model outperformed classical methods in brain disease classification and revealed robust intersite prediction performance.We also localized important FC features and brain regions associated with ASD classification.Overall,our study further promotes the biomarker detection and computer-aided classification for ASD diagnosis,and the proposed MSA module is flexible and easy to implement in other classification networks. 展开更多
关键词 Autism spectrum disorder diagnosis resting-state fMRI deep neural network functional connectivity multi-scale attention module
在线阅读 下载PDF
MSSTNet:Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention
6
作者 Changchen ZHAO Hongsheng WANG Yuanjing FENG 《Virtual Reality & Intelligent Hardware》 2023年第2期124-141,共18页
Background The use of remote photoplethysmography(rPPG)to estimate blood volume pulse in a noncontact manner has been an active research topic in recent years.Existing methods are primarily based on a singlescale regi... Background The use of remote photoplethysmography(rPPG)to estimate blood volume pulse in a noncontact manner has been an active research topic in recent years.Existing methods are primarily based on a singlescale region of interest(ROI).However,some noise signals that are not easily separated in a single-scale space can be easily separated in a multi-scale space.Also,existing spatiotemporal networks mainly focus on local spatiotemporal information and do not emphasize temporal information,which is crucial in pulse extraction problems,resulting in insufficient spatiotemporal feature modelling.Methods Here,we propose a multi-scale facial video pulse extraction network based on separable spatiotemporal convolution(SSTC)and dimension separable attention(DSAT).First,to solve the problem of a single-scale ROI,we constructed a multi-scale feature space for initial signal separation.Second,SSTC and DSAT were designed for efficient spatiotemporal correlation modeling,which increased the information interaction between the long-span time and space dimensions;this placed more emphasis on temporal features.Results The signal-to-noise ratio(SNR)of the proposed network reached 9.58dB on the PURE dataset and 6.77dB on the UBFC-rPPG dataset,outperforming state-of-the-art algorithms.Conclusions The results showed that fusing multi-scale signals yielded better results than methods based on only single-scale signals.The proposed SSTC and dimension-separable attention mechanism will contribute to more accurate pulse signal extraction. 展开更多
关键词 Remote photoplethysmography Heart rate Separable spatiotemporal convolution Dimension separable attention multi-scale Neural network
在线阅读 下载PDF
EHDC-YOLO: Enhancing Object Detection for UAV Imagery via Multi-Scale Edge and Detail Capture
7
作者 Zhiyong Deng Yanchen Ye Jiangling Guo 《Computers, Materials & Continua》 2026年第1期1665-1682,共18页
With the rapid expansion of drone applications,accurate detection of objects in aerial imagery has become crucial for intelligent transportation,urban management,and emergency rescue missions.However,existing methods ... With the rapid expansion of drone applications,accurate detection of objects in aerial imagery has become crucial for intelligent transportation,urban management,and emergency rescue missions.However,existing methods face numerous challenges in practical deployment,including scale variation handling,feature degradation,and complex backgrounds.To address these issues,we propose Edge-enhanced and Detail-Capturing You Only Look Once(EHDC-YOLO),a novel framework for object detection in Unmanned Aerial Vehicle(UAV)imagery.Based on the You Only Look Once version 11 nano(YOLOv11n)baseline,EHDC-YOLO systematically introduces several architectural enhancements:(1)a Multi-Scale Edge Enhancement(MSEE)module that leverages multi-scale pooling and edge information to enhance boundary feature extraction;(2)an Enhanced Feature Pyramid Network(EFPN)that integrates P2-level features with Cross Stage Partial(CSP)structures and OmniKernel convolutions for better fine-grained representation;and(3)Dynamic Head(DyHead)with multi-dimensional attention mechanisms for enhanced cross-scale modeling and perspective adaptability.Comprehensive experiments on the Vision meets Drones for Detection(VisDrone-DET)2019 dataset demonstrate that EHDC-YOLO achieves significant improvements,increasing mean Average Precision(mAP)@0.5 from 33.2%to 46.1%(an absolute improvement of 12.9 percentage points)and mAP@0.5:0.95 from 19.5%to 28.0%(an absolute improvement of 8.5 percentage points)compared with the YOLOv11n baseline,while maintaining a reasonable parameter count(2.81 M vs the baseline’s 2.58 M).Further ablation studies confirm the effectiveness of each proposed component,while visualization results highlight EHDC-YOLO’s superior performance in detecting objects and handling occlusions in complex drone scenarios. 展开更多
关键词 UAV imagery object detection multi-scale feature fusion edge enhancement detail preservation YOLO feature pyramid network attention mechanism
在线阅读 下载PDF
Attention-enhanced multi-time scale LSTM for soft sensor modeling of corn starch liquefaction
8
作者 Yu Zhuang Zhongyi Zhang +5 位作者 Jin Tao Yi Li Fan Li Yu Wang Lei Zhang Jian Du 《Chinese Journal of Chemical Engineering》 2026年第1期132-144,共13页
Data-driven deep learning modeling has been increasingly applied to quality prediction in complex chemical processes.However,the data show complex temporal features due to different residence times and strong coupling... Data-driven deep learning modeling has been increasingly applied to quality prediction in complex chemical processes.However,the data show complex temporal features due to different residence times and strong coupling relationships among chemical entities.This study proposes a multi-scale temporal feature extraction module to extract local dynamic temporal features across different time scales and combines it with long short-term memory(LSTM)networks to capture global temporal patterns,thereby taking full advantage of available data.In addition,variable-wise channel attention is integrated into the model to enhance attention on the essential parts of the feature maps and improve predictive performance.Furthermore,by analyzing the attention weights,the model quickly identifies the key variables that significantly affect the predictions.Finally,the model is applied to a real corn starch liquefaction process and achieves an accurate product quality prediction with an R^(2) value of 0.9392,which represents a 4%to 9%improvement over traditional models and demonstrates the superiority of the proposed approach. 展开更多
关键词 multi-scale dilated causal convolution Neural networks Soft sensor Systems engineering attention mechanism Biochemical engineering
在线阅读 下载PDF
基于Attention_DenseCNN的水稻问答系统问句分类 被引量:16
9
作者 王郝日钦 吴华瑞 +2 位作者 冯帅 刘志超 许童羽 《农业机械学报》 EI CAS CSCD 北大核心 2021年第7期237-243,共7页
为了解决“中国农技推广APP”问答社区中水稻提问数据快速自动分类的问题,提出一种基于Attention_DenseCNN的水稻文本分类方法。根据水稻文本具备的特征,采用Word2vec方法对文本数据进行处理与分析,并结合农业分词词典对文本数据进行向... 为了解决“中国农技推广APP”问答社区中水稻提问数据快速自动分类的问题,提出一种基于Attention_DenseCNN的水稻文本分类方法。根据水稻文本具备的特征,采用Word2vec方法对文本数据进行处理与分析,并结合农业分词词典对文本数据进行向量化处理,采用Word2vec方法能够有效地解决文本的高维性和稀疏性问题。对卷积神经网络(CNN)上下游卷积块之间建立一条稠密的链接,并结合注意力机制(Attention),使文本中的关键词特征得以充分体现,使文本分类模型具有更好的文本特征提取精度,从而提高了分类精确率。试验表明:基于Attention_DenseCNN的水稻问句分类模型可以提高文本特征的利用率、减少特征丢失,能够快速、准确地对水稻问句文本进行自动分类,其分类精确率及F1值分别为95.6%和94.9%,与其他7种神经网络问句分类方法相比,分类效果明显提升。 展开更多
关键词 水稻问句分类 自然语言处理 密集连接卷积神经网络 注意力机制
在线阅读 下载PDF
A Coarse to Fine Thin Cloud Removal Network with Pyramid Non-local Attention
10
作者 GUAN Wang TIAN Zhenkai +5 位作者 MA Tao ZHAO Lingyuan XIE Shizhe YAN Jin DU Yang ZOU Yunkun 《Transactions of Nanjing University of Aeronautics and Astronautics》 2025年第5期589-600,共12页
In remote sensing imagery,approximately 67%of the data are affected by cloud cover,significantly increasing the difficulty of image classification,recognition,and other downstream interpretation tasks.To effectively a... In remote sensing imagery,approximately 67%of the data are affected by cloud cover,significantly increasing the difficulty of image classification,recognition,and other downstream interpretation tasks.To effectively address the randomness of cloud distribution and the non-uniformity of cloud thickness,we propose a coarse-to-fine thin cloud removal architecture based on the observations of the random distribution and uneven thickness of cloud.In the coarse-level declouding network,we innovatively introduce a multi-scale attention mechanism,i.e.,pyramid nonlocal attention(PNA).By integrating global context with local detail information,it specifically addresses image quality degradation caused by the uncertainty in cloud distribution.During the fine-level declouding stage,we focus on the impact of cloud thickness on declouding results(primarily manifested as insufficient detail information).Through a carefully designed residual dense module,we significantly enhance the extraction and utilization of feature details.Thus,our approach precisely restores lost local texture features on top of coarse-level results,achieving a substantial leap in declouding quality.To evaluate the effectiveness of our cloud removal technology and attention mechanism,we conducted comprehensive analyses on publicly available datasets.Results demonstrate that our method achieves state-of-the-art performance across a wide range of techniques. 展开更多
关键词 channel attention thin cloud removal network pyramid non-local attention(PNA) remote sensing image residual dense connection
在线阅读 下载PDF
基于CWT-IDenseNet的滚动轴承故障诊断方法 被引量:1
11
作者 贾广飞 梁汉文 +2 位作者 杨金秋 武哲 韩雨欣 《河北科技大学学报》 北大核心 2025年第2期129-140,共12页
针对一维信号所含信息不全面和DenseNet网络在变工况下存在过拟合等问题,提出了基于连续小波变换时频图像和改进密集连接卷积网络(improved DenseNet,IDenseNet)的滚动轴承故障诊断方法CWT-IDenseNet。首先,将一维振动信号通过CWT转为... 针对一维信号所含信息不全面和DenseNet网络在变工况下存在过拟合等问题,提出了基于连续小波变换时频图像和改进密集连接卷积网络(improved DenseNet,IDenseNet)的滚动轴承故障诊断方法CWT-IDenseNet。首先,将一维振动信号通过CWT转为二维时频图像;其次,对DenseNet网络进行改进,将DenseNet第1个卷积块中的ReLU激活函数替换为Swish激活函数(Swish激活函数更平滑);同时,在网络中引入基于风格的卷积神经网络重校准模块(style-based recalibration module,SRM)和空间与通道注意力机制模块(convolutional block attention module,CBAM),SRM关注特征通道权重,CBAM则从通道和空间2个维度增强特征表达能力,进而得到IDenseNet;最后,将二维时频图像输入到IDenseNet模型中进行特征提取和故障诊断,通过模型的Softmax层输出故障诊断结果。结果表明,所提方法在恒定工况及变工况下的平均故障识别准确率均达到97.80%,且在迁移学习模型中,平均故障识别准确率达到了99.44%。CWT-IDenseNet方法可以有效提高模型的泛化能力,在恒定工况及变工况下具有显著优势,对提高滚动轴承故障诊断的准确率和可靠性具有参考价值。 展开更多
关键词 机械动力学与振动 滚动轴承故障诊断 连续小波变换 密集连接卷积网络 注意力机制
在线阅读 下载PDF
改进DenseNet的干气密封摩擦润滑状态识别研究 被引量:2
12
作者 张帅 丁雪兴 +2 位作者 王世鹏 力宁 张兰霞 《振动与冲击》 北大核心 2025年第4期313-321,共9页
为了克服干气密封运行中端面接触状态参数(膜厚、端面开启时间)测量困难的问题,提出自注意力机制融合稠密连接网络(DenseNet-convolutional block attention module,DenseNet-CBAM)的干气密封端面摩擦润滑状态识别方法。根据斯特里贝克... 为了克服干气密封运行中端面接触状态参数(膜厚、端面开启时间)测量困难的问题,提出自注意力机制融合稠密连接网络(DenseNet-convolutional block attention module,DenseNet-CBAM)的干气密封端面摩擦润滑状态识别方法。根据斯特里贝克曲线和干气密封运行规律分析端面可能出现的摩擦润滑状态:流体润滑,边界润滑、混合润滑。通过声发射传感器采集密封系统运行时的声发射信号,通过滤波、时域分析、频域分析得出能够表征各种摩擦润滑状态的特征分量,获取三维连续小波(3D continuous wavelet transform,3D-CWT)时频图,最终基于深度学习模型Densenet-CBAM识别时频图,实现密封系统摩擦润滑状态识别。与其他二维时频特征图作为输入端相比,3D-CWT时频图提高了状态识别的准确率。同时,相较于其他深度学习模型,该方法对干气密封摩擦润滑状态识别精度高,达到了99.27%。 展开更多
关键词 干气密封 稠密连接网络 自注意力机制 声发射 状态识别
在线阅读 下载PDF
Disease Recognition of Apple Leaf Using Lightweight Multi-Scale Network with ECANet 被引量:4
13
作者 Helong Yu Xianhe Cheng +2 位作者 Ziqing Li Qi Cai Chunguang Bi 《Computer Modeling in Engineering & Sciences》 SCIE EI 2022年第9期711-738,共28页
To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks,a lightweight ResNet(LW-ResNet)model for apple disease rec... To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks,a lightweight ResNet(LW-ResNet)model for apple disease recognition is proposed.Based on the deep residual network(ResNet18),the multi-scale feature extraction layer is constructed by group convolution to realize the compression model and improve the extraction ability of different sizes of lesion features.By improving the identity mapping structure to reduce information loss.By introducing the efficient channel attention module(ECANet)to suppress noise from a complex background.The experimental results show that the average precision,recall and F1-score of the LW-ResNet on the test set are 97.80%,97.92%and 97.85%,respectively.The parameter memory is 2.32 MB,which is 94%less than that of ResNet18.Compared with the classic lightweight networks SqueezeNet and MobileNetV2,LW-ResNet has obvious advantages in recognition performance,speed,parameter memory requirement and time complexity.The proposed model has the advantages of low computational cost,low storage cost,strong real-time performance,high identification accuracy,and strong practicability,which can meet the needs of real-time identification task of apple leaf disease on resource-constrained devices. 展开更多
关键词 Apple disease recognition deep residual network multi-scale feature efficient channel attention module lightweight network
在线阅读 下载PDF
基于格拉姆角和场和改进DenseNet的配电网故障选线方法
14
作者 方豪 魏业文 +1 位作者 张子洵 张轲钦 《电力系统及其自动化学报》 北大核心 2025年第9期109-118,共10页
配电网小电流接地系统发生单相接地故障时,故障特征微弱。针对现有故障选线方法准确率不高、泛化能力差的问题,提出一种基于格拉姆角和场和改进密集型卷积神经网络的故障选线方法。首先,利用格拉姆角和场将线路一维零序电流信号转换为... 配电网小电流接地系统发生单相接地故障时,故障特征微弱。针对现有故障选线方法准确率不高、泛化能力差的问题,提出一种基于格拉姆角和场和改进密集型卷积神经网络的故障选线方法。首先,利用格拉姆角和场将线路一维零序电流信号转换为二维图像;然后,用融合SENet注意力机制的密集型卷积神经网络提取图像中的故障特征信息,用随机森林算法判别故障线路;最后,采用随机搜索算法优化模型参数,得到最优故障选线融合模型。仿真结果表明,相比于其他故障选线方法,所提方法在选线准确率、抗噪性能、泛化能力等方面表现优秀,为小电流接地系统故障选线提供了一种新思路。 展开更多
关键词 小电流接地系统 故障选线 密集型卷积神经网络 格拉姆角和场 SENet注意力机制 随机森林
在线阅读 下载PDF
LMACNet:Lightweight Multi-Scale Attention Convolutional Network for Glass Insulator Defects
15
作者 Mingsen Li Yongfeng Zhang Xianlong Lv 《Complex System Modeling and Simulation》 2026年第1期24-39,共16页
Glass insulator defect detection faces challenges such as noise data interference,small defect targets,and low accuracy.Current most methods aimed at enhancing the accuracy of insulator defect detection inevitably inc... Glass insulator defect detection faces challenges such as noise data interference,small defect targets,and low accuracy.Current most methods aimed at enhancing the accuracy of insulator defect detection inevitably increase model complexity,requiring additional computational resources.To address this issue,this paper proposes a Lightweight Multi-scale Attention Convolutional Network(LMACNet).A general Spatial-Channel Cross Attention(SCCA)is proposed to perform cross-spatial learning on multi-scale features.SCCAMobile Inverted Bottleneck Convolution(SCCA-MBConv)is designed for lightweight feature extraction.Additionally,we improved the Cross Stage Partial Bottleneck with 2 convolutions(C2f)module to enhance multi-scale feature fusion and high-level semantic feature learning.Specifically,an auxiliary network is designed to train the lightweight model,alleviating the information bottleneck and ensuring sufficient gradient flow.Extensive experiments were conducted on the public Vietnam Public Merged dataset of Broken Glass Insulator(VPMBGl)dataset and a custom dataset,the results demonstrate that LMACNet achieves detection performance comparable to other related detection models,while having a parameter count of only 2.54x106 and requiring 3.01x109 operations.LMACNet exhibits efficiency and practicality. 展开更多
关键词 Auxiliary Branch deep learning glass insulator defect lightweight Lightweight multi-scale attention Convolutional network(LMACNet)
原文传递
Siamese Dense Pixel-Level Fusion Network for Real-Time UAV Tracking 被引量:1
16
作者 Zhenyu Huang Gun Li +4 位作者 Xudong Sun Yong Chen Jie Sun Zhangsong Ni Yang Yang 《Computers, Materials & Continua》 SCIE EI 2023年第9期3219-3238,共20页
Onboard visual object tracking in unmanned aerial vehicles(UAVs)has attractedmuch interest due to its versatility.Meanwhile,due to high precision,Siamese networks are becoming hot spots in visual object tracking.Howev... Onboard visual object tracking in unmanned aerial vehicles(UAVs)has attractedmuch interest due to its versatility.Meanwhile,due to high precision,Siamese networks are becoming hot spots in visual object tracking.However,most Siamese trackers fail to balance the tracking accuracy and time within onboard limited computational resources of UAVs.To meet the tracking precision and real-time requirements,this paper proposes a Siamese dense pixel-level network for UAV object tracking named SiamDPL.Specifically,the Siamese network extracts features of the search region and the template region through a parameter-shared backbone network,then performs correlationmatching to obtain the candidate regionwith high similarity.To improve the matching effect of template and search features,this paper designs a dense pixel-level feature fusion module to enhance the matching ability by pixel-wise correlation and enrich the feature diversity by dense connection.An attention module composed of self-attention and channel attention is introduced to learn global context information and selectively emphasize the target feature region in the spatial and channel dimensions.In addition,a target localization module is designed to improve target location accuracy.Compared with other advanced trackers,experiments on two public benchmarks,which are UAV123@10fps and UAV20L fromthe unmanned air vehicle123(UAV123)dataset,show that SiamDPL can achieve superior performance and low complexity with a running speed of 100.1 fps on NVIDIA TITAN RTX. 展开更多
关键词 Siamese network UAV object tracking dense pixel-level feature fusion attention module target localization
在线阅读 下载PDF
Attention-based neural network for end-to-end music separation 被引量:1
17
作者 Jing Wang Hanyue Liu +3 位作者 Haorong Ying Chuhan Qiu Jingxin Li Muhammad Shahid Anwar 《CAAI Transactions on Intelligence Technology》 SCIE EI 2023年第2期355-363,共9页
The end-to-end separation algorithm with superior performance in the field of speech separation has not been effectively used in music separation.Moreover,since music signals are often dual channel data with a high sa... The end-to-end separation algorithm with superior performance in the field of speech separation has not been effectively used in music separation.Moreover,since music signals are often dual channel data with a high sampling rate,how to model longsequence data and make rational use of the relevant information between channels is also an urgent problem to be solved.In order to solve the above problems,the performance of the end-to-end music separation algorithm is enhanced by improving the network structure.Our main contributions include the following:(1)A more reasonable densely connected U-Net is designed to capture the long-term characteristics of music,such as main melody,tone and so on.(2)On this basis,the multi-head attention and dualpath transformer are introduced in the separation module.Channel attention units are applied recursively on the feature map of each layer of the network,enabling the network to perform long-sequence separation.Experimental results show that after the introduction of the channel attention,the performance of the proposed algorithm has a stable improvement compared with the baseline system.On the MUSDB18 dataset,the average score of the separated audio exceeds that of the current best-performing music separation algorithm based on the time-frequency domain(T-F domain). 展开更多
关键词 channel attention densely connected network end-to-end music separation
在线阅读 下载PDF
基于CAM-DenseNet模型的邮轮薄板焊缝缺陷识别算法
18
作者 黎林发 王岳 《造船技术》 2025年第1期78-84,共7页
邮轮薄板焊缝的熔深和熔宽相对较小,母材与焊缝区域差异性小,焊缝表面缺陷较难判别。为准确地定位焊缝位置,提出一种将注意力机制的坐标注意力模块(Coordinate Attention Module,CAM)融入密集链接卷积网络(Densely Connected Convolutio... 邮轮薄板焊缝的熔深和熔宽相对较小,母材与焊缝区域差异性小,焊缝表面缺陷较难判别。为准确地定位焊缝位置,提出一种将注意力机制的坐标注意力模块(Coordinate Attention Module,CAM)融入密集链接卷积网络(Densely Connected Convolutional Networks,DenseNet)的邮轮薄板焊缝缺陷识别算法,建立CAM-DenseNet模型。将网络中的激活函数ReLU替换为更具有稳定性的ReLU6,并利用贝叶斯优化算法对CAM-DenseNet模型的超参数组合进行优化和选取。在焊接车间利用相机采集邮轮薄板焊缝三原色(Red Green Blue,RGB)图片,自建立邮轮薄板焊缝缺陷数据集,并按焊缝缺陷类型将数据集分为凹陷、气孔、毛刺、表面裂纹和无缺陷等5类。试验结果表明,CAM-DonseNet模型对邮轮薄板焊缝缺陷识别具有优异表现。 展开更多
关键词 邮轮 薄板 焊缝缺陷 识别算法 深度学习 密集链接卷积网络 坐标注意力模块 CAM-denseNet模型 激活函数 贝叶斯优化算法
在线阅读 下载PDF
Lightweight Human Pose Estimation Based on Multi-Attention Mechanism
19
作者 LIN Xiao LU Meichen +1 位作者 GAO Mufeng LI Yan 《Journal of Shanghai Jiaotong university(Science)》 2025年第5期899-910,共12页
Human pose estimation has received much attention from the research community because of its wide range of applications.However,current research for pose estimation is usually complex and computationally intensive,esp... Human pose estimation has received much attention from the research community because of its wide range of applications.However,current research for pose estimation is usually complex and computationally intensive,especially the feature loss problems in the feature fusion process.To address the above problems,we propose a lightweight human pose estimation network based on multi-attention mechanism(LMANet).In our method,network parameters can be significantly reduced by lightweighting the bottleneck blocks with depth-wise separable convolution on the high-resolution networks.After that,we also introduce a multi-attention mechanism to improve the model prediction accuracy,and the channel attention module is added in the initial stage of the network to enhance the local cross-channel information interaction.More importantly,we inject spatial crossawareness module in the multi-scale feature fusion stage to reduce the spatial information loss during feature extraction.Extensive experiments on COCO2017 dataset and MPII dataset show that LMANet can guarantee a higher prediction accuracy with fewer network parameters and computational effort.Compared with the highresolution network HRNet,the number of parameters and the computational complexity of the network are reduced by 67%and 73%,respectively. 展开更多
关键词 human pose estimation attention mechanisms multi-scale feature fusion high-resolution networks
原文传递
Attention⁃Based Multi⁃scale CNN and LSTM Model for Remaining Useful Life Estimation
20
作者 DUAN Jiajun LU Zhong DU Zhiqiang 《Transactions of Nanjing University of Aeronautics and Astronautics》 2025年第S1期64-77,共14页
Current aero-engine life prediction areas typically focus on single-scale degradation features,and the existing methods are not comprehensive enough to capture the relationship within time series data.To address this ... Current aero-engine life prediction areas typically focus on single-scale degradation features,and the existing methods are not comprehensive enough to capture the relationship within time series data.To address this problem,we propose a novel remaining useful life(RUL)estimation method based on the attention mechanism.Our approach designs a two-layer multi-scale feature extraction module that integrates degradation features at different scales.These features are then processed in parallel by a self-attention module and a three-layer long short-term memory(LSTM)network,which together capture long-term dependencies and adaptively weigh important feature.The integration of degradation patterns from both components into the attention module enhances the model’s ability to capture long-term dependencies.Visualizing the attention module’s weight matrices further improves model interpretability.Experimental results on the C-MAPSS dataset demonstrate that our approach outperforms the existing state-of-the-art methods. 展开更多
关键词 attention mechanism convolutional neural network(CNN) long short-term memory(LSTM) multi-scale feature extraction
在线阅读 下载PDF
上一页 1 2 10 下一页 到第
使用帮助 返回顶部