期刊文献+
共找到42篇文章
< 1 2 3 >
每页显示 20 50 100
Improved multi-scale inverse bottleneck residual network based on triplet parallel attention for apple leaf disease identification 被引量:3
1
作者 Lei Tang Jizheng Yi Xiaoyao Li 《Journal of Integrative Agriculture》 SCIE CAS CSCD 2024年第3期901-922,共22页
Accurate diagnosis of apple leaf diseases is crucial for improving the quality of apple production and promoting the development of the apple industry. However, apple leaf diseases do not differ significantly from ima... Accurate diagnosis of apple leaf diseases is crucial for improving the quality of apple production and promoting the development of the apple industry. However, apple leaf diseases do not differ significantly from image texture and structural information. The difficulties in disease feature extraction in complex backgrounds slow the related research progress. To address the problems, this paper proposes an improved multi-scale inverse bottleneck residual network model based on a triplet parallel attention mechanism, which is built upon ResNet-50, while improving and combining the inception module and ResNext inverse bottleneck blocks, to recognize seven types of apple leaf(including six diseases of alternaria leaf spot, brown spot, grey spot, mosaic, rust, scab, and one healthy). First, the 3×3 convolutions in some of the residual modules are replaced by multi-scale residual convolutions, the convolution kernels of different sizes contained in each branch of the multi-scale convolution are applied to extract feature maps of different sizes, and the outputs of these branches are multi-scale fused by summing to enrich the output features of the images. Second, the global layer-wise dynamic coordinated inverse bottleneck structure is used to reduce the network feature loss. The inverse bottleneck structure makes the image information less lossy when transforming from different dimensional feature spaces. The fusion of multi-scale and layer-wise dynamic coordinated inverse bottlenecks makes the model effectively balances computational efficiency and feature representation capability, and more robust with a combination of horizontal and vertical features in the fine identification of apple leaf diseases. Finally, after each improved module, a triplet parallel attention module is integrated with cross-dimensional interactions among channels through rotations and residual transformations, which improves the parallel search efficiency of important features and the recognition rate of the network with relatively small computational costs while the dimensional dependencies are improved. To verify the validity of the model in this paper, we uniformly enhance apple leaf disease images screened from the public data sets of Plant Village, Baidu Flying Paddle, and the Internet. The final processed image count is 14,000. The ablation study, pre-processing comparison, and method comparison are conducted on the processed datasets. The experimental results demonstrate that the proposed method reaches 98.73% accuracy on the adopted datasets, which is 1.82% higher than the classical ResNet-50 model, and 0.29% better than the apple leaf disease datasets before preprocessing. It also achieves competitive results in apple leaf disease identification compared to some state-ofthe-art methods. 展开更多
关键词 multi-scale module inverse bottleneck structure triplet parallel attention apple leaf disease
在线阅读 下载PDF
Multi-Scale Attention-Based Deep Neural Network for Brain Disease Diagnosis 被引量:1
2
作者 Yin Liang Gaoxu Xu Sadaqat ur Rehman 《Computers, Materials & Continua》 SCIE EI 2022年第9期4645-4661,共17页
Whole brain functional connectivity(FC)patterns obtained from resting-state functional magnetic resonance imaging(rs-fMRI)have been widely used in the diagnosis of brain disorders such as autism spectrum disorder(ASD)... Whole brain functional connectivity(FC)patterns obtained from resting-state functional magnetic resonance imaging(rs-fMRI)have been widely used in the diagnosis of brain disorders such as autism spectrum disorder(ASD).Recently,an increasing number of studies have focused on employing deep learning techniques to analyze FC patterns for brain disease classification.However,the high dimensionality of the FC features and the interpretation of deep learning results are issues that need to be addressed in the FC-based brain disease classification.In this paper,we proposed a multi-scale attention-based deep neural network(MSA-DNN)model to classify FC patterns for the ASD diagnosis.The model was implemented by adding a flexible multi-scale attention(MSA)module to the auto-encoder based backbone DNN,which can extract multi-scale features of the FC patterns and change the level of attention for different FCs by continuous learning.Our model will reinforce the weights of important FC features while suppress the unimportant FCs to ensure the sparsity of the model weights and enhance the model interpretability.We performed systematic experiments on the large multi-sites ASD dataset with both ten-fold and leaveone-site-out cross-validations.Results showed that our model outperformed classical methods in brain disease classification and revealed robust intersite prediction performance.We also localized important FC features and brain regions associated with ASD classification.Overall,our study further promotes the biomarker detection and computer-aided classification for ASD diagnosis,and the proposed MSA module is flexible and easy to implement in other classification networks. 展开更多
关键词 Autism spectrum disorder diagnosis resting-state fMRI deep neural network functional connectivity multi-scale attention module
在线阅读 下载PDF
Two Stages Segmentation Algorithm of Breast Tumor in DCE-MRI Based on Multi-Scale Feature and Boundary Attention Mechanism
3
作者 Bing Li Liangyu Wang +3 位作者 Xia Liu Hongbin Fan Bo Wang Shoudi Tong 《Computers, Materials & Continua》 SCIE EI 2024年第7期1543-1561,共19页
Nuclearmagnetic resonance imaging of breasts often presents complex backgrounds.Breast tumors exhibit varying sizes,uneven intensity,and indistinct boundaries.These characteristics can lead to challenges such as low a... Nuclearmagnetic resonance imaging of breasts often presents complex backgrounds.Breast tumors exhibit varying sizes,uneven intensity,and indistinct boundaries.These characteristics can lead to challenges such as low accuracy and incorrect segmentation during tumor segmentation.Thus,we propose a two-stage breast tumor segmentation method leveraging multi-scale features and boundary attention mechanisms.Initially,the breast region of interest is extracted to isolate the breast area from surrounding tissues and organs.Subsequently,we devise a fusion network incorporatingmulti-scale features and boundary attentionmechanisms for breast tumor segmentation.We incorporate multi-scale parallel dilated convolution modules into the network,enhancing its capability to segment tumors of various sizes through multi-scale convolution and novel fusion techniques.Additionally,attention and boundary detection modules are included to augment the network’s capacity to locate tumors by capturing nonlocal dependencies in both spatial and channel domains.Furthermore,a hybrid loss function with boundary weight is employed to address sample class imbalance issues and enhance the network’s boundary maintenance capability through additional loss.Themethod was evaluated using breast data from 207 patients at RuijinHospital,resulting in a 6.64%increase in Dice similarity coefficient compared to the benchmarkU-Net.Experimental results demonstrate the superiority of the method over other segmentation techniques,with fewer model parameters. 展开更多
关键词 Dynamic contrast-enhanced magnetic resonance imaging(DCE-MRI) breast tumor segmentation multi-scale dilated convolution boundary attention the hybrid loss function with boundary weight
在线阅读 下载PDF
Learning multi-scale attention network for fine-grained visual classification
4
作者 Peipei Zhao Siyan Yang +4 位作者 Wei Ding Ruyi Liu Wentian Xin Xiangzeng Liu Qiguang Miao 《Journal of Information and Intelligence》 2025年第6期492-503,共12页
Fine-grained visual classification(FGVC)is a very challenging task due to distinguishing subcategories under the same super-category.Recent works mainly localize discriminative image regions and capture subtle inter-c... Fine-grained visual classification(FGVC)is a very challenging task due to distinguishing subcategories under the same super-category.Recent works mainly localize discriminative image regions and capture subtle inter-class differences by utilizing attention-based methods.However,at the same layer,most attention-based works only consider large-scale attention blocks with the same size as feature maps,and they ignore small-scale attention blocks that are smaller than feature maps.To distinguish subcategories,it is important to exploit small local regions.In this work,a novel multi-scale attention network(MSANet)is proposed to capture large and small regions at the same layer in fine-grained visual classification.Specifically,a novel multi-scale attention layer(MSAL)is proposed,which generates multiple groups in each feature maps to capture different-scale discriminative regions.The groups based on large-scale regions can exploit global features and the groups based on the small-scale regions can extract local subtle features.Then,a simple feature fusion strategy is utilized to fully integrate global features and local subtle features to mine information that are more conducive to FGVC.Comprehensive experiments in Caltech-UCSD Birds-200-2011(CUB),FGVC-Aircraft(AIR)and Stanford Cars(Cars)datasets show that our method achieves the competitive performances,which demonstrate its effectiveness. 展开更多
关键词 Fine-grained visual classification multi-scale attention network multi-scale attention module Feature fusion strategy
原文传递
Real-time detection network for tiny traffic sign using multi-scale attention module 被引量:16
5
作者 YANG TingTing TONG Chao 《Science China(Technological Sciences)》 SCIE EI CAS CSCD 2022年第2期396-406,共11页
As one of the key technologies of intelligent vehicles, traffic sign detection is still a challenging task because of the tiny size of its target object. To address the challenge, we present a novel detection network ... As one of the key technologies of intelligent vehicles, traffic sign detection is still a challenging task because of the tiny size of its target object. To address the challenge, we present a novel detection network improved from yolo-v3 for the tiny traffic sign with high precision in real-time. First, a visual multi-scale attention module(MSAM), a light-weight yet effective module, is devised to fuse the multi-scale feature maps with channel weights and spatial masks. It increases the representation power of the network by emphasizing useful features and suppressing unnecessary ones. Second, we exploit effectively fine-grained features about tiny objects from the shallower layers through modifying backbone Darknet-53 and adding one prediction head to yolo-v3. Finally, a receptive field block is added into the neck of the network to broaden the receptive field. Experiments prove the effectiveness of our network in both quantitative and qualitative aspects. The m AP@0.5 of our network reaches 0.965 and its detection speed is55.56 FPS for 512 × 512 images on the challenging Tsinghua-Tencent 100 k(TT100 k) dataset. 展开更多
关键词 tiny object detection traffic sign detection multi-scale attention module REAL-TIME
原文传递
Disease Recognition of Apple Leaf Using Lightweight Multi-Scale Network with ECANet 被引量:4
6
作者 Helong Yu Xianhe Cheng +2 位作者 Ziqing Li Qi Cai Chunguang Bi 《Computer Modeling in Engineering & Sciences》 SCIE EI 2022年第9期711-738,共28页
To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks,a lightweight ResNet(LW-ResNet)model for apple disease rec... To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks,a lightweight ResNet(LW-ResNet)model for apple disease recognition is proposed.Based on the deep residual network(ResNet18),the multi-scale feature extraction layer is constructed by group convolution to realize the compression model and improve the extraction ability of different sizes of lesion features.By improving the identity mapping structure to reduce information loss.By introducing the efficient channel attention module(ECANet)to suppress noise from a complex background.The experimental results show that the average precision,recall and F1-score of the LW-ResNet on the test set are 97.80%,97.92%and 97.85%,respectively.The parameter memory is 2.32 MB,which is 94%less than that of ResNet18.Compared with the classic lightweight networks SqueezeNet and MobileNetV2,LW-ResNet has obvious advantages in recognition performance,speed,parameter memory requirement and time complexity.The proposed model has the advantages of low computational cost,low storage cost,strong real-time performance,high identification accuracy,and strong practicability,which can meet the needs of real-time identification task of apple leaf disease on resource-constrained devices. 展开更多
关键词 Apple disease recognition deep residual network multi-scale feature efficient channel attention module lightweight network
在线阅读 下载PDF
基于改进卷积神经网络的水体分割方法
7
作者 张永宏 孙岩 +2 位作者 田伟 马光义 朱灵龙 《计算机应用与软件》 北大核心 2026年第2期164-174,188,共12页
由于遥感图像中水体具有复杂的多尺度特征,传统方法在提取水体过程中容易产生误判和漏判现象。针对这一问题,提出一种融合局部和全局信息的新网络结构。该网络首先在编码端设计一个带有注意机制的残差模块,用于获取每个位置特征的全局... 由于遥感图像中水体具有复杂的多尺度特征,传统方法在提取水体过程中容易产生误判和漏判现象。针对这一问题,提出一种融合局部和全局信息的新网络结构。该网络首先在编码端设计一个带有注意机制的残差模块,用于获取每个位置特征的全局和局部信息,采用多路径扩张卷积实现多尺度水体特征提取。为了提高水体边界处的分割精度,在网络解码端设计细化注意力融合模块。实验结果显示该网络的召回率、精准率、F1-scores分别为95.78%、94.24%、93.75%,与传统卷积神经网络相比,评价指标分别提高1.56百分点、1.72百分点、1.62百分点。 展开更多
关键词 水体分割 全局注意力机制 多路径扩张卷积 局部和全局信息
在线阅读 下载PDF
基于改进YOLOv10的多尺度舰船目标图像检测算法
8
作者 刘兆华 王中训 +2 位作者 贺鹏飞 刘宁波 孙艳丽 《海军工程大学学报》 北大核心 2026年第1期45-52,共8页
针对红外与可见光舰船图像分辨率低、纹理细节欠佳以及舰船目标尺度变化大等问题,本文提出了一种基于改进YOLOv10的多尺度舰船目标图像检测算法。首先,为了提高模型的特征提取能力,在骨干网络中加入了大型可分离核注意力模块;然后,为了... 针对红外与可见光舰船图像分辨率低、纹理细节欠佳以及舰船目标尺度变化大等问题,本文提出了一种基于改进YOLOv10的多尺度舰船目标图像检测算法。首先,为了提高模型的特征提取能力,在骨干网络中加入了大型可分离核注意力模块;然后,为了适应舰船目标尺寸变化大的问题,在颈部网络中添加了多尺度扩张注意力模块,提高了模型的多尺度检测能力;最后,引入了考虑边界框形状的损失函数,提高了模型对小目标的检测能力。在采集的红外与可见光舰船图像数据集上实验结果表明:改进后的算法在增加较少参数量的基础上平均精度均值较原有模型提高了1.2%,平均精度提高了1.9%,显著提高了模型的多尺度目标检测能力。 展开更多
关键词 多尺度目标检测 红外与可见光图像 YOLOv10 大型可分离核注意力模块 多尺度扩张注意力模块
在线阅读 下载PDF
基于改进ShuffleNet V2网络的路面类型识别
9
作者 张缓缓 冯屹轩 吴宏超 《江苏大学学报(自然科学版)》 北大核心 2026年第1期48-54,共7页
针对路面类型识别模型体积大、精确度低的问题,提出基于改进ShuffleNet V2网络的路面类型识别模型.在ShuffleNet V2网络结构中添加高效通道注意力(ECA)模块,通过注意力机制实现跨通道信息交互,并能根据输入的通道数量调整卷积核的大小;... 针对路面类型识别模型体积大、精确度低的问题,提出基于改进ShuffleNet V2网络的路面类型识别模型.在ShuffleNet V2网络结构中添加高效通道注意力(ECA)模块,通过注意力机制实现跨通道信息交互,并能根据输入的通道数量调整卷积核的大小;使用LeakyRelu函数替代ReLU函数,避免激活函数失效;引入由膨胀卷积组成的模块,在图像分辨率不变的同时,获取更广泛的图像信息,以提高模型的特征提取能力及泛化能力;根据路面类型的分类特点,调整各个模块的堆叠次数和网络的整体架构,降低模型的运算量和参数量.将改进后的算法在道路表面分类数据集(RSCD)上进行验证.结果表明:改进后的ShuffleNet V2模型参数量为4.67×10^(6)个,比原模型减少了1.4×10^(5)个;准确率为95.53%,比改进前提高了0.71百分点;推理时间减少了31%,新模型提高了对路面类型识别的准确率和响应速度. 展开更多
关键词 路面类型识别 卷积神经网络 ShuffleNet模型 ECA注意力机制 膨胀卷积模块 轻量化模型
在线阅读 下载PDF
Hard-rock tunnel lithology identification using multiscale dilated convolutional attention network based on tunnel face images 被引量:1
10
作者 Wenjun ZHANG Wuqi ZHANG +5 位作者 Gaole ZHANG Jun HUANG Minggeng LI Xiaohui WANG Fei YE Xiaoming GUAN 《Frontiers of Structural and Civil Engineering》 SCIE EI CSCD 2023年第12期1796-1812,共17页
For real-time classification of rock-masses in hard-rock tunnels,quick determination of the rock lithology on the tunnel face during construction is essential.Motivated by current breakthroughs in artificial intellige... For real-time classification of rock-masses in hard-rock tunnels,quick determination of the rock lithology on the tunnel face during construction is essential.Motivated by current breakthroughs in artificial intelligence technology in machine vision,a new automatic detection approach for classifying tunnel lithology based on tunnel face images was developed.The method benefits from residual learning for training a deep convolutional neural network(DCNN),and a multi-scale dilated convolutional attention block is proposed.The block with different dilation rates can provide various receptive fields,and thus it can extract multi-scale features.Moreover,the attention mechanism is utilized to select the salient features adaptively and further improve the performance of the model.In this study,an initial image data set made up of photographs of tunnel faces consisting of basalt,granite,siltstone,and tuff was first collected.After classifying and enhancing the training,validation,and testing data sets,a new image data set was generated.A comparison of the experimental findings demonstrated that the suggested approach outperforms previous classifiers in terms of various indicators,including accuracy,precision,recall,F1-score,and computing time.Finally,a visualization analysis was performed to explain the process of the network in the classification of tunnel lithology through feature extraction.Overall,this study demonstrates the potential of using artificial intelligence methods for in situ rock lithology classification utilizing geological images of the tunnel face. 展开更多
关键词 hard-rock tunnel face intelligent lithology identification multi-scale dilated convolutional attention network image classification deep learning
原文传递
Attention-enhanced multi-time scale LSTM for soft sensor modeling of corn starch liquefaction
11
作者 Yu Zhuang Zhongyi Zhang +5 位作者 Jin Tao Yi Li Fan Li Yu Wang Lei Zhang Jian Du 《Chinese Journal of Chemical Engineering》 2026年第1期132-144,共13页
Data-driven deep learning modeling has been increasingly applied to quality prediction in complex chemical processes.However,the data show complex temporal features due to different residence times and strong coupling... Data-driven deep learning modeling has been increasingly applied to quality prediction in complex chemical processes.However,the data show complex temporal features due to different residence times and strong coupling relationships among chemical entities.This study proposes a multi-scale temporal feature extraction module to extract local dynamic temporal features across different time scales and combines it with long short-term memory(LSTM)networks to capture global temporal patterns,thereby taking full advantage of available data.In addition,variable-wise channel attention is integrated into the model to enhance attention on the essential parts of the feature maps and improve predictive performance.Furthermore,by analyzing the attention weights,the model quickly identifies the key variables that significantly affect the predictions.Finally,the model is applied to a real corn starch liquefaction process and achieves an accurate product quality prediction with an R^(2) value of 0.9392,which represents a 4%to 9%improvement over traditional models and demonstrates the superiority of the proposed approach. 展开更多
关键词 multi-scale dilated causal convolution Neural networks Soft sensor Systems engineering attention mechanism Biochemical engineering
在线阅读 下载PDF
基于改进SegFormer的超声影像分割方法研究
12
作者 王高才 杨满 张晓洁 《现代电子技术》 北大核心 2025年第23期17-24,共8页
针对SegFormer网络分层的Transformer编码器逐级递减图像分辨率,提取图像多尺度特征的特点,文中提出一种边缘提取模块(EEM),该模块能更好地注意到超声影像的边界信息。为了解决超声图像中出现的复杂边界问题,在SegFormer编码层和解码层... 针对SegFormer网络分层的Transformer编码器逐级递减图像分辨率,提取图像多尺度特征的特点,文中提出一种边缘提取模块(EEM),该模块能更好地注意到超声影像的边界信息。为了解决超声图像中出现的复杂边界问题,在SegFormer编码层和解码层之间加入空洞空间卷积金字塔池化(ASPP)模块,确保其在获取较大感受野的同时不会发生网格效应。结合EEM和ASPP模块的SegFormer能够在保持低级边缘细节特征的同时进一步提取抽象特征,使得超声影像分割的精度进一步提升。将EEM连接在SegFormer编码器之前,再和解码器进行特征融合之后,与ASPP模块的输出特征做拼接并预测分割掩码,在甲状腺结节超声影像、乳腺结节超声影像和白细胞数据集上取得了较好的效果。经实验验证,该模型的mIoU分别提高了1.25%、2.2%和0.13%,证明了改进方法的有效性。 展开更多
关键词 图像分割 改进的SegFormer 超声影像 混合空洞卷积 混合通道注意力 边缘提取模块
在线阅读 下载PDF
改进U-Net模型的隧道掌子面图像语义分割研究
13
作者 陈登峰 程静 +1 位作者 赵蕾 何拓航 《防灾减灾工程学报》 北大核心 2025年第4期776-783,共8页
隧道掌子面岩体结构是判断岩土工程地质条件、制定施工和支护方案、预防塌方及涌水等事故的直观依据。将U-Net模型应用于掌子面岩体结构图像分割与识别时,下采样过程中缩小图像尺寸会导致岩体部分细节信息丢失,上采样过程中将低层特征... 隧道掌子面岩体结构是判断岩土工程地质条件、制定施工和支护方案、预防塌方及涌水等事故的直观依据。将U-Net模型应用于掌子面岩体结构图像分割与识别时,下采样过程中缩小图像尺寸会导致岩体部分细节信息丢失,上采样过程中将低层特征传递到高层的跳跃连接导致特征映射过大。因此,提出加入空洞空间卷积池化金字塔模块ASPP和卷积注意力模块CBAM的改进U-Net模型。在U-Net模型的跳跃连接过程中加ASPP,利用不同膨胀率的空洞卷积捕获不同尺度的上下文信息,融合不同感受野的信息,从而更全面的理解图像内容;U-Net模型的下采样过程中加入CBAM,使网络模型更加关注有用的特征,从而增强特征的表达能力。实验结果表明,改进的网络模型相较于原始U-Net模型分割和识别性能有显著提升,在某隧道工程掌子面岩体图像数据集上Precision达到93.04%,mIoU达到74.98%,mPA达到78.89%。 展开更多
关键词 隧道掌子面 图像语义分割 卷积注意力模块 空洞空间卷积池化金字塔模块
原文传递
基于YOLOv4算法的口罩佩戴检测 被引量:2
14
作者 杨震山 丁昕苗 +2 位作者 范新磊 姜成云 王觅甲 《计算机应用与软件》 北大核心 2025年第2期181-189,共9页
针对公共场所口罩佩戴检测所面临的实时性、多目标多姿态以及面部遮挡等挑战,设计一种基于YOLOv4的口罩佩戴检测方法。该方法一方面通过引入空洞卷积和非对称卷积等思想,结合RFB设计特征加强模块RFB-s,并替换YOLOv4中的空间金字塔结构,... 针对公共场所口罩佩戴检测所面临的实时性、多目标多姿态以及面部遮挡等挑战,设计一种基于YOLOv4的口罩佩戴检测方法。该方法一方面通过引入空洞卷积和非对称卷积等思想,结合RFB设计特征加强模块RFB-s,并替换YOLOv4中的空间金字塔结构,扩大了模型感受野,同时降低了网络参数量。另一方面,增加了注意力模块,提升了模型信息处理能力。通过在自建的口罩佩戴检测数据集和开源数据集上的实验,对比不同网络结构和不同算法情况下的mAP值和运行速度,验证了该算法在口罩佩戴检测性能上的提升。 展开更多
关键词 新型冠状病毒 口罩 YOLOv4 空洞卷积 注意力模块 数据集
在线阅读 下载PDF
基于3D-CA-GAN的岩石体纹理合成技术
15
作者 段炼 冯云 +4 位作者 花卫华 陈启浩 刘修国 张坤 付伟 《地球科学》 北大核心 2025年第11期4499-4513,共15页
基于二维样本(深度学习)的体纹理合成是一种重要的岩石体纹理生成途径,目前岩石体纹理合成存在无法长距离依赖和颜色失真的问题.提出一种基于三维坐标注意力生成对抗网络(3D-Coordinate Attention Generative Adversarial Network,简称3... 基于二维样本(深度学习)的体纹理合成是一种重要的岩石体纹理生成途径,目前岩石体纹理合成存在无法长距离依赖和颜色失真的问题.提出一种基于三维坐标注意力生成对抗网络(3D-Coordinate Attention Generative Adversarial Network,简称3D-CA-GAN)的创新方法.通过将坐标注意力机制(Coordinate Attention,简称CA)扩展至三维空间,结合内容感知上采样模块和多尺度判别器,实现了矿物颗粒空间分布的高保真建模.实验表明,该方法在SSIM(0.773)、PSNR(提升24.92%)和LPIPS(降低0.110)等指标上显著优于现有技术,消融实验进一步验证3D-CA模块使方向性纹理的SSIM提升14.69%.本研究为地质建模提供了具有真实感纹理合成的新解决方案,其三维注意力框架对通用生成任务具有借鉴意义. 展开更多
关键词 岩石 体纹理 混合空洞卷积 注意力模块 3D-CA-GAN 三维建模
原文传递
An improved multiscale fusion dense network with efficient multiscale attention mechanism for apple leaf disease identification 被引量:1
16
作者 Dandan DAI Hui LIU 《Frontiers of Agricultural Science and Engineering》 2025年第2期173-189,共17页
With the development of smart agriculture,accurately identifying crop diseases through visual recognition techniques instead of by eye has been a significant challenge.This study focused on apple leaf disease,which is... With the development of smart agriculture,accurately identifying crop diseases through visual recognition techniques instead of by eye has been a significant challenge.This study focused on apple leaf disease,which is closely related to the final yield of apples.A multiscale fusion dense network combined with an efficient multiscale attention(EMA)mechanism called Incept_EMA_DenseNet was developed to better identify eight complex apple leaf disease images.Incept_EMA_DenseNet consists of three crucial parts:the inception module,which substituted the convolution layer with multiscale fusion methods in the shallow feature extraction layer;the EMA mechanism,which is used for obtaining appropriate weights of different dense blocks;and the improved DenseNet based on DenseNet_121.Specifically,to find appropriate multiscale fusion methods,the residual module and inception module were compared to determine the performance of each technique,and Incept_EMA_DenseNet achieved an accuracy of 95.38%.Second,this work used three attention mechanisms,and the efficient multiscale attention mechanism obtained the best performance.Third,the convolution layers and bottlenecks were modified without performance degradation,reducing half of the computational load compared with the original models.Incept_EMA_DenseNet,as proposed in this paper,has an accuracy of 96.76%,being 2.93%,3.44%,and 4.16%better than Resnet50,DenseNet_121 and GoogLeNet,respectively,proved to be reliable and beneficial,and can effectively and conveniently assist apple growers with leaf disease identification in the field. 展开更多
关键词 Incept_EMA_DenseNet multi-scale fusion module efficient multiscale attention mechanism apple leaf disease identification
原文传递
Deep Learning-Based Algorithm for Robust Object Detection in Flooded and Rainy Environments
17
作者 Pengfei Wang Jiwu Sun +4 位作者 Lu Lu Hongchen Li Hongzhe Liu Cheng Xu Yongqiang Liu 《Computers, Materials & Continua》 2025年第8期2883-2903,共21页
Flooding and heavy rainfall under extreme weather conditions pose significant challenges to target detection algorithms.Traditional methods often struggle to address issues such as image blurring,dynamic noise interfe... Flooding and heavy rainfall under extreme weather conditions pose significant challenges to target detection algorithms.Traditional methods often struggle to address issues such as image blurring,dynamic noise interference,and variations in target scale.Conventional neural network(CNN)-based target detection approaches face notable limitations in such adverse weather scenarios,primarily due to the fixed geometric sampling structures that hinder adaptability to complex backgrounds and dynamically changing object appearances.To address these challenges,this paper proposes an optimized YOLOv9 model incorporating an improved deformable convolutional network(DCN)enhanced with a multi-scale dilated attention(MSDA)mechanism.Specifically,the DCN module enhances themodel’s adaptability to target deformation and noise interference by adaptively adjusting the sampling grid positions,while also integrating feature amplitude modulation to further improve robustness.Additionally,theMSDA module is introduced to capture contextual features acrossmultiple scales,effectively addressing issues related to target occlusion and scale variation commonly encountered in flood-affected environments.Experimental evaluations are conducted on the ISE-UFDS and UA-DETRAC datasets.The results demonstrate that the proposedmodel significantly outperforms state-of-the-art methods in key evaluation metrics,including precision,recall,F1-score,and mAP(Mean Average Precision).Notably,the model exhibits superior robustness and generalization performance under simulated severe weather conditions,offering reliable technical support for disaster emergency response systems.This study contributes to enhancing the accuracy and real-time capabilities of flood early warning systems,thereby supporting more effective disaster mitigation strategies. 展开更多
关键词 YOLO vehicle detection FLOOD deformable convolutional networks multi-scale dilated attention
暂未订购
融合高效卷积注意力的时域卷积网络短期负荷预测模型
18
作者 孙东磊 李文升 +1 位作者 梁露 张智晟 《山东科技大学学报(自然科学版)》 北大核心 2025年第5期83-90,共8页
为避免时域卷积网络中膨胀卷积结构导致的负荷信息不连续现象,进一步提升预测模型对重要负荷特征的提取能力,本研究提出一种融合高效卷积注意力模块的混合膨胀卷积改进时域卷积网络(ECBAM-HTCN)的短期负荷预测模型。该模型以具备并行计... 为避免时域卷积网络中膨胀卷积结构导致的负荷信息不连续现象,进一步提升预测模型对重要负荷特征的提取能力,本研究提出一种融合高效卷积注意力模块的混合膨胀卷积改进时域卷积网络(ECBAM-HTCN)的短期负荷预测模型。该模型以具备并行计算能力的时域卷积网络为基础学习负荷数据特征,通过构建混合膨胀卷积层改进时域卷积网络残差块,利用不同膨胀系数的卷积自适应地捕获不同距离下全部负荷数据,避免信息不连续;同时,引入能够自适应调整卷积核大小的一维卷积改进传统卷积注意力模块,高效捕获负荷数据空间和通道两个维度的重要信息。基于实际电网负荷数据仿真实验表明,在短期负荷预测任务中,所提出的ECBAM-HTCN模型具有较高的预测精度和较好的稳定性。 展开更多
关键词 短期负荷预测 时域卷积网络 混合膨胀卷积 高效卷积注意力模块
在线阅读 下载PDF
临床教学中基于空洞卷积优化注意力模块的超声影像分割算法
19
作者 王晓珍 《自动化与仪器仪表》 2025年第9期59-62,67,共5页
随着智能技术的发展,临床教学采用深度学习技术已逐渐成为普遍现象之一。为了有效地处理超声影像中存在的多尺度和长距离依赖问题,研究设计了一种新型空洞卷积模块,并在U-Net网络跳跃连接中增加注意力模块。结果表明,该方法的平均交并比... 随着智能技术的发展,临床教学采用深度学习技术已逐渐成为普遍现象之一。为了有效地处理超声影像中存在的多尺度和长距离依赖问题,研究设计了一种新型空洞卷积模块,并在U-Net网络跳跃连接中增加注意力模块。结果表明,该方法的平均交并比、F1值分别为86.3%、87.6%。对比原始网络分别提高了15.2%、19.1%。该方法的超声影像分割任务准确性有所提高,有效地增强了模型的性能。其对于复杂和模糊边界的肿瘤区域,具有较好的鲁棒性。这不仅提升了乳腺超声影像处理的效率,也为临床教学中的智能化应用提供了有力支持,有助于推动临床教学向更加智能化、精准化的方向发展。 展开更多
关键词 超声影像 乳腺 U-net 注意力模块 空洞卷积
原文传递
基于循环生成对抗网络的人脸素描合成 被引量:4
20
作者 葛延良 孙笑笑 +2 位作者 张乔 王冬梅 王肖肖 《吉林大学学报(理学版)》 CAS 北大核心 2022年第4期897-905,共9页
针对当前卷积神经网络通常以降低感受野为条件获得多尺度图像特征,以及很难捕获各特征通道之间重要关系的问题,结合循环生成对抗网络结构的特点提出一种新的多尺度自注意力机制的循环生成对抗网络.首先,在生成器中使用VGG16模块组成U-Ne... 针对当前卷积神经网络通常以降低感受野为条件获得多尺度图像特征,以及很难捕获各特征通道之间重要关系的问题,结合循环生成对抗网络结构的特点提出一种新的多尺度自注意力机制的循环生成对抗网络.首先,在生成器中使用VGG16模块组成U-Net结构网络,以增强对图像特征信息的提取,同时对网络中的下采样和上采样进行改进,以提高特征分辨率,获取更多的细节信息;其次,设计多尺度特征聚合模块,采用不同采样率的多个并行空洞卷积,整合了不同尺度上的空间信息,在保持图像较大感受野的同时,多比例地捕捉图像信息;最后,为捕获空间维度和通道维度中的特征依赖关系,设计像素自注意力模块对空间维度和通道维度上的语义依赖关系进行建模,以增强图像特征的表现能力,提高生成素描图像的质量. 展开更多
关键词 深度学习 循环生成对抗网络 空洞卷积 多尺度特征聚合模块 像素自注意力模块
在线阅读 下载PDF
上一页 1 2 3 下一页 到第
使用帮助 返回顶部