期刊文献+
共找到153篇文章
< 1 2 8 >
每页显示 20 50 100
AMSFuse:Adaptive Multi-Scale Feature Fusion Network for Diabetic Retinopathy Classification
1
作者 Chengzhang Zhu Ahmed Alasri +5 位作者 Tao Xu Yalong Xiao Abdulrahman Noman Raeed Alsabri Xuanchu Duan Monir Abdullah 《Computers, Materials & Continua》 2025年第3期5153-5167,共15页
Globally,diabetic retinopathy(DR)is the primary cause of blindness,affecting millions of people worldwide.This widespread impact underscores the critical need for reliable and precise diagnostic techniques to ensure p... Globally,diabetic retinopathy(DR)is the primary cause of blindness,affecting millions of people worldwide.This widespread impact underscores the critical need for reliable and precise diagnostic techniques to ensure prompt diagnosis and effective treatment.Deep learning-based automated diagnosis for diabetic retinopathy can facilitate early detection and treatment.However,traditional deep learning models that focus on local views often learn feature representations that are less discriminative at the semantic level.On the other hand,models that focus on global semantic-level information might overlook critical,subtle local pathological features.To address this issue,we propose an adaptive multi-scale feature fusion network called(AMSFuse),which can adaptively combine multi-scale global and local features without compromising their individual representation.Specifically,our model incorporates global features for extracting high-level contextual information from retinal images.Concurrently,local features capture fine-grained details,such as microaneurysms,hemorrhages,and exudates,which are critical for DR diagnosis.These global and local features are adaptively fused using a fusion block,followed by an Integrated Attention Mechanism(IAM)that refines the fused features by emphasizing relevant regions,thereby enhancing classification accuracy for DR classification.Our model achieves 86.3%accuracy on the APTOS dataset and 96.6%RFMiD,both of which are comparable to state-of-the-art methods. 展开更多
关键词 Diabetic retinopathy multi-scale feature fusion global features local features integrated attention mechanism retinal images
暂未订购
MSFResNet:A ResNeXt50 model based on multi-scale feature fusion for wild mushroom identification
2
作者 YANG Yang JU Tao +1 位作者 YANG Wenjie ZHAO Yuyang 《Journal of Measurement Science and Instrumentation》 2025年第1期66-74,共9页
To solve the problems of redundant feature information,the insignificant difference in feature representation,and low recognition accuracy of the fine-grained image,based on the ResNeXt50 model,an MSFResNet network mo... To solve the problems of redundant feature information,the insignificant difference in feature representation,and low recognition accuracy of the fine-grained image,based on the ResNeXt50 model,an MSFResNet network model is proposed by fusing multi-scale feature information.Firstly,a multi-scale feature extraction module is designed to obtain multi-scale information on feature images by using different scales of convolution kernels.Meanwhile,the channel attention mechanism is used to increase the global information acquisition of the network.Secondly,the feature images processed by the multi-scale feature extraction module are fused with the deep feature images through short links to guide the full learning of the network,thus reducing the loss of texture details of the deep network feature images,and improving network generalization ability and recognition accuracy.Finally,the validity of the MSFResNet model is verified using public datasets and applied to wild mushroom identification.Experimental results show that compared with ResNeXt50 network model,the accuracy of the MSFResNet model is improved by 6.01%on the FGVC-Aircraft common dataset.It achieves 99.13%classification accuracy on the wild mushroom dataset,which is 0.47%higher than ResNeXt50.Furthermore,the experimental results of the thermal map show that the MSFResNet model significantly reduces the interference of background information,making the network focus on the location of the main body of wild mushroom,which can effectively improve the accuracy of wild mushroom identification. 展开更多
关键词 multi-scale feature fusion attention mechanism ResNeXt50 wild mushroom identification deep learning
在线阅读 下载PDF
M2ANet:Multi-branch and multi-scale attention network for medical image segmentation
3
作者 Wei Xue Chuanghui Chen +3 位作者 Xuan Qi Jian Qin Zhen Tang Yongsheng He 《Chinese Physics B》 2025年第8期547-559,共13页
Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to ... Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to the inability to effectively capture global information from images,CNNs can easily lead to loss of contours and textures in segmentation results.Notice that the transformer model can effectively capture the properties of long-range dependencies in the image,and furthermore,combining the CNN and the transformer can effectively extract local details and global contextual features of the image.Motivated by this,we propose a multi-branch and multi-scale attention network(M2ANet)for medical image segmentation,whose architecture consists of three components.Specifically,in the first component,we construct an adaptive multi-branch patch module for parallel extraction of image features to reduce information loss caused by downsampling.In the second component,we apply residual block to the well-known convolutional block attention module to enhance the network’s ability to recognize important features of images and alleviate the phenomenon of gradient vanishing.In the third component,we design a multi-scale feature fusion module,in which we adopt adaptive average pooling and position encoding to enhance contextual features,and then multi-head attention is introduced to further enrich feature representation.Finally,we validate the effectiveness and feasibility of the proposed M2ANet method through comparative experiments on four benchmark medical image segmentation datasets,particularly in the context of preserving contours and textures. 展开更多
关键词 medical image segmentation convolutional neural network multi-branch attention multi-scale feature fusion
原文传递
Attention Guided Multi Scale Feature Fusion Network for Automatic Prostate Segmentation
4
作者 Yuchun Li Mengxing Huang +1 位作者 Yu Zhang Zhiming Bai 《Computers, Materials & Continua》 SCIE EI 2024年第2期1649-1668,共20页
The precise and automatic segmentation of prostate magnetic resonance imaging(MRI)images is vital for assisting doctors in diagnosing prostate diseases.In recent years,many advanced methods have been applied to prosta... The precise and automatic segmentation of prostate magnetic resonance imaging(MRI)images is vital for assisting doctors in diagnosing prostate diseases.In recent years,many advanced methods have been applied to prostate segmentation,but due to the variability caused by prostate diseases,automatic segmentation of the prostate presents significant challenges.In this paper,we propose an attention-guided multi-scale feature fusion network(AGMSF-Net)to segment prostate MRI images.We propose an attention mechanism for extracting multi-scale features,and introduce a 3D transformer module to enhance global feature representation by adding it during the transition phase from encoder to decoder.In the decoder stage,a feature fusion module is proposed to obtain global context information.We evaluate our model on MRI images of the prostate acquired from a local hospital.The relative volume difference(RVD)and dice similarity coefficient(DSC)between the results of automatic prostate segmentation and ground truth were 1.21%and 93.68%,respectively.To quantitatively evaluate prostate volume on MRI,which is of significant clinical significance,we propose a unique AGMSF-Net.The essential performance evaluation and validation experiments have demonstrated the effectiveness of our method in automatic prostate segmentation. 展开更多
关键词 Prostate segmentation multi-scale attention 3D Transformer feature fusion MRI
在线阅读 下载PDF
A Lightweight Multiscale Feature Fusion Network for Solar Cell Defect Detection
5
作者 Xiaoyun Chen Lanyao Zhang +3 位作者 Xiaoling Chen Yigang Cen Linna Zhang Fugui Zhang 《Computers, Materials & Continua》 SCIE EI 2025年第1期521-542,共22页
Solar cell defect detection is crucial for quality inspection in photovoltaic power generation modules.In the production process,defect samples occur infrequently and exhibit random shapes and sizes,which makes it cha... Solar cell defect detection is crucial for quality inspection in photovoltaic power generation modules.In the production process,defect samples occur infrequently and exhibit random shapes and sizes,which makes it challenging to collect defective samples.Additionally,the complex surface background of polysilicon cell wafers complicates the accurate identification and localization of defective regions.This paper proposes a novel Lightweight Multiscale Feature Fusion network(LMFF)to address these challenges.The network comprises a feature extraction network,a multi-scale feature fusion module(MFF),and a segmentation network.Specifically,a feature extraction network is proposed to obtain multi-scale feature outputs,and a multi-scale feature fusion module(MFF)is used to fuse multi-scale feature information effectively.In order to capture finer-grained multi-scale information from the fusion features,we propose a multi-scale attention module(MSA)in the segmentation network to enhance the network’s ability for small target detection.Moreover,depthwise separable convolutions are introduced to construct depthwise separable residual blocks(DSR)to reduce the model’s parameter number.Finally,to validate the proposed method’s defect segmentation and localization performance,we constructed three solar cell defect detection datasets:SolarCells,SolarCells-S,and PVEL-S.SolarCells and SolarCells-S are monocrystalline silicon datasets,and PVEL-S is a polycrystalline silicon dataset.Experimental results show that the IOU of our method on these three datasets can reach 68.5%,51.0%,and 92.7%,respectively,and the F1-Score can reach 81.3%,67.5%,and 96.2%,respectively,which surpasses other commonly usedmethods and verifies the effectiveness of our LMFF network. 展开更多
关键词 Defect segmentation multi-scale feature fusion multi-scale attention depthwise separable residual block
在线阅读 下载PDF
CGMISeg:Context-Guided Multi-Scale Interactive for Efficient Semantic Segmentation
6
作者 Ze Wang Jin Qin +1 位作者 Chuhua Huang Yongjun Zhang 《Computers, Materials & Continua》 2025年第9期5811-5829,共19页
Semantic segmentation has made significant breakthroughs in various application fields,but achieving both accurate and efficient segmentation with limited computational resources remains a major challenge.To this end,... Semantic segmentation has made significant breakthroughs in various application fields,but achieving both accurate and efficient segmentation with limited computational resources remains a major challenge.To this end,we propose CGMISeg,an efficient semantic segmentation architecture based on a context-guided multi-scale interaction strategy,aiming to significantly reduce computational overhead while maintaining segmentation accuracy.CGMISeg consists of three core components:context-aware attention modulation,feature reconstruction,and crossinformation fusion.Context-aware attention modulation is carefully designed to capture key contextual information through channel and spatial attention mechanisms.The feature reconstruction module reconstructs contextual information from different scales,modeling key rectangular areas by capturing critical contextual information in both horizontal and vertical directions,thereby enhancing the focus on foreground features.The cross-information fusion module aims to fuse the reconstructed high-level features with the original low-level features during upsampling,promoting multi-scale interaction and enhancing the model’s ability to handle objects at different scales.We extensively evaluated CGMISeg on ADE20K,Cityscapes,and COCO-Stuff,three widely used datasets benchmarks,and the experimental results show that CGMISeg exhibits significant advantages in segmentation performance,computational efficiency,and inference speed,clearly outperforming several mainstream methods,including SegFormer,Feedformer,and SegNext.Specifically,CGMISeg achieves 42.9%mIoU(Mean Intersection over Union)and 15.7 FPS(Frames Per Second)on the ADE20K dataset with 3.8 GFLOPs(Giga Floating-point Operations Per Second),outperforming Feedformer and SegNeXt by 3.7%and 1.8%in mIoU,respectively,while also offering reduced computational complexity and faster inference.CGMISeg strikes an excellent balance between accuracy and efficiency,significantly enhancing both computational and inference performance while maintaining high precision,showcasing exceptional practical value and strong potential for widespread applications. 展开更多
关键词 Semantic segmentation context-aware attention modulation feature reconstruction cross-information fusion
在线阅读 下载PDF
Revolutionizing anemia detection:integrative machine learning models and advanced attention mechanisms
7
作者 Muhammad Ramzan Jinfang Sheng +2 位作者 Muhammad Usman Saeed Bin Wang Faisal Z.Duraihem 《Visual Computing for Industry,Biomedicine,and Art》 2024年第1期183-195,共13页
This study addresses the critical issue of anemia detection using machine learning(ML)techniques.Although a widespread blood disorder with significant health implications,anemia often remains undetected.This necessita... This study addresses the critical issue of anemia detection using machine learning(ML)techniques.Although a widespread blood disorder with significant health implications,anemia often remains undetected.This necessitates timely and efficient diagnostic methods,as traditional approaches that rely on manual assessment are time-consuming and subjective.The present study explored the application of ML-particularly classification models,such as logistic regression,decision trees,random forest,support vector machines,Naïve Bayes,and k-nearest neighbors-in conjunction with innovative models incorporating attention modules and spatial attention to detect anemia.The proposed models demonstrated promising results,achieving high accuracy,precision,recall,and F1 scores for both textual and image datasets.In addition,an integrated approach that combines textual and image data was found to outperform the individual modalities.Specifically,the proposed AlexNet Multiple Spatial Attention model achieved an exceptional accuracy of 99.58%,emphasizing its potential to revolutionize automated anemia detection.The results of ablation studies confirm the significance of key components-including the blue-green-red,multiple,and spatial attentions-in enhancing model performance.Overall,this study presents a comprehensive and innovative framework for noninvasive anemia detection,contributing valuable insights to the field. 展开更多
关键词 ANEMIA NONINVASIVE MULTIMODAL feature fusion attention module
在线阅读 下载PDF
Grasp Detection with Hierarchical Multi-Scale Feature Fusion and Inverted Shuffle Residual
8
作者 Wenjie Geng Zhiqiang Cao +3 位作者 Peiyu Guan Fengshui Jing Min Tan Junzhi Yu 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2024年第1期244-256,共13页
Grasp detection plays a critical role for robot manipulation.Mainstream pixel-wise grasp detection networks with encoder-decoder structure receive much attention due to good accuracy and efficiency.However,they usuall... Grasp detection plays a critical role for robot manipulation.Mainstream pixel-wise grasp detection networks with encoder-decoder structure receive much attention due to good accuracy and efficiency.However,they usually transmit the high-level feature in the encoder to the decoder,and low-level features are neglected.It is noted that low-level features contain abundant detail information,and how to fully exploit low-level features remains unsolved.Meanwhile,the channel information in high-level feature is also not well mined.Inevitably,the performance of grasp detection is degraded.To solve these problems,we propose a grasp detection network with hierarchical multi-scale feature fusion and inverted shuffle residual.Both low-level and high-level features in the encoder are firstly fused by the designed skip connections with attention module,and the fused information is then propagated to corresponding layers of the decoder for in-depth feature fusion.Such a hierarchical fusion guarantees the quality of grasp prediction.Furthermore,an inverted shuffle residual module is created,where the high-level feature from encoder is split in channel and the resultant split features are processed in their respective branches.By such differentiation processing,more high-dimensional channel information is kept,which enhances the representation ability of the network.Besides,an information enhancement module is added before the encoder to reinforce input information.The proposed method attains 98.9%and 97.8%in image-wise and object-wise accuracy on the Cornell grasping dataset,respectively,and the experimental results verify the effectiveness of the method. 展开更多
关键词 grasp detection hierarchical multi-scale feature fusion skip connections with attention inverted shuffle residual
原文传递
An improved multiscale fusion dense network with efficient multiscale attention mechanism for apple leaf disease identification 被引量:1
9
作者 Dandan DAI Hui LIU 《Frontiers of Agricultural Science and Engineering》 2025年第2期173-189,共17页
With the development of smart agriculture,accurately identifying crop diseases through visual recognition techniques instead of by eye has been a significant challenge.This study focused on apple leaf disease,which is... With the development of smart agriculture,accurately identifying crop diseases through visual recognition techniques instead of by eye has been a significant challenge.This study focused on apple leaf disease,which is closely related to the final yield of apples.A multiscale fusion dense network combined with an efficient multiscale attention(EMA)mechanism called Incept_EMA_DenseNet was developed to better identify eight complex apple leaf disease images.Incept_EMA_DenseNet consists of three crucial parts:the inception module,which substituted the convolution layer with multiscale fusion methods in the shallow feature extraction layer;the EMA mechanism,which is used for obtaining appropriate weights of different dense blocks;and the improved DenseNet based on DenseNet_121.Specifically,to find appropriate multiscale fusion methods,the residual module and inception module were compared to determine the performance of each technique,and Incept_EMA_DenseNet achieved an accuracy of 95.38%.Second,this work used three attention mechanisms,and the efficient multiscale attention mechanism obtained the best performance.Third,the convolution layers and bottlenecks were modified without performance degradation,reducing half of the computational load compared with the original models.Incept_EMA_DenseNet,as proposed in this paper,has an accuracy of 96.76%,being 2.93%,3.44%,and 4.16%better than Resnet50,DenseNet_121 and GoogLeNet,respectively,proved to be reliable and beneficial,and can effectively and conveniently assist apple growers with leaf disease identification in the field. 展开更多
关键词 Incept_EMA_DenseNet multi-scale fusion module efficient multiscale attention mechanism apple leaf disease identification
原文传递
Disease Recognition of Apple Leaf Using Lightweight Multi-Scale Network with ECANet 被引量:4
10
作者 Helong Yu Xianhe Cheng +2 位作者 Ziqing Li Qi Cai Chunguang Bi 《Computer Modeling in Engineering & Sciences》 SCIE EI 2022年第9期711-738,共28页
To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks,a lightweight ResNet(LW-ResNet)model for apple disease rec... To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks,a lightweight ResNet(LW-ResNet)model for apple disease recognition is proposed.Based on the deep residual network(ResNet18),the multi-scale feature extraction layer is constructed by group convolution to realize the compression model and improve the extraction ability of different sizes of lesion features.By improving the identity mapping structure to reduce information loss.By introducing the efficient channel attention module(ECANet)to suppress noise from a complex background.The experimental results show that the average precision,recall and F1-score of the LW-ResNet on the test set are 97.80%,97.92%and 97.85%,respectively.The parameter memory is 2.32 MB,which is 94%less than that of ResNet18.Compared with the classic lightweight networks SqueezeNet and MobileNetV2,LW-ResNet has obvious advantages in recognition performance,speed,parameter memory requirement and time complexity.The proposed model has the advantages of low computational cost,low storage cost,strong real-time performance,high identification accuracy,and strong practicability,which can meet the needs of real-time identification task of apple leaf disease on resource-constrained devices. 展开更多
关键词 Apple disease recognition deep residual network multi-scale feature efficient channel attention module lightweight network
在线阅读 下载PDF
基于改进Swin Transformer的人脸活体检测 被引量:2
11
作者 王旭光 卜辰宇 时泽宇 《中国测试》 北大核心 2025年第6期31-39,共9页
随着人脸识别技术的发展,人脸活体检测作为人脸识别系统的安全保障变得更加重要。但当前主流的人脸活体检测模型仅针对特定的检测场景及欺诈攻击方式,面对未知攻击的鲁棒性和泛化能力较差。为此,该文提出一种改进的Swin Transformer模型... 随着人脸识别技术的发展,人脸活体检测作为人脸识别系统的安全保障变得更加重要。但当前主流的人脸活体检测模型仅针对特定的检测场景及欺诈攻击方式,面对未知攻击的鲁棒性和泛化能力较差。为此,该文提出一种改进的Swin Transformer模型,即CDCSwin-T(central difference convolution Swin Transformer)模型。该模型以Swin Transformer为主干,利用其滑动窗口注意力机制提取人脸全局信息,同时引入中心差分卷积(central difference convolution,CDC)模块提取人脸局部信息,加强主干模型捕获真假人脸差异的能力,从而增强其面对未知攻击的鲁棒性;另外在主干模型中引入瓶颈注意力模块,引导模型关注人脸关键信息,加速模型训练;最终将主干模型不同阶段的多尺度信息进行自适应融合,进一步提升该文模型的泛化能力。CDCSwin-T模型在OULU-NPU数据集4个协议上的平均分类错误率(ACER)分别为0.2%,1.1%,(1.1±0.6)%,(2.8±1.4)%,在CASIA-MFSD和REPLAYATTACK数据集跨库测试上的半错误率(HTER)分别为14.1%,22.9%,均优于当前的主流模型,表明其面对未知攻击的鲁棒性和泛化能力均有所提升。 展开更多
关键词 人脸活体检测 Swin Transformer 瓶颈注意力模块 特征融合
在线阅读 下载PDF
融合注意力机制和快速网络的口罩人脸检测算法 被引量:1
12
作者 兰红 王恪 陈子怡 《微电子学与计算机》 2025年第5期81-92,共12页
口罩人脸检测是智能监控系统中的关键部分,在城市管理和公共卫生安全等方面有着重要意义。针对口罩人脸检测在处理光照条件、遮挡等问题的图像时出现的漏检和检测不准确问题,提出了FSR-YOLOv8口罩人脸检测算法。该算法通过在骨干网络中... 口罩人脸检测是智能监控系统中的关键部分,在城市管理和公共卫生安全等方面有着重要意义。针对口罩人脸检测在处理光照条件、遮挡等问题的图像时出现的漏检和检测不准确问题,提出了FSR-YOLOv8口罩人脸检测算法。该算法通过在骨干网络中融合了优化的FasterNeXt,从而提高了检测速度和减少了参数量。同时,通过在特征融合点间加入改进的SPCBAM注意力机制,模型能更有效地提取关键特征,减少背景干扰。此外,构建了RFB_HDC_BD模块显著提高了特征语义信息的利用率。在同等情况下,FSR-YOLOv8优于YOLOv8以及其他主流算法。该模型在公开的AIZOO数据集上的m_(AP)值达到了96.2%,在自制数据集上的m_(AP)值达到了89.2%,且模型参数量相比于YOLOv8模型降低了16%,参数量减低的同时具有较高的精确度。 展开更多
关键词 口罩人脸检测 注意力机制 快速网络 特征融合模块 YOLOv8n
在线阅读 下载PDF
CSD-YOLOv8的输电线路故障目标检测
13
作者 马旭 王锐 +6 位作者 邓军 常驰 郝帅 李添麒 刘峥岐 李国亮 赵晴 《西安科技大学学报》 北大核心 2025年第2期383-392,共10页
针对无人机巡检输电线路过程中待检测目标受复杂背景干扰、故障目标部分遮挡以及目标多尺度造成传统算法难以准确检测的问题,提出一种基于CSD-YOLOv8的输电线路故障目标检测方法。首先,以YOLOv8网络作为基础框架,并在其主干网络中引入... 针对无人机巡检输电线路过程中待检测目标受复杂背景干扰、故障目标部分遮挡以及目标多尺度造成传统算法难以准确检测的问题,提出一种基于CSD-YOLOv8的输电线路故障目标检测方法。首先,以YOLOv8网络作为基础框架,并在其主干网络中引入空间金字塔池化将不同尺度特征进行融合;然后,在检测网络头部中引入深度可分离卷积,并将其与交叉卷积连接模块结合,实现对部分遮挡目标的准确检测;此外,设计基于通道注意力机制的特征融合模块对不同层级特征进行加权融合,提高复杂背景下故障目标特征信息提取能力;最后,利用某电力巡检部门近5年的巡检数据对所提出算法进行验证。结果表明:相比于4种经典对比算法,所提方法在对12种故障类型检测效果的综合指标最好,平均检测精度为94.7%,召回率为93.0%。与此同时,所提算法具有较好的实时性,对于分辨率为1280×720的图像检测速度为45帧/s,为输电线路的智能巡检奠定了坚实的理论基础。 展开更多
关键词 YOLOv8 多尺度检测 通道注意力机制 特征融合 深度可分离模块
在线阅读 下载PDF
噪声背景下梅尔频率倒谱系数与多注意力网络在电机故障诊断中的应用
14
作者 宋恩哲 朱仁杰 +2 位作者 靖海国 姚崇 柯赟 《哈尔滨工程大学学报》 北大核心 2025年第3期475-485,共11页
针对电机实际工作过程中存在噪声干扰导致故障诊断精度下降的问题,本文提出了一种基于梅尔频率倒谱系数动态特征与多注意力融合卷积神经网络的故障诊断方法。通过梅尔频率倒谱系数动态特征提取噪声信号中的低频信息,并结合卷积注意力模... 针对电机实际工作过程中存在噪声干扰导致故障诊断精度下降的问题,本文提出了一种基于梅尔频率倒谱系数动态特征与多注意力融合卷积神经网络的故障诊断方法。通过梅尔频率倒谱系数动态特征提取噪声信号中的低频信息,并结合卷积注意力模块的自适应调节能力及多特征融合策略进一步减少噪声对故障诊断的干扰。通过电机台架数据验证了该方法在噪声条件下诊断的可行性,然而该方法受梅尔频率倒谱系数参数与网络结构的直接影响,因此具体分析了不同参数条件对抗噪性能的影响。实验结果表明:在信噪比-10 dB噪声背景下,梅尔频率倒谱系数动态特征与多注意力融合卷积神经网络相结合的故障诊断方法仍保持90%以上的诊断精度。 展开更多
关键词 电机 故障诊断 噪声环境 梅尔频率倒谱系数 卷积神经网络 多尺度 卷积注意力模块 特征融合
在线阅读 下载PDF
多尺度融合增强与注意力机制结合的图像语义分割
15
作者 刘书刚 杜昊东 王洪涛 《计算机应用与软件》 北大核心 2025年第6期225-233,278,共10页
针对当前图像语义分割中分割效率不高与分割边界不连续问题,提出一种多尺度融合增强与注意力机制结合的语义分割算法。该算法对原有DeepLabv3+网络结构进行改进,在编码器部分提出一种特征提取增强网络结构,充分利用相邻层各个尺度的特... 针对当前图像语义分割中分割效率不高与分割边界不连续问题,提出一种多尺度融合增强与注意力机制结合的语义分割算法。该算法对原有DeepLabv3+网络结构进行改进,在编码器部分提出一种特征提取增强网络结构,充分利用相邻层各个尺度的特征信息进行融合,在解码器末端使用改进的轻量化卷积注意力模块,使得对于物体边界分割更加充分。通过在Pascal VOC2007和Cityscapes数据集上进行实验验证,结果表明该方法较原有网络的精确度有显著的提高。 展开更多
关键词 语义分割 特征融合增强 注意力模块 编码器 上采样
在线阅读 下载PDF
基于改进TransUNet的肺部图像分割
16
作者 石勇涛 邱康齐 +1 位作者 柳迪 杜威 《现代电子技术》 北大核心 2025年第15期27-36,共10页
语义分割作为肺部影像分析的关键步骤,其准确率直接关系进一步的图像分析和治疗决策。面对肺部器官不规则外形、模糊边界以及噪声等问题,传统分割方法存在边界分割精确度不高、易出现误差等问题。针对这些挑战,文中提出一种基于多尺度... 语义分割作为肺部影像分析的关键步骤,其准确率直接关系进一步的图像分析和治疗决策。面对肺部器官不规则外形、模糊边界以及噪声等问题,传统分割方法存在边界分割精确度不高、易出现误差等问题。针对这些挑战,文中提出一种基于多尺度边缘特征融合的神经网络(MSB-AffTransU2Net)用于肺部图像的分割。首先,替换了TransUNet中的编解码器,采用U2-Net的RSU模块来增强特征提取的性能;然后,使用注意力特征融合机制替换原本的Concat方法,以减少模型参数并且提升特征的融合效果;接着,加入了多尺度特征提取器以及边界引导的上下文聚合模块,以融合提取更加精确的肺部边缘特征;最后,为优化模型损失函数,采纳了Dice损失与交叉熵损失,创建了一个新颖的损失函数。在COVID-19 Radiography Database的COVID类数据集上验证了所提算法的有效性。实验结果证明,MSB-AffTransU2Net在COVID数据集上的前景交并比(pIoU)和平均准确率(mAcc)与TransUNet算法相比,分别提高了3.03%和0.72%,证明了所提算法的有效性。 展开更多
关键词 COVID-19 肺部图像分割 TransUNet 边缘特征 边界引导的上下文聚合模块 注意力特征融合
在线阅读 下载PDF
基于深度学习的车道线检测算法
17
作者 岳永恒 赵志浩 《华南理工大学学报(自然科学版)》 北大核心 2025年第9期22-30,共9页
针对智能车辆在复杂场景下的车道线检测准确性问题,该文提出了一种融合多尺度空间注意力机制和路径聚合网络(PANet)的车道线检测算法。该算法首先引入行锚框UFLD车道线检测模型,并结合深度可分离卷积的特征金字塔增强模块PANet,以实现... 针对智能车辆在复杂场景下的车道线检测准确性问题,该文提出了一种融合多尺度空间注意力机制和路径聚合网络(PANet)的车道线检测算法。该算法首先引入行锚框UFLD车道线检测模型,并结合深度可分离卷积的特征金字塔增强模块PANet,以实现图像的多尺度特征提取;接着,网络框架中设计多尺度空间注意力模块,且引入SimAM轻量级注意力机制,以增强对目标特征的聚焦能力;然后,设计自适应特征融合模块,通过智能调整不同尺度特征图的融合权重,对PANet输出的特征图进行跨尺度融合,以提升网络对复杂特征的提取能力。在TuSimple数据集上的实验结果表明,所提算法的检测精度为96.84%,较原算法提升了1.02个百分点,优于传统的主流算法;在CULane数据集上的实验结果表明,所提算法的F_(1)值为72.74%,优于传统的主流算法,较原算法提升了4.34个百分点,尤其在强光和阴影等极端场景下的检测性能提升显著,说明所提算法在复杂场景下具有优异的检测能力;实时性测试结果显示,所提算法的推理速度达118.0 f/s,满足智能车辆的实时性需求。 展开更多
关键词 车道线检测 深度学习 多尺度空间注意力机制 自适应特征融合
在线阅读 下载PDF
基于特征分治与融合的铁路扣件轻量化实时检测模型
18
作者 鄢化彪 林初欣 +3 位作者 黄绿娥 李东丽 刘词波 徐方奇 《北京交通大学学报》 北大核心 2025年第3期56-67,共12页
为解决嵌入式设备实时处理海量铁路扣件视觉图像数据时无法兼顾精确度与检测速度的问题,提出一种基于特征分治与融合的轻量化实时检测模型.首先,利用基于空间与通道特征的分治混合注意力模块强化模型的特征提取能力,降低图像中复杂背景... 为解决嵌入式设备实时处理海量铁路扣件视觉图像数据时无法兼顾精确度与检测速度的问题,提出一种基于特征分治与融合的轻量化实时检测模型.首先,利用基于空间与通道特征的分治混合注意力模块强化模型的特征提取能力,降低图像中复杂背景对目标的干扰;其次,提出一种二重分治特征融合方法,提升对不同大小目标的检测能力,同时在检测头(YOLO Head)的代价体构建方面,引入可变焦距损失函数(Varifocal Loss,VFL)代替YOLOX-Nano检测头的二值交叉熵损失函数,提高轻量化实时检测的精度;再次,使用随机Alpha-IoU(RAL)损失函数动态调整参数,延缓算法的收敛速度从而优化模型的训练曲线,避免模型训练过程陷入局部最优解;最后,采集10233个检测目标并划分为6种类型,选择YOLOX-Nano、Faster R-CNN及YOLOv8n等主流目标检测模型作为对比进行实验.实验结果表明:所提模型的每秒帧数(Frames Per Second,FPS)为60.24,平均精度(Average Precision,AP)为83.40%,较基线模型提高了3.24%;参数量为2.31 M,较YOLOX-Tiny减少54.08%,浮点数计算量为1.99 G,较YOLOX-Tiny减少69.15%.研究成果可为轻量级实时检测模型与计算系统提供参考. 展开更多
关键词 轻量级嵌入式系统 分治混合注意力模块 分治特征融合 代价体构建
在线阅读 下载PDF
基于频率与注意力机制的图像去雾算法
19
作者 王军 孟儒君 程勇 《计算机系统应用》 2025年第1期161-170,共10页
由于大气雾和气溶胶的存在,图像能见度显著下降且色彩失真,给高级图像识别带来极大困难.现有的图像去雾算法常存在过度增强、细节丢失和去雾不充分等问题.针对过度增强和去雾不充分的问题,本文提出了一种基于频率和注意力机制的图像去... 由于大气雾和气溶胶的存在,图像能见度显著下降且色彩失真,给高级图像识别带来极大困难.现有的图像去雾算法常存在过度增强、细节丢失和去雾不充分等问题.针对过度增强和去雾不充分的问题,本文提出了一种基于频率和注意力机制的图像去雾算法(frequency and attention mechanism of the image dehazing network,FANet).该算法采用编码器-解码器结构,通过构建双分支频率提取模块获取全局和局部的高低频信息.构建频率融合模块调整高低频信息的权重占比,并在下采样过程中引入附加通道-像素模块和通道-像素注意力模块,以优化去雾效果.实验结果显示,FANet在SOTS-indoor数据集上的PSNR和SSIM分别为40.07 dB和0.9958,在SOTS-outdoor数据集上分别为39.77 dB和0.9958.同时,该算法也在HSTS和Haze4k测试集上取得了不错的结果,与其他去雾算法相比有效缓解了颜色失真和去雾不彻底等问题. 展开更多
关键词 图像去雾 双分支频率提取模块 注意力机制 特征融合 编码器-解码器结构
在线阅读 下载PDF
改进YOLOv7-Tiny的道路裂缝检测算法 被引量:3
20
作者 王启涵 刘超 《计算机工程与应用》 北大核心 2025年第10期372-380,共9页
道路裂缝检测是道路工程中的重要环节。针对现阶段道路裂缝检测算法中准确度低、效率低的问题,提出了一种基于YOLOv7-Tiny的轻量型道路裂缝检测算法YOLOv7-TPSF。引入部分卷积PConv,对原网络中耗参量较多的3×3卷积层进行部分替换,... 道路裂缝检测是道路工程中的重要环节。针对现阶段道路裂缝检测算法中准确度低、效率低的问题,提出了一种基于YOLOv7-Tiny的轻量型道路裂缝检测算法YOLOv7-TPSF。引入部分卷积PConv,对原网络中耗参量较多的3×3卷积层进行部分替换,降低模型的参数量,提升模型的训练速度;结合特征融合网络BiFusion Neck与加权特征金字塔BiFPN的优点,提出了新的特征融合模块Bi-FusFPN,减少网络计算量,强化多尺度特征的融合能力;在输出端添加无参注意力机制SimAM,进一步提高大、中、小三类目标的检测能力。实验结果表明,YOLOv7-TPSF算法相较于YOLOv7-Tiny算法,网络参数量与计算量分别减少了31.7%、34.6%,准确度与检测速度分别提高了3.7%、9.7%,一定程度上满足了道路裂缝检测准确性与实时性的需求。 展开更多
关键词 道路裂缝检测 YOLOv7-Tiny 轻量型 注意力机制 特征融合模块Bi-FusFPN
在线阅读 下载PDF
上一页 1 2 8 下一页 到第
使用帮助 返回顶部