期刊文献+
共找到169篇文章
< 1 2 9 >
每页显示 20 50 100
YOLO-SPDNet:Multi-Scale Sequence and Attention-Based Tomato Leaf Disease Detection Model
1
作者 Meng Wang Jinghan Cai +6 位作者 Wenzheng Liu Xue Yang Jingjing Zhang Qiangmin Zhou Fanzhen Wang Hang Zhang Tonghai Liu 《Phyton-International Journal of Experimental Botany》 2026年第1期290-308,共19页
Tomato is a major economic crop worldwide,and diseases on tomato leaves can significantly reduce both yield and quality.Traditional manual inspection is inefficient and highly subjective,making it difficult to meet th... Tomato is a major economic crop worldwide,and diseases on tomato leaves can significantly reduce both yield and quality.Traditional manual inspection is inefficient and highly subjective,making it difficult to meet the requirements of early disease identification in complex natural environments.To address this issue,this study proposes an improved YOLO11-based model,YOLO-SPDNet(Scale Sequence Fusion,Position-Channel Attention,and Dual Enhancement Network).The model integrates the SEAM(Self-Ensembling Attention Mechanism)semantic enhancement module,the MLCA(Mixed Local Channel Attention)lightweight attention mechanism,and the SPA(Scale-Position-Detail Awareness)module composed of SSFF(Scale Sequence Feature Fusion),TFE(Triple Feature Encoding),and CPAM(Channel and Position Attention Mechanism).These enhancements strengthen fine-grained lesion detection while maintaining model lightweightness.Experimental results show that YOLO-SPDNet achieves an accuracy of 91.8%,a recall of 86.5%,and an mAP@0.5 of 90.6%on the test set,with a computational complexity of 12.5 GFLOPs.Furthermore,the model reaches a real-time inference speed of 987 FPS,making it suitable for deployment on mobile agricultural terminals and online monitoring systems.Comparative analysis and ablation studies further validate the reliability and practical applicability of the proposed model in complex natural scenes. 展开更多
关键词 Tomato disease detection YOLO multi-scale feature fusion attention mechanism lightweight model
在线阅读 下载PDF
AMSFuse:Adaptive Multi-Scale Feature Fusion Network for Diabetic Retinopathy Classification
2
作者 Chengzhang Zhu Ahmed Alasri +5 位作者 Tao Xu Yalong Xiao Abdulrahman Noman Raeed Alsabri Xuanchu Duan Monir Abdullah 《Computers, Materials & Continua》 2025年第3期5153-5167,共15页
Globally,diabetic retinopathy(DR)is the primary cause of blindness,affecting millions of people worldwide.This widespread impact underscores the critical need for reliable and precise diagnostic techniques to ensure p... Globally,diabetic retinopathy(DR)is the primary cause of blindness,affecting millions of people worldwide.This widespread impact underscores the critical need for reliable and precise diagnostic techniques to ensure prompt diagnosis and effective treatment.Deep learning-based automated diagnosis for diabetic retinopathy can facilitate early detection and treatment.However,traditional deep learning models that focus on local views often learn feature representations that are less discriminative at the semantic level.On the other hand,models that focus on global semantic-level information might overlook critical,subtle local pathological features.To address this issue,we propose an adaptive multi-scale feature fusion network called(AMSFuse),which can adaptively combine multi-scale global and local features without compromising their individual representation.Specifically,our model incorporates global features for extracting high-level contextual information from retinal images.Concurrently,local features capture fine-grained details,such as microaneurysms,hemorrhages,and exudates,which are critical for DR diagnosis.These global and local features are adaptively fused using a fusion block,followed by an Integrated Attention Mechanism(IAM)that refines the fused features by emphasizing relevant regions,thereby enhancing classification accuracy for DR classification.Our model achieves 86.3%accuracy on the APTOS dataset and 96.6%RFMiD,both of which are comparable to state-of-the-art methods. 展开更多
关键词 Diabetic retinopathy multi-scale feature fusion global features local features integrated attention mechanism retinal images
暂未订购
MSFResNet:A ResNeXt50 model based on multi-scale feature fusion for wild mushroom identification
3
作者 YANG Yang JU Tao +1 位作者 YANG Wenjie ZHAO Yuyang 《Journal of Measurement Science and Instrumentation》 2025年第1期66-74,共9页
To solve the problems of redundant feature information,the insignificant difference in feature representation,and low recognition accuracy of the fine-grained image,based on the ResNeXt50 model,an MSFResNet network mo... To solve the problems of redundant feature information,the insignificant difference in feature representation,and low recognition accuracy of the fine-grained image,based on the ResNeXt50 model,an MSFResNet network model is proposed by fusing multi-scale feature information.Firstly,a multi-scale feature extraction module is designed to obtain multi-scale information on feature images by using different scales of convolution kernels.Meanwhile,the channel attention mechanism is used to increase the global information acquisition of the network.Secondly,the feature images processed by the multi-scale feature extraction module are fused with the deep feature images through short links to guide the full learning of the network,thus reducing the loss of texture details of the deep network feature images,and improving network generalization ability and recognition accuracy.Finally,the validity of the MSFResNet model is verified using public datasets and applied to wild mushroom identification.Experimental results show that compared with ResNeXt50 network model,the accuracy of the MSFResNet model is improved by 6.01%on the FGVC-Aircraft common dataset.It achieves 99.13%classification accuracy on the wild mushroom dataset,which is 0.47%higher than ResNeXt50.Furthermore,the experimental results of the thermal map show that the MSFResNet model significantly reduces the interference of background information,making the network focus on the location of the main body of wild mushroom,which can effectively improve the accuracy of wild mushroom identification. 展开更多
关键词 multi-scale feature fusion attention mechanism ResNeXt50 wild mushroom identification deep learning
在线阅读 下载PDF
M2ANet:Multi-branch and multi-scale attention network for medical image segmentation 被引量:1
4
作者 Wei Xue Chuanghui Chen +3 位作者 Xuan Qi Jian Qin Zhen Tang Yongsheng He 《Chinese Physics B》 2025年第8期547-559,共13页
Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to ... Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to the inability to effectively capture global information from images,CNNs can easily lead to loss of contours and textures in segmentation results.Notice that the transformer model can effectively capture the properties of long-range dependencies in the image,and furthermore,combining the CNN and the transformer can effectively extract local details and global contextual features of the image.Motivated by this,we propose a multi-branch and multi-scale attention network(M2ANet)for medical image segmentation,whose architecture consists of three components.Specifically,in the first component,we construct an adaptive multi-branch patch module for parallel extraction of image features to reduce information loss caused by downsampling.In the second component,we apply residual block to the well-known convolutional block attention module to enhance the network’s ability to recognize important features of images and alleviate the phenomenon of gradient vanishing.In the third component,we design a multi-scale feature fusion module,in which we adopt adaptive average pooling and position encoding to enhance contextual features,and then multi-head attention is introduced to further enrich feature representation.Finally,we validate the effectiveness and feasibility of the proposed M2ANet method through comparative experiments on four benchmark medical image segmentation datasets,particularly in the context of preserving contours and textures. 展开更多
关键词 medical image segmentation convolutional neural network multi-branch attention multi-scale feature fusion
原文传递
Attention Guided Multi Scale Feature Fusion Network for Automatic Prostate Segmentation
5
作者 Yuchun Li Mengxing Huang +1 位作者 Yu Zhang Zhiming Bai 《Computers, Materials & Continua》 SCIE EI 2024年第2期1649-1668,共20页
The precise and automatic segmentation of prostate magnetic resonance imaging(MRI)images is vital for assisting doctors in diagnosing prostate diseases.In recent years,many advanced methods have been applied to prosta... The precise and automatic segmentation of prostate magnetic resonance imaging(MRI)images is vital for assisting doctors in diagnosing prostate diseases.In recent years,many advanced methods have been applied to prostate segmentation,but due to the variability caused by prostate diseases,automatic segmentation of the prostate presents significant challenges.In this paper,we propose an attention-guided multi-scale feature fusion network(AGMSF-Net)to segment prostate MRI images.We propose an attention mechanism for extracting multi-scale features,and introduce a 3D transformer module to enhance global feature representation by adding it during the transition phase from encoder to decoder.In the decoder stage,a feature fusion module is proposed to obtain global context information.We evaluate our model on MRI images of the prostate acquired from a local hospital.The relative volume difference(RVD)and dice similarity coefficient(DSC)between the results of automatic prostate segmentation and ground truth were 1.21%and 93.68%,respectively.To quantitatively evaluate prostate volume on MRI,which is of significant clinical significance,we propose a unique AGMSF-Net.The essential performance evaluation and validation experiments have demonstrated the effectiveness of our method in automatic prostate segmentation. 展开更多
关键词 Prostate segmentation multi-scale attention 3D Transformer feature fusion MRI
在线阅读 下载PDF
EHDC-YOLO: Enhancing Object Detection for UAV Imagery via Multi-Scale Edge and Detail Capture
6
作者 Zhiyong Deng Yanchen Ye Jiangling Guo 《Computers, Materials & Continua》 2026年第1期1665-1682,共18页
With the rapid expansion of drone applications,accurate detection of objects in aerial imagery has become crucial for intelligent transportation,urban management,and emergency rescue missions.However,existing methods ... With the rapid expansion of drone applications,accurate detection of objects in aerial imagery has become crucial for intelligent transportation,urban management,and emergency rescue missions.However,existing methods face numerous challenges in practical deployment,including scale variation handling,feature degradation,and complex backgrounds.To address these issues,we propose Edge-enhanced and Detail-Capturing You Only Look Once(EHDC-YOLO),a novel framework for object detection in Unmanned Aerial Vehicle(UAV)imagery.Based on the You Only Look Once version 11 nano(YOLOv11n)baseline,EHDC-YOLO systematically introduces several architectural enhancements:(1)a Multi-Scale Edge Enhancement(MSEE)module that leverages multi-scale pooling and edge information to enhance boundary feature extraction;(2)an Enhanced Feature Pyramid Network(EFPN)that integrates P2-level features with Cross Stage Partial(CSP)structures and OmniKernel convolutions for better fine-grained representation;and(3)Dynamic Head(DyHead)with multi-dimensional attention mechanisms for enhanced cross-scale modeling and perspective adaptability.Comprehensive experiments on the Vision meets Drones for Detection(VisDrone-DET)2019 dataset demonstrate that EHDC-YOLO achieves significant improvements,increasing mean Average Precision(mAP)@0.5 from 33.2%to 46.1%(an absolute improvement of 12.9 percentage points)and mAP@0.5:0.95 from 19.5%to 28.0%(an absolute improvement of 8.5 percentage points)compared with the YOLOv11n baseline,while maintaining a reasonable parameter count(2.81 M vs the baseline’s 2.58 M).Further ablation studies confirm the effectiveness of each proposed component,while visualization results highlight EHDC-YOLO’s superior performance in detecting objects and handling occlusions in complex drone scenarios. 展开更多
关键词 UAV imagery object detection multi-scale feature fusion edge enhancement detail preservation YOLO feature pyramid network attention mechanism
在线阅读 下载PDF
A Lightweight Multiscale Feature Fusion Network for Solar Cell Defect Detection
7
作者 Xiaoyun Chen Lanyao Zhang +3 位作者 Xiaoling Chen Yigang Cen Linna Zhang Fugui Zhang 《Computers, Materials & Continua》 SCIE EI 2025年第1期521-542,共22页
Solar cell defect detection is crucial for quality inspection in photovoltaic power generation modules.In the production process,defect samples occur infrequently and exhibit random shapes and sizes,which makes it cha... Solar cell defect detection is crucial for quality inspection in photovoltaic power generation modules.In the production process,defect samples occur infrequently and exhibit random shapes and sizes,which makes it challenging to collect defective samples.Additionally,the complex surface background of polysilicon cell wafers complicates the accurate identification and localization of defective regions.This paper proposes a novel Lightweight Multiscale Feature Fusion network(LMFF)to address these challenges.The network comprises a feature extraction network,a multi-scale feature fusion module(MFF),and a segmentation network.Specifically,a feature extraction network is proposed to obtain multi-scale feature outputs,and a multi-scale feature fusion module(MFF)is used to fuse multi-scale feature information effectively.In order to capture finer-grained multi-scale information from the fusion features,we propose a multi-scale attention module(MSA)in the segmentation network to enhance the network’s ability for small target detection.Moreover,depthwise separable convolutions are introduced to construct depthwise separable residual blocks(DSR)to reduce the model’s parameter number.Finally,to validate the proposed method’s defect segmentation and localization performance,we constructed three solar cell defect detection datasets:SolarCells,SolarCells-S,and PVEL-S.SolarCells and SolarCells-S are monocrystalline silicon datasets,and PVEL-S is a polycrystalline silicon dataset.Experimental results show that the IOU of our method on these three datasets can reach 68.5%,51.0%,and 92.7%,respectively,and the F1-Score can reach 81.3%,67.5%,and 96.2%,respectively,which surpasses other commonly usedmethods and verifies the effectiveness of our LMFF network. 展开更多
关键词 Defect segmentation multi-scale feature fusion multi-scale attention depthwise separable residual block
在线阅读 下载PDF
Lightweight Human Pose Estimation Based on Multi-Attention Mechanism
8
作者 LIN Xiao LU Meichen +1 位作者 GAO Mufeng LI Yan 《Journal of Shanghai Jiaotong university(Science)》 2025年第5期899-910,共12页
Human pose estimation has received much attention from the research community because of its wide range of applications.However,current research for pose estimation is usually complex and computationally intensive,esp... Human pose estimation has received much attention from the research community because of its wide range of applications.However,current research for pose estimation is usually complex and computationally intensive,especially the feature loss problems in the feature fusion process.To address the above problems,we propose a lightweight human pose estimation network based on multi-attention mechanism(LMANet).In our method,network parameters can be significantly reduced by lightweighting the bottleneck blocks with depth-wise separable convolution on the high-resolution networks.After that,we also introduce a multi-attention mechanism to improve the model prediction accuracy,and the channel attention module is added in the initial stage of the network to enhance the local cross-channel information interaction.More importantly,we inject spatial crossawareness module in the multi-scale feature fusion stage to reduce the spatial information loss during feature extraction.Extensive experiments on COCO2017 dataset and MPII dataset show that LMANet can guarantee a higher prediction accuracy with fewer network parameters and computational effort.Compared with the highresolution network HRNet,the number of parameters and the computational complexity of the network are reduced by 67%and 73%,respectively. 展开更多
关键词 human pose estimation attention mechanisms multi-scale feature fusion high-resolution networks
原文传递
CGMISeg:Context-Guided Multi-Scale Interactive for Efficient Semantic Segmentation
9
作者 Ze Wang Jin Qin +1 位作者 Chuhua Huang Yongjun Zhang 《Computers, Materials & Continua》 2025年第9期5811-5829,共19页
Semantic segmentation has made significant breakthroughs in various application fields,but achieving both accurate and efficient segmentation with limited computational resources remains a major challenge.To this end,... Semantic segmentation has made significant breakthroughs in various application fields,but achieving both accurate and efficient segmentation with limited computational resources remains a major challenge.To this end,we propose CGMISeg,an efficient semantic segmentation architecture based on a context-guided multi-scale interaction strategy,aiming to significantly reduce computational overhead while maintaining segmentation accuracy.CGMISeg consists of three core components:context-aware attention modulation,feature reconstruction,and crossinformation fusion.Context-aware attention modulation is carefully designed to capture key contextual information through channel and spatial attention mechanisms.The feature reconstruction module reconstructs contextual information from different scales,modeling key rectangular areas by capturing critical contextual information in both horizontal and vertical directions,thereby enhancing the focus on foreground features.The cross-information fusion module aims to fuse the reconstructed high-level features with the original low-level features during upsampling,promoting multi-scale interaction and enhancing the model’s ability to handle objects at different scales.We extensively evaluated CGMISeg on ADE20K,Cityscapes,and COCO-Stuff,three widely used datasets benchmarks,and the experimental results show that CGMISeg exhibits significant advantages in segmentation performance,computational efficiency,and inference speed,clearly outperforming several mainstream methods,including SegFormer,Feedformer,and SegNext.Specifically,CGMISeg achieves 42.9%mIoU(Mean Intersection over Union)and 15.7 FPS(Frames Per Second)on the ADE20K dataset with 3.8 GFLOPs(Giga Floating-point Operations Per Second),outperforming Feedformer and SegNeXt by 3.7%and 1.8%in mIoU,respectively,while also offering reduced computational complexity and faster inference.CGMISeg strikes an excellent balance between accuracy and efficiency,significantly enhancing both computational and inference performance while maintaining high precision,showcasing exceptional practical value and strong potential for widespread applications. 展开更多
关键词 Semantic segmentation context-aware attention modulation feature reconstruction cross-information fusion
在线阅读 下载PDF
Disease Recognition of Apple Leaf Using Lightweight Multi-Scale Network with ECANet 被引量:4
10
作者 Helong Yu Xianhe Cheng +2 位作者 Ziqing Li Qi Cai Chunguang Bi 《Computer Modeling in Engineering & Sciences》 SCIE EI 2022年第9期711-738,共28页
To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks,a lightweight ResNet(LW-ResNet)model for apple disease rec... To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks,a lightweight ResNet(LW-ResNet)model for apple disease recognition is proposed.Based on the deep residual network(ResNet18),the multi-scale feature extraction layer is constructed by group convolution to realize the compression model and improve the extraction ability of different sizes of lesion features.By improving the identity mapping structure to reduce information loss.By introducing the efficient channel attention module(ECANet)to suppress noise from a complex background.The experimental results show that the average precision,recall and F1-score of the LW-ResNet on the test set are 97.80%,97.92%and 97.85%,respectively.The parameter memory is 2.32 MB,which is 94%less than that of ResNet18.Compared with the classic lightweight networks SqueezeNet and MobileNetV2,LW-ResNet has obvious advantages in recognition performance,speed,parameter memory requirement and time complexity.The proposed model has the advantages of low computational cost,low storage cost,strong real-time performance,high identification accuracy,and strong practicability,which can meet the needs of real-time identification task of apple leaf disease on resource-constrained devices. 展开更多
关键词 Apple disease recognition deep residual network multi-scale feature efficient channel attention module lightweight network
在线阅读 下载PDF
Revolutionizing anemia detection:integrative machine learning models and advanced attention mechanisms
11
作者 Muhammad Ramzan Jinfang Sheng +2 位作者 Muhammad Usman Saeed Bin Wang Faisal Z.Duraihem 《Visual Computing for Industry,Biomedicine,and Art》 2024年第1期183-195,共13页
This study addresses the critical issue of anemia detection using machine learning(ML)techniques.Although a widespread blood disorder with significant health implications,anemia often remains undetected.This necessita... This study addresses the critical issue of anemia detection using machine learning(ML)techniques.Although a widespread blood disorder with significant health implications,anemia often remains undetected.This necessitates timely and efficient diagnostic methods,as traditional approaches that rely on manual assessment are time-consuming and subjective.The present study explored the application of ML-particularly classification models,such as logistic regression,decision trees,random forest,support vector machines,Naïve Bayes,and k-nearest neighbors-in conjunction with innovative models incorporating attention modules and spatial attention to detect anemia.The proposed models demonstrated promising results,achieving high accuracy,precision,recall,and F1 scores for both textual and image datasets.In addition,an integrated approach that combines textual and image data was found to outperform the individual modalities.Specifically,the proposed AlexNet Multiple Spatial Attention model achieved an exceptional accuracy of 99.58%,emphasizing its potential to revolutionize automated anemia detection.The results of ablation studies confirm the significance of key components-including the blue-green-red,multiple,and spatial attentions-in enhancing model performance.Overall,this study presents a comprehensive and innovative framework for noninvasive anemia detection,contributing valuable insights to the field. 展开更多
关键词 ANEMIA NONINVASIVE MULTIMODAL feature fusion attention module
在线阅读 下载PDF
Learning multi-scale attention network for fine-grained visual classification
12
作者 Peipei Zhao Siyan Yang +4 位作者 Wei Ding Ruyi Liu Wentian Xin Xiangzeng Liu Qiguang Miao 《Journal of Information and Intelligence》 2025年第6期492-503,共12页
Fine-grained visual classification(FGVC)is a very challenging task due to distinguishing subcategories under the same super-category.Recent works mainly localize discriminative image regions and capture subtle inter-c... Fine-grained visual classification(FGVC)is a very challenging task due to distinguishing subcategories under the same super-category.Recent works mainly localize discriminative image regions and capture subtle inter-class differences by utilizing attention-based methods.However,at the same layer,most attention-based works only consider large-scale attention blocks with the same size as feature maps,and they ignore small-scale attention blocks that are smaller than feature maps.To distinguish subcategories,it is important to exploit small local regions.In this work,a novel multi-scale attention network(MSANet)is proposed to capture large and small regions at the same layer in fine-grained visual classification.Specifically,a novel multi-scale attention layer(MSAL)is proposed,which generates multiple groups in each feature maps to capture different-scale discriminative regions.The groups based on large-scale regions can exploit global features and the groups based on the small-scale regions can extract local subtle features.Then,a simple feature fusion strategy is utilized to fully integrate global features and local subtle features to mine information that are more conducive to FGVC.Comprehensive experiments in Caltech-UCSD Birds-200-2011(CUB),FGVC-Aircraft(AIR)and Stanford Cars(Cars)datasets show that our method achieves the competitive performances,which demonstrate its effectiveness. 展开更多
关键词 Fine-grained visual classification multi-scale attention network multi-scale attention module feature fusion strategy
原文传递
A hierarchical framework for cervical cell classification using attention-based multi-scale local binary convolutional neural networks
13
作者 Tao Wan Lei Cao +2 位作者 Yulan Jin Dong Chen Zengchang Qin 《Medicine in Novel Technology and Devices》 2025年第3期213-228,共16页
Traditional classification methods for cervical cells heavily rely on manual feature extraction,constraining their versatility due to the intricacies of cytology images.Although deep learning approaches offer remarkab... Traditional classification methods for cervical cells heavily rely on manual feature extraction,constraining their versatility due to the intricacies of cytology images.Although deep learning approaches offer remarkable po-tential,they often sacrifice domain-specific knowledge,particularly the morphological patterns characterizing various cell subtypes during automated feature extraction.To bridge this gap,we introduce a novel hierarchical framework that integrates robust features from color,texture,and morphology with latent representations discovered by an improved attention-based multi-scale local binary convolutional neural networks(MS-LBCNN),designed to facilitate powerful feature extraction mechanism.We enhance the standard 6-class Bethesda system(TBS)classification by incorporating a coarse-to-refine fusion strategy,which optimizes the classification pro-cess.The proposed method is uniquely equipped to manage the complexities present in both individual and clustered cell images.Upon rigorous evaluation across three independent data cohorts,our method consistently surpassed existing state-of-the-art techniques.The experimental results indicated the potential of our method in enhancing the development of automation-aided diagnostic systems,and bolstering both the accuracy and ef-ficiency of cytology screening procedures. 展开更多
关键词 Cervical cell classification multi-scale local binary convolutional neural networks attention mechanism The Bethesda system feature fusion
在线阅读 下载PDF
基于轻量级卷积神经网络的岩石图像岩性识别方法
14
作者 刘善伟 马志伟 +1 位作者 魏世清 魏忠勇 《地质科技通报》 北大核心 2026年第1期360-370,共11页
岩性识别是油气勘探和开发过程中的重要环节,对于油气勘探定位、储层评价以及储层模型建立具有重要的指导意义。但传统的人工岩性识别方法耗时耗力,经典的深度学习模型虽然识别精度高,但模型的参数量较大,为了提高模型识别精度,同时降... 岩性识别是油气勘探和开发过程中的重要环节,对于油气勘探定位、储层评价以及储层模型建立具有重要的指导意义。但传统的人工岩性识别方法耗时耗力,经典的深度学习模型虽然识别精度高,但模型的参数量较大,为了提高模型识别精度,同时降低模型的参数量,使模型适用于岩性实时识别工作,首先收集了白云岩、砂岩等8种岩石共3016张岩石图像构建岩性识别数据集,然后以轻量型卷积神经网络ShuffleNetV2模型为基础网络,提出了一种Rock-ShuffleNetV2岩性识别模型(RSHFNet模型)。模型中将混合注意力机制模块(convolutional block attention module,简称CBAM)以及多尺度特征融合模块(multi-scale feature fusion module,简称MSF)融入基础网络中以加强模型的特征提取能力,提升模型识别性能,并优化模型中ShuffleNetV2单元的堆叠次数以减少模型参数量。结果表明:与基础模型相比,RSHFNet模型的准确率达到了87.21%,提高了4.98%;同时,模型参数量与浮点运算量分别降低到了869702个,0.93×108,分别是基础模型的0.67,0.63倍,模型参数量明显降低;并且RSHFNet模型的综合性能明显优于现有的卷积神经网络。RSHFNet岩性识别模型具有较高的识别精度和较好的泛化能力,同时更加的轻量化,为实现野外实时的岩性识别工作提供了新思路。 展开更多
关键词 岩性识别 ShuffleNetV2网络 混合注意力机制模块 多尺度特征融合模块 卷积神经网络
在线阅读 下载PDF
融合轻量化和注意力机制的口罩人脸检测算法
15
作者 兰红 王恪 陈子怡 《计算机工程与应用》 北大核心 2026年第1期274-284,共11页
口罩人脸检测是智能监控系统中的关键部分,在城市管理和公共卫生安全方面有重要意义。针对口罩人脸检测在处理小目标(远处目标所显示的图像很小)、光照条件、佩戴口罩人脸展示不同方向等问题在复杂场景图像时出现的漏检和检测不准确问题... 口罩人脸检测是智能监控系统中的关键部分,在城市管理和公共卫生安全方面有重要意义。针对口罩人脸检测在处理小目标(远处目标所显示的图像很小)、光照条件、佩戴口罩人脸展示不同方向等问题在复杂场景图像时出现的漏检和检测不准确问题,提出IM-YOLO口罩人脸检测算法,并调整模型深度,设计并构建了轻量化IM-YOLO模型来满足口罩人脸检测的各类复杂场景下的实时性需求。针对参数量过高的问题,构建了融合多头注意力和空间注意力的MC注意力模块和轻量化模块MCB。设计了CGFPN结构来充分融合低层与高层之间的多尺度特征信息。构建RHM模块来提高特征语义信息的利用率和减少特征冗余。引入Inner-IoU损失函数来提升模型整体性能。在同等情况下,IM-YOLO优于YOLOv8以及其他主流算法。并且该模型在AIZOO数据集上的mAP值达到了96.2%,在自制数据集上的mAP值达到了89.0%,且模型参数量相比于YOLOv8降低了40%,适用于当前智能监控系统中的口罩人脸检测。 展开更多
关键词 口罩人脸检测 注意力机制 轻量化 特征融合模块 YOLOv8n
在线阅读 下载PDF
Grasp Detection with Hierarchical Multi-Scale Feature Fusion and Inverted Shuffle Residual
16
作者 Wenjie Geng Zhiqiang Cao +3 位作者 Peiyu Guan Fengshui Jing Min Tan Junzhi Yu 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2024年第1期244-256,共13页
Grasp detection plays a critical role for robot manipulation.Mainstream pixel-wise grasp detection networks with encoder-decoder structure receive much attention due to good accuracy and efficiency.However,they usuall... Grasp detection plays a critical role for robot manipulation.Mainstream pixel-wise grasp detection networks with encoder-decoder structure receive much attention due to good accuracy and efficiency.However,they usually transmit the high-level feature in the encoder to the decoder,and low-level features are neglected.It is noted that low-level features contain abundant detail information,and how to fully exploit low-level features remains unsolved.Meanwhile,the channel information in high-level feature is also not well mined.Inevitably,the performance of grasp detection is degraded.To solve these problems,we propose a grasp detection network with hierarchical multi-scale feature fusion and inverted shuffle residual.Both low-level and high-level features in the encoder are firstly fused by the designed skip connections with attention module,and the fused information is then propagated to corresponding layers of the decoder for in-depth feature fusion.Such a hierarchical fusion guarantees the quality of grasp prediction.Furthermore,an inverted shuffle residual module is created,where the high-level feature from encoder is split in channel and the resultant split features are processed in their respective branches.By such differentiation processing,more high-dimensional channel information is kept,which enhances the representation ability of the network.Besides,an information enhancement module is added before the encoder to reinforce input information.The proposed method attains 98.9%and 97.8%in image-wise and object-wise accuracy on the Cornell grasping dataset,respectively,and the experimental results verify the effectiveness of the method. 展开更多
关键词 grasp detection hierarchical multi-scale feature fusion skip connections with attention inverted shuffle residual
原文传递
基于SCE-YOLO网络的遥感小目标检测
17
作者 付佳俊 丁胜 +1 位作者 刘小明 李琛 《计算机工程与设计》 北大核心 2026年第1期235-243,共9页
为解决遥感图像密集小目标检测问题,提出基于YOLOv8n改进的空间上下文强化网络SCE-YOLO。在特征提取和特征融合之间构建一个高效的空间上下文增强模块SCEM来增强模型的局部特征信息和全局空间上下文感知能力;提出CSRC加强对特征的通道... 为解决遥感图像密集小目标检测问题,提出基于YOLOv8n改进的空间上下文强化网络SCE-YOLO。在特征提取和特征融合之间构建一个高效的空间上下文增强模块SCEM来增强模型的局部特征信息和全局空间上下文感知能力;提出CSRC加强对特征的通道和空间的关注度,来设计特征加权融合模块FWFM;使用加权损失函数降低微小目标对IoU的敏感度,提升微小目标的召回率。在自制的数据集RSOD、公共的AI-TOD微小遥感数据集上进行对比实验,实验结果表明,提出的算法在遥感小目标检测上具有良好的性能。 展开更多
关键词 遥感小目标检测 YOLOv8n 空间上下文强化模块 注意力机制 通道空间加权 特征加权融合模块 加权损失函数
在线阅读 下载PDF
基于CBAM增强与多尺度特征融合的AD MRI图像分类方法
18
作者 韦建武 王宋 +3 位作者 任建禹 肖叶子 邵梅 韩刚 《西安邮电大学学报》 2026年第2期109-117,共9页
为解决现有基于深度学习的阿尔茨海默病(Alzheimer's Disease,AD)高分辨率磁共振成像(Magnetic Resonance Imaging,MRI)图像分类方法中全局特征建模不足、多尺度病理信息利用不充分及类别不平衡问题,提出融合卷积块注意力模块与多... 为解决现有基于深度学习的阿尔茨海默病(Alzheimer's Disease,AD)高分辨率磁共振成像(Magnetic Resonance Imaging,MRI)图像分类方法中全局特征建模不足、多尺度病理信息利用不充分及类别不平衡问题,提出融合卷积块注意力模块与多尺度过渡层的改进密集连接网络(DenseNet)模型。具体方法为:在DenseNet121架构下,于各密集块末端集成卷积块注意力模块(Convolutional Block Attention Module,CBAM)以聚焦关键脑区病理变化,设计多尺度Transition层优化下采样时多尺度病理信息融合与保全,采用焦点损失函数缓解类别不平衡。OASIS-1数据集实验显示,该模型分类准确率90.91%、F1值92.12%、召回率94.00%,显著优于MobileNetV2、VGG16及传统DenseNet模型。其能提升AD分期诊断精度,尤其在降低痴呆漏诊率、识别轻度痴呆(黄金干预窗口)上表现突出,可为临床早期干预提供支撑,具备临床转化潜力。 展开更多
关键词 阿尔茨海默病 磁共振成像图像分类 多尺度特征融合 密集连接网络 深度学习 卷积块注意力模块
在线阅读 下载PDF
基于改进的YOLOv8尺度自适应大渣检测算法研究
19
作者 徐福斌 殷浩然 +1 位作者 任莉芸 李哲 《自动化技术与应用》 2026年第3期128-132,146,共6页
针对工业场景中大渣块的检测,研究提出一种结合尺度自适应模块和特征融合自注意力机制的改进YOLOv8模型。该方法通过引入深度估计子网络,利用目标的深度信息动态调整锚框的大小,以提高模型在不同尺度下的检测能力。此外,特征融合的自注... 针对工业场景中大渣块的检测,研究提出一种结合尺度自适应模块和特征融合自注意力机制的改进YOLOv8模型。该方法通过引入深度估计子网络,利用目标的深度信息动态调整锚框的大小,以提高模型在不同尺度下的检测能力。此外,特征融合的自注意力机制进一步增强了对显著目标特征的关注,减少了背景噪声的干扰。研究还使用了自制的数据集,该数据集包含在工业排渣机场景下采集的不同深度、尺度和光照条件下的大渣块样本,以评估所提方法的性能。实验结果表明,改进后的YOLOv8模型在复杂场景中表现出显著的性能提升,在平均精度、准确率和召回率方面均优于其他主流检测模型,达到了98.1%的mAP。这验证了所提方法在复杂工业场景中检测大渣块的有效性与鲁棒性。 展开更多
关键词 图像处理 尺度自适应模块 特征融合自注意力 工业熔渣检测 深度估计 目标检测
在线阅读 下载PDF
基于深度学习的舰船关键部位检测算法
20
作者 王瑶 胥辉旗 +1 位作者 曹司磊 王磊 《系统工程与电子技术》 北大核心 2026年第2期410-421,共12页
针对当前缺乏舰船关键部位的检测算法、对应数据集,检测算法精度速度无法平衡及网络对舰船位置尺度变换鲁棒性不强等技术难题,构建一种三维特征增强和不同尺度特征结合的无锚框的舰船关键部位选择方法,基于相似度特征模块的深层聚合分... 针对当前缺乏舰船关键部位的检测算法、对应数据集,检测算法精度速度无法平衡及网络对舰船位置尺度变换鲁棒性不强等技术难题,构建一种三维特征增强和不同尺度特征结合的无锚框的舰船关键部位选择方法,基于相似度特征模块的深层聚合分割算法,实现对舰船关键部位的精准高效检测。首先,通过引入感受野模块实现网络多尺度特征融合,提升检测精度。然后,通过并入基于相似度的注意力模块提升对有用目标信息的关注度;通过使用可变形卷积实现对不同层的特征信息进行聚合,有效提升网络的泛化能力和表达能力。最后,在不使用锚框的前提下,通过目标中心点预测,再回归得到中心点偏移、目标角度、尺度信息,提升检测速度。分别在自建数据集及Pascal视觉对象类别(Visual Object Classes,VOC)数据集上进行对比实验,充分证明了所提网络对舰船关键部位检测的精准性及时效性,能够为反舰装备实现外科手术式打击提供可行技术途径及理论支撑。 展开更多
关键词 舰船目标 关键部位 相似度注意力机制 特征融合 精准选择
在线阅读 下载PDF
上一页 1 2 9 下一页 到第
使用帮助 返回顶部