期刊文献+
共找到62篇文章
< 1 2 4 >
每页显示 20 50 100
Deep Learning Approach for COVID-19 Detection in Computed Tomography Images 被引量:2
1
作者 Mohamad Mahmoud Al Rahhal Yakoub Bazi +2 位作者 Rami M.Jomaa Mansour Zuair Naif Al Ajlan 《Computers, Materials & Continua》 SCIE EI 2021年第5期2093-2110,共18页
With the rapid spread of the coronavirus disease 2019(COVID-19)worldwide,the establishment of an accurate and fast process to diagnose the disease is important.The routine real-time reverse transcription-polymerase ch... With the rapid spread of the coronavirus disease 2019(COVID-19)worldwide,the establishment of an accurate and fast process to diagnose the disease is important.The routine real-time reverse transcription-polymerase chain reaction(rRT-PCR)test that is currently used does not provide such high accuracy or speed in the screening process.Among the good choices for an accurate and fast test to screen COVID-19 are deep learning techniques.In this study,a new convolutional neural network(CNN)framework for COVID-19 detection using computed tomography(CT)images is proposed.The EfficientNet architecture is applied as the backbone structure of the proposed network,in which feature maps with different scales are extracted from the input CT scan images.In addition,atrous convolution at different rates is applied to these multi-scale feature maps to generate denser features,which facilitates in obtaining COVID-19 findings in CT scan images.The proposed framework is also evaluated in this study using a public CT dataset containing 2482 CT scan images from patients of both classes(i.e.,COVID-19 and non-COVID-19).To augment the dataset using additional training examples,adversarial examples generation is performed.The proposed system validates its superiority over the state-of-the-art methods with values exceeding 99.10%in terms of several metrics,such as accuracy,precision,recall,and F1.The proposed system also exhibits good robustness,when it is trained using a small portion of data(20%),with an accuracy of 96.16%. 展开更多
关键词 COVID-19 deep learning computed tomography multi-scale features atrous convolution adversarial examples
在线阅读 下载PDF
Feature Fusion-Based Deep Learning Network to Recognize Table Tennis Actions
2
作者 Chih-Ta Yen Tz-Yun Chen +1 位作者 Un-Hung Chen Guo-Chang WangZong-Xian Chen 《Computers, Materials & Continua》 SCIE EI 2023年第1期83-99,共17页
A system for classifying four basic table tennis strokes using wearable devices and deep learning networks is proposed in this study.The wearable device consisted of a six-axis sensor,Raspberry Pi 3,and a power bank.M... A system for classifying four basic table tennis strokes using wearable devices and deep learning networks is proposed in this study.The wearable device consisted of a six-axis sensor,Raspberry Pi 3,and a power bank.Multiple kernel sizes were used in convolutional neural network(CNN)to evaluate their performance for extracting features.Moreover,a multiscale CNN with two kernel sizes was used to perform feature fusion at different scales in a concatenated manner.The CNN achieved recognition of the four table tennis strokes.Experimental data were obtained from20 research participants who wore sensors on the back of their hands while performing the four table tennis strokes in a laboratory environment.The data were collected to verify the performance of the proposed models for wearable devices.Finally,the sensor and multi-scale CNN designed in this study achieved accuracy and F1 scores of 99.58%and 99.16%,respectively,for the four strokes.The accuracy for five-fold cross validation was 99.87%.This result also shows that the multi-scale convolutional neural network has better robustness after fivefold cross validation. 展开更多
关键词 Wearable devices deep learning six-axis sensor feature fusion multi-scale convolutional neural networks action recognit
在线阅读 下载PDF
MewCDNet: A Wavelet-Based Multi-Scale Interaction Network for Efficient Remote Sensing Building Change Detection
3
作者 Jia Liu Hao Chen +5 位作者 Hang Gu Yushan Pan Haoran Chen Erlin Tian Min Huang Zuhe Li 《Computers, Materials & Continua》 2026年第1期687-710,共24页
Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectra... Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability. 展开更多
关键词 Remote sensing change detection deep learning wavelet transform multi-scale
在线阅读 下载PDF
Transfer learning framework for multi-scale crack type classification with sparse microseismic networks
4
作者 Arnold Yuxuan Xie Bing QLi 《International Journal of Mining Science and Technology》 SCIE EI CAS CSCD 2024年第2期167-178,共12页
Rock fracture mechanisms can be inferred from moment tensors(MT)inverted from microseismic events.However,MT can only be inverted for events whose waveforms are acquired across a network of sensors.This is limiting fo... Rock fracture mechanisms can be inferred from moment tensors(MT)inverted from microseismic events.However,MT can only be inverted for events whose waveforms are acquired across a network of sensors.This is limiting for underground mines where the microseismic stations often lack azimuthal coverage.Thus,there is a need for a method to invert fracture mechanisms using waveforms acquired by a sparse microseismic network.Here,we present a novel,multi-scale framework to classify whether a rock crack contracts or dilates based on a single waveform.The framework consists of a deep learning model that is initially trained on 2400000+manually labelled field-scale seismic and microseismic waveforms acquired across 692 stations.Transfer learning is then applied to fine-tune the model on 300000+MT-labelled labscale acoustic emission waveforms from 39 individual experiments instrumented with different sensor layouts,loading,and rock types in training.The optimal model achieves over 86%F-score on unseen waveforms at both the lab-and field-scale.This model outperforms existing empirical methods in classification of rock fracture mechanisms monitored by a sparse microseismic network.This facilitates rapid assessment of,and early warning against,various rock engineering hazard such as induced earthquakes and rock bursts. 展开更多
关键词 multi-scale Fracture processes Microseismic Acoustic emission Source mechanism deep learning
在线阅读 下载PDF
MSFResNet:A ResNeXt50 model based on multi-scale feature fusion for wild mushroom identification
5
作者 YANG Yang JU Tao +1 位作者 YANG Wenjie ZHAO Yuyang 《Journal of Measurement Science and Instrumentation》 2025年第1期66-74,共9页
To solve the problems of redundant feature information,the insignificant difference in feature representation,and low recognition accuracy of the fine-grained image,based on the ResNeXt50 model,an MSFResNet network mo... To solve the problems of redundant feature information,the insignificant difference in feature representation,and low recognition accuracy of the fine-grained image,based on the ResNeXt50 model,an MSFResNet network model is proposed by fusing multi-scale feature information.Firstly,a multi-scale feature extraction module is designed to obtain multi-scale information on feature images by using different scales of convolution kernels.Meanwhile,the channel attention mechanism is used to increase the global information acquisition of the network.Secondly,the feature images processed by the multi-scale feature extraction module are fused with the deep feature images through short links to guide the full learning of the network,thus reducing the loss of texture details of the deep network feature images,and improving network generalization ability and recognition accuracy.Finally,the validity of the MSFResNet model is verified using public datasets and applied to wild mushroom identification.Experimental results show that compared with ResNeXt50 network model,the accuracy of the MSFResNet model is improved by 6.01%on the FGVC-Aircraft common dataset.It achieves 99.13%classification accuracy on the wild mushroom dataset,which is 0.47%higher than ResNeXt50.Furthermore,the experimental results of the thermal map show that the MSFResNet model significantly reduces the interference of background information,making the network focus on the location of the main body of wild mushroom,which can effectively improve the accuracy of wild mushroom identification. 展开更多
关键词 multi-scale feature fusion attention mechanism ResNeXt50 wild mushroom identification deep learning
在线阅读 下载PDF
基于Mamba与注意力机制的三阴性乳腺癌超声图像分类方法
6
作者 杨颖 宋元冰 +3 位作者 张一峰 吴蓉 杜宇 郑祎 《海军军医大学学报》 北大核心 2026年第1期37-45,共9页
目的探讨一种基于Mamba与注意力机制的混合神经网络架构(MECSA-Net)在三阴性乳腺癌(TNBC)超声图像分类中的应用效果。方法回顾性收集1059幅乳腺超声图像,其中TNBC图像166幅,非TNBC图像893幅。采用图像增强技术缓解类别不平衡问题。提出... 目的探讨一种基于Mamba与注意力机制的混合神经网络架构(MECSA-Net)在三阴性乳腺癌(TNBC)超声图像分类中的应用效果。方法回顾性收集1059幅乳腺超声图像,其中TNBC图像166幅,非TNBC图像893幅。采用图像增强技术缓解类别不平衡问题。提出轻量级混合神经网络架构MECSA-Net,其特征提取模块为高效混洗感知块(SAEffBlock),由状态空间建模分支(SSM-Branch)与轻量卷积分支(EffConvBranch)组成。在分类器前端引入多尺度空洞融合注意力(MDFA)模块,以提升模型对多尺度结构的感知能力和上下文信息建模能力。结果在TNBC分类任务中,MECSA-Net准确率为93.9%、精确率为94.4%、F1分数为93.9%、AUC为0.976,整体性能优于ResNet-18、ResNet-50、EfficientNet-B0、ViT-Base和MedMamba-T等主流模型。混淆矩阵分析显示,该模型对TNBC与非TNBC样本均具备较高的识别准确性和较低的误判率。消融实验进一步验证了EffConvBranch与MDFA模块在局部纹理建模与多尺度结构判别中的关键作用,显著增强了模型的分类性能与鲁棒性。结论MECSA-Net在TNBC超声图像分类中表现出优异的准确性与鲁棒性,具备良好的临床应用前景,可为TNBC术前智能辅助诊断提供技术支持。 展开更多
关键词 三阴性乳腺癌 超声图像 深度学习 mamba 状态空间建模 注意力机制
原文传递
基于Mamba-UNet架构的3D MRI脑肿瘤分割方法
7
作者 张野 牛大田 《计算机应用研究》 北大核心 2026年第1期305-312,共8页
多模态MRI脑肿瘤影像的精准分割对脑癌临床诊疗及预后评估至关重要。针对卷积神经网络在捕获全局上下文信息和建立长远程依赖关系方面存在的局限性,提出了基于Mamba与U-Net融合架构的PhC-ToMamba分割模型。模型在瓶颈层嵌入了ToM模块旨... 多模态MRI脑肿瘤影像的精准分割对脑癌临床诊疗及预后评估至关重要。针对卷积神经网络在捕获全局上下文信息和建立长远程依赖关系方面存在的局限性,提出了基于Mamba与U-Net融合架构的PhC-ToMamba分割模型。模型在瓶颈层嵌入了ToM模块旨在有效建模高维特征的全局信息,通过从三个方向计算特征依赖关系并交互,提取更适用于三维图像的全局特征信息;此外,为进一步提升全局特征的提取能力,提出了一种新的多面体卷积(PhConv),并将其嵌入至编码器中,显著扩大了感受野,并提升对重点目标区域的特征提取能力,有效解决了当前主流脑肿瘤图像分割模型对全局信息感知的局限性问题,增强了对关键区域的关注度。在BraTS 2021和MSD Task01_BrainTumor数据集上进行了广泛的实验。实验结果显示,PhC-ToMamba在整个肿瘤、肿瘤核心和增强肿瘤分割任务中的Dice系数分别达到了95.05%/90.46%、94.53%/89.91%和90.74%/75.91%。与其他先进方法相比,PhC-ToMamba在分割精度和参数效率方面展现了优越性,为脑肿瘤分割任务提供稳健的解决方案,从而提高了诊断准确性。 展开更多
关键词 深度学习 MRI脑肿瘤分割 多面体卷积 三维U-Net mamba
在线阅读 下载PDF
基于近红外光谱结合MS-Mamba模型的塑料包装分类方法
8
作者 管艺博 杜欣芸 张志霞 《包装工程》 北大核心 2026年第3期198-209,共12页
目的针对常见塑料包装材料外观相似、成分差异微小且传统检测方法效率低、易受人为因素影响等问题,探索近红外光谱技术结合深度学习模型在塑料包装材料快速识别中的应用潜力。方法首先利用近红外光谱仪对PET、PE、PP、PVC、PLA和PBS等... 目的针对常见塑料包装材料外观相似、成分差异微小且传统检测方法效率低、易受人为因素影响等问题,探索近红外光谱技术结合深度学习模型在塑料包装材料快速识别中的应用潜力。方法首先利用近红外光谱仪对PET、PE、PP、PVC、PLA和PBS等典型塑料包装材料进行光谱数据采集。采用Savitzky-Golay平滑、标准正态变量变换(SNV)及多元散射校正(MSC)多步策略对原始光谱进行预处理,以消除噪声与散射干扰。在此基础上,构建了一种基于多尺度Mamba(Multi-scale mamba,MS-Mamba)的分类模型,该模型利用状态空间模型(SSM)的线性复杂度优势,通过多尺度卷积支路与门控融合机制,同时捕捉光谱序列的局部纹理特征与全局长程依赖。为验证模型性能,将所提方法与传统机器学习模型(PCA-SVM、PLS-DA)及深度学习模型(LSTM、Transformer等)进行对比,并以准确率、精确率、召回率和F1分数作为评价指标。结果实验结果表明,MS-Mamba模型在分类精度与稳定性方面均显著优于传统机器学习(PCA-SVM,PLS-DA)及主流深度学习模型(ResNet,Transformer)。其中测试集准确率、精确率、召回率和F1分数分别达到99.91%、99.89%、99.88%以及99.88,且在区分PE与PP等高度相似材料时表现出极高的鲁棒性。结论本文方法能够实现塑料包装材料的快速、无损和高精度识别,为包装材料的自动化检测与绿色回收提供了一种可行的技术路径。 展开更多
关键词 近红外光谱 塑料包装分类 多尺度mamba 深度学习 无损识别
在线阅读 下载PDF
A multi-scale convolutional auto-encoder and its application in fault diagnosis of rolling bearings 被引量:12
9
作者 Ding Yunhao Jia Minping 《Journal of Southeast University(English Edition)》 EI CAS 2019年第4期417-423,共7页
Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on ... Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on the standard convolutional auto-encoder.In this model,the parallel convolutional and deconvolutional kernels of different scales are used to extract the features from the input signal and reconstruct the input signal;then the feature map extracted by multi-scale convolutional kernels is used as the input of the classifier;and finally the parameters of the whole model are fine-tuned using labeled data.Experiments on one set of simulation fault data and two sets of rolling bearing fault data are conducted to validate the proposed method.The results show that the model can achieve 99.75%,99.3%and 100%diagnostic accuracy,respectively.In addition,the diagnostic accuracy and reconstruction error of the one-dimensional multi-scale convolutional auto-encoder are compared with traditional machine learning,convolutional neural networks and a traditional convolutional auto-encoder.The final results show that the proposed model has a better recognition effect for rolling bearing fault data. 展开更多
关键词 fault diagnosis deep learning convolutional auto-encoder multi-scale convolutional kernel feature extraction
在线阅读 下载PDF
Multi-Scale Mixed Attention Tea Shoot Instance Segmentation Model 被引量:1
10
作者 Dongmei Chen Peipei Cao +5 位作者 Lijie Yan Huidong Chen Jia Lin Xin Li Lin Yuan Kaihua Wu 《Phyton-International Journal of Experimental Botany》 SCIE 2024年第2期261-275,共15页
Tea leaf picking is a crucial stage in tea production that directly influences the quality and value of the tea.Traditional tea-picking machines may compromise the quality of the tea leaves.High-quality teas are often... Tea leaf picking is a crucial stage in tea production that directly influences the quality and value of the tea.Traditional tea-picking machines may compromise the quality of the tea leaves.High-quality teas are often handpicked and need more delicate operations in intelligent picking machines.Compared with traditional image processing techniques,deep learning models have stronger feature extraction capabilities,and better generalization and are more suitable for practical tea shoot harvesting.However,current research mostly focuses on shoot detection and cannot directly accomplish end-to-end shoot segmentation tasks.We propose a tea shoot instance segmentation model based on multi-scale mixed attention(Mask2FusionNet)using a dataset from the tea garden in Hangzhou.We further analyzed the characteristics of the tea shoot dataset,where the proportion of small to medium-sized targets is 89.9%.Our algorithm is compared with several mainstream object segmentation algorithms,and the results demonstrate that our model achieves an accuracy of 82%in recognizing the tea shoots,showing a better performance compared to other models.Through ablation experiments,we found that ResNet50,PointRend strategy,and the Feature Pyramid Network(FPN)architecture can improve performance by 1.6%,1.4%,and 2.4%,respectively.These experiments demonstrated that our proposed multi-scale and point selection strategy optimizes the feature extraction capability for overlapping small targets.The results indicate that the proposed Mask2FusionNet model can perform the shoot segmentation in unstructured environments,realizing the individual distinction of tea shoots,and complete extraction of the shoot edge contours with a segmentation accuracy of 82.0%.The research results can provide algorithmic support for the segmentation and intelligent harvesting of premium tea shoots at different scales. 展开更多
关键词 Tea shoots attention mechanism multi-scale feature extraction instance segmentation deep learning
在线阅读 下载PDF
Multi-Scale and Multi-Channel Networks for CSI Feedback in Massive MIMO System
11
作者 Bingyang Cheng Jianing Zhao Yu Hu 《Journal of Computer and Communications》 2021年第10期132-141,共10页
<div style="text-align:justify;"> In the frequency division duplex (FDD) mode of the massive MIMO system, the system needs to perform coding through channel state information (CSI) to obtain performanc... <div style="text-align:justify;"> In the frequency division duplex (FDD) mode of the massive MIMO system, the system needs to perform coding through channel state information (CSI) to obtain performance gains. However, the number of antennas of the base station has been greatly increased, resulting in a rapid increase in the overhead for the user terminal to feedback CSI to the base station. In this article, we propose a method based on multi-task CNN to achieve compression and reconstruction of channel state information through a multi-scale and multi-channel convolutional neural network. We also introduce a dynamic learning rate model to improve the accuracy of channel state information reconstruction. The simulation results show that compared with the original CsiNet and other work, the proposed CSI feedback network has better reconstruction performance. </div> 展开更多
关键词 deep learning multi-scale MULTI-CHANNEL Massive MIMO
在线阅读 下载PDF
基于Mamba模型的区域电价预测方法
12
作者 白晓磊 张雪元 +2 位作者 王智永 全力 刘欣 《计算机应用》 北大核心 2025年第S2期137-142,共6页
针对电力市场中电价预测精度和效率的提升需求,提出一种基于S-Mamba2(Simple-Mamba-2)模型的区域电价预测方法,以解决复杂场景中进行电价预测时的计算资源消耗高以及效率低等问题。所提方法在Mamba-2模型的基础上,引入双向Mamba-2模块... 针对电力市场中电价预测精度和效率的提升需求,提出一种基于S-Mamba2(Simple-Mamba-2)模型的区域电价预测方法,以解决复杂场景中进行电价预测时的计算资源消耗高以及效率低等问题。所提方法在Mamba-2模型的基础上,引入双向Mamba-2模块和前馈神经网络(FFN)编码层,从而有效捕捉电价历史数据中的尖峰特性、季节性规律、变量的内在互相关特性(VC)和电价的时序依赖(TD)特性。在澳大利亚电力市场运营商(AEMO)和2014年全球电力能源预测竞赛(GEFCom2014)提供的电价预测数据集上的实验结果表明,相较于T iransformer和TimeDiffusion等模型,S-Mamba2模型提升了预测性能,预测准确率最高达到97.88%。可见,所提方法为电力市场的效率提升、交易风险降低以及资源配置优化提供了有力的技术支持。 展开更多
关键词 mamba模型 状态空间模型 深度学习 电价预测
在线阅读 下载PDF
基于FDC-Mamba的关龙胆根茎实例分割与表型参数提取
13
作者 崔红光 刘海涛 +3 位作者 马有泽 黄文忠 李宏博 王铁军 《农业机械学报》 北大核心 2025年第10期500-511,共12页
针对关龙胆根茎中茎痕与残留茎基表型特征高度相似,且根茎尺寸小、形态复杂,导致图像分割特征提取困难、识别精度不足等问题,本文提出了焦点调制-动态检测头-上下文引导-Mamba(Focal Modulation-DyHead seg-Context Guided-Mamba,FDC-Ma... 针对关龙胆根茎中茎痕与残留茎基表型特征高度相似,且根茎尺寸小、形态复杂,导致图像分割特征提取困难、识别精度不足等问题,本文提出了焦点调制-动态检测头-上下文引导-Mamba(Focal Modulation-DyHead seg-Context Guided-Mamba,FDC-Mamba)关龙胆根茎实例分割模型。首先,为解决关龙胆相邻根丝边界模糊、缠绕部位重叠问题,引入目标检测型Mamba(Object detection Mamba,ODMamba)主干网络补充纹理细节,加强结构一致性;其次,通过融合Focal Modulation与Context Guided结构部分,增强多尺度感知能力和细节分割能力;最后,将DyHead结构结合辅助检测头(Auxiliary Head)训练策略,开发一种用于实例分割新训练结构DyHead seg,提高信息传递效率、优化学习过程。与其他常用实例分割模型(YOLO系列、Mask R-CNN、PointRend、HTC、SOLOv2、RT-DETR、HYPER)、不同特征金字塔架构模块(RepBN、AIFI、LSKA)、不同下采样结构模块(SRFD、ADown、CARAFE、EUCB、Gold-YOLO、HWD、PSConv、SODConv、WaveletPool)在关龙胆根茎数据集上进行对比,改进后模型完成了对关龙胆根茎实例分割,在根茎边缘和细小区域定位方面具有更高准确度,Box类型和Mask类型精度P、AP50、AP95分别提升6.52、5.09、5.44个百分点和4.49、2.68、1.16个百分点。基于分割结果,提出了关龙胆根长、根部粗细度、含杂率和色度4种表型参数提取方法。试验结果表明,所提出模型分割精度(Mask类型P)达87.12%,比基线模型高4.49个百分点。关龙胆表型参数提取结果与人工测量结果相对误差均在5%以内。本文对以关龙胆为代表的根茎类中药材表型特征提取具有较高的准确性,可为后续炮制工艺与装备研发奠定基础。 展开更多
关键词 关龙胆 实例分割 表型参数提取 深度学习 FDC-mamba
在线阅读 下载PDF
DFFMamba:A Novel Remote Sensing Change Detection Method with Difference Feature Fusion Mamba
14
作者 PENG Daifeng DONG Fengxu GUAN Haiyan 《Transactions of Nanjing University of Aeronautics and Astronautics》 2025年第6期728-748,共21页
Change detection(CD)plays a crucial role in numerous fields,where both convolutional neural networks(CNNs)and Transformers have demonstrated exceptional performance in CD tasks.However,CNNs suffer from limited recepti... Change detection(CD)plays a crucial role in numerous fields,where both convolutional neural networks(CNNs)and Transformers have demonstrated exceptional performance in CD tasks.However,CNNs suffer from limited receptive fields,hindering their ability to capture global features,while Transformers are constrained by high computational complexity.Recently,Mamba architecture,which is based on state space models(SSMs),has shown powerful global modeling capabilities while achieving linear computational complexity.Although some researchers have incorporated Mamba into CD tasks,the existing Mamba⁃based remote sensing CD methods struggle to effectively perceive the inherent locality of changed regions when flattening and scanning remote sensing images,leading to limitations in extracting change features.To address these issues,we propose a novel Mamba⁃based CD method termed difference feature fusion Mamba model(DFFMamba)by mitigating the loss of feature locality caused by traditional Mamba⁃style scanning.Specifically,two distinct difference feature extraction modules are designed:Difference Mamba(DMamba)and local difference Mamba(LDMamba),where DMamba extracts difference features by calculating the difference in coefficient matrices between the state⁃space equations of the bi⁃temporal features.Building upon DMamba,LDMamba combines a locally adaptive state⁃space scanning(LASS)strategy to enhance feature locality so as to accurately extract difference features.Additionally,a fusion Mamba(FMamba)module is proposed,which employs a spatial⁃channel token modeling SSM(SCTMS)unit to integrate multi⁃dimensional spatio⁃temporal interactions of change features,thereby capturing their dependencies across both spatial and channel dimensions.To verify the effectiveness of the proposed DFFMamba,extensive experiments are conducted on three datasets of WHU⁃CD,LEVIR⁃CD,and CLCD.The results demonstrate that DFFMamba significantly outperforms state⁃of⁃the⁃art CD methods,achieving intersection over union(IoU)scores of 90.67%,85.04%,and 66.56%on the three datasets,respectively. 展开更多
关键词 change detection state space model(SSM)change feature fusion deep learning difference mamba(Dmamba) local difference mamba(LDmamba) spatial⁃channel token modeling SSM(SCTMS)
在线阅读 下载PDF
卷积Mamba模型驱动的地震随机噪声压制方法 被引量:1
15
作者 韦秀娟 刘兴业 周怀来 《煤田地质与勘探》 北大核心 2025年第5期196-206,共11页
【背景】地震随机噪声压制是提升地震资料质量的关键环节之一,数据驱动的深度学习方法提供了一种智能解决方案。然而,主流的基于卷积神经网络的随机噪声智能压制方法受限于局部感受野特性,导致去噪过程中局部细节与宏观结构的协同优化不... 【背景】地震随机噪声压制是提升地震资料质量的关键环节之一,数据驱动的深度学习方法提供了一种智能解决方案。然而,主流的基于卷积神经网络的随机噪声智能压制方法受限于局部感受野特性,导致去噪过程中局部细节与宏观结构的协同优化不足,进而影响噪声压制精度。广泛应用于全局特征提取的Transformer模型通过自注意力机制能够有效捕获长距离依赖关系,理论上可弥补卷积神经网络在全局建模能力方面的局限性。但其计算慢,资源占用大,应用受限。【目的和方法】针对上述问题,提出了融合卷积Mamba的地震数据随机噪声压制网络(CMUNet)。基于二维选择性扫描技术(沿水平、垂直双方向遍历输入数据),通过状态空间方程构建全局动态系统,实现对地震数据时空特征的跨尺度特征提取,借助Mamba模型的硬件感知并行扫描算法降低计算资源消耗,保证去噪效果的同时提升计算效率。针对地震数据的特点,设计卷积-Mamba混合模块,在UNet编码器中构建层次化特征提取路径,即浅层CNN聚焦局部噪声模式识别,深层Mamba捕获大尺度地质结构关联性;进一步引入残差通道注意力门控,强化有效信号与噪声的特征可分性。【结果和结论】对于合成数据测试,提出的方法相较于UNet在信噪比、峰值信噪比和结构相似性上分别提高了2.4 dB、2.4 dB和0.0056,表现出对随机噪声的有效压制能力及对有效信号的保护能力。在野外实际地震数据应用中,局部相似性图像分析结果显示较低的局部相似值,进一步印证了该方法对有效信号的损伤程度低,展现出更优的保幅性,具有良好应用前景。 展开更多
关键词 地震随机噪声压制 深度学习 卷积神经网络 状态空间模型 mamba
在线阅读 下载PDF
基于Mamba-2的视频快照压缩成像重构方法
16
作者 石敦攀 徐伟 +3 位作者 朴永杰 方应红 籍浩林 李鹏飞 《液晶与显示》 北大核心 2025年第6期881-894,共14页
视频快照压缩成像(SCI)是一种新型的成像技术,通过在单个曝光时间内使用一个二维探测器捕获三维视频数据,然后采用合适的算法重建原始的视频数据。尽管目前的许多算法在视频SCI的重建任务中有着非常出色的表现,但它们重建质量的提升往... 视频快照压缩成像(SCI)是一种新型的成像技术,通过在单个曝光时间内使用一个二维探测器捕获三维视频数据,然后采用合适的算法重建原始的视频数据。尽管目前的许多算法在视频SCI的重建任务中有着非常出色的表现,但它们重建质量的提升往往需要以牺牲重建速度为代价,使算法的实时性大幅降低。为兼顾重建质量与重建速度,本文提出了一种基于Mamba-2的端到端深度视频SCI重构方法——M2BA-SCI。M2BA-SCI网络由预处理模块、token生成块、Mamba注意力块和视频重建块组成,其中Mamba注意力块主要由Mamba-2线性注意力块和前馈神经网络构成。在模拟和真实视频数据集上的大量实验表明,M2BA-SCI与先前算法相比取得了更为均衡的效果,在提高重建质量的同时仍保持较快的重建速度。在基准灰度视频数据集中,平均PSNR为34.85,平均SSIM为0.966,运行时间为0.23 s。在基准彩色视频数据集上的平均PSNR为36.21,平均SSIM为0.963,运行时间为1.03 s。M2BA-SCI为视频SCI重建带来了新的思路,为基于Mamba模型设计出更高重建质量的算法提供了参考。 展开更多
关键词 视频快照压缩成像 压缩感知 mamba-2 深度学习
在线阅读 下载PDF
基于Mamba结构的轻量级皮肤病变图像分割网络
17
作者 贺蒙蒙 张小艳 李洪安 《图学学报》 北大核心 2025年第6期1257-1266,共10页
皮肤病变分割是医学图像分析中的一项重要任务,对于皮肤类疾病的早期诊断和治疗具有重要意义。然而,在处理高分辨率皮肤图像和捕捉细微病变特征时,现有模型仍面临着计算复杂度高以及冗余信息处理不足等挑战。为此,提出一种基于Mamba结... 皮肤病变分割是医学图像分析中的一项重要任务,对于皮肤类疾病的早期诊断和治疗具有重要意义。然而,在处理高分辨率皮肤图像和捕捉细微病变特征时,现有模型仍面临着计算复杂度高以及冗余信息处理不足等挑战。为此,提出一种基于Mamba结构的轻量级皮肤病变图像分割网络ResMamba,采用六级U型结构,主要通过将Mamba嵌入到视觉状态空间中同时引入到编解码器中,ResVSS模块作为编码器的核心组成部分,通过删除冗余线性层可减少参数量,同时结合深度卷积块和可学习尺度参数对残差连接进行缩放,从而通过降低模型复杂度来提升分割精度。在跳跃连接模块使用多级、多尺度信息融合模块生成空间和通道注意力图,有效融合了多尺度信息。通过在公开皮肤数据集ISIC2017和ISIC2018上进行实验验证,结果表明,ResMamba模型在平衡参数数量和分割性能方面都具有较好的分割性能,验证了该模型的有效性。 展开更多
关键词 深度学习 皮肤病变分割 mamba结构 状态空间模型 轻量化
在线阅读 下载PDF
卷积增强Vision Mamba模型的构建及其应用 被引量:1
18
作者 俞焕友 范静 黄凡 《计算机技术与发展》 2025年第8期45-52,共8页
针对Vision Mamba(Vim)模型的局限性,该文提出了一种改进的模型——Convolutional Vision Mamba(CvM)。此模型通过摒弃Vim中的图形切割和位置编码机制,转而采用卷积操作进行替代,以实现对全局视觉信息的更高效处理。同时,此模型对Vim模... 针对Vision Mamba(Vim)模型的局限性,该文提出了一种改进的模型——Convolutional Vision Mamba(CvM)。此模型通过摒弃Vim中的图形切割和位置编码机制,转而采用卷积操作进行替代,以实现对全局视觉信息的更高效处理。同时,此模型对Vim模型中的位置嵌入模块进行了优化,以解决其固有的高计算量和内存消耗问题。进而,该文将CvM模型应用于医学图像分类领域,选用了血细胞图像、脑肿瘤图像、胸部CT扫描、病理性近视眼底图像以及肺炎X射线影像等数据集进行实验。实验结果表明,与Vim模型及其他5个神经网络模型相比,CvM模型在准确率上表现更为出色,在内存占用和参数数量方面也展现出明显的优势。消融实验表明,深度可分离卷积比标准卷积使用的参数和显存占用更少,而且在血细胞图像、脑肿瘤图像等医学图像分类上,准确率还有了显著提升。这些结果充分说明了CvM模型的优势和可行性。 展开更多
关键词 深度学习 Vision mamba 卷积神经网络 深度可分离卷积 医学图像分类
在线阅读 下载PDF
Mamba与Transformer混合图像补全技术的研究与实现
19
作者 刘海洋 胡永 田野 《实验室研究与探索》 北大核心 2025年第10期96-104,共9页
针对纹理复杂、色彩丰富的图像在修复过程中存在的细节缺失与色彩分布不均问题,提出一种基于Mamba与Transformer并行图像补全方法。该方法构建了由Mamba和Transformer的并行生成模型。Transformer通过全局自注意力机制捕捉图像块之间的... 针对纹理复杂、色彩丰富的图像在修复过程中存在的细节缺失与色彩分布不均问题,提出一种基于Mamba与Transformer并行图像补全方法。该方法构建了由Mamba和Transformer的并行生成模型。Transformer通过全局自注意力机制捕捉图像块之间的长距离依赖关系,Mamba则高效处理长序列数据,以弥补在细粒度细节处理不足,结合二者全局感知与高效远程依赖学习优势实现高质量重建。为强化全局与局部特征深度融合,设计上下文广播特征聚合网络,并采用谱归一化马尔科夫判别模型对抗训练。实验结果表明,该方法在多项指标上均优于对比方法,PSNR平均提升1.94 db, SSIM平均提升0.043 5,LPIPS平均下降0.624,能有效提升复杂图像的修复质量,为图像修复及相关领域中融合不同模型优势提供了新思路。 展开更多
关键词 图像修复 深度学习 曼巴 转换器 上下文广播特征聚合网络
在线阅读 下载PDF
基于Mamba-2编码的集装箱箱位分配模型
20
作者 向若愚 杨有 陈雁翎 《河南科学》 2025年第3期321-329,共9页
箱位分配在集装箱码头中至关重要,影响非生产性成本和作业效率。针对箱位分配问题,基于规则的策略求解速度快,但理论上无法获取最优解;数学规划模型理论上可以获得最优解,但计算时间随堆场规模增加而呈指数型增长,难以满足集装箱堆存的... 箱位分配在集装箱码头中至关重要,影响非生产性成本和作业效率。针对箱位分配问题,基于规则的策略求解速度快,但理论上无法获取最优解;数学规划模型理论上可以获得最优解,但计算时间随堆场规模增加而呈指数型增长,难以满足集装箱堆存的应用要求。使用深度强化学习方法设计模型,可以在短时间内获得高质量解。为此,针对贝位构造不能充分表达栈间语义的问题,定义5个栈位输入特征;设计基于端到端的编解码器模型,用于集装箱箱位分配。该模型采用Mamba-2进行编码,使用多头注意力进行解码,使用带基线的强化学习算法进行训练。仿真实验表明,所设计模型在中大规模问题上具有性能优势,能在较短时间内选择合理箱位,降低翻箱率,提高港口作业效率。 展开更多
关键词 集装箱 箱位分配 mamba-2 深度强化学习 状态空间模型
在线阅读 下载PDF
上一页 1 2 4 下一页 到第
使用帮助 返回顶部