期刊文献+
共找到1,507篇文章
< 1 2 76 >
每页显示 20 50 100
A Hybrid Deep Learning Approach Using Vision Transformer and U-Net for Flood Segmentation
1
作者 Cyreneo Dofitas Jr Yong-Woon Kim Yung-Cheol Byun 《Computers, Materials & Continua》 2026年第2期1209-1227,共19页
Recent advances in deep learning have significantly improved flood detection and segmentation from aerial and satellite imagery.However,conventional convolutional neural networks(CNNs)often struggle in complex flood s... Recent advances in deep learning have significantly improved flood detection and segmentation from aerial and satellite imagery.However,conventional convolutional neural networks(CNNs)often struggle in complex flood scenarios involving reflections,occlusions,or indistinct boundaries due to limited contextual modeling.To address these challenges,we propose a hybrid flood segmentation framework that integrates a Vision Transformer(ViT)encoder with a U-Net decoder,enhanced by a novel Flood-Aware Refinement Block(FARB).The FARB module improves boundary delineation and suppresses noise by combining residual smoothing with spatial-channel attention mechanisms.We evaluate our model on a UAV-acquired flood imagery dataset,demonstrating that the proposed ViTUNet+FARB architecture outperforms existing CNN and Transformer-based models in terms of accuracy and mean Intersection over Union(mIoU).Detailed ablation studies further validate the contribution of each component,confirming that the FARB design significantly enhances segmentation quality.To its better performance and computational efficiency,the proposed framework is well-suited for flood monitoring and disaster response applications,particularly in resource-constrained environments. 展开更多
关键词 Flood detection vision transformer(ViT) U-Net segmentation image processing deep learning artificial intelligence
在线阅读 下载PDF
Deep Learning for Brain Tumor Segmentation and Classification: A Systematic Review of Methods and Trends
2
作者 Ameer Hamza Robertas Damaševicius 《Computers, Materials & Continua》 2026年第1期132-172,共41页
This systematic review aims to comprehensively examine and compare deep learning methods for brain tumor segmentation and classification using MRI and other imaging modalities,focusing on recent trends from 2022 to 20... This systematic review aims to comprehensively examine and compare deep learning methods for brain tumor segmentation and classification using MRI and other imaging modalities,focusing on recent trends from 2022 to 2025.The primary objective is to evaluate methodological advancements,model performance,dataset usage,and existing challenges in developing clinically robust AI systems.We included peer-reviewed journal articles and highimpact conference papers published between 2022 and 2025,written in English,that proposed or evaluated deep learning methods for brain tumor segmentation and/or classification.Excluded were non-open-access publications,books,and non-English articles.A structured search was conducted across Scopus,Google Scholar,Wiley,and Taylor&Francis,with the last search performed in August 2025.Risk of bias was not formally quantified but considered during full-text screening based on dataset diversity,validation methods,and availability of performance metrics.We used narrative synthesis and tabular benchmarking to compare performance metrics(e.g.,accuracy,Dice score)across model types(CNN,Transformer,Hybrid),imaging modalities,and datasets.A total of 49 studies were included(43 journal articles and 6 conference papers).These studies spanned over 9 public datasets(e.g.,BraTS,Figshare,REMBRANDT,MOLAB)and utilized a range of imaging modalities,predominantly MRI.Hybrid models,especially ResViT and UNetFormer,consistently achieved high performance,with classification accuracy exceeding 98%and segmentation Dice scores above 0.90 across multiple studies.Transformers and hybrid architectures showed increasing adoption post2023.Many studies lacked external validation and were evaluated only on a few benchmark datasets,raising concerns about generalizability and dataset bias.Few studies addressed clinical interpretability or uncertainty quantification.Despite promising results,particularly for hybrid deep learning models,widespread clinical adoption remains limited due to lack of validation,interpretability concerns,and real-world deployment barriers. 展开更多
关键词 Brain tumor segmentation brain tumor classification deep learning vision transformers hybrid models
在线阅读 下载PDF
Harnessing deep learning for the discovery of latent patterns in multi-omics medical data
3
作者 Okechukwu Paul-Chima Ugwu Fabian COgenyi +8 位作者 Chinyere Nkemjika Anyanwu Melvin Nnaemeka Ugwu Esther Ugo Alum Mariam Basajja Joseph Obiezu Chukwujekwu Ezeonwumelu Daniel Ejim Uti Ibe Michael Usman Chukwuebuka Gabriel Eze Simeon Ikechukwu Egba 《Medical Data Mining》 2026年第1期32-45,共14页
The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities... The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities and obstacles.The huge and diversified nature of these datasets cannot always be managed using traditional data analysis methods.As a consequence,deep learning has emerged as a strong tool for analysing numerous omics data due to its ability to handle complex and non-linear relationships.This paper explores the fundamental concepts of deep learning and how they are used in multi-omics medical data mining.We demonstrate how autoencoders,variational autoencoders,multimodal models,attention mechanisms,transformers,and graph neural networks enable pattern analysis and recognition across all omics data.Deep learning has been found to be effective in illness classification,biomarker identification,gene network learning,and therapeutic efficacy prediction.We also consider critical problems like as data quality,model explainability,whether findings can be repeated,and computational power requirements.We now consider future elements of combining omics with clinical and imaging data,explainable AI,federated learning,and real-time diagnostics.Overall,this study emphasises the need of collaborating across disciplines to advance deep learning-based multi-omics research for precision medicine and comprehending complicated disorders. 展开更多
关键词 deep learning multi-omics integration biomedical data mining precision medicine graph neural networks autoencoders and transformers
在线阅读 下载PDF
Application of deep learning models in the pathological classification and staging of esophageal cancer:A focus on Wave-Vision Transformer
4
作者 Wei Wei Xiao-Lei Zhang +4 位作者 Hong-Zhen Wang Lin-Lin Wang Jing-Li Wen Xin Han Qian Liu 《World Journal of Gastroenterology》 2025年第19期68-85,共18页
BACKGROUND Esophageal cancer is the sixth most common cancer worldwide,with a high mortality rate.Early prognosis of esophageal abnormalities can improve patient survival rates.The progression of esophageal cancer fol... BACKGROUND Esophageal cancer is the sixth most common cancer worldwide,with a high mortality rate.Early prognosis of esophageal abnormalities can improve patient survival rates.The progression of esophageal cancer follows a sequence from esophagitis to non-dysplastic Barrett’s esophagus,dysplastic Barrett’s esophagus,and eventually esophageal adenocarcinoma(EAC).This study explored the application of deep learning technology in the precise diagnosis of pathological classification and staging of EAC to enhance diagnostic accuracy and efficiency.AIM To explore the application of deep learning models,particularly Wave-Vision Transformer(Wave-ViT),in the pathological classification and staging of esophageal cancer to enhance diagnostic accuracy and efficiency.METHODS We applied several deep learning models,including multi-layer perceptron,residual network,transformer,and Wave-ViT,to a dataset of clinically validated esophageal pathology images.The models were trained to identify pathological features and assist in the classification and staging of different stages of esophageal cancer.The models were compared based on accuracy,computational complexity,and efficiency.RESULTS The Wave-ViT model demonstrated the highest accuracy at 88.97%,surpassing the transformer(87.65%),residual network(85.44%),and multi-layer perceptron(81.17%).Additionally,Wave-ViT exhibited low computational complexity with significantly reduced parameter size,making it highly efficient for real-time clinical applications.CONCLUSION Deep learning technology,particularly the Frequency-Domain Transformer model,shows promise in improving the precision of pathological classification and staging of EAC.The application of the Frequency-Domain Transformer model enhances the automation of the diagnostic process and may support early detection and treatment of EAC.Future research may further explore the potential of this model in broader medical image analysis applications,particularly in the field of precision medicine. 展开更多
关键词 Esophageal cancer deep learning Wave-Vision transformer Pathological classification STAGING Early detection
暂未订购
A deep learning model for ocean surface latent heat flux based on transformer and data assimilation
5
作者 Yahui Liu Hengxiao Li Jichao Wang 《Acta Oceanologica Sinica》 2025年第5期115-130,共16页
Efficient and accurate prediction of ocean surface latent heat fluxes is essential for understanding and modeling climate dynamics.Conventional estimation methods have low resolution and lack accuracy.The transformer ... Efficient and accurate prediction of ocean surface latent heat fluxes is essential for understanding and modeling climate dynamics.Conventional estimation methods have low resolution and lack accuracy.The transformer model,with its self-attention mechanism,effectively captures long-range dependencies,leading to a degradation of accuracy over time.Due to the non-linearity and uncertainty of physical processes,the transformer model encounters the problem of error accumulation,leading to a degradation of accuracy over time.To solve this problem,we combine the Data Assimilation(DA)technique with the transformer model and continuously modify the model state to make it closer to the actual observations.In this paper,we propose a deep learning model called TransNetDA,which integrates transformer,convolutional neural network and DA methods.By combining data-driven and DA methods for spatiotemporal prediction,TransNetDA effectively extracts multi-scale spatial features and significantly improves prediction accuracy.The experimental results indicate that the TransNetDA method surpasses traditional techniques in terms of root mean square error and R2 metrics,showcasing its superior performance in predicting latent heat fluxes at the ocean surface. 展开更多
关键词 climate dynamics deep learning(DL) Data Assimilation(DA) transformER ensemble Kalman filter ocean surface latent heat flux
在线阅读 下载PDF
Transformer架构驱动下的综采工作面矿压时序特征智能预测
6
作者 杜锋 陈博 +7 位作者 王文强 浦海 杜雪明 李国栋 乔瑞 李鑫磊 徐杰 曹煜 《煤田地质与勘探》 北大核心 2026年第2期1-13,共13页
【背景】矿压预测是顶板灾害预警和管理的重要手段,是智能化矿井安全生产的前提和基础。开采过程中综采工作面环境复杂多变,导致基于电液控制系统采集的支架压力数据分布差异较大,预测困难。【方法】基于Transformer的矿压预测模型,使... 【背景】矿压预测是顶板灾害预警和管理的重要手段,是智能化矿井安全生产的前提和基础。开采过程中综采工作面环境复杂多变,导致基于电液控制系统采集的支架压力数据分布差异较大,预测困难。【方法】基于Transformer的矿压预测模型,使用线性插值填补缺失的矿压值,并使用滑动窗口算法调整训练时的矿压数据结构;针对矿压数据的时序特性,构建融合时序特征的输入序列,利用多头注意力(multi-head-attention)机制动态计算权重,根据数据本身自适应地聚焦关键时间步,从而有效捕捉复杂的非线性时序依赖,显著提升特征表征与预测能力,最后使用迁移学习方法,完成对上、中、下工作面支架工作阻力预测,并搭建基于矿压大数据的智能分析及预测平台。【结果和结论】使用多头注意力机制代替神经网络捕捉全局矿压数据特征,比循环神经网络(recurrent neural network,RNN)和长短期记忆网络(long short-term memory,LSTM)具有更强长序列依赖能力和特征学习能力,能有效降低模型损失,更加适用于预测矿压,Transformer模型在测试集上的均方误差和平均绝对误差损失精度分别达到0.34%和2.57%。Transformer模型也具有较强的泛化能力,使用迁移学习方法微调后,能够有效降低模型损失,在迁移同工作面其他支架时具有更好的泛化效果,Transformer预测模型进一步验证在矿压预测问题的适用性和可行性。平台可视化显示系统可精准分析预测前后的来压次数、推进距离、来压判据和工作面矿压云图等关键参数,为顶板灾害预警乃至其他灾害预警提供新思路,也为矿井安全高效开采与智能化建设奠定了坚实基础。 展开更多
关键词 综采工作面 深度学习 transformer模型 时间序列 矿压显现 矿压预测
在线阅读 下载PDF
基于Transformer多尺度融合网络的暖通空调能耗预测模型
7
作者 于水 韩府宏 +1 位作者 罗宇晨 孙圣坤 《太阳能学报》 北大核心 2026年第2期300-309,共10页
提出一种基于Transformer的多尺度融合网络模型,用于预测建筑暖通空调的能耗。通过引入多尺度金字塔模块与时间卷积网络结构,该模型能够有效捕捉时序特征的局部与整体信息,从而提高预测的准确性。实验结果表明,该模型在预测性能上优于... 提出一种基于Transformer的多尺度融合网络模型,用于预测建筑暖通空调的能耗。通过引入多尺度金字塔模块与时间卷积网络结构,该模型能够有效捕捉时序特征的局部与整体信息,从而提高预测的准确性。实验结果表明,该模型在预测性能上优于传统的单一模型,均方根误差(RMSE)与平均绝对误差(MAE)均显著降低,决定系数(R2)达到0.9826。该模型可为建筑能耗管理提供一种高效且准确的预测工具,有助于实现更高效的建筑能源管理与节能策略。 展开更多
关键词 HVAC 特征提取 深度学习 负荷预测 多尺度特征 transformer模型
原文传递
多尺度非对称注意力遥感去雾Transformer
8
作者 王旭阳 梁宇航 《广西师范大学学报(自然科学版)》 北大核心 2026年第2期77-89,共13页
雾霾干扰会导致遥感图像结构模糊、细节丢失,严重影响下游视觉任务的准确性。为此,本文提出一种异构增强的遥感图像去雾网络,从空间结构建模与频率信息整合2个层面提升特征恢复能力。具体而言,设计多尺度非对称注意力Transformer模块,... 雾霾干扰会导致遥感图像结构模糊、细节丢失,严重影响下游视觉任务的准确性。为此,本文提出一种异构增强的遥感图像去雾网络,从空间结构建模与频率信息整合2个层面提升特征恢复能力。具体而言,设计多尺度非对称注意力Transformer模块,引入方向感知机制以增强模糊边缘与纹理细节的建模;同时构建基于小波变换高低频自适应增强模块,使用Haar小波分解分离频域信息,分别通过高频与低频子模块强化边缘轮廓与结构表达。2个模块分别嵌入特征提取与融合阶段,协同缓解传统方法方向性建模不足与高频特征易丢失等问题。在保持低计算开销的前提下,本文方法在HAZE1K与RICE数据集上的平均PSNR/SSIM性能分别达到24.9936/0.9099与33.1802/0.8942,在细节恢复方面表现出显著优势。 展开更多
关键词 遥感图像去雾 transformER 非对称注意力 高低频特征增强 小波变换 方向感知建模 深度学习
在线阅读 下载PDF
基于可学习小波变换和Transformer融合的调制识别方法
9
作者 田明浩 杨盼云 姚沐汐 《通信技术》 2026年第1期31-37,共7页
针对复杂电磁环境下无线电信号调制识别精度低的问题,提出了一种基于可学习小波变换和Transformer融合的调制识别方法。首先,通过可学习小波变换模块将信号进行奇偶分解,利用强化的预测、更新算子和注意力机制自适应提取多分辨率特征,... 针对复杂电磁环境下无线电信号调制识别精度低的问题,提出了一种基于可学习小波变换和Transformer融合的调制识别方法。首先,通过可学习小波变换模块将信号进行奇偶分解,利用强化的预测、更新算子和注意力机制自适应提取多分辨率特征,同时引入正则化约束确保小波分解的稳定性;其次,构建双分支特征增强架构,通过挤压和激励(SE)注意力对小波特征进行自适应加权,利用Transformer捕获全局依赖关系;最后,将两个分支输出的特征在特征维度拼接后输入到全连接分类器中,以进行调制类型识别。实验结果表明,所提出的模型具有优异的调制识别精度。相较于其他深度学习方法,所提方法的整体识别精度提升了3%~10%,在不同信噪比的条件下均具有更强的特征学习能力和更好的鲁棒性。 展开更多
关键词 调制识别 深度学习 小波变换 transformER
在线阅读 下载PDF
Forecasting landslide deformation by integrating domain knowledge into interpretable deep learning considering spatiotemporal correlations 被引量:2
10
作者 Zhengjing Ma Gang Mei 《Journal of Rock Mechanics and Geotechnical Engineering》 2025年第2期960-982,共23页
Forecasting landslide deformation is challenging due to influence of various internal and external factors on the occurrence of systemic and localized heterogeneities.Despite the potential to improve landslide predict... Forecasting landslide deformation is challenging due to influence of various internal and external factors on the occurrence of systemic and localized heterogeneities.Despite the potential to improve landslide predictability,deep learning has yet to be sufficiently explored for complex deformation patterns associated with landslides and is inherently opaque.Herein,we developed a holistic landslide deformation forecasting method that considers spatiotemporal correlations of landslide deformation by integrating domain knowledge into interpretable deep learning.By spatially capturing the interconnections between multiple deformations from different observation points,our method contributes to the understanding and forecasting of landslide systematic behavior.By integrating specific domain knowledge relevant to each observation point and merging internal properties with external variables,the local heterogeneity is considered in our method,identifying deformation temporal patterns in different landslide zones.Case studies involving reservoir-induced landslides and creeping landslides demonstrated that our approach(1)enhances the accuracy of landslide deformation forecasting,(2)identifies significant contributing factors and their influence on spatiotemporal deformation characteristics,and(3)demonstrates how identifying these factors and patterns facilitates landslide forecasting.Our research offers a promising and pragmatic pathway toward a deeper understanding and forecasting of complex landslide behaviors. 展开更多
关键词 GEOHAZARDS Landslide deformation forecasting Landslide predictability Knowledge infused deep learning interpretable machine learning Attention mechanism transformer
在线阅读 下载PDF
A Survey of Deep Learning for Time Series Forecasting:Theories,Datasets,and State-of-the-Art Techniques 被引量:1
11
作者 Gaoyong Lu Yang Ou +5 位作者 Zhihong Wang Yingnan Qu Yingsheng Xia Dibin Tang Igor Kotenko Wei Li 《Computers, Materials & Continua》 2025年第11期2403-2441,共39页
Deep learning(DL)has revolutionized time series forecasting(TSF),surpassing traditional statistical methods(e.g.,ARIMA)and machine learning techniques in modeling complex nonlinear dynamics and long-term dependencies ... Deep learning(DL)has revolutionized time series forecasting(TSF),surpassing traditional statistical methods(e.g.,ARIMA)and machine learning techniques in modeling complex nonlinear dynamics and long-term dependencies prevalent in real-world temporal data.This comprehensive survey reviews state-of-the-art DL architectures forTSF,focusing on four core paradigms:(1)ConvolutionalNeuralNetworks(CNNs),adept at extracting localized temporal features;(2)Recurrent Neural Networks(RNNs)and their advanced variants(LSTM,GRU),designed for sequential dependency modeling;(3)Graph Neural Networks(GNNs),specialized for forecasting structured relational data with spatial-temporal dependencies;and(4)Transformer-based models,leveraging self-attention mechanisms to capture global temporal patterns efficiently.We provide a rigorous analysis of the theoretical underpinnings,recent algorithmic advancements(e.g.,TCNs,attention mechanisms,hybrid architectures),and practical applications of each framework,supported by extensive benchmark datasets(e.g.,ETT,traffic flow,financial indicators)and standardized evaluation metrics(MAE,MSE,RMSE).Critical challenges,including handling irregular sampling intervals,integrating domain knowledge for robustness,and managing computational complexity,are thoroughly discussed.Emerging research directions highlighted include diffusion models for uncertainty quantification,hybrid pipelines combining classical statistical and DL techniques for enhanced interpretability,quantile regression with Transformers for riskaware forecasting,and optimizations for real-time deployment.This work serves as an essential reference,consolidating methodological innovations,empirical resources,and future trends to bridge the gap between theoretical research and practical implementation needs for researchers and practitioners in the field. 展开更多
关键词 Time series forecasting deep learning transformER neural network
在线阅读 下载PDF
ISAR autofocus imaging algorithm for maneuvering targets based on deep learning and keystone transform 被引量:5
12
作者 SHI Hongyin LIU Yue +1 位作者 GUO Jianwen LIU Mingxin 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2020年第6期1178-1185,共8页
The issue of small-angle maneuvering targets inverse synthetic aperture radar(ISAR)imaging has been successfully addressed by popular motion compensation algorithms.However,when the target’s rotational velocity is su... The issue of small-angle maneuvering targets inverse synthetic aperture radar(ISAR)imaging has been successfully addressed by popular motion compensation algorithms.However,when the target’s rotational velocity is sufficiently high during the dwell time of the radar,such compensation algorithms cannot obtain a high quality image.This paper proposes an ISAR imaging algorithm based on keystone transform and deep learning algorithm.The keystone transform is used to coarsely compensate for the target’s rotational motion and translational motion,and the deep learning algorithm is used to achieve a super-resolution image.The uniformly distributed point target data are used as the data set of the training u-net network.In addition,this method does not require estimating the motion parameters of the target,which simplifies the algorithm steps.Finally,several experiments are performed to demonstrate the effectiveness of the proposed algorithm. 展开更多
关键词 inverse synthetic aperture radar(ISAR) maneuvering target keystone transform deep learning u-net network
在线阅读 下载PDF
基于Retinex理论的低光照图像重建Transformer模型
13
作者 周秉萱 何家峰 程昱 《激光杂志》 北大核心 2026年第2期92-99,共8页
针对现有低光照图像重建算法普遍存在的颜色失真、细节模糊等问题,提出了一种基于Retinex理论的低光照图像重建Transformer模型。在Retinex物理模型的基础上,创新性地引入光照增益调整模块,借助像素级照度补偿系数估计,精准实现图像亮... 针对现有低光照图像重建算法普遍存在的颜色失真、细节模糊等问题,提出了一种基于Retinex理论的低光照图像重建Transformer模型。在Retinex物理模型的基础上,创新性地引入光照增益调整模块,借助像素级照度补偿系数估计,精准实现图像亮度的增强。为进一步提升光照特征对重建过程的指导效能,精心设计了自适应光照特征增强模块,提升光照特征的可判别性与鲁棒性。同时,有机融合小波变换的多尺度分解能力以及Transformer在长程依赖关系建模方面的独特优势,实现了在低光照条件下对图像亮度增强、噪声抑制与结构保留的协同优化。实验结果表明,该方法在多个公开低光照数据集上表现优异,能够有效提升低光照图像重建质量。 展开更多
关键词 深度学习 RETINEX 低光照图像 transformER
原文传递
基于多尺度特征增强和时序Transformer的SiC外延生长浓度预测模型
14
作者 张忠义 王朗 +6 位作者 芦伟立 李帅 杨建业 高楠 王波 潘国平 房玉龙 《硅酸盐学报》 北大核心 2026年第1期35-48,共14页
碳化硅(SiC)外延层掺杂浓度直接决定功率器件性能,现有调控依赖人工经验与离线测试,存在成本高、滞后性强的问题。为此,提出一种多尺度特征增强时序Transformer(Multi-scale Feature-enhanced Temporal Transformer Network,MFT-Net)模... 碳化硅(SiC)外延层掺杂浓度直接决定功率器件性能,现有调控依赖人工经验与离线测试,存在成本高、滞后性强的问题。为此,提出一种多尺度特征增强时序Transformer(Multi-scale Feature-enhanced Temporal Transformer Network,MFT-Net)模型,整合多尺度卷积、压缩和激励(SE)模块、Transformer及门控循环单元(GRU)模块,构建“当炉-下炉”双场景浓度预测体系。多尺度卷积捕捉毫秒至小时级参数动态,SE强化核心特征,Transformer建模全参数耦合,GRU传递跨炉状态。基于1200炉数据实验表明,模型当炉预测相对误差低至1.35%、决定系数R_(2)达0.89,下炉预测相对误差为1.66%,R_(2)达到0.87,显著优于传统模型。经统计,该模型预计可降低离线测试成本约30%、提升工艺稳定性约15%,为SiC外延智能化提供支撑。 展开更多
关键词 碳化硅外延生长 核心层掺杂浓度 多尺度卷积 transformER 工艺参数预测 深度学习
原文传递
Suppression of seismic random noise by deep learning combined with stationary wavelet packet transform 被引量:1
15
作者 Fan Hua Wang Dong-Bo +2 位作者 Zhang Yang Wang Wen-Xu Li Tao 《Applied Geophysics》 SCIE CSCD 2024年第4期740-751,880,共13页
Many traditional denoising methods,such as Gaussian filtering,tend to blur and lose details or edge information while reducing noise.The stationary wavelet packet transform is a multi-scale and multi-band analysis too... Many traditional denoising methods,such as Gaussian filtering,tend to blur and lose details or edge information while reducing noise.The stationary wavelet packet transform is a multi-scale and multi-band analysis tool.Compared with the stationary wavelet transform,it can suppress high-frequency noise while preserving more edge details.Deep learning has significantly progressed in denoising applications.DnCNN,a residual network;FFDNet,an efficient,fl exible network;U-NET,a codec network;and GAN,a generative adversative network,have better denoising effects than BM3D,the most popular conventional denoising method.Therefore,SWP_hFFDNet,a random noise attenuation network based on the stationary wavelet packet transform(SWPT)and modified FFDNet,is proposed.This network combines the advantages of SWPT,Huber norm,and FFDNet.In addition,it has three characteristics:First,SWPT is an eff ective featureextraction tool that can obtain low-and high-frequency features of different scales and frequency bands.Second,because the noise level map is the input of the network,the noise removal performance of diff erent noise levels can be improved.Third,the Huber norm can reduce the sensitivity of the network to abnormal data and enhance its robustness.The network is trained using the Adam algorithm and the BSD500 dataset,which is augmented,noised,and decomposed by SWPT.Experimental and actual data processing results show that the denoising eff ect of the proposed method is almost the same as those of BM3D,DnCNN,and FFDNet networks for low noise.However,for high noise,the proposed method is superior to the aforementioned networks. 展开更多
关键词 random noise stationary wavelet packet transform deep learning noise level map Huber norm
在线阅读 下载PDF
基于Transformer-卷积神经网络模型实现单节点腰部康复训练动作识别任务
16
作者 余圣涵 成贤锴 +1 位作者 郑跃 杨颖 《中国组织工程研究》 北大核心 2026年第16期4125-4136,共12页
背景:惯性测量单元被广泛用于人体姿态感知与动态捕捉。深度学习已逐步替代传统规则与特征工程,广泛应用于动作识别任务。卷积神经网络在提取局部动态特征方面表现良好,Transformer则在建模长时序依赖方面展现出强大能力。目的:通过基于... 背景:惯性测量单元被广泛用于人体姿态感知与动态捕捉。深度学习已逐步替代传统规则与特征工程,广泛应用于动作识别任务。卷积神经网络在提取局部动态特征方面表现良好,Transformer则在建模长时序依赖方面展现出强大能力。目的:通过基于Transformer-卷积神经网络融合模型识别方法,实现在单惯性传感器条件下的腰部康复训练动作识别任务。方法:采集6名健康受试者佩戴单个惯性传感器条件下执行腰部康复动作的加速度与角速度数据,以动作类型为数据进行标注,制作腰部康复动作数据集。通过腰部康复动作数据集对Transformer-卷积神经网络融合模型进行训练,构建动作分类模型。通过留一交叉验证评估模型准确性,并与线性判别分析、支持向量机、多层感知、经典Transformer等模型进行性能对比。结果与结论:在5类动作识别任务中,Transformer-卷积神经网络模型准确率达96.67%,F1-score为0.9669。在单传感器输入的条件下,相较于传统模型,在识别精度与泛化能力方面具有明显优势。验证了基于单惯性测量单元数据的深度模型在腰部康复动作分类任务中的实用性,为轻量化、高部署性的居家腰部康复训练系统提供基础。 展开更多
关键词 慢性腰痛 康复训练 深度学习 transformER 单节点惯性传感器 动作分类
暂未订购
基于改进实时检测Transformer的轻量级番茄果实检测算法
17
作者 卢承方 崔艳荣 +2 位作者 胡蓉华 王浩宇 陈鹏翔 《山东农业科学》 北大核心 2026年第1期150-163,共14页
针对复杂环境下番茄果实目标多、外形各异、分布密集等导致智能采摘识别难度大的问题,同时为解决Transformer架构巨大运算量导致的模型边缘设备部署困难的问题,本研究提出一种改进实时检测Transformer(Real-Time Detection Transformer,... 针对复杂环境下番茄果实目标多、外形各异、分布密集等导致智能采摘识别难度大的问题,同时为解决Transformer架构巨大运算量导致的模型边缘设备部署困难的问题,本研究提出一种改进实时检测Transformer(Real-Time Detection Transformer,RT-DETR)的番茄果实检测模型SPC-DETR。首先,实地拍摄采集多尺度的复杂环境下番茄果实图片,并从公开数据集中筛选符合复杂环境的番茄果实图片作为补充,进行标注及数据增强后构建了一个包含3398张图片的番茄果实数据集。其次,以RT-DETR-R18为基线模型,进行如下改进:选用计算复杂度低且检测精度较高的StarNet作为主干特征提取基准网络,并引入倒置残差移动块iRMB,构建iR-StarNet作为主干网络,在轻量化主干的同时进一步提高模型对番茄果实的特征提取能力;设计一种改进的并行空洞卷积结构块EMA-PDC,该模块通过融合跨空间学习的高效多尺度注意力机制,可在减少计算量的同时扩大模型感受野,提升模型多尺度表示的准确性;使用DEA-Net提出的内容引导注意力融合方案CGA-Fusion改进颈部网络,进一步提高模型表征能力。实验结果表明,SPC-DETR的精确率、召回率和平均精度均值(mAP50)分别为88.0%、83.7%和90.2%,分别比基线模型RT-DETR-R18提高2.0、0.3、1.7个百分点,模型权重大小、参数量、浮点运算量分别为25.2 MB、12.9 M、34.3 GFLOPs,分别比RT-DETR-R18降低34.72%、36.14%、41.47%,综合表现优于目前大部分主流的YOLO系列模型和DETR系列模型。本研究结果可为提升智能采摘机器人识别检测复杂环境下番茄果实的准确率提供技术参考。 展开更多
关键词 番茄果实识别 transformER RT-DETR 轻量化 目标实时检测 深度学习
在线阅读 下载PDF
基于Transformer的深部矿山微震信号自动分类技术研究
18
作者 骆贞江 雷入 +4 位作者 马少维 谭丽龙 于德宁 贺艳军 彭平安 《金属矿山》 北大核心 2026年第2期166-176,共11页
针对深井矿山微震监测中人工判别效率低下、传统机器学习方法对复杂非平稳信号分类精度不足的问题,提出一种基于Transformer架构的微震信号自动分类模型。基于真实矿山数据构建了包含微震事件、爆破振动及噪声3类信号的标注数据集。通... 针对深井矿山微震监测中人工判别效率低下、传统机器学习方法对复杂非平稳信号分类精度不足的问题,提出一种基于Transformer架构的微震信号自动分类模型。基于真实矿山数据构建了包含微震事件、爆破振动及噪声3类信号的标注数据集。通过提取信号时频特征并结合Transformer的自注意力机制,该模型有效捕捉了波形中的长程依赖关系与全局特征,显著提升了分类精度,同时增强了对波形混淆现象的辨识能力,展现出良好的泛化性与工程适用性。试验结果表明:模型在测试集上达到96.3%的整体分类准确率,微震事件与爆破信号的识别率均超过97%;在多项性能指标上均显著优于SVM、KNN、CNN-BiLSTM及VGG16等对比模型。本模型通过有效解决波形混淆问题,为矿山微震监测系统的智能化升级与地压灾害实时精准预警提供了技术支撑。 展开更多
关键词 微震监测 信号分类 transformer模型 深度学习 深部矿山
在线阅读 下载PDF
基于Transformer的壁画裂缝自动化数字修复方法
19
作者 刘晨玲 李炳焱 +3 位作者 米艳华 薛峰 王建华 戴震 《太原理工大学学报》 北大核心 2026年第1期186-194,共9页
【目的】随着现代科技的发展,文物保护领域正经历深刻变革。传统的壁画修复手段虽能维护文物的部分完整性,但受限于技术和操作的复杂性。人工智能等数字化文物修复技术能够实现对壁画文物的高精度记录与还原,然而,现有图像修复方法大多... 【目的】随着现代科技的发展,文物保护领域正经历深刻变革。传统的壁画修复手段虽能维护文物的部分完整性,但受限于技术和操作的复杂性。人工智能等数字化文物修复技术能够实现对壁画文物的高精度记录与还原,然而,现有图像修复方法大多依赖专家知识或人工辅助,无法实现壁画数字修复的自动化。【方法】本文结合北方地区长期缺水干燥的环境特征,针对壁画文物可能存在的撕扯断裂或裂缝错位问题,实现自动化数字修复。具体而言,本研究提出基于Transformer的壁画裂缝自动化数字修复方法;基于VGG16-Unet模型,创新性提出基于膨胀核的裂缝掩码优化方法,通过形态学膨胀操作显著提升裂缝覆盖率。此外,利用Transformer的全局结构学习能力与CNN上采样实现结构修复,并引入傅立叶卷积与大核注意力模块完成纹理细节还原和高保真修复。【结果】实验基于山西多地真实壁画数据验证了该方法的有效性,结果表明,该方法在裂缝识别覆盖度、结构还原度及纹理协调性方面均优于传统方法,尤其对北方干燥地区特有的断裂错位型裂缝修复效果显著。 展开更多
关键词 壁画修复 深度学习 壁画图像分割 transformER
在线阅读 下载PDF
基于WGAN-GP-Transformer的地表沉陷SHAP可解释预测模型
20
作者 刘超 夏大平 《金属矿山》 北大核心 2026年第2期203-217,共15页
为实现地表沉陷的精准预测,以及对预测结果进行深度解释来指导实际工程,提出了基于改进生成对抗网络(WGAN-GP)与Transformer的地表沉陷SHAP可解释模型。利用该模型对地表下沉量、影响角正切和拐点偏移距进行预测,从而将预测参数结合概... 为实现地表沉陷的精准预测,以及对预测结果进行深度解释来指导实际工程,提出了基于改进生成对抗网络(WGAN-GP)与Transformer的地表沉陷SHAP可解释模型。利用该模型对地表下沉量、影响角正切和拐点偏移距进行预测,从而将预测参数结合概率积分法来建立地表沉陷公式。首先,利用Wasserstein距离、梯度惩罚策略对传统生成对抗网络进行改进,以增强地表沉陷数据,丰富训练集。然后,采用基于多头自注意力机制的Transformer架构对增强数据进行深度学习,并通过贝叶斯优化寻优超参数。最后,基于SHAP法对预测过程与结果进行全面剖析解释,以揭示不同特征对预测参数的影响规律。结果表明:WGAN-GP-Transformer对下沉量、影响角正切与拐点偏移距在测试集上表现出优异的预测能力,表明模型能有效捕捉预测地表沉陷的复杂非线性特征,以及可有效应对数据稀缺的场景。揭示了影响3个预测参数的特征贡献强度与作用方向存在显著差异;松散层厚度对预测下沉量影响最大,采深对预测影响角正切和拐点偏移距的影响最大。模型在鲁西南某矿3301工作面的实际应用表明,其预测沉陷曲线与实际情况高度吻合,验证了其在实际工程中的可靠性与泛化性能。 展开更多
关键词 地表沉陷预测 transformer WGAN-GP SHAP 深度学习
在线阅读 下载PDF
上一页 1 2 76 下一页 到第
使用帮助 返回顶部