期刊文献+
共找到1,505篇文章
< 1 2 76 >
每页显示 20 50 100
Deep Learning for Brain Tumor Segmentation and Classification: A Systematic Review of Methods and Trends
1
作者 Ameer Hamza Robertas Damaševicius 《Computers, Materials & Continua》 2026年第1期132-172,共41页
This systematic review aims to comprehensively examine and compare deep learning methods for brain tumor segmentation and classification using MRI and other imaging modalities,focusing on recent trends from 2022 to 20... This systematic review aims to comprehensively examine and compare deep learning methods for brain tumor segmentation and classification using MRI and other imaging modalities,focusing on recent trends from 2022 to 2025.The primary objective is to evaluate methodological advancements,model performance,dataset usage,and existing challenges in developing clinically robust AI systems.We included peer-reviewed journal articles and highimpact conference papers published between 2022 and 2025,written in English,that proposed or evaluated deep learning methods for brain tumor segmentation and/or classification.Excluded were non-open-access publications,books,and non-English articles.A structured search was conducted across Scopus,Google Scholar,Wiley,and Taylor&Francis,with the last search performed in August 2025.Risk of bias was not formally quantified but considered during full-text screening based on dataset diversity,validation methods,and availability of performance metrics.We used narrative synthesis and tabular benchmarking to compare performance metrics(e.g.,accuracy,Dice score)across model types(CNN,Transformer,Hybrid),imaging modalities,and datasets.A total of 49 studies were included(43 journal articles and 6 conference papers).These studies spanned over 9 public datasets(e.g.,BraTS,Figshare,REMBRANDT,MOLAB)and utilized a range of imaging modalities,predominantly MRI.Hybrid models,especially ResViT and UNetFormer,consistently achieved high performance,with classification accuracy exceeding 98%and segmentation Dice scores above 0.90 across multiple studies.Transformers and hybrid architectures showed increasing adoption post2023.Many studies lacked external validation and were evaluated only on a few benchmark datasets,raising concerns about generalizability and dataset bias.Few studies addressed clinical interpretability or uncertainty quantification.Despite promising results,particularly for hybrid deep learning models,widespread clinical adoption remains limited due to lack of validation,interpretability concerns,and real-world deployment barriers. 展开更多
关键词 Brain tumor segmentation brain tumor classification deep learning vision transformers hybrid models
在线阅读 下载PDF
Harnessing deep learning for the discovery of latent patterns in multi-omics medical data
2
作者 Okechukwu Paul-Chima Ugwu Fabian COgenyi +8 位作者 Chinyere Nkemjika Anyanwu Melvin Nnaemeka Ugwu Esther Ugo Alum Mariam Basajja Joseph Obiezu Chukwujekwu Ezeonwumelu Daniel Ejim Uti Ibe Michael Usman Chukwuebuka Gabriel Eze Simeon Ikechukwu Egba 《Medical Data Mining》 2026年第1期32-45,共14页
The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities... The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities and obstacles.The huge and diversified nature of these datasets cannot always be managed using traditional data analysis methods.As a consequence,deep learning has emerged as a strong tool for analysing numerous omics data due to its ability to handle complex and non-linear relationships.This paper explores the fundamental concepts of deep learning and how they are used in multi-omics medical data mining.We demonstrate how autoencoders,variational autoencoders,multimodal models,attention mechanisms,transformers,and graph neural networks enable pattern analysis and recognition across all omics data.Deep learning has been found to be effective in illness classification,biomarker identification,gene network learning,and therapeutic efficacy prediction.We also consider critical problems like as data quality,model explainability,whether findings can be repeated,and computational power requirements.We now consider future elements of combining omics with clinical and imaging data,explainable AI,federated learning,and real-time diagnostics.Overall,this study emphasises the need of collaborating across disciplines to advance deep learning-based multi-omics research for precision medicine and comprehending complicated disorders. 展开更多
关键词 deep learning multi-omics integration biomedical data mining precision medicine graph neural networks autoencoders and transformers
在线阅读 下载PDF
Application of deep learning models in the pathological classification and staging of esophageal cancer:A focus on Wave-Vision Transformer
3
作者 Wei Wei Xiao-Lei Zhang +4 位作者 Hong-Zhen Wang Lin-Lin Wang Jing-Li Wen Xin Han Qian Liu 《World Journal of Gastroenterology》 2025年第19期68-85,共18页
BACKGROUND Esophageal cancer is the sixth most common cancer worldwide,with a high mortality rate.Early prognosis of esophageal abnormalities can improve patient survival rates.The progression of esophageal cancer fol... BACKGROUND Esophageal cancer is the sixth most common cancer worldwide,with a high mortality rate.Early prognosis of esophageal abnormalities can improve patient survival rates.The progression of esophageal cancer follows a sequence from esophagitis to non-dysplastic Barrett’s esophagus,dysplastic Barrett’s esophagus,and eventually esophageal adenocarcinoma(EAC).This study explored the application of deep learning technology in the precise diagnosis of pathological classification and staging of EAC to enhance diagnostic accuracy and efficiency.AIM To explore the application of deep learning models,particularly Wave-Vision Transformer(Wave-ViT),in the pathological classification and staging of esophageal cancer to enhance diagnostic accuracy and efficiency.METHODS We applied several deep learning models,including multi-layer perceptron,residual network,transformer,and Wave-ViT,to a dataset of clinically validated esophageal pathology images.The models were trained to identify pathological features and assist in the classification and staging of different stages of esophageal cancer.The models were compared based on accuracy,computational complexity,and efficiency.RESULTS The Wave-ViT model demonstrated the highest accuracy at 88.97%,surpassing the transformer(87.65%),residual network(85.44%),and multi-layer perceptron(81.17%).Additionally,Wave-ViT exhibited low computational complexity with significantly reduced parameter size,making it highly efficient for real-time clinical applications.CONCLUSION Deep learning technology,particularly the Frequency-Domain Transformer model,shows promise in improving the precision of pathological classification and staging of EAC.The application of the Frequency-Domain Transformer model enhances the automation of the diagnostic process and may support early detection and treatment of EAC.Future research may further explore the potential of this model in broader medical image analysis applications,particularly in the field of precision medicine. 展开更多
关键词 Esophageal cancer deep learning Wave-Vision transformer Pathological classification STAGING Early detection
暂未订购
Integrating explainable deep learning with multi-omics for screening progressive diagnostic biomarkers of hepatocellular carcinoma covering the“inflammation-cancer”transformation
4
作者 Saiyu Li Yiwen Zhang +8 位作者 Lifang Guan Yijing Dong Mingzhe Zhang Qian Zhang Huarong Xu Wei Xiao Zhenzhong Wang Yan Cui Qing Li 《Journal of Pharmaceutical Analysis》 2025年第9期2199-2202,共4页
Chronic uncontrolled inflammation is a major risk factor driving the occurrence of hepatocellular carcinoma(HCC),with over half of global cases attributed to hepatitis B virus(HBV)infection.Persistent inflammation fre... Chronic uncontrolled inflammation is a major risk factor driving the occurrence of hepatocellular carcinoma(HCC),with over half of global cases attributed to hepatitis B virus(HBV)infection.Persistent inflammation frequently progresses to cirrhosis and,ultimately,malignancy[1].Monitoring the key risk factors involved in the inflammatory-to-cancerous transformation in HCC is crucial for enabling timely intervention and improving patient survival rates.To address this challenge,we analyzed plasma samples collected from healthy volunteers and patients at various stages of HCC progression. 展开更多
关键词 plasma samples chronic uncontrolled inflammation multi omics explainable deep learning hepatocellular carcinoma key risk factors inflammation cancer transformation hepatocellular carcinoma hcc
暂未订购
A deep learning model for ocean surface latent heat flux based on transformer and data assimilation
5
作者 Yahui Liu Hengxiao Li Jichao Wang 《Acta Oceanologica Sinica》 2025年第5期115-130,共16页
Efficient and accurate prediction of ocean surface latent heat fluxes is essential for understanding and modeling climate dynamics.Conventional estimation methods have low resolution and lack accuracy.The transformer ... Efficient and accurate prediction of ocean surface latent heat fluxes is essential for understanding and modeling climate dynamics.Conventional estimation methods have low resolution and lack accuracy.The transformer model,with its self-attention mechanism,effectively captures long-range dependencies,leading to a degradation of accuracy over time.Due to the non-linearity and uncertainty of physical processes,the transformer model encounters the problem of error accumulation,leading to a degradation of accuracy over time.To solve this problem,we combine the Data Assimilation(DA)technique with the transformer model and continuously modify the model state to make it closer to the actual observations.In this paper,we propose a deep learning model called TransNetDA,which integrates transformer,convolutional neural network and DA methods.By combining data-driven and DA methods for spatiotemporal prediction,TransNetDA effectively extracts multi-scale spatial features and significantly improves prediction accuracy.The experimental results indicate that the TransNetDA method surpasses traditional techniques in terms of root mean square error and R2 metrics,showcasing its superior performance in predicting latent heat fluxes at the ocean surface. 展开更多
关键词 climate dynamics deep learning(DL) Data Assimilation(DA) transformER ensemble Kalman filter ocean surface latent heat flux
在线阅读 下载PDF
Transformer架构驱动下的综采工作面矿压时序特征智能预测
6
作者 杜锋 陈博 +7 位作者 王文强 浦海 杜雪明 李国栋 乔瑞 李鑫磊 徐杰 曹煜 《煤田地质与勘探》 北大核心 2026年第2期1-13,共13页
【背景】矿压预测是顶板灾害预警和管理的重要手段,是智能化矿井安全生产的前提和基础。开采过程中综采工作面环境复杂多变,导致基于电液控制系统采集的支架压力数据分布差异较大,预测困难。【方法】基于Transformer的矿压预测模型,使... 【背景】矿压预测是顶板灾害预警和管理的重要手段,是智能化矿井安全生产的前提和基础。开采过程中综采工作面环境复杂多变,导致基于电液控制系统采集的支架压力数据分布差异较大,预测困难。【方法】基于Transformer的矿压预测模型,使用线性插值填补缺失的矿压值,并使用滑动窗口算法调整训练时的矿压数据结构;针对矿压数据的时序特性,构建融合时序特征的输入序列,利用多头注意力(multi-head-attention)机制动态计算权重,根据数据本身自适应地聚焦关键时间步,从而有效捕捉复杂的非线性时序依赖,显著提升特征表征与预测能力,最后使用迁移学习方法,完成对上、中、下工作面支架工作阻力预测,并搭建基于矿压大数据的智能分析及预测平台。【结果和结论】使用多头注意力机制代替神经网络捕捉全局矿压数据特征,比循环神经网络(recurrent neural network,RNN)和长短期记忆网络(long short-term memory,LSTM)具有更强长序列依赖能力和特征学习能力,能有效降低模型损失,更加适用于预测矿压,Transformer模型在测试集上的均方误差和平均绝对误差损失精度分别达到0.34%和2.57%。Transformer模型也具有较强的泛化能力,使用迁移学习方法微调后,能够有效降低模型损失,在迁移同工作面其他支架时具有更好的泛化效果,Transformer预测模型进一步验证在矿压预测问题的适用性和可行性。平台可视化显示系统可精准分析预测前后的来压次数、推进距离、来压判据和工作面矿压云图等关键参数,为顶板灾害预警乃至其他灾害预警提供新思路,也为矿井安全高效开采与智能化建设奠定了坚实基础。 展开更多
关键词 综采工作面 深度学习 transformer模型 时间序列 矿压显现 矿压预测
在线阅读 下载PDF
基于Transformer多尺度融合网络的暖通空调能耗预测模型
7
作者 于水 韩府宏 +1 位作者 罗宇晨 孙圣坤 《太阳能学报》 北大核心 2026年第2期300-309,共10页
提出一种基于Transformer的多尺度融合网络模型,用于预测建筑暖通空调的能耗。通过引入多尺度金字塔模块与时间卷积网络结构,该模型能够有效捕捉时序特征的局部与整体信息,从而提高预测的准确性。实验结果表明,该模型在预测性能上优于... 提出一种基于Transformer的多尺度融合网络模型,用于预测建筑暖通空调的能耗。通过引入多尺度金字塔模块与时间卷积网络结构,该模型能够有效捕捉时序特征的局部与整体信息,从而提高预测的准确性。实验结果表明,该模型在预测性能上优于传统的单一模型,均方根误差(RMSE)与平均绝对误差(MAE)均显著降低,决定系数(R2)达到0.9826。该模型可为建筑能耗管理提供一种高效且准确的预测工具,有助于实现更高效的建筑能源管理与节能策略。 展开更多
关键词 HVAC 特征提取 深度学习 负荷预测 多尺度特征 transformer模型
原文传递
多尺度非对称注意力遥感去雾Transformer
8
作者 王旭阳 梁宇航 《广西师范大学学报(自然科学版)》 北大核心 2026年第2期77-89,共13页
雾霾干扰会导致遥感图像结构模糊、细节丢失,严重影响下游视觉任务的准确性。为此,本文提出一种异构增强的遥感图像去雾网络,从空间结构建模与频率信息整合2个层面提升特征恢复能力。具体而言,设计多尺度非对称注意力Transformer模块,... 雾霾干扰会导致遥感图像结构模糊、细节丢失,严重影响下游视觉任务的准确性。为此,本文提出一种异构增强的遥感图像去雾网络,从空间结构建模与频率信息整合2个层面提升特征恢复能力。具体而言,设计多尺度非对称注意力Transformer模块,引入方向感知机制以增强模糊边缘与纹理细节的建模;同时构建基于小波变换高低频自适应增强模块,使用Haar小波分解分离频域信息,分别通过高频与低频子模块强化边缘轮廓与结构表达。2个模块分别嵌入特征提取与融合阶段,协同缓解传统方法方向性建模不足与高频特征易丢失等问题。在保持低计算开销的前提下,本文方法在HAZE1K与RICE数据集上的平均PSNR/SSIM性能分别达到24.9936/0.9099与33.1802/0.8942,在细节恢复方面表现出显著优势。 展开更多
关键词 遥感图像去雾 transformER 非对称注意力 高低频特征增强 小波变换 方向感知建模 深度学习
在线阅读 下载PDF
基于可学习小波变换和Transformer融合的调制识别方法
9
作者 田明浩 杨盼云 姚沐汐 《通信技术》 2026年第1期31-37,共7页
针对复杂电磁环境下无线电信号调制识别精度低的问题,提出了一种基于可学习小波变换和Transformer融合的调制识别方法。首先,通过可学习小波变换模块将信号进行奇偶分解,利用强化的预测、更新算子和注意力机制自适应提取多分辨率特征,... 针对复杂电磁环境下无线电信号调制识别精度低的问题,提出了一种基于可学习小波变换和Transformer融合的调制识别方法。首先,通过可学习小波变换模块将信号进行奇偶分解,利用强化的预测、更新算子和注意力机制自适应提取多分辨率特征,同时引入正则化约束确保小波分解的稳定性;其次,构建双分支特征增强架构,通过挤压和激励(SE)注意力对小波特征进行自适应加权,利用Transformer捕获全局依赖关系;最后,将两个分支输出的特征在特征维度拼接后输入到全连接分类器中,以进行调制类型识别。实验结果表明,所提出的模型具有优异的调制识别精度。相较于其他深度学习方法,所提方法的整体识别精度提升了3%~10%,在不同信噪比的条件下均具有更强的特征学习能力和更好的鲁棒性。 展开更多
关键词 调制识别 深度学习 小波变换 transformER
在线阅读 下载PDF
Forecasting landslide deformation by integrating domain knowledge into interpretable deep learning considering spatiotemporal correlations 被引量:2
10
作者 Zhengjing Ma Gang Mei 《Journal of Rock Mechanics and Geotechnical Engineering》 2025年第2期960-982,共23页
Forecasting landslide deformation is challenging due to influence of various internal and external factors on the occurrence of systemic and localized heterogeneities.Despite the potential to improve landslide predict... Forecasting landslide deformation is challenging due to influence of various internal and external factors on the occurrence of systemic and localized heterogeneities.Despite the potential to improve landslide predictability,deep learning has yet to be sufficiently explored for complex deformation patterns associated with landslides and is inherently opaque.Herein,we developed a holistic landslide deformation forecasting method that considers spatiotemporal correlations of landslide deformation by integrating domain knowledge into interpretable deep learning.By spatially capturing the interconnections between multiple deformations from different observation points,our method contributes to the understanding and forecasting of landslide systematic behavior.By integrating specific domain knowledge relevant to each observation point and merging internal properties with external variables,the local heterogeneity is considered in our method,identifying deformation temporal patterns in different landslide zones.Case studies involving reservoir-induced landslides and creeping landslides demonstrated that our approach(1)enhances the accuracy of landslide deformation forecasting,(2)identifies significant contributing factors and their influence on spatiotemporal deformation characteristics,and(3)demonstrates how identifying these factors and patterns facilitates landslide forecasting.Our research offers a promising and pragmatic pathway toward a deeper understanding and forecasting of complex landslide behaviors. 展开更多
关键词 GEOHAZARDS Landslide deformation forecasting Landslide predictability Knowledge infused deep learning interpretable machine learning Attention mechanism transformer
在线阅读 下载PDF
ISAR autofocus imaging algorithm for maneuvering targets based on deep learning and keystone transform 被引量:5
11
作者 SHI Hongyin LIU Yue +1 位作者 GUO Jianwen LIU Mingxin 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2020年第6期1178-1185,共8页
The issue of small-angle maneuvering targets inverse synthetic aperture radar(ISAR)imaging has been successfully addressed by popular motion compensation algorithms.However,when the target’s rotational velocity is su... The issue of small-angle maneuvering targets inverse synthetic aperture radar(ISAR)imaging has been successfully addressed by popular motion compensation algorithms.However,when the target’s rotational velocity is sufficiently high during the dwell time of the radar,such compensation algorithms cannot obtain a high quality image.This paper proposes an ISAR imaging algorithm based on keystone transform and deep learning algorithm.The keystone transform is used to coarsely compensate for the target’s rotational motion and translational motion,and the deep learning algorithm is used to achieve a super-resolution image.The uniformly distributed point target data are used as the data set of the training u-net network.In addition,this method does not require estimating the motion parameters of the target,which simplifies the algorithm steps.Finally,several experiments are performed to demonstrate the effectiveness of the proposed algorithm. 展开更多
关键词 inverse synthetic aperture radar(ISAR) maneuvering target keystone transform deep learning u-net network
在线阅读 下载PDF
基于多尺度特征增强和时序Transformer的SiC外延生长浓度预测模型
12
作者 张忠义 王朗 +6 位作者 芦伟立 李帅 杨建业 高楠 王波 潘国平 房玉龙 《硅酸盐学报》 北大核心 2026年第1期35-48,共14页
碳化硅(SiC)外延层掺杂浓度直接决定功率器件性能,现有调控依赖人工经验与离线测试,存在成本高、滞后性强的问题。为此,提出一种多尺度特征增强时序Transformer(Multi-scale Feature-enhanced Temporal Transformer Network,MFT-Net)模... 碳化硅(SiC)外延层掺杂浓度直接决定功率器件性能,现有调控依赖人工经验与离线测试,存在成本高、滞后性强的问题。为此,提出一种多尺度特征增强时序Transformer(Multi-scale Feature-enhanced Temporal Transformer Network,MFT-Net)模型,整合多尺度卷积、压缩和激励(SE)模块、Transformer及门控循环单元(GRU)模块,构建“当炉-下炉”双场景浓度预测体系。多尺度卷积捕捉毫秒至小时级参数动态,SE强化核心特征,Transformer建模全参数耦合,GRU传递跨炉状态。基于1200炉数据实验表明,模型当炉预测相对误差低至1.35%、决定系数R_(2)达0.89,下炉预测相对误差为1.66%,R_(2)达到0.87,显著优于传统模型。经统计,该模型预计可降低离线测试成本约30%、提升工艺稳定性约15%,为SiC外延智能化提供支撑。 展开更多
关键词 碳化硅外延生长 核心层掺杂浓度 多尺度卷积 transformER 工艺参数预测 深度学习
原文传递
Suppression of seismic random noise by deep learning combined with stationary wavelet packet transform 被引量:1
13
作者 Fan Hua Wang Dong-Bo +2 位作者 Zhang Yang Wang Wen-Xu Li Tao 《Applied Geophysics》 SCIE CSCD 2024年第4期740-751,880,共13页
Many traditional denoising methods,such as Gaussian filtering,tend to blur and lose details or edge information while reducing noise.The stationary wavelet packet transform is a multi-scale and multi-band analysis too... Many traditional denoising methods,such as Gaussian filtering,tend to blur and lose details or edge information while reducing noise.The stationary wavelet packet transform is a multi-scale and multi-band analysis tool.Compared with the stationary wavelet transform,it can suppress high-frequency noise while preserving more edge details.Deep learning has significantly progressed in denoising applications.DnCNN,a residual network;FFDNet,an efficient,fl exible network;U-NET,a codec network;and GAN,a generative adversative network,have better denoising effects than BM3D,the most popular conventional denoising method.Therefore,SWP_hFFDNet,a random noise attenuation network based on the stationary wavelet packet transform(SWPT)and modified FFDNet,is proposed.This network combines the advantages of SWPT,Huber norm,and FFDNet.In addition,it has three characteristics:First,SWPT is an eff ective featureextraction tool that can obtain low-and high-frequency features of different scales and frequency bands.Second,because the noise level map is the input of the network,the noise removal performance of diff erent noise levels can be improved.Third,the Huber norm can reduce the sensitivity of the network to abnormal data and enhance its robustness.The network is trained using the Adam algorithm and the BSD500 dataset,which is augmented,noised,and decomposed by SWPT.Experimental and actual data processing results show that the denoising eff ect of the proposed method is almost the same as those of BM3D,DnCNN,and FFDNet networks for low noise.However,for high noise,the proposed method is superior to the aforementioned networks. 展开更多
关键词 random noise stationary wavelet packet transform deep learning noise level map Huber norm
在线阅读 下载PDF
基于Transformer-卷积神经网络模型实现单节点腰部康复训练动作识别任务
14
作者 余圣涵 成贤锴 +1 位作者 郑跃 杨颖 《中国组织工程研究》 北大核心 2026年第16期4125-4136,共12页
背景:惯性测量单元被广泛用于人体姿态感知与动态捕捉。深度学习已逐步替代传统规则与特征工程,广泛应用于动作识别任务。卷积神经网络在提取局部动态特征方面表现良好,Transformer则在建模长时序依赖方面展现出强大能力。目的:通过基于... 背景:惯性测量单元被广泛用于人体姿态感知与动态捕捉。深度学习已逐步替代传统规则与特征工程,广泛应用于动作识别任务。卷积神经网络在提取局部动态特征方面表现良好,Transformer则在建模长时序依赖方面展现出强大能力。目的:通过基于Transformer-卷积神经网络融合模型识别方法,实现在单惯性传感器条件下的腰部康复训练动作识别任务。方法:采集6名健康受试者佩戴单个惯性传感器条件下执行腰部康复动作的加速度与角速度数据,以动作类型为数据进行标注,制作腰部康复动作数据集。通过腰部康复动作数据集对Transformer-卷积神经网络融合模型进行训练,构建动作分类模型。通过留一交叉验证评估模型准确性,并与线性判别分析、支持向量机、多层感知、经典Transformer等模型进行性能对比。结果与结论:在5类动作识别任务中,Transformer-卷积神经网络模型准确率达96.67%,F1-score为0.9669。在单传感器输入的条件下,相较于传统模型,在识别精度与泛化能力方面具有明显优势。验证了基于单惯性测量单元数据的深度模型在腰部康复动作分类任务中的实用性,为轻量化、高部署性的居家腰部康复训练系统提供基础。 展开更多
关键词 慢性腰痛 康复训练 深度学习 transformER 单节点惯性传感器 动作分类
暂未订购
基于改进实时检测Transformer的轻量级番茄果实检测算法
15
作者 卢承方 崔艳荣 +2 位作者 胡蓉华 王浩宇 陈鹏翔 《山东农业科学》 北大核心 2026年第1期150-163,共14页
针对复杂环境下番茄果实目标多、外形各异、分布密集等导致智能采摘识别难度大的问题,同时为解决Transformer架构巨大运算量导致的模型边缘设备部署困难的问题,本研究提出一种改进实时检测Transformer(Real-Time Detection Transformer,... 针对复杂环境下番茄果实目标多、外形各异、分布密集等导致智能采摘识别难度大的问题,同时为解决Transformer架构巨大运算量导致的模型边缘设备部署困难的问题,本研究提出一种改进实时检测Transformer(Real-Time Detection Transformer,RT-DETR)的番茄果实检测模型SPC-DETR。首先,实地拍摄采集多尺度的复杂环境下番茄果实图片,并从公开数据集中筛选符合复杂环境的番茄果实图片作为补充,进行标注及数据增强后构建了一个包含3398张图片的番茄果实数据集。其次,以RT-DETR-R18为基线模型,进行如下改进:选用计算复杂度低且检测精度较高的StarNet作为主干特征提取基准网络,并引入倒置残差移动块iRMB,构建iR-StarNet作为主干网络,在轻量化主干的同时进一步提高模型对番茄果实的特征提取能力;设计一种改进的并行空洞卷积结构块EMA-PDC,该模块通过融合跨空间学习的高效多尺度注意力机制,可在减少计算量的同时扩大模型感受野,提升模型多尺度表示的准确性;使用DEA-Net提出的内容引导注意力融合方案CGA-Fusion改进颈部网络,进一步提高模型表征能力。实验结果表明,SPC-DETR的精确率、召回率和平均精度均值(mAP50)分别为88.0%、83.7%和90.2%,分别比基线模型RT-DETR-R18提高2.0、0.3、1.7个百分点,模型权重大小、参数量、浮点运算量分别为25.2 MB、12.9 M、34.3 GFLOPs,分别比RT-DETR-R18降低34.72%、36.14%、41.47%,综合表现优于目前大部分主流的YOLO系列模型和DETR系列模型。本研究结果可为提升智能采摘机器人识别检测复杂环境下番茄果实的准确率提供技术参考。 展开更多
关键词 番茄果实识别 transformER RT-DETR 轻量化 目标实时检测 深度学习
在线阅读 下载PDF
基于Transformer的深部矿山微震信号自动分类技术研究
16
作者 骆贞江 雷入 +4 位作者 马少维 谭丽龙 于德宁 贺艳军 彭平安 《金属矿山》 北大核心 2026年第2期166-176,共11页
针对深井矿山微震监测中人工判别效率低下、传统机器学习方法对复杂非平稳信号分类精度不足的问题,提出一种基于Transformer架构的微震信号自动分类模型。基于真实矿山数据构建了包含微震事件、爆破振动及噪声3类信号的标注数据集。通... 针对深井矿山微震监测中人工判别效率低下、传统机器学习方法对复杂非平稳信号分类精度不足的问题,提出一种基于Transformer架构的微震信号自动分类模型。基于真实矿山数据构建了包含微震事件、爆破振动及噪声3类信号的标注数据集。通过提取信号时频特征并结合Transformer的自注意力机制,该模型有效捕捉了波形中的长程依赖关系与全局特征,显著提升了分类精度,同时增强了对波形混淆现象的辨识能力,展现出良好的泛化性与工程适用性。试验结果表明:模型在测试集上达到96.3%的整体分类准确率,微震事件与爆破信号的识别率均超过97%;在多项性能指标上均显著优于SVM、KNN、CNN-BiLSTM及VGG16等对比模型。本模型通过有效解决波形混淆问题,为矿山微震监测系统的智能化升级与地压灾害实时精准预警提供了技术支撑。 展开更多
关键词 微震监测 信号分类 transformer模型 深度学习 深部矿山
在线阅读 下载PDF
基于Transformer的壁画裂缝自动化数字修复方法
17
作者 刘晨玲 李炳焱 +3 位作者 米艳华 薛峰 王建华 戴震 《太原理工大学学报》 北大核心 2026年第1期186-194,共9页
【目的】随着现代科技的发展,文物保护领域正经历深刻变革。传统的壁画修复手段虽能维护文物的部分完整性,但受限于技术和操作的复杂性。人工智能等数字化文物修复技术能够实现对壁画文物的高精度记录与还原,然而,现有图像修复方法大多... 【目的】随着现代科技的发展,文物保护领域正经历深刻变革。传统的壁画修复手段虽能维护文物的部分完整性,但受限于技术和操作的复杂性。人工智能等数字化文物修复技术能够实现对壁画文物的高精度记录与还原,然而,现有图像修复方法大多依赖专家知识或人工辅助,无法实现壁画数字修复的自动化。【方法】本文结合北方地区长期缺水干燥的环境特征,针对壁画文物可能存在的撕扯断裂或裂缝错位问题,实现自动化数字修复。具体而言,本研究提出基于Transformer的壁画裂缝自动化数字修复方法;基于VGG16-Unet模型,创新性提出基于膨胀核的裂缝掩码优化方法,通过形态学膨胀操作显著提升裂缝覆盖率。此外,利用Transformer的全局结构学习能力与CNN上采样实现结构修复,并引入傅立叶卷积与大核注意力模块完成纹理细节还原和高保真修复。【结果】实验基于山西多地真实壁画数据验证了该方法的有效性,结果表明,该方法在裂缝识别覆盖度、结构还原度及纹理协调性方面均优于传统方法,尤其对北方干燥地区特有的断裂错位型裂缝修复效果显著。 展开更多
关键词 壁画修复 深度学习 壁画图像分割 transformER
在线阅读 下载PDF
基于WGAN-GP-Transformer的地表沉陷SHAP可解释预测模型
18
作者 刘超 夏大平 《金属矿山》 北大核心 2026年第2期203-217,共15页
为实现地表沉陷的精准预测,以及对预测结果进行深度解释来指导实际工程,提出了基于改进生成对抗网络(WGAN-GP)与Transformer的地表沉陷SHAP可解释模型。利用该模型对地表下沉量、影响角正切和拐点偏移距进行预测,从而将预测参数结合概... 为实现地表沉陷的精准预测,以及对预测结果进行深度解释来指导实际工程,提出了基于改进生成对抗网络(WGAN-GP)与Transformer的地表沉陷SHAP可解释模型。利用该模型对地表下沉量、影响角正切和拐点偏移距进行预测,从而将预测参数结合概率积分法来建立地表沉陷公式。首先,利用Wasserstein距离、梯度惩罚策略对传统生成对抗网络进行改进,以增强地表沉陷数据,丰富训练集。然后,采用基于多头自注意力机制的Transformer架构对增强数据进行深度学习,并通过贝叶斯优化寻优超参数。最后,基于SHAP法对预测过程与结果进行全面剖析解释,以揭示不同特征对预测参数的影响规律。结果表明:WGAN-GP-Transformer对下沉量、影响角正切与拐点偏移距在测试集上表现出优异的预测能力,表明模型能有效捕捉预测地表沉陷的复杂非线性特征,以及可有效应对数据稀缺的场景。揭示了影响3个预测参数的特征贡献强度与作用方向存在显著差异;松散层厚度对预测下沉量影响最大,采深对预测影响角正切和拐点偏移距的影响最大。模型在鲁西南某矿3301工作面的实际应用表明,其预测沉陷曲线与实际情况高度吻合,验证了其在实际工程中的可靠性与泛化性能。 展开更多
关键词 地表沉陷预测 transformer WGAN-GP SHAP 深度学习
在线阅读 下载PDF
超临界翼型流场预测:Transformer与卷积神经网络的结合
19
作者 贺子舟 唐维劭 +2 位作者 王龑 杨韫加 张宇飞 《国防科技大学学报》 北大核心 2026年第1期16-27,共12页
为解决超临界翼型流场快速预测问题,基于当前深度学习流场预测模型的两种主要思路——卷积神经网络和Transformer,提出一种综合结构的深度学习模型,称为TransCNN-FoilNet。该模型能够预测一系列不同厚度的超临界翼型在不同攻角下的流场... 为解决超临界翼型流场快速预测问题,基于当前深度学习流场预测模型的两种主要思路——卷积神经网络和Transformer,提出一种综合结构的深度学习模型,称为TransCNN-FoilNet。该模型能够预测一系列不同厚度的超临界翼型在不同攻角下的流场,相较于基准模型最高可减少79.5%的平均绝对值误差。还针对超临界翼型流场预测模型的训练提出了一种新的组合损失函数,称为加权L1SSIM损失函数。结果表明,该损失函数可以改善对升阻力系数的预测,阻力系数相对误差最多可以减少17.8%。所提出的模型实现了在降低复杂度的同时提升预测准确性和泛化性能,能够为超临界翼型流场的快速可靠预测提供有力支持。 展开更多
关键词 超临界翼型 深度学习 流场预测 transformer模型
在线阅读 下载PDF
卷积神经网络与Vision Transformer在胶质瘤中的研究进展
20
作者 杨浩辉 徐涛 +3 位作者 王伟 安良良 敖用芳 朱家宝 《磁共振成像》 北大核心 2026年第1期168-174,共7页
胶质瘤因高度异质性、强侵袭性及预后差,传统诊疗面临巨大挑战。深度学习技术的引入为其精准诊疗提供了新路径,其中卷积神经网络(convolutional neural network,CNN)与Vision Transformer(ViT)是核心工具。CNN凭借层级化卷积操作在局部... 胶质瘤因高度异质性、强侵袭性及预后差,传统诊疗面临巨大挑战。深度学习技术的引入为其精准诊疗提供了新路径,其中卷积神经网络(convolutional neural network,CNN)与Vision Transformer(ViT)是核心工具。CNN凭借层级化卷积操作在局部特征提取(如肿瘤边缘、纹理细节)上具有天然优势,而ViT基于自注意力机制在全局上下文建模(如肿瘤跨区域异质性、多模态关联)方面表现突出,二者的融合策略通过整合局部精细特征与全局关联信息,在应对胶质瘤边界模糊、跨模态数据异构性等临床难题中展现出显著优势。本文综述了二者在胶质瘤检测与分割、病理分级、分子分型、预后评估等关键临床任务中的研究进展,阐述了原理、单独应用及融合策略。同时,本文也探讨了当前研究中存在的挑战,诸如对数据标注的强依赖性、模型可解释性不足等问题,并展望了未来的发展方向,例如构建轻量化架构、发展自监督学习以及推进多组学融合等前沿,以期为胶质瘤智能诊断提供系统性参考。 展开更多
关键词 胶质瘤 深度学习 卷积神经网络 Vision transformer 磁共振成像
暂未订购
上一页 1 2 76 下一页 到第
使用帮助 返回顶部