期刊文献+
共找到15,257篇文章
< 1 2 250 >
每页显示 20 50 100
基于多方位感知深度融合检测头的目标检测算法
1
作者 包晓安 彭书友 +3 位作者 张娜 涂小妹 张庆琪 吴彪 《浙江大学学报(工学版)》 北大核心 2026年第1期32-42,共11页
针对传统目标检测头难以有效捕捉全局信息的问题,提出基于多方位感知深度融合检测头的目标检测算法.通过在检测头部分设计高效双轴窗口注意力编码器(EDWE)模块,使网络能够深度融合捕获到的全局信息与局部信息;在特征金字塔结构之后使用... 针对传统目标检测头难以有效捕捉全局信息的问题,提出基于多方位感知深度融合检测头的目标检测算法.通过在检测头部分设计高效双轴窗口注意力编码器(EDWE)模块,使网络能够深度融合捕获到的全局信息与局部信息;在特征金字塔结构之后使用重参化大核卷积(RLK)模块,减小来自主干网络的特征空间差异,增强网络对中小型数据集的适应性;引入编码器选择保留模块(ESM),选择性地累积来自EDWE模块的输出,优化反向传播.实验结果表明,在规模较大的MS-COCO2017数据集上,所提算法应用于常见模型RetinaNet、FCOS、ATSS时使AP分别提升了2.9、2.6、3.4个百分点;在规模较小的PASCAL VOC2007数据集上,所提算法使3种模型的AP分别实现了1.3、1.0和1.1个百分点的提升.通过EDWE、RLK和ESM模块的协同作用,所提算法有效提升了目标检测精度,在不同规模的数据集上均展现了显著的性能优势. 展开更多
关键词 检测头 目标检测 Transformer编码器 深度融合 大核卷积
在线阅读 下载PDF
MewCDNet: A Wavelet-Based Multi-Scale Interaction Network for Efficient Remote Sensing Building Change Detection
2
作者 Jia Liu Hao Chen +5 位作者 Hang Gu Yushan Pan Haoran Chen Erlin Tian Min Huang Zuhe Li 《Computers, Materials & Continua》 2026年第1期687-710,共24页
Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectra... Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability. 展开更多
关键词 Remote sensing change detection deep learning wavelet transform MULTI-SCALE
在线阅读 下载PDF
A Synthetic Speech Detection Model Combining Local-Global Dependency
3
作者 Jiahui Song Yuepeng Zhang Wenhao Yuan 《Computers, Materials & Continua》 2026年第1期1312-1326,共15页
Synthetic speech detection is an essential task in the field of voice security,aimed at identifying deceptive voice attacks generated by text-to-speech(TTS)systems or voice conversion(VC)systems.In this paper,we propo... Synthetic speech detection is an essential task in the field of voice security,aimed at identifying deceptive voice attacks generated by text-to-speech(TTS)systems or voice conversion(VC)systems.In this paper,we propose a synthetic speech detection model called TFTransformer,which integrates both local and global features to enhance detection capabilities by effectively modeling local and global dependencies.Structurally,the model is divided into two main components:a front-end and a back-end.The front-end of the model uses a combination of SincLayer and two-dimensional(2D)convolution to extract high-level feature maps(HFM)containing local dependency of the input speech signals.The back-end uses time-frequency Transformer module to process these feature maps and further capture global dependency.Furthermore,we propose TFTransformer-SE,which incorporates a channel attention mechanism within the 2D convolutional blocks.This enhancement aims to more effectively capture local dependencies,thereby improving the model’s performance.The experiments were conducted on the ASVspoof 2021 LA dataset,and the results showed that the model achieved an equal error rate(EER)of 3.37%without data augmentation.Additionally,we evaluated the model using the ASVspoof 2019 LA dataset,achieving an EER of 0.84%,also without data augmentation.This demonstrates that combining local and global dependencies in the time-frequency domain can significantly improve detection accuracy. 展开更多
关键词 Synthetic speech detection transformer local-global time-frequency domain
在线阅读 下载PDF
Cell type-dependent role of transforming growth factor-βsignaling on postnatal neural stem cell proliferation and migration
4
作者 Kierra Ware Joshua Peter +1 位作者 Lucas McClain Yu Luo 《Neural Regeneration Research》 2026年第3期1151-1161,共11页
Adult neurogenesis continuously produces new neurons critical for cognitive plasticity in adult rodents.While it is known transforming growth factor-βsignaling is important in embryonic neurogenesis,its role in postn... Adult neurogenesis continuously produces new neurons critical for cognitive plasticity in adult rodents.While it is known transforming growth factor-βsignaling is important in embryonic neurogenesis,its role in postnatal neurogenesis remains unclear.In this study,to define the precise role of transforming growth factor-βsignaling in postnatal neurogenesis at distinct stages of the neurogenic cascade both in vitro and in vivo,we developed two novel inducible and cell type-specific mouse models to specifically silence transforming growth factor-βsignaling in neural stem cells in(mGFAPcre-ALK5fl/fl-Ai9)or immature neuroblasts in(DCXcreERT2-ALK5fl/fl-Ai9).Our data showed that exogenous transforming growth factor-βtreatment led to inhibition of the proliferation of primary neural stem cells while stimulating their migration.These effects were abolished in activin-like kinase 5(ALK5)knockout primary neural stem cells.Consistent with this,inhibition of transforming growth factor-βsignaling with SB-431542 in wild-type neural stem cells stimulated proliferation while inhibited the migration of neural stem cells.Interestingly,deletion of transforming growth factor-βreceptor in neural stem cells in vivo inhibited the migration of postnatal born neurons in mGFAPcre-ALK5fl/fl-Ai9 mice,while abolishment of transforming growth factor-βsignaling in immature neuroblasts in DCXcreERT2-ALK5fl/fl-Ai9 mice did not affect the migration of these cells in the hippocampus.In summary,our data supports a dual role of transforming growth factor-βsignaling in the proliferation and migration of neural stem cells in vitro.Moreover,our data provides novel insights on cell type-specific-dependent requirements of transforming growth factor-βsignaling on neural stem cell proliferation and migration in vivo. 展开更多
关键词 adult neurogenesis DOUBLECORTIN HIPPOCAMPUS MIGRATION neural stem cells PROLIFERATION transforming growth factor-β
暂未订购
SwinHCAD: A Robust Multi-Modality Segmentation Model for Brain Tumors Using Transformer and Channel-Wise Attention
5
作者 Seyong Jin Muhammad Fayaz +2 位作者 L.Minh Dang Hyoung-Kyu Song Hyeonjoon Moon 《Computers, Materials & Continua》 2026年第1期511-533,共23页
Brain tumors require precise segmentation for diagnosis and treatment plans due to their complex morphology and heterogeneous characteristics.While MRI-based automatic brain tumor segmentation technology reduces the b... Brain tumors require precise segmentation for diagnosis and treatment plans due to their complex morphology and heterogeneous characteristics.While MRI-based automatic brain tumor segmentation technology reduces the burden on medical staff and provides quantitative information,existing methodologies and recent models still struggle to accurately capture and classify the fine boundaries and diverse morphologies of tumors.In order to address these challenges and maximize the performance of brain tumor segmentation,this research introduces a novel SwinUNETR-based model by integrating a new decoder block,the Hierarchical Channel-wise Attention Decoder(HCAD),into a powerful SwinUNETR encoder.The HCAD decoder block utilizes hierarchical features and channelspecific attention mechanisms to further fuse information at different scales transmitted from the encoder and preserve spatial details throughout the reconstruction phase.Rigorous evaluations on the recent BraTS GLI datasets demonstrate that the proposed SwinHCAD model achieved superior and improved segmentation accuracy on both the Dice score and HD95 metrics across all tumor subregions(WT,TC,and ET)compared to baseline models.In particular,the rationale and contribution of the model design were clarified through ablation studies to verify the effectiveness of the proposed HCAD decoder block.The results of this study are expected to greatly contribute to enhancing the efficiency of clinical diagnosis and treatment planning by increasing the precision of automated brain tumor segmentation. 展开更多
关键词 Attention mechanism brain tumor segmentation channel-wise attention decoder deep learning medical imaging MRI TRANSFORMER U-Net
在线阅读 下载PDF
M2ATNet: Multi-Scale Multi-Attention Denoising and Feature Fusion Transformer for Low-Light Image Enhancement
6
作者 Zhongliang Wei Jianlong An Chang Su 《Computers, Materials & Continua》 2026年第1期1819-1838,共20页
Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approach... Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approaches,while effective in global illumination modeling,often struggle to simultaneously suppress noise and preserve structural details,especially under heterogeneous lighting.Furthermore,misalignment between luminance and color channels introduces additional challenges to accurate enhancement.In response to the aforementioned difficulties,we introduce a single-stage framework,M2ATNet,using the multi-scale multi-attention and Transformer architecture.First,to address the problems of texture blurring and residual noise,we design a multi-scale multi-attention denoising module(MMAD),which is applied separately to the luminance and color channels to enhance the structural and texture modeling capabilities.Secondly,to solve the non-alignment problem of the luminance and color channels,we introduce the multi-channel feature fusion Transformer(CFFT)module,which effectively recovers the dark details and corrects the color shifts through cross-channel alignment and deep feature interaction.To guide the model to learn more stably and efficiently,we also fuse multiple types of loss functions to form a hybrid loss term.We extensively evaluate the proposed method on various standard datasets,including LOL-v1,LOL-v2,DICM,LIME,and NPE.Evaluation in terms of numerical metrics and visual quality demonstrate that M2ATNet consistently outperforms existing advanced approaches.Ablation studies further confirm the critical roles played by the MMAD and CFFT modules to detail preservation and visual fidelity under challenging illumination-deficient environments. 展开更多
关键词 Low-light image enhancement multi-scale multi-attention TRANSFORMER
在线阅读 下载PDF
Deep Learning for Brain Tumor Segmentation and Classification: A Systematic Review of Methods and Trends
7
作者 Ameer Hamza Robertas Damaševicius 《Computers, Materials & Continua》 2026年第1期132-172,共41页
This systematic review aims to comprehensively examine and compare deep learning methods for brain tumor segmentation and classification using MRI and other imaging modalities,focusing on recent trends from 2022 to 20... This systematic review aims to comprehensively examine and compare deep learning methods for brain tumor segmentation and classification using MRI and other imaging modalities,focusing on recent trends from 2022 to 2025.The primary objective is to evaluate methodological advancements,model performance,dataset usage,and existing challenges in developing clinically robust AI systems.We included peer-reviewed journal articles and highimpact conference papers published between 2022 and 2025,written in English,that proposed or evaluated deep learning methods for brain tumor segmentation and/or classification.Excluded were non-open-access publications,books,and non-English articles.A structured search was conducted across Scopus,Google Scholar,Wiley,and Taylor&Francis,with the last search performed in August 2025.Risk of bias was not formally quantified but considered during full-text screening based on dataset diversity,validation methods,and availability of performance metrics.We used narrative synthesis and tabular benchmarking to compare performance metrics(e.g.,accuracy,Dice score)across model types(CNN,Transformer,Hybrid),imaging modalities,and datasets.A total of 49 studies were included(43 journal articles and 6 conference papers).These studies spanned over 9 public datasets(e.g.,BraTS,Figshare,REMBRANDT,MOLAB)and utilized a range of imaging modalities,predominantly MRI.Hybrid models,especially ResViT and UNetFormer,consistently achieved high performance,with classification accuracy exceeding 98%and segmentation Dice scores above 0.90 across multiple studies.Transformers and hybrid architectures showed increasing adoption post2023.Many studies lacked external validation and were evaluated only on a few benchmark datasets,raising concerns about generalizability and dataset bias.Few studies addressed clinical interpretability or uncertainty quantification.Despite promising results,particularly for hybrid deep learning models,widespread clinical adoption remains limited due to lack of validation,interpretability concerns,and real-world deployment barriers. 展开更多
关键词 Brain tumor segmentation brain tumor classification deep learning vision transformers hybrid models
在线阅读 下载PDF
A Transformer-Based Deep Learning Framework with Semantic Encoding and Syntax-Aware LSTM for Fake Electronic News Detection
8
作者 Hamza Murad Khan Shakila Basheer +3 位作者 Mohammad Tabrez Quasim Raja`a Al-Naimi Vijaykumar Varadarajan Anwar Khan 《Computers, Materials & Continua》 2026年第1期1024-1048,共25页
With the increasing growth of online news,fake electronic news detection has become one of the most important paradigms of modern research.Traditional electronic news detection techniques are generally based on contex... With the increasing growth of online news,fake electronic news detection has become one of the most important paradigms of modern research.Traditional electronic news detection techniques are generally based on contextual understanding,sequential dependencies,and/or data imbalance.This makes distinction between genuine and fabricated news a challenging task.To address this problem,we propose a novel hybrid architecture,T5-SA-LSTM,which synergistically integrates the T5 Transformer for semantically rich contextual embedding with the Self-Attentionenhanced(SA)Long Short-Term Memory(LSTM).The LSTM is trained using the Adam optimizer,which provides faster and more stable convergence compared to the Stochastic Gradient Descend(SGD)and Root Mean Square Propagation(RMSProp).The WELFake and FakeNewsPrediction datasets are used,which consist of labeled news articles having fake and real news samples.Tokenization and Synthetic Minority Over-sampling Technique(SMOTE)methods are used for data preprocessing to ensure linguistic normalization and class imbalance.The incorporation of the Self-Attention(SA)mechanism enables the model to highlight critical words and phrases,thereby enhancing predictive accuracy.The proposed model is evaluated using accuracy,precision,recall(sensitivity),and F1-score as performance metrics.The model achieved 99%accuracy on the WELFake dataset and 96.5%accuracy on the FakeNewsPrediction dataset.It outperformed the competitive schemes such as T5-SA-LSTM(RMSProp),T5-SA-LSTM(SGD)and some other models. 展开更多
关键词 Fake news detection tokenization SMOTE text-to-text transfer transformer(T5) long short-term memory(LSTM) self-attention mechanism(SA) T5-SA-LSTM WELFake dataset FakeNewsPrediction dataset
在线阅读 下载PDF
Small extracellular vesicles derived from hair follicle neural crest stem cells enhance perineurial cell proliferation and migration via the TGF-β/SMAD/HAS2 pathway
9
作者 Yiming Huo Bing Xiao +8 位作者 Haojie Yu Yang Xu Jiachen Zheng Chao Huang Ling Wang Haiyan Lin Jiajun Xu Pengfei Yang Fang Liu 《Neural Regeneration Research》 2026年第5期2060-2072,共13页
Peripheral nerve defect repair is a complex process that involves multiple cell types;perineurial cells play a pivotal role.Hair follicle neural crest stem cells promote perineurial cell proliferation and migration vi... Peripheral nerve defect repair is a complex process that involves multiple cell types;perineurial cells play a pivotal role.Hair follicle neural crest stem cells promote perineurial cell proliferation and migration via paracrine signaling;however,their clinical applications are limited by potential risks such as tumorigenesis and xenogeneic immune rejection,which are similar to the risks associated with other stem cell transplantations.The present study therefore focuses on small extracellular vesicles derived from hair follicle neural crest stem cells,which preserve the bioactive properties of the parent cells while avoiding the transplantation-associated risks.In vitro,small extracellular vesicles derived from hair follicle neural crest stem cells significantly enhanced the proliferation,migration,tube formation,and barrier function of perineurial cells,and subsequently upregulated the expression of tight junction proteins.Furthermore,in a rat model of sciatic nerve defects bridged with silicon tubes,treatment with small extracellular vesicles derived from hair follicle neural crest stem cells resulted in higher tight junction protein expression in perineurial cells,thus facilitating neural tissue regeneration.At 10 weeks post-surgery,rats treated with small extracellular vesicles derived from hair follicle neural crest stem cells exhibited improved nerve function recovery and reduced muscle atrophy.Transcriptomic and micro RNA analyses revealed that small extracellular vesicles derived from hair follicle neural crest stem cells deliver mi R-21-5p,which inhibits mothers against decapentaplegic homolog 7 expression,thereby activating the transforming growth factor-β/mothers against decapentaplegic homolog signaling pathway and upregulating hyaluronan synthase 2 expression,and further enhancing tight junction protein expression.Together,our findings indicate that small extracellular vesicles derived from hair follicle neural crest stem cells promote the proliferation,migration,and tight junction protein formation of perineurial cells.These results provide new insights into peripheral nerve regeneration from the perspective of perineurial cells,and present a novel approach for the clinical treatment of peripheral nerve defects. 展开更多
关键词 hair follicle neural crest stem cells HAS2 MIGRATION miR-21-5p perineurial cells proliferation peripheral nerve injury SMAD7 small extracellular vesicles transforming growth factor-β/SMAD signaling pathway
暂未订购
基于Transformer的时间序列预测方法综述 被引量:4
10
作者 陈嘉俊 刘波 +2 位作者 林伟伟 郑剑文 谢家晨 《计算机科学》 北大核心 2025年第6期96-105,共10页
时间序列预测作为分析历史数据以预测未来趋势的关键技术,已广泛应用于金融、气象等领域。然而,传统方法如自回归移动平均模型和指数平滑法等在处理非线性模式、捕捉长期依赖性时存在局限。最近,基于Transformer的方法因其自注意力机制... 时间序列预测作为分析历史数据以预测未来趋势的关键技术,已广泛应用于金融、气象等领域。然而,传统方法如自回归移动平均模型和指数平滑法等在处理非线性模式、捕捉长期依赖性时存在局限。最近,基于Transformer的方法因其自注意力机制,在自然语言处理与计算机视觉领域取得突破,也开始拓展至时间序列预测领域并取得显著成果。因此,探究如何将Transformer高效运用于时间序列预测,成为推动该领域发展的关键。首先,介绍了时间序列的特性,阐述了时间序列预测的常见任务类别及评估指标。接着,深入解析Transformer的基本架构,并挑选了近年来在时间序列预测中广受关注的Transfo-rmer衍生模型,从模块及架构层面进行分类,并分别从问题解决、创新点及局限性3个维度进行比较和分析。最后,进一步探讨了时间序列预测Transformer在未来可能的研究方向。 展开更多
关键词 时间序列 Transformer模型 深度学习 注意力机制 预测
在线阅读 下载PDF
基于Transformer模型的时序数据预测方法综述 被引量:15
11
作者 孟祥福 石皓源 《计算机科学与探索》 北大核心 2025年第1期45-64,共20页
时序数据预测(TSF)是指通过分析历史数据的趋势性、季节性等潜在信息,预测未来时间点或时间段的数值和趋势。时序数据由传感器生成,在金融、医疗、能源、交通、气象等众多领域都发挥着重要作用。随着物联网传感器的发展,海量的时序数据... 时序数据预测(TSF)是指通过分析历史数据的趋势性、季节性等潜在信息,预测未来时间点或时间段的数值和趋势。时序数据由传感器生成,在金融、医疗、能源、交通、气象等众多领域都发挥着重要作用。随着物联网传感器的发展,海量的时序数据难以使用传统的机器学习解决,而Transformer在自然语言处理和计算机视觉等领域的诸多任务表现优秀,学者们利用Transformer模型有效捕获长期依赖关系,使得时序数据预测任务取得了飞速发展。综述了基于Transformer模型的时序数据预测方法,按时间梳理了时序数据预测的发展进程,系统介绍了时序数据预处理过程和方法,介绍了常用的时序预测评价指标和数据集。以算法框架为研究内容系统阐述了基于Transformer的各类模型在TSF任务中的应用方法和工作原理。通过实验对比了各个模型的性能、优点和局限性,并对实验结果展开了分析与讨论。结合Transformer模型在时序数据预测任务中现有工作存在的挑战提出了该方向未来发展趋势。 展开更多
关键词 深度学习 时序数据预测 数据预处理 Transformer模型
在线阅读 下载PDF
基于MobileViT的智能变电站继电保护二次回路故障诊断技术 被引量:3
12
作者 郑茂然 余江 +3 位作者 史泽兵 高宏慧 姜健琳 沈亚东 《电网与清洁能源》 北大核心 2025年第6期31-38,共8页
随着变电站智能化技术的飞速发展,二次设备数量急剧增长,通信网络配置日趋复杂,高效准确地实现继电保护二次回路故障诊断关系着智能变电站的安全稳定。为了降低智能变电站自动化运维成本,研究了基于MobileViT的继电保护二次回路故障诊... 随着变电站智能化技术的飞速发展,二次设备数量急剧增长,通信网络配置日趋复杂,高效准确地实现继电保护二次回路故障诊断关系着智能变电站的安全稳定。为了降低智能变电站自动化运维成本,研究了基于MobileViT的继电保护二次回路故障诊断技术。将二次设备报文接收状态以及交换机端口流量信息作为保护回路的故障特征,结合矩阵式编码技术获取保护回路故障特征的二维图像形式;利用先进的MobileViT算法构建继电保护二次回路故障诊断模型,并提出在线操作方法。以典型110 kV智能变电站二次系统为例,验证所提方法的可行性和有效性。算例结果表明,基于MobileViT的故障诊断模型能够准确识别保护回路中的二次设备及通信链路故障。与其他方法的对比研究证明了所提方法在特征构造、辨识误差、诊断精度等方面存在优势。 展开更多
关键词 智能变电站 继电保护 二次回路 故障诊断 TRANSFORMER
在线阅读 下载PDF
基于CBAM-TransUNet的地震断层识别方法 被引量:2
13
作者 王新 张薇 +2 位作者 陈同俊 张傲 赵砀 《煤炭学报》 北大核心 2025年第2期1192-1202,共11页
断层的检测和识别在煤炭勘探开采过程中至关重要,传统的人工解释断层方法已经无法满足实际生产的需求,基于深度学习的地震断层解释方法在断层分割领域表现较为出色。常规卷积神经网络(CNN)感受野有限,不能很好地利用全局信息,会导致一... 断层的检测和识别在煤炭勘探开采过程中至关重要,传统的人工解释断层方法已经无法满足实际生产的需求,基于深度学习的地震断层解释方法在断层分割领域表现较为出色。常规卷积神经网络(CNN)感受野有限,不能很好地利用全局信息,会导致一些预测的断层存在连续性不足和断层缺失等问题。Transformer具有提取全局信息的优势,引入CNN和Transformer融合的TransUNet网络,构建一种基于CBAM-TransUNet的地震断层识别方法对二维地震断层图像进行识别。首先,将CBAM-Block注意力模块融入TransUNet网络,将该模块分别加入CNN断层编码器部分和连接断层编码器与断层解码器的3层跳跃连接部分,同时从通道和空间2个维度增强地震断层图像的识别能力;其次,选择Dice损失函数和交叉熵损失函数联合优化的损失函数,使得断层图像分割更为准确,CBAM-TransUNet断层识别网络在合成地震数据集上获得的DICE值和IOU值分别提高到0.84和0.75,试验结果表明断层识别的连续性更强,明显优于其他经典分割方法;最后,利用构建的模型对荷兰近海北海F3区块真实地震数据集进行了断层解释。试验结果表明:基于CBAM-TransUNet的地震断层识别方法在去除冗余断层信息的同时能够有效识别出断层,在断层识别准确度和断层识别连续性方面表现优异,识别出的断层细节更加丰富,提高了断层识别的精度,可以有效应用于实际地震数据中识别断层。 展开更多
关键词 地震图像 断层识别 机器学习 注意力机制 TRANSFORMER
在线阅读 下载PDF
多变量时序标记Transformer及其在电潜泵故障诊断中的应用 被引量:3
14
作者 李康 李爽 +2 位作者 高小永 李强 张来斌 《控制与决策》 北大核心 2025年第4期1145-1153,共9页
电潜泵故障诊断对于确保安全可靠采油至关重要,但是,电潜泵数据呈现出的多变量、非线性和动态变化等复杂特性为该任务带来了严峻挑战.近年来,深度学习在复杂数据特征提取方面表现出的强大能力催生了一系列基于神经网络的电潜泵故障诊断... 电潜泵故障诊断对于确保安全可靠采油至关重要,但是,电潜泵数据呈现出的多变量、非线性和动态变化等复杂特性为该任务带来了严峻挑战.近年来,深度学习在复杂数据特征提取方面表现出的强大能力催生了一系列基于神经网络的电潜泵故障诊断方法.然而,多数方法忽略了电潜泵数据的动态特性以及长时依赖特征提取困难的问题.针对上述问题,提出一种多变量时序标记Transformer神经网络来实现电潜泵故障诊断.该模型设计新的多变量时间序列标记策略,继承引入多头注意力机制和残差连接的传统Transformer神经网络编码器在长时依赖特征提取方面的优势,用前向神经网络替代传统Transformer神经网络解码器来简化模型复杂度.通过对油田现场故障数据分析,验证所提出方法的有效性.实验结果表明,所提出方法实现了10类电潜泵故障的精确诊断,相比于流行的深度学习方法诊断性能更优. 展开更多
关键词 电潜泵 Transformer神经网络 深度学习 特征提取 故障诊断 多变量时序标记
原文传递
基于改进去噪扩散概率模型的风电机组故障样本生成方法 被引量:2
15
作者 孟昱煜 张沣琦 +2 位作者 火久元 常琛 陈峰 《振动与冲击》 北大核心 2025年第4期286-297,共12页
为解决风电机组故障诊断中故障样本不足而导致模型准确率不高的问题,将当下备受关注的数据增强方法-去噪扩散概率模型(denoising diffusion probability model,DDPM)引入到故障诊断领域以生成大量高质量的故障样本数据集。因此,结合Tran... 为解决风电机组故障诊断中故障样本不足而导致模型准确率不高的问题,将当下备受关注的数据增强方法-去噪扩散概率模型(denoising diffusion probability model,DDPM)引入到故障诊断领域以生成大量高质量的故障样本数据集。因此,结合Transformer网络,提出了一种DDPM-Transformer风电机组故障样本生成方法。首先,将用于计算机视觉图像生成领域的DDPM模型应用于风电机组故障诊断领域中,通过前向加噪过程将数据逐渐转化为噪声,再通过逆向去噪过程将噪声逐步恢复为原始数据,实现从噪声中生成故障数据,解决数据不平衡问题;其次,通过对原始DDPM中使用的U-net模块进行改进,使用Transformer模型替换U-net网络,利用扩散后的数据和添加的噪声训练Transformer模型,实现噪声预测,以提高故障数据的生成质量;最后,使用多种生成模型评价指标对生成的故障数据进行评价,在监督控制和数据采集系统(supervisory control and data acquisition,SCADA)故障数据生成中论证改进DDPM-Transformer模型的性能。通过试验证明,所提DDPM-Transformer模型与现有的生成模型相比,最大均值异(maximum mean discrepancy,MMD)最大提升0.13,峰值信噪比(peak signal to noise ratio,PSNR)最大提升7.8。所提模型可以有效地生成质量更高的风电机组故障样本,从而基于该样本集辅助训练基于深度学习的故障诊断模型,可以使诊断模型具有更高精度和良好的稳定性。 展开更多
关键词 DDPM TRANSFORMER 风电机组 故障诊断 样本生成
在线阅读 下载PDF
医疗领域的大型语言模型综述 被引量:1
16
作者 肖建力 许东舟 +4 位作者 王浩 刘敏 周雷 朱林 顾松 《智能系统学报》 北大核心 2025年第3期530-547,共18页
深度学习是人工智能领域的热门研究方向之一,它通过构建多层人工神经网络模仿人脑对数据的处理机制。大型语言模型(large language model,LLM)基于深度学习的架构,在无需编程指令的情况下,能通过分析大量数据以获得理解和生成人类语言... 深度学习是人工智能领域的热门研究方向之一,它通过构建多层人工神经网络模仿人脑对数据的处理机制。大型语言模型(large language model,LLM)基于深度学习的架构,在无需编程指令的情况下,能通过分析大量数据以获得理解和生成人类语言的能力,被广泛应用于自然语言处理、计算机视觉、智慧医疗、智慧交通等诸多领域。文章总结了LLM在医疗领域的应用,涵盖了LLM针对医疗任务的基本训练流程、特殊策略以及在具体医疗场景中的应用。同时,进一步讨论了LLM在应用中面临的挑战,包括决策过程缺乏透明度、输出准确性以及隐私、伦理问题等,随后列举了相应的改进策略。最后,文章展望了LLM在医疗领域的未来发展趋势,及其对人类健康事业发展的潜在影响。 展开更多
关键词 人工智能 深度学习 TRANSFORMER 大型语言模型 智慧医疗 数据分析 图像处理 计算机视觉
在线阅读 下载PDF
DPRT-YOLO:智能网联汽车复杂驾驶环境实时目标检测器 被引量:1
17
作者 董一兵 曾辉 +2 位作者 李建科 侯少杰 石磊 《计算机工程与应用》 北大核心 2025年第14期148-162,共15页
目标检测是智能网联汽车视觉感知系统的一项基本任务,可为先进驾驶辅助系统提供基础数据和决策依据。然而,在低光照和恶劣天气等复杂环境中,车载目标检测算法面临小目标检测性能不佳、漏检率和误检率偏高的挑战。针对这一挑战,发展了一... 目标检测是智能网联汽车视觉感知系统的一项基本任务,可为先进驾驶辅助系统提供基础数据和决策依据。然而,在低光照和恶劣天气等复杂环境中,车载目标检测算法面临小目标检测性能不佳、漏检率和误检率偏高的挑战。针对这一挑战,发展了一种面向智能网联汽车的实时目标检测器(DPRT-YOLO),通过对流行的YOLOv10模型进行改造,使其更加适用于复杂驾驶环境中的目标检测任务,并通过在NVIDIA边缘计算平台上开展消融和对比实验,验证了算法的有效性。设计了增强加权多分支特征融合网络(EWMFFN),引入浅层加权融合和多分支加权融合模块,消除特征融合过程中的层间干扰,设计星形拓扑特征交互结构,提升模型对小尺度目标的检测能力,同时保持了网络结构的轻量化设计。融合卷积门控线性单元(convolutional gated linear units,CGLU)与卷积加法自注意力(convolutional additive token mixer,CATM),通过局部-全局双通路机制建立小目标尺度信息的长期上下文关系并保持模型的轻量化。为了评估模型在真实算力场景中的检测性能,将其部署在NVIDIA Jetson Xavier Nx平台上,采用NVIDIA TensorRT FP16量化加速,在BDD100K和TT100K测试集上开展推理实验,并与基准模型进行对比,结果显示:(1)检测精度方面,与YOLOv10n和YOLO11n相比,改进模型的mAP@0.5指标分别提升了6.1和7.4个百分点,mAP@0.5:0.95指标分别提升了3.6和4.2个百分点,同时,参数量分别降低了26.1%和34.9%。(2)检测速度方面,改进模型Small和Nano两种版本的推理速度分别达到了29 FPS和35 FPS。实验结果表明:与参考模型相比,改进算法在复杂驾驶环境中的表现更加优异,在检测精度与检测速度之间达到了更好的平衡,适于部署在智能网联汽车的环境感知系统中。 展开更多
关键词 实时目标检测 复杂驾驶环境 DPRT-YOLO 多尺度特征融合 TRANSFORMER
在线阅读 下载PDF
基于改进Transformer结构的电力绝缘子运动模糊图像复原网络 被引量:1
18
作者 李鹏 常乐 +2 位作者 覃发富 孟庆伟 陈继明 《电网技术》 北大核心 2025年第6期2623-2631,I0143-I0146,共13页
针对高压输电线路巡检航拍过程中产生的电力绝缘子图像运动模糊的失真情形,影响后续绝缘子定位及缺陷检测的问题,提出了一种基于改进Transformer结构的电力绝缘子图像运动模糊复原方法。为了适应电力绝缘子航拍图像中全局与局部模糊的... 针对高压输电线路巡检航拍过程中产生的电力绝缘子图像运动模糊的失真情形,影响后续绝缘子定位及缺陷检测的问题,提出了一种基于改进Transformer结构的电力绝缘子图像运动模糊复原方法。为了适应电力绝缘子航拍图像中全局与局部模糊的复原需求,在Transformer网络结构上引入条带注意力模块,结合卷积神经网络,在减小内存空间需求和不依赖大量训练数据的同时实现高效的模糊绝缘子图像复原;同时,在网络目标函数中引入对比学习损失,充分地挖掘和利用清晰与模糊电力绝缘子图像的关联信息。构建运动模糊绝缘子图像数据集进行图像复原与缺陷检测实验,结果表明,该文的运动模糊绝缘子图像复原方法在峰值信噪比(peak signal-to-noise ratio,PSNR)和结构相似度(structure similarity index measure,SSIM)这两个指标上均高于Deblur GAN-v2、MIMO-UNet等主流算法,使用目标检测算法YOLOv5和YOLOv7对去模糊前后的绝缘子进行定位与自爆缺陷检测后显示该文方法在提升高压输电线路巡检任务中绝缘子定位与缺陷检测的准确率上具有实际应用意义。 展开更多
关键词 运动模糊图像复原 TRANSFORMER 对比学习 绝缘子及缺陷检测
原文传递
融合梯度预测和无参注意力的高效地震去噪Transformer 被引量:1
19
作者 高磊 乔昊炜 +2 位作者 梁东升 闵帆 杨梅 《计算机科学与探索》 北大核心 2025年第5期1342-1352,共11页
压制随机噪声能够有效提升地震数据的信噪比(SNR)。近年来,基于卷积神经网络(CNN)的深度学习方法在地震数据去噪领域展现出显著性能。然而,CNN中的卷积操作由于感受野的限制通常只能捕获局部信息而不能建立全局信息的长距离连接,可能会... 压制随机噪声能够有效提升地震数据的信噪比(SNR)。近年来,基于卷积神经网络(CNN)的深度学习方法在地震数据去噪领域展现出显著性能。然而,CNN中的卷积操作由于感受野的限制通常只能捕获局部信息而不能建立全局信息的长距离连接,可能会导致细节信息的丢失。针对地震数据去噪问题,提出了一种融合梯度预测和无参注意力的高效Transformer模型(ETGP)。引入多头“转置”注意力来代替传统的多头注意力,它能在通道间计算注意力来表示全局信息,缓解了传统多头注意力复杂度过高的问题。提出了无参注意力前馈神经网络,它能同时考虑空间和通道维度计算注意力权重,而不向网络增加参数。设计了梯度预测网络以提取边缘信息,并将信息自适应地添加到并行Transformer的输入中,从而获得高质量的地震数据。在合成数据和野外数据上进行了实验,并与经典和先进的去噪方法进行了比较。结果表明,ETGP去噪方法不仅能更有效地压制随机噪声,并且在弱信号保留和同相轴连续性方面具有显著优势。 展开更多
关键词 地震数据去噪 卷积神经网络 TRANSFORMER 注意力模块 梯度融合
在线阅读 下载PDF
BMTA:多元场景下的大面积破损图像修复 被引量:1
20
作者 曹岩 辛子昊 +2 位作者 邬开俊 单宏全 郭炳森 《计算机科学与探索》 北大核心 2025年第6期1553-1563,共11页
针对图像修复过程中图像像素之间语义联系不连贯、大范围损坏图像的局部纹理细节修复效果不明显的问题,提出一种名为BMTA的单阶段图像修复网络模型,用于修复多场景下的大面积破损图像,使修复出的图像在人眼主观感受和客观评价指标上都... 针对图像修复过程中图像像素之间语义联系不连贯、大范围损坏图像的局部纹理细节修复效果不明显的问题,提出一种名为BMTA的单阶段图像修复网络模型,用于修复多场景下的大面积破损图像,使修复出的图像在人眼主观感受和客观评价指标上都有良好的表现。生成器模块通过在卷积层中穿插双重单向注意力模块来对输入图像进行特征压缩、重建和强化重要特征信息,将压缩的特征分通道进行局部特征提取和全局特征提取,利用分割条纹窗口建立全局信息联系,使用残差密集块对局部细节信息深度提取,并将所提取的特征进行融合。在解码器部分,为防止在解码过程中造成局部信息丢失和修复过程中对上下文信息理解的不准确,使用门控的线性自注意力模块来保证网络中信息的多层次保留,从而达到更接近原图的修复效果。使用鉴别器来评估修复结果,促使修复图像在结构和纹理上具有更好的表现性。在CelebA、StreetView以及Places2数据集上的表现均优于当前先进的图像修复算法。 展开更多
关键词 图像修复 注意力机制 TRANSFORMER 特征提取
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部