期刊文献+
共找到268,813篇文章
< 1 2 250 >
每页显示 20 50 100
CAPTLDA:基于胶囊网络和Transformer预测LncRNA-疾病关联
1
作者 张嘉辉 谭建军 《生物医学》 2026年第1期11-24,共14页
长链非编码RNA (lncRNA)是一类长度超过200个核苷酸的转录物,在多种疾病的发病机制中发挥关键作用。因此,阐明lncRNA与疾病之间的关联对于理解潜在的发病机制和开发新的疾病预防、诊断和治疗策略至关重要。虽然传统的生物学实验对于预... 长链非编码RNA (lncRNA)是一类长度超过200个核苷酸的转录物,在多种疾病的发病机制中发挥关键作用。因此,阐明lncRNA与疾病之间的关联对于理解潜在的发病机制和开发新的疾病预防、诊断和治疗策略至关重要。虽然传统的生物学实验对于预测长链非编码RNA-疾病关联(LDA)是有价值的,但往往费用高昂且耗时。开发有效的LDA预测计算模型是有必要的。当前的计算方法在有效整合多源数据和捕获异质生物网络中的复杂高阶关系模式方面经常遇到限制。这项研究提出了一种新的计算框架命名为CAPTLDA,将lncRNA、疾病和miRNA的相似性和关联整合到一个加权的异构网络邻接矩阵中,引入了胶囊网络,以增强特征学习。此外,还采用Transformer编码器,它结合了全局多头代理注意力机制和并行的多头局部注意力机制,以全面捕获全局依赖关系和局部上下文信息,最终实现准确的LDA预测。在两个基准数据集上进行的综合计算实验表明,模型在性能上优于先进的现有方法。案例研究进一步验证了它在识别潜在疾病相关lncRNA方面的有效性。 展开更多
关键词 LncRNA-疾病关联 胶囊网络 transformER 代理注意力机制 深度学习
在线阅读 下载PDF
Prediction of ferrite transformation start temperature in hot-rolled aluminum bearing dual-phase steel based on combination of metallurgical principles and experimental data
2
作者 Xiao-Guang Zhou Si-Qiao Wang +5 位作者 Xu-Yuan Zhang Hong-Bing Wang Chun-Yang Shu Xin-Yao Zhang Si-Wei Wu Zhen-Yu Liu 《Journal of Iron and Steel Research International》 2026年第1期149-158,共10页
The accurate establishment of a ferrite transformation start temperature model is crucial to design a reasonable controlled rolling process and ensure uniform microstructure in aluminum bearing dual-phase steel.The me... The accurate establishment of a ferrite transformation start temperature model is crucial to design a reasonable controlled rolling process and ensure uniform microstructure in aluminum bearing dual-phase steel.The measurements of the expansion-temperature curves of aluminum bearing dual-phase steel under continuous cooling and isothermal conditions are presented,utilizing a dynamic transformation dilatometer experiment.Based on these expansion-temperature curves,the start temperature and incubation time of ferrite transformation were determined,elucidating the influence of process parameters on both the incubation time and the start temperature of ferrite transformation.By integrating metallurgical principles with measured incubation time of ferrite transformation,and considering the effects of temperature and strain,a fitting model for the variation in volume free energy during ferrite nucleation was derived.Building upon this foundation,a high-precision incubation time of ferrite transformation mathematical model for the experimental steel was established.To more accurately calculate the start temperature of ferrite transformation under continuous cooling conditions,the Scheil’s additivity rule was modified to account for the effects of deformation and cooling rate.The results indicate that the modification coefficient decreases with increasing the cooling rate and strain,thereby significantly improving the accuracy of calculating the starting temperature of ferrite transformation using the modified additivity rule. 展开更多
关键词 Dual-phase steel Ferrite transformation Incubation time Ferrite transformation start temperature Model
原文传递
Photoacoustic-computed tomography 3D data compression method and system based on Wavelet-Transformer
3
作者 Jialin Li Tingting Li +2 位作者 Yiming Ma Yi Shen Mingjian Sun 《Journal of Innovative Optical Health Sciences》 2026年第1期110-125,共16页
Photoacoustic-computed tomography is a novel imaging technique that combines high absorption contrast and deep tissue penetration capability,enabling comprehensive three-dimensional imaging of biological targets.Howev... Photoacoustic-computed tomography is a novel imaging technique that combines high absorption contrast and deep tissue penetration capability,enabling comprehensive three-dimensional imaging of biological targets.However,the increasing demand for higher resolution and real-time imaging results in significant data volume,limiting data storage,transmission and processing efficiency of system.Therefore,there is an urgent need for an effective method to compress the raw data without compromising image quality.This paper presents a photoacoustic-computed tomography 3D data compression method and system based on Wavelet-Transformer.This method is based on the cooperative compression framework that integrates wavelet hard coding with deep learning-based soft decoding.It combines the multiscale analysis capability of wavelet transforms with the global feature modeling advantage of Transformers,achieving high-quality data compression and reconstruction.Experimental results using k-wave simulation suggest that the proposed compression system has advantages under extreme compression conditions,achieving a raw data compression ratio of up to 1:40.Furthermore,three-dimensional data compression experiment using in vivo mouse demonstrated that the maximum peak signal-to-noise ratio(PSNR)and structural similarity index(SSIM)values of reconstructed images reached 38.60 and 0.9583,effectively overcoming detail loss and artifacts introduced by raw data compression.All the results suggest that the proposed system can significantly reduce storage requirements and hardware cost,enhancing computational efficiency and image quality.These advantages support the development of photoacoustic-computed tomography toward higher efficiency,real-time performance and intelligent functionality. 展开更多
关键词 Photoacoustic-computed tomography data compression transformER
原文传递
Effect of fluoride roasting on copper species transformation on chrysocolla surfaces and its role in enhanced sulfidation flotation
4
作者 Yingqiang Ma Xin Huang +5 位作者 Yafeng Fu Zhenguo Song Sen Luo Shuanglin Zheng Feng Rao Wanzhong Yin 《International Journal of Minerals,Metallurgy and Materials》 2026年第1期165-176,共12页
It is difficult to recover chrysocolla from sulfidation flotation which is closely related to the mineral surface composition.In this study,the effects of fluoride roasting on the surface composition of chrysocolla we... It is difficult to recover chrysocolla from sulfidation flotation which is closely related to the mineral surface composition.In this study,the effects of fluoride roasting on the surface composition of chrysocolla were investigated,its impact on sulfidation flotation was explored,and the mechanisms involved in both fluoride roasting and sulfidation flotation were discussed.With CaF_(2)as the roasting reagent,Na_(2)S·9H_(2)O as the sulfidation reagent,and sodium butyl xanthate(NaBX)as the collector,the results of the flotation experiments showed that fluoride roasting improved the floatability of chrysocolla,and the recovery rate increased from 16.87%to 82.74%.X-ray diffraction analysis revealed that after fluoride roasting,approximately all the Cu on the chrysocolla surface was exposed in the form of CuO,which could provide a basis for subsequent sulfidation flotation.The microscopy and elemental analyses revealed that large quantities of"pagoda-like"grains were observed on the sulfidation surface of the fluoride-roasted chrysocolla,indicating high crystallinity particles of copper sulfide.This suggests that the effect of sulfide formation on the chrysocolla surface was more pronounced.X-ray photoelectron spectroscopy revealed that fluoride roasting increased the relative contents of sulfur and copper on the surface and that both the Cu~+and polysulfide fractions on the surface of the minerals increased.This enhances the effect of sulfidation,which is conducive to flotation recovery.Therefore,fluoride roasting improved the effect of copper species transformation and sulfidation on the surface of chysocolla,promoted the adsorption of collectors,and improved the recovery of chrysocolla from sulfidation flotation. 展开更多
关键词 sulfidation flotation CHRYSOCOLLA fluoride roasting copper species transformation enhanced sulfidation
在线阅读 下载PDF
Bi-STAT+:An Enhanced Bidirectional Spatio-Temporal Adaptive Transformer for Urban Traffic Flow Forecasting
5
作者 Yali Cao Weijian Hu +3 位作者 Lingfang Li Minchao Li Meng Xu Ke Han 《Computers, Materials & Continua》 2026年第2期963-985,共23页
Traffic flow prediction constitutes a fundamental component of Intelligent Transportation Systems(ITS),playing a pivotal role in mitigating congestion,enhancing route optimization,and improving the utilization efficie... Traffic flow prediction constitutes a fundamental component of Intelligent Transportation Systems(ITS),playing a pivotal role in mitigating congestion,enhancing route optimization,and improving the utilization efficiency of roadway infrastructure.However,existingmethods struggle in complex traffic scenarios due to static spatio-temporal embedding,restricted multi-scale temporal modeling,and weak representation of local spatial interactions.This study proposes Bi-STAT+,an enhanced bidirectional spatio-temporal attention framework to address existing limitations through three principal contributions:(1)an adaptive spatio-temporal embedding module that dynamically adjusts embeddings to capture complex traffic variations;(2)frequency-domain analysis in the temporal dimension for simultaneous high-frequency details and low-frequency trend extraction;and(3)an agent attention mechanism in the spatial dimension that enhances local feature extraction through dynamic weight allocation.Extensive experiments were performed on four distinct datasets,including two publicly benchmark datasets(PEMS04 and PEMS08)and two private datasets collected from Baotou and Chengdu,China.The results demonstrate that Bi-STAT+consistently outperforms existing methods in terms of MAE,RMSE,and MAPE,while maintaining strong robustness against missing data and noise.Furthermore,the results highlight that prediction accuracy improves significantly with higher sampling rates,providing crucial insights for optimizing real-world deployment scenarios. 展开更多
关键词 Traffic flow prediction spatio-temporal feature modeling transformER intelligent transportation deep learning
在线阅读 下载PDF
Superpixel-Aware Transformer with Attention-Guided Boundary Refinement for Salient Object Detection
6
作者 Burhan Baraklı Can Yüzkollar +1 位作者 Tugrul Ta¸sçı Ibrahim Yıldırım 《Computer Modeling in Engineering & Sciences》 2026年第1期1092-1129,共38页
Salient object detection(SOD)models struggle to simultaneously preserve global structure,maintain sharp object boundaries,and sustain computational efficiency in complex scenes.In this study,we propose SPSALNet,a task... Salient object detection(SOD)models struggle to simultaneously preserve global structure,maintain sharp object boundaries,and sustain computational efficiency in complex scenes.In this study,we propose SPSALNet,a task-driven two-stage(macro–micro)architecture that restructures the SOD process around superpixel representations.In the proposed approach,a“split-and-enhance”principle,introduced to our knowledge for the first time in the SOD literature,hierarchically classifies superpixels and then applies targeted refinement only to ambiguous or error-prone regions.At the macro stage,the image is partitioned into content-adaptive superpixel regions,and each superpixel is represented by a high-dimensional region-level feature vector.These representations define a regional decomposition problem in which superpixels are assigned to three classes:background,object interior,and transition regions.Superpixel tokens interact with a global feature vector from a deep network backbone through a cross-attention module and are projected into an enriched embedding space that jointly encodes local topology and global context.At the micro stage,the model employs a U-Net-based refinement process that allocates computational resources only to ambiguous transition regions.The image and distance–similarity maps derived from superpixels are processed through a dual-encoder pathway.Subsequently,channel-aware fusion blocks adaptively combine information from these two sources,producing sharper and more stable object boundaries.Experimental results show that SPSALNet achieves high accuracy with lower computational cost compared to recent competing methods.On the PASCAL-S and DUT-OMRON datasets,SPSALNet exhibits a clear performance advantage across all key metrics,and it ranks first on accuracy-oriented measures on HKU-IS.On the challenging DUT-OMRON benchmark,SPSALNet reaches a MAE of 0.034.Across all datasets,it preserves object boundaries and regional structure in a stable and competitive manner. 展开更多
关键词 Salient object detection superpixel segmentation transformERS attention mechanism multi-level fusion edge-preserving refinement model-driven
在线阅读 下载PDF
SwinHCAD: A Robust Multi-Modality Segmentation Model for Brain Tumors Using Transformer and Channel-Wise Attention
7
作者 Seyong Jin Muhammad Fayaz +2 位作者 L.Minh Dang Hyoung-Kyu Song Hyeonjoon Moon 《Computers, Materials & Continua》 2026年第1期511-533,共23页
Brain tumors require precise segmentation for diagnosis and treatment plans due to their complex morphology and heterogeneous characteristics.While MRI-based automatic brain tumor segmentation technology reduces the b... Brain tumors require precise segmentation for diagnosis and treatment plans due to their complex morphology and heterogeneous characteristics.While MRI-based automatic brain tumor segmentation technology reduces the burden on medical staff and provides quantitative information,existing methodologies and recent models still struggle to accurately capture and classify the fine boundaries and diverse morphologies of tumors.In order to address these challenges and maximize the performance of brain tumor segmentation,this research introduces a novel SwinUNETR-based model by integrating a new decoder block,the Hierarchical Channel-wise Attention Decoder(HCAD),into a powerful SwinUNETR encoder.The HCAD decoder block utilizes hierarchical features and channelspecific attention mechanisms to further fuse information at different scales transmitted from the encoder and preserve spatial details throughout the reconstruction phase.Rigorous evaluations on the recent BraTS GLI datasets demonstrate that the proposed SwinHCAD model achieved superior and improved segmentation accuracy on both the Dice score and HD95 metrics across all tumor subregions(WT,TC,and ET)compared to baseline models.In particular,the rationale and contribution of the model design were clarified through ablation studies to verify the effectiveness of the proposed HCAD decoder block.The results of this study are expected to greatly contribute to enhancing the efficiency of clinical diagnosis and treatment planning by increasing the precision of automated brain tumor segmentation. 展开更多
关键词 Attention mechanism brain tumor segmentation channel-wise attention decoder deep learning medical imaging MRI transformER U-Net
在线阅读 下载PDF
Multimodal Trajectory Generation for Robotic Motion Planning Using Transformer-Based Fusion and Adversarial Learning
8
作者 Shtwai Alsubai Ahmad Almadhor +3 位作者 Abdullah Al Hejaili Najib Ben Aoun Tahani Alsubait Vincent Karovic 《Computer Modeling in Engineering & Sciences》 2026年第2期848-869,共22页
In Human–Robot Interaction(HRI),generating robot trajectories that accurately reflect user intentions while ensuring physical realism remains challenging,especially in unstructured environments.In this study,we devel... In Human–Robot Interaction(HRI),generating robot trajectories that accurately reflect user intentions while ensuring physical realism remains challenging,especially in unstructured environments.In this study,we develop a multimodal framework that integrates symbolic task reasoning with continuous trajectory generation.The approach employs transformer models and adversarial training to map high-level intent to robotic motion.Information from multiple data sources,such as voice traits,hand and body keypoints,visual observations,and recorded paths,is integrated simultaneously.These signals are mapped into a shared representation that supports interpretable reasoning while enabling smooth and realistic motion generation.Based on this design,two different learning strategies are investigated.In the first step,grammar-constrained Linear Temporal Logic(LTL)expressions are created from multimodal human inputs.These expressions are subsequently decoded into robot trajectories.The second method generates trajectories directly from symbolic intent and linguistic data,bypassing an intermediate logical representation.Transformer encoders combine multiple types of information,and autoregressive transformer decoders generate motion sequences.Adding smoothness and speed limits during training increases the likelihood of physical feasibility.To improve the realism and stability of the generated trajectories during training,an adversarial discriminator is also included to guide them toward the distribution of actual robot motion.Tests on the NATSGLD dataset indicate that the complete system exhibits stable training behaviour and performance.In normalised coordinates,the logic-based pipeline has an Average Displacement Error(ADE)of 0.040 and a Final Displacement Error(FDE)of 0.036.The adversarial generator makes substantially more progress,reducing ADE to 0.021 and FDE to 0.018.Visual examination confirms that the generated trajectories closely align with observed motion patterns while preserving smooth temporal dynamics. 展开更多
关键词 Multimodal trajectory generation robotic motion planning transformer networks sensor fusion reinforcement learning generative adversarial networks
在线阅读 下载PDF
EdgeST-Fusion:A Cross-Modal Federated Learning and Graph Transformer Framework for Multimodal Spatiotemporal Data Analytics in Smart City Consumer Electronics
9
作者 Mohammed M.Alenazi 《Computers, Materials & Continua》 2026年第5期1376-1408,共33页
Multimodal spatiotemporal data from smart city consumer electronics present critical challenges including cross-modal temporal misalignment,unreliable data quality,limited joint modeling of spatial and temporal depend... Multimodal spatiotemporal data from smart city consumer electronics present critical challenges including cross-modal temporal misalignment,unreliable data quality,limited joint modeling of spatial and temporal dependencies,and weak resilience to adversarial updates.To address these limitations,EdgeST-Fusion is introduced as a cross-modal federated graph transformer framework for context-aware smart city analytics.The architecture integrates cross-modal embedding networks for modality alignment,graph transformer encoders for spatial dependency modeling,temporal self-attention for dynamic pattern learning,and adaptive anomaly detection to ensure data quality and security during aggregation.A privacy-preserving federated learning protocol with differential privacy guarantees enables collaborative model training without centralizing sensitive data.The framework employs data-quality-aware weighted aggregation to enhance robustness against noisy and malicious client updates.Experimental evaluation on the GeoLife,PeMS-Bay,and SmartHome+datasets demonstrates that EdgeST-Fusion achieves 21.8%improvement in prediction accuracy,35.7%reduction in communication overhead,and 29.4%enhancement in security resilience compared to recent baselines.Real-world deployment across three smart city testbeds validates practical viability with 90.0%average accuracy and sub-250 ms inference latency.The proposed framework remains feasible for deployment on heterogeneous and resource-constrained consumer electronics devices whilemaintaining strong privacy guarantees and scalability for large-scale urban environments. 展开更多
关键词 Federated learning graph transformer spatiotemporal analytics consumer electronics smart cities cross-modal fusion edge computing privacy preservation
在线阅读 下载PDF
A Novel Evolutionary Optimized Transformer-Deep Reinforcement Learning Framework for False Data Injection Detection in Industry 4.0 Smart Water Infrastructures
10
作者 Ahmad Salehiyan Nuria Serrano +2 位作者 Francisco Hernando-Gallego Diego Martín José Vicenteálvarez-Bravo 《Computers, Materials & Continua》 2026年第5期1588-1624,共37页
The increasing integration of cyber-physical components in Industry 4.0 water infrastructures has heightened the risk of false data injection(FDI)attacks,posing critical threats to operational integrity,resource manag... The increasing integration of cyber-physical components in Industry 4.0 water infrastructures has heightened the risk of false data injection(FDI)attacks,posing critical threats to operational integrity,resource management,and public safety.Traditional detection mechanisms often struggle to generalize across heterogeneous environments or adapt to sophisticated,stealthy threats.To address these challenges,we propose a novel evolutionary optimized transformer-based deep reinforcement learning framework(Evo-Transformer-DRL)designed for robust and adaptive FDI detection in smart water infrastructures.The proposed architecture integrates three powerful paradigms:a transformer encoder for modeling complex temporal dependencies in multivariate time series,a DRL agent for learning optimal decision policies in dynamic environments,and an evolutionary optimizer to fine-tune model hyper-parameters.This synergy enhances detection performance while maintaining adaptability across varying data distributions.Specifically,hyper-parameters of both the transformer and DRL modules are optimized using an improved grey wolf optimizer(IGWO),ensuring a balanced trade-off between detection accuracy and computational efficiency.The model is trained and evaluated on three realistic Industry 4.0 water datasets:secure water treatment(SWaT),water distribution(WADI),and battle of the attack detection algorithms(BATADAL),which capture diverse attack scenarios in smart treatment and distribution systems.Comparative analysis against state-of-the-art baselines including Transformer,DRL,bidirectional encoder representations from transformers(BERT),convolutional neural network(CNN),long short-term memory(LSTM),and support vector machines(SVM)demonstrates that our proposed Evo-Transformer-DRL framework consistently outperforms others in key metrics such as accuracy,recall,area under the curve(AUC),and execution time.Notably,it achieves a maximum detection accuracy of 99.19%,highlighting its strong generalization capability across different testbeds.These results confirm the suitability of our hybrid framework for real-world Industry 4.0 deployment,where rapid adaptation,scalability,and reliability are paramount for securing critical infrastructure systems. 展开更多
关键词 Industry 4.0 smart water systems false data injection detection cyber-physical security transformER deep reinforcement learning grey wolf optimizer
在线阅读 下载PDF
CNN结合Transformer的高光谱图像和LiDAR数据协同地物分类方法 被引量:2
11
作者 吴海滨 左云逸 +2 位作者 王爱丽 吕浩然 王敏慧 《仪器仪表学报》 北大核心 2025年第8期286-301,共16页
在高光谱图像与LiDAR数据协同分类的研究领域中,尽管CNN和Transformer在图像处理和数据分析中分别展现出对局部特征和全局依赖关系的敏锐洞察力,但二者的协同机制尚未充分挖掘,跨模态特征互补潜力未被有效释放。故提出了一种CNN结合Tran... 在高光谱图像与LiDAR数据协同分类的研究领域中,尽管CNN和Transformer在图像处理和数据分析中分别展现出对局部特征和全局依赖关系的敏锐洞察力,但二者的协同机制尚未充分挖掘,跨模态特征互补潜力未被有效释放。故提出了一种CNN结合Transformer的高光谱图像和LiDAR数据的多模态遥感数据协同地物分类方法。首先,该模型通过主成分分析对高光谱图像进行降维处理以去除光谱的冗余信息,继而利用CNN分层捕获局部纹理特征,同时借助Transformer自注意力机制构建全局光谱-空间表征。然后通过双向特征交互机制,将Transformer输出的全局上下文信息注入CNN特征通道,同时将CNN通道提取的局部细节反馈至Transformer支路,经特征耦合单元实现跨尺度特征对齐,强化模型对高光谱图像全局结构与局部细节的联合提取能力。对于LiDAR数据,采用动态卷积级联模块有效捕获高程信息和上下文关系,最终通过跨模态特征融合模块实现双源数据特征的深度交互与融合,在双模态语义互补中提升复杂地物的分类精度。在Houston2013、Trento和Augsburg这3个公开数据集上的实验表明,该方法总体分类精度分别达到99.85%、99.68%和97.34%,平均准确率分别达到99.87%、99.34%和90.60%,较GLT、HCT等主流方法的分类精度有所提高,充分证明所提方法进行多模态数据协同分类的优势和有效性。 展开更多
关键词 高光谱图像 LIdaR数据 transformER 卷积神经网络 多模态数据
原文传递
基于CEEMDAN与Transformer注意力机制的非平稳涌水量智能预测方法
12
作者 李大鹏 李军卫 +3 位作者 白志新 马彪 王晓东 康志伟 《山西煤炭》 2026年第2期38-46,96,共10页
矿井涌水量预测对安全生产至关重要,但其序列具有非平稳、非线性及多尺度耦合等复杂特性,传统方法难以准确预测。为此,提出一种融合互补集合经验模态分解(CEEMDAN)与多头注意力(Transformer注意力)机制的智能预测模型,核心突破在于通过... 矿井涌水量预测对安全生产至关重要,但其序列具有非平稳、非线性及多尺度耦合等复杂特性,传统方法难以准确预测。为此,提出一种融合互补集合经验模态分解(CEEMDAN)与多头注意力(Transformer注意力)机制的智能预测模型,核心突破在于通过“多尺度分解-全局依赖捕捉-关键特征强化”的三级协同架构,解决非平稳时序预测中模态混叠、长时依赖丢失及特征贡献失衡问题。具体而言,模型首先利用CEEMDAN将原始涌水量序列自适应分解为具有明确物理意义的固有模态函数(IMF)与残差分量,从根源上抑制模态混叠、降低数据复杂度;随后通过基于多头自注意力的Transformer编码器,结合位置编码精准捕捉不同频率IMF分量的全局时序依赖,突破传统循环神经网络长时依赖捕捉能力不足的局限;最终引入通道注意力模块,自适应加权多尺度特征,显著强化关键分量对预测结果的贡献度。基于山西省大同市塔山矿二盘区某工作面实测数据的验证结果表明,该模型预测性能全面优于ARIMA、LSTM、VMD-DBN等11类主流模型:平均绝对误差(EMA)降至27.4 m^(3)/d,均方根误差(ERMS)低至31.5 m^(3)/d,决定系数(R2)高达0.96,其中R2较最优对比模型(Informer)提升2.1%,为非平稳矿井涌水量预测提供了高精度、强适应性的新方法,对矿井水害防控与排水系统优化具有重要工程应用价值。 展开更多
关键词 多尺度解析 多头注意力 注意力机制 智能预测模型 互补集合经验模态分解自适应噪声
在线阅读 下载PDF
一种面向地图综合建筑多边形化简的Transformer模型
13
作者 刘鹏程 成晓强 +2 位作者 肖天元 杨敏 艾廷华 《测绘学报》 北大核心 2026年第1期124-137,共14页
针对地图综合中建筑多边形化简方法依赖人工规则、自动化程度低且难以利用已有化简成果的问题,本文提出了一种基于Transformer机制的建筑多边形化简模型。该模型首先把建筑多边形映射至一定范围的网格空间,将建筑多边形的坐标串表达为... 针对地图综合中建筑多边形化简方法依赖人工规则、自动化程度低且难以利用已有化简成果的问题,本文提出了一种基于Transformer机制的建筑多边形化简模型。该模型首先把建筑多边形映射至一定范围的网格空间,将建筑多边形的坐标串表达为网格序列,从而获取建筑多边形化简前后的Token序列,构建出建筑多边形化简样本对数据;随后采用Transformer架构建立模型,基于样本数据利用模型的掩码自注意力机制学习点序列之间的依赖关系,最终逐点生成新的简化多边形,从而实现建筑多边形的化简。在训练过程中,模型使用结构化的样本数据,设计了忽略特定索引的交叉熵损失函数以提升化简质量。试验设计包括主试验与泛化验证两部分。主试验基于洛杉矶1∶2000建筑数据集,分别采用0.2、0.3和0.5 mm 3种网格尺寸对多边形进行编码,实现了目标比例尺为1∶5000与1∶10000的化简。试验结果表明,在0.3 mm的网格尺寸下模型性能最优,验证集上的化简结果与人工标注的一致率超过92.0%,且针对北京部分区域的建筑多边形数据的泛化试验验证了模型的迁移能力;与LSTM模型的对比分析显示,在参数规模相近的条件下,LSTM模型无法形成有效收敛,并生成可用结果。本文证实了Transformer在处理空间几何序列任务中的潜力,且能够有效复用已有化简样本,为智能建筑多边形化简提供了具有工程实用价值的途径。 展开更多
关键词 地图综合 建筑多边形化简 TOKENIZATION transformer模型 上下文工程
在线阅读 下载PDF
融合视觉测量和Transformer的桥梁裂缝智能检测与三维映射
14
作者 余加勇 杨睿韬 +2 位作者 王昱东 彭志豪 周劲 《湖南大学学报(自然科学版)》 北大核心 2026年第3期108-118,共11页
无人机巡检方法已成为桥梁表观病害检测的重要手段.针对无人机(unmanned aerial vehicle,UVA)易受振动干扰、成像质量不稳定、卷积神经网络识别裂缝小目标效果差及裂缝定位困难等问题,提出了融合视觉测量与Transformer的桥梁裂缝智能检... 无人机巡检方法已成为桥梁表观病害检测的重要手段.针对无人机(unmanned aerial vehicle,UVA)易受振动干扰、成像质量不稳定、卷积神经网络识别裂缝小目标效果差及裂缝定位困难等问题,提出了融合视觉测量与Transformer的桥梁裂缝智能检测与三维映射方法,建立基于Real ESRGan图像超分辨率技术的航拍图像采集方法,复原运动模糊图像,实现对高质量桥梁图像的高效采集;在桥梁复杂背景下,通过具有全局注意力机制的Detection Transformer算法和数字图像处理技术,实现对裂缝病害的识别提取与高亮标识;利用多视角影像匹配法处理裂缝高亮标识图片,生成密集点云数据,构建桥梁病害面的精细模型,实现对裂缝病害的三维映射定位.以长沙市靳江河大桥为实验桥梁,使用无人机拍摄2316张桥梁表观图像用于生成三维实景模型,拍摄479张精细图像用于病害检测,使用Real ESRgan超分辨率重建算法对运动模糊图像进行复原,所建立的桥梁病害部位精细模型像素点分辨率达0.25 mm/pixel,裂缝宽度测量最小相对误差为1.37%,最大相对误差为9.90%.结果表明,融合视觉测量和Transformer的桥梁裂缝智能检测与三维映射方法,能够有效提升检测效率,保障人员安全,实现对桥梁裂缝的数字化、智能化、可视化检测,具备重要的研究价值与广阔的应用前景. 展开更多
关键词 桥梁 裂缝检测 无人机 三维建模 transformER
在线阅读 下载PDF
基于CNN-Transformer架构的电磁传播损耗预测算法
15
作者 万勇 李骏杰 +1 位作者 孙伟峰 戴永寿 《现代电子技术》 北大核心 2026年第6期43-48,共6页
为了解决传统经验传播损耗模型预测精度不足的问题,提出一种基于CNN-Transformer架构的电磁传播损耗预测算法,通过构建回归模型进行精准的传播损耗预测。通过斯皮尔曼系数法提取有效特征,利用CNN提取与传播损耗预测高度相关的浅层特征,... 为了解决传统经验传播损耗模型预测精度不足的问题,提出一种基于CNN-Transformer架构的电磁传播损耗预测算法,通过构建回归模型进行精准的传播损耗预测。通过斯皮尔曼系数法提取有效特征,利用CNN提取与传播损耗预测高度相关的浅层特征,将从卫星图像中获取的传播路径上地物特征序列进行位置编码,增强对传播路径中不同地物特征顺序对传播损耗影响的理解。最后将CNN提取的浅层特征与位置编码后的地物特征输入到Transformer模型,通过多头自注意力机制捕捉特征间的全局关联性,从而有效校正传播损耗的预测结果。实验结果表明,所提出的CNN-Transformer方法显著降低了传播损耗预测的均方根误差(RMSE),达到了3.3745 dB,同时保持了0.8956的较高确定性系数(R^(2))。所提的电磁传播损耗预测算法为无线通信传播特性研究领域提供了参考,具有一定的应用价值。 展开更多
关键词 电磁传播 损耗预测 transformER CNN 斯皮尔曼系数法 地物类型 位置编码
在线阅读 下载PDF
基于行列栅格感知Transformer的车道线检测方法
16
作者 陈广秋 刘枫铭 +1 位作者 段锦 黄丹丹 《光学精密工程》 北大核心 2026年第6期953-972,共20页
低光照或光线不均的夜间环境下,道路成像存在车道线可见性低、局部过曝和阴影,现有车道线检测算法多聚焦于提升正常光照环境下的检测能力,忽略了夜间光照环境下的道路特征退化问题,导致算法的精确性和鲁棒性差。针对上述问题,本文在编... 低光照或光线不均的夜间环境下,道路成像存在车道线可见性低、局部过曝和阴影,现有车道线检测算法多聚焦于提升正常光照环境下的检测能力,忽略了夜间光照环境下的道路特征退化问题,导致算法的精确性和鲁棒性差。针对上述问题,本文在编解码语义分割框架下,提出了一种基于行列栅格感知Transformer的车道线检测方法。该方法首先采用光增强曲线模块对输入图像进行光照归一化,通过生成对抗网络实现光照失衡图像到光照分布合理图像的映射,有效抑制噪声与过曝;编码器采用ResNet34网络提取多尺度特征;行列栅格感知Transformer模块通过行、列双向令牌编码显式建模车道线的空间结构与上下文关系,增强模型对几何形变与局部遮挡的鲁棒性;解码器由双边上采样模块与置信度评估模块构成,分别完成特征重建与车道线存在性预测。实验结果表明,本文方法在TuSimple数据集下准确率为96.86%;在CULane数据集下整体场景F1分数为77.5%,其中夜间场景下F1值达到76.7%。本文方法的检测精度优于当前主流车道线检测模型,能有效实现复杂夜间环境下的车道线精准检测。 展开更多
关键词 交通工程 车道线检测 语义分割 transformER 栅格感知
在线阅读 下载PDF
基于BSimilar优化PTransformer的光伏功率短期预测
17
作者 张文广 蔡浩 +1 位作者 刘科 孙盼荣 《动力工程学报》 北大核心 2026年第1期77-84,102,共9页
为提高光伏功率短期预测的精度,提出了考虑光伏设备性能退化因素的相似日算法优化的分时段多通道独立光伏功率短期预测方法。首先,在PTransformer模型中用分时段与通道独立的方法来处理光伏输入数据,以降低空间复杂度及提高长时间数据... 为提高光伏功率短期预测的精度,提出了考虑光伏设备性能退化因素的相似日算法优化的分时段多通道独立光伏功率短期预测方法。首先,在PTransformer模型中用分时段与通道独立的方法来处理光伏输入数据,以降低空间复杂度及提高长时间数据序列的关注度。其次,运用Transformer的编码器模型,通过自身注意力机制捕捉光伏序列特征之间的依赖关系,进行光伏功率的短期预测。最后,运用夹角余弦距离计算相似度并考虑光伏设备性能退化因素确定相似日,利用其功率数据优化PTransformer模型,以改善功率数据的滞后性。结果表明:相比典型的光伏功率短期预测方法,所提方法训练速度更快,预测精准度更高,并且对复杂天气状况下的光伏功率也有较好的预测结果。 展开更多
关键词 光伏功率 短期预测 性能退化 贝叶斯分析 transformER 相似日
在线阅读 下载PDF
Transformer架构驱动下的综采工作面矿压时序特征智能预测
18
作者 杜锋 陈博 +7 位作者 王文强 浦海 杜雪明 李国栋 乔瑞 李鑫磊 徐杰 曹煜 《煤田地质与勘探》 北大核心 2026年第2期1-13,共13页
【背景】矿压预测是顶板灾害预警和管理的重要手段,是智能化矿井安全生产的前提和基础。开采过程中综采工作面环境复杂多变,导致基于电液控制系统采集的支架压力数据分布差异较大,预测困难。【方法】基于Transformer的矿压预测模型,使... 【背景】矿压预测是顶板灾害预警和管理的重要手段,是智能化矿井安全生产的前提和基础。开采过程中综采工作面环境复杂多变,导致基于电液控制系统采集的支架压力数据分布差异较大,预测困难。【方法】基于Transformer的矿压预测模型,使用线性插值填补缺失的矿压值,并使用滑动窗口算法调整训练时的矿压数据结构;针对矿压数据的时序特性,构建融合时序特征的输入序列,利用多头注意力(multi-head-attention)机制动态计算权重,根据数据本身自适应地聚焦关键时间步,从而有效捕捉复杂的非线性时序依赖,显著提升特征表征与预测能力,最后使用迁移学习方法,完成对上、中、下工作面支架工作阻力预测,并搭建基于矿压大数据的智能分析及预测平台。【结果和结论】使用多头注意力机制代替神经网络捕捉全局矿压数据特征,比循环神经网络(recurrent neural network,RNN)和长短期记忆网络(long short-term memory,LSTM)具有更强长序列依赖能力和特征学习能力,能有效降低模型损失,更加适用于预测矿压,Transformer模型在测试集上的均方误差和平均绝对误差损失精度分别达到0.34%和2.57%。Transformer模型也具有较强的泛化能力,使用迁移学习方法微调后,能够有效降低模型损失,在迁移同工作面其他支架时具有更好的泛化效果,Transformer预测模型进一步验证在矿压预测问题的适用性和可行性。平台可视化显示系统可精准分析预测前后的来压次数、推进距离、来压判据和工作面矿压云图等关键参数,为顶板灾害预警乃至其他灾害预警提供新思路,也为矿井安全高效开采与智能化建设奠定了坚实基础。 展开更多
关键词 综采工作面 深度学习 transformer模型 时间序列 矿压显现 矿压预测
在线阅读 下载PDF
基于长短期记忆网络-Transformer模型参数优化的锂离子电池剩余使用寿命预测 被引量:1
19
作者 高建树 郝世宇 党一诺 《汽车工程师》 2026年第1期32-39,共8页
为提高锂离子电池剩余使用寿命(RUL)预测的准确性,提出了一种基于长短期记忆(LSTM)网络-Transformer模型参数优化的RUL预测方法,采用网格搜索法选取模型的超参数,利用LSTM网络提取锂离子电池时间序列中的长短期依赖关系,使用Transforme... 为提高锂离子电池剩余使用寿命(RUL)预测的准确性,提出了一种基于长短期记忆(LSTM)网络-Transformer模型参数优化的RUL预测方法,采用网格搜索法选取模型的超参数,利用LSTM网络提取锂离子电池时间序列中的长短期依赖关系,使用Transformer的自注意力机制处理全局信息并对超参数进行优化,通过全连接层进行最终的寿命预测。基于美国国家航空航天局(NASA)数据集和先进生命周期工程中心(CALCE)数据集的试验验证结果表明,模型在更短的序列长度、更少的隐藏层数量和训练次数等条件下,在多种评价指标上均优于LSTM网络模型、Transformer模型及其他神经网络模型,具有更高的预测精度和鲁棒性。最后,通过不同电池的对比试验进一步验证了模型在不同电池数据上的泛化能力。 展开更多
关键词 锂离子电池 剩余使用寿命预测 参数优化 长短期记忆神经网络 transformER 混合模型
在线阅读 下载PDF
融合群分解与Transformer-KAN的短期风速预测
20
作者 史加荣 张思怡 《南京信息工程大学学报》 北大核心 2026年第1期60-68,共9页
针对风速固有的不稳定性,通过融合群分解(Swarm Decomposition,SWD)、Transformer和Kolmogorov-Arnold网络(KAN),提出一种SWD-Transformer-KAN预测模型.首先,利用SWD对原始风速数据进行分解,以提取关键特征.其次,针对每个被分解的子序列... 针对风速固有的不稳定性,通过融合群分解(Swarm Decomposition,SWD)、Transformer和Kolmogorov-Arnold网络(KAN),提出一种SWD-Transformer-KAN预测模型.首先,利用SWD对原始风速数据进行分解,以提取关键特征.其次,针对每个被分解的子序列,建立Transformer-KAN模型,所建模型充分利用了Transformer的时序处理能力和KAN的非线性逼近能力.最后,对所有子序列的预测结果进行叠加,得到最终的风速预测值.为了验证所提出模型的有效性,将其与其他模型进行实验对比,结果表明,SWD-Transformer-KAN模型具有最优的预测性能,其决定系数(R^(2))高达99.91%. 展开更多
关键词 风速预测 群分解 transformER Kolmogorov-Arnold网络
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部