期刊文献+
共找到1,766篇文章
< 1 2 89 >
每页显示 20 50 100
基于Enhanced Transformer的铁路客运站节假日客流预测研究
1
作者 朱友蓉 李得伟 +2 位作者 李涛 吴迪 李华 《铁道经济研究》 2026年第1期97-108,共12页
节假日作为居民集中出行的高峰期,其客流特征直接关系到铁路运营的安全、运力配置效率和服务质量。节假日期间的铁路客流呈现出与日常显著不同的特殊性,主要表现为长距离出行需求剧增、旅游流与探亲流高度叠加,以及客流分布的时空不均衡... 节假日作为居民集中出行的高峰期,其客流特征直接关系到铁路运营的安全、运力配置效率和服务质量。节假日期间的铁路客流呈现出与日常显著不同的特殊性,主要表现为长距离出行需求剧增、旅游流与探亲流高度叠加,以及客流分布的时空不均衡性,为铁路运营管理带来了挑战。一是客流需求的突增,热门线路和高峰时段的运输能力趋于饱和,传统时间序列模型难以捕捉这种剧烈的非平稳波动;二是预售数据不完整性,旅客购票行为贯穿整个预售期,不同时间点获取的预售数据反映的未来客流信息是动态变化的;三是客流受时间、节假日效应、列车运行安排等多种因素共同影响,这些特征之间存在复杂的非线性耦合关系。为解决上述问题,提出一种基于Enhanced Transformer的铁路客运站节假日客流预测模型。在特征工程方面,主要从时间特征、节假日特征和运营特征3个维度构建了多源特征体系:时间特征包括预售提前量和小时周期编码,用于捕捉旅客出行决策行为和一天内客流的规律性波动;节假日特征涵盖周末指示、节假日标记、节前高峰和节假日周末叠加效应,用于精确捕捉节假日期间客流模式的突变特征;运营特征则提取了每小时上下行列车班次数,反映车站的实时运力供给情况。通过多头自注意力机制,模型能够在不同的表示子空间中并行学习这些多源特征间的复杂交互模式,实现对客流驱动因素的深度理解。创新性地将动态变化的预售数据作为关键输入特征,结合模型的时序信息处理能力,实现对未来客流的滚动预测,突破传统方法在处理预售期动态性上的局限,通过选取苏州地区4个核心铁路客站(苏州北站、苏州站、苏州新区站、苏州园区站)在2025年春节期间的客流数据进行案例分析。实验结果表明,Enhanced Transformer模型对于苏州北站和苏州站等客流规模大的枢纽站,预测准确率可达84.06%,证明了模型在处理高流量、高波动性时间序列数据时的有效性。与Transformer,XGBoost,LSTM,Bi-LSTM的4种基准模型的对比实验显示,Enhanced Transformer在MSE,RMSE,MAE和准确率等所有评估指标上均全面优于其他模型。相较于标准Transformer模型,其预测准确率提升了约6.29%~6.89%;相较于LSTM,准确率提升约3.4%。这些性能提升归因于模型在长序列依赖捕捉、非平稳数据适应和多源特征交互方面的结构优势,为铁路管理部门提供了有力的技术支持,有助于实现节假日期间运力的精准配置、提升旅客服务质量和保障运营安全。 展开更多
关键词 铁路客流预测 节假日 enhanced transformer 动态预售数据获取时间 时间序列预测 多源特征 注意力机制 铁路运营
在线阅读 下载PDF
M2ATNet: Multi-Scale Multi-Attention Denoising and Feature Fusion Transformer for Low-Light Image Enhancement
2
作者 Zhongliang Wei Jianlong An Chang Su 《Computers, Materials & Continua》 2026年第1期1819-1838,共20页
Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approach... Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approaches,while effective in global illumination modeling,often struggle to simultaneously suppress noise and preserve structural details,especially under heterogeneous lighting.Furthermore,misalignment between luminance and color channels introduces additional challenges to accurate enhancement.In response to the aforementioned difficulties,we introduce a single-stage framework,M2ATNet,using the multi-scale multi-attention and Transformer architecture.First,to address the problems of texture blurring and residual noise,we design a multi-scale multi-attention denoising module(MMAD),which is applied separately to the luminance and color channels to enhance the structural and texture modeling capabilities.Secondly,to solve the non-alignment problem of the luminance and color channels,we introduce the multi-channel feature fusion Transformer(CFFT)module,which effectively recovers the dark details and corrects the color shifts through cross-channel alignment and deep feature interaction.To guide the model to learn more stably and efficiently,we also fuse multiple types of loss functions to form a hybrid loss term.We extensively evaluate the proposed method on various standard datasets,including LOL-v1,LOL-v2,DICM,LIME,and NPE.Evaluation in terms of numerical metrics and visual quality demonstrate that M2ATNet consistently outperforms existing advanced approaches.Ablation studies further confirm the critical roles played by the MMAD and CFFT modules to detail preservation and visual fidelity under challenging illumination-deficient environments. 展开更多
关键词 Low-light image enhancement multi-scale multi-attention transformer
在线阅读 下载PDF
Effect of fluoride roasting on copper species transformation on chrysocolla surfaces and its role in enhanced sulfidation flotation
3
作者 Yingqiang Ma Xin Huang +5 位作者 Yafeng Fu Zhenguo Song Sen Luo Shuanglin Zheng Feng Rao Wanzhong Yin 《International Journal of Minerals,Metallurgy and Materials》 2026年第1期165-176,共12页
It is difficult to recover chrysocolla from sulfidation flotation which is closely related to the mineral surface composition.In this study,the effects of fluoride roasting on the surface composition of chrysocolla we... It is difficult to recover chrysocolla from sulfidation flotation which is closely related to the mineral surface composition.In this study,the effects of fluoride roasting on the surface composition of chrysocolla were investigated,its impact on sulfidation flotation was explored,and the mechanisms involved in both fluoride roasting and sulfidation flotation were discussed.With CaF_(2)as the roasting reagent,Na_(2)S·9H_(2)O as the sulfidation reagent,and sodium butyl xanthate(NaBX)as the collector,the results of the flotation experiments showed that fluoride roasting improved the floatability of chrysocolla,and the recovery rate increased from 16.87%to 82.74%.X-ray diffraction analysis revealed that after fluoride roasting,approximately all the Cu on the chrysocolla surface was exposed in the form of CuO,which could provide a basis for subsequent sulfidation flotation.The microscopy and elemental analyses revealed that large quantities of"pagoda-like"grains were observed on the sulfidation surface of the fluoride-roasted chrysocolla,indicating high crystallinity particles of copper sulfide.This suggests that the effect of sulfide formation on the chrysocolla surface was more pronounced.X-ray photoelectron spectroscopy revealed that fluoride roasting increased the relative contents of sulfur and copper on the surface and that both the Cu~+and polysulfide fractions on the surface of the minerals increased.This enhances the effect of sulfidation,which is conducive to flotation recovery.Therefore,fluoride roasting improved the effect of copper species transformation and sulfidation on the surface of chysocolla,promoted the adsorption of collectors,and improved the recovery of chrysocolla from sulfidation flotation. 展开更多
关键词 sulfidation flotation CHRYSOCOLLA fluoride roasting copper species transformation enhanced sulfidation
在线阅读 下载PDF
多尺度非对称注意力遥感去雾Transformer
4
作者 王旭阳 梁宇航 《广西师范大学学报(自然科学版)》 北大核心 2026年第2期77-89,共13页
雾霾干扰会导致遥感图像结构模糊、细节丢失,严重影响下游视觉任务的准确性。为此,本文提出一种异构增强的遥感图像去雾网络,从空间结构建模与频率信息整合2个层面提升特征恢复能力。具体而言,设计多尺度非对称注意力Transformer模块,... 雾霾干扰会导致遥感图像结构模糊、细节丢失,严重影响下游视觉任务的准确性。为此,本文提出一种异构增强的遥感图像去雾网络,从空间结构建模与频率信息整合2个层面提升特征恢复能力。具体而言,设计多尺度非对称注意力Transformer模块,引入方向感知机制以增强模糊边缘与纹理细节的建模;同时构建基于小波变换高低频自适应增强模块,使用Haar小波分解分离频域信息,分别通过高频与低频子模块强化边缘轮廓与结构表达。2个模块分别嵌入特征提取与融合阶段,协同缓解传统方法方向性建模不足与高频特征易丢失等问题。在保持低计算开销的前提下,本文方法在HAZE1K与RICE数据集上的平均PSNR/SSIM性能分别达到24.9936/0.9099与33.1802/0.8942,在细节恢复方面表现出显著优势。 展开更多
关键词 遥感图像去雾 transformer 非对称注意力 高低频特征增强 小波变换 方向感知建模 深度学习
在线阅读 下载PDF
结合密集多尺度特征融合和特征知识增强Transformer的遥感图像描述模型
5
作者 刘汉卿 桑国明 张益嘉 《计算机应用》 北大核心 2026年第3期741-749,共9页
针对遥感图像描述任务中多尺度特征利用不足、纹理重复区域细节关联度低及多目标特征协同建模困难等问题,提出一种结合密集多尺度特征融合和特征知识增强Transformer的遥感图像描述模型DMFKF-T(Dense Multiscale Feature and Knowledge ... 针对遥感图像描述任务中多尺度特征利用不足、纹理重复区域细节关联度低及多目标特征协同建模困难等问题,提出一种结合密集多尺度特征融合和特征知识增强Transformer的遥感图像描述模型DMFKF-T(Dense Multiscale Feature and Knowledge Fusion Transformer)。设计密集多尺度特征融合模块(DMFFM),通过跨层级跳跃连接动态聚合不同尺度的特征图,同步捕获全局场景特征与局部细节信息;在解码阶段,引入语义融合增强(SFA)模块增强模型捕捉长距离依赖关系与理解上下文信息的能力,并结合离散余弦变换(DCT)的频率增强通道注意力机制分析频域特征的相关性,从而强化对复杂空间拓扑和非线性关系的建模能力。实验结果表明,在RSICD(Remote Sensing Image Captioning Dataset)上,与SD-RSIC(Summarization-driven Deep Remote Sensing Image Captioning)模型相比,DMFKF-T的BLEU-4(BiLingual Evaluation Understudy with 4-grams)和CIDEr(Consensus-based Image Description Evaluation)指标分别提升了8.6%和14.4%。可见,DMFKF-T可以准确地生成语义丰富的遥感图像描述语句。 展开更多
关键词 密集多尺度特征融合 语义融合增强 频率增强通道注意力 特征知识增强transformer 遥感图像描述
在线阅读 下载PDF
结合多尺度边缘增强和轻量化Transformer的图像超分辨率重建网络
6
作者 王文佩 赵宇峰 +1 位作者 徐飞 赵莎莎 《计算机系统应用》 2026年第1期117-128,共12页
现有超分辨率网络(super-resolution,SR)通过简单堆叠多分支结构来捕捉多尺度特征,导致网络推理速度缓慢,且无法有效建模全局像素关联.部分研究引入Transformer的自注意力机制来提升重建质量,却导致计算复杂度大幅上升.针对这些问题,本... 现有超分辨率网络(super-resolution,SR)通过简单堆叠多分支结构来捕捉多尺度特征,导致网络推理速度缓慢,且无法有效建模全局像素关联.部分研究引入Transformer的自注意力机制来提升重建质量,却导致计算复杂度大幅上升.针对这些问题,本文提出了一种结合多尺度边缘增强和轻量化Transformer的SR网络(ECTL-SR).具体而言,提出一种轻量边缘导向卷积块有效捕捉并融合不同感受野下边缘细粒度特征,同时引入结构重参数化技术来减少多分支冗余计算和内存开销.此外,将轻量型位置感知环形卷积嵌入改进的Transformer架构中来增强网络捕捉图像长距离依赖能力,在低成本下实现感受野高效扩展.实验结果表明,该网络在性能与效率之间取得了良好平衡,并在Urban100等多个基准数据集上优于现有SR方法,展现出更优的重建效果. 展开更多
关键词 图像超分辨率 边缘增强 轻量化transformer 结构重参数化 位置感知环形卷积
在线阅读 下载PDF
RetinexWT: Retinex-Based Low-Light Enhancement Method Combining Wavelet Transform
7
作者 Hongji Chen Jianxun Zhang +2 位作者 Tianze Yu Yingzhu Zeng Huan Zeng 《Computers, Materials & Continua》 2026年第2期2113-2132,共20页
Low-light image enhancement aims to improve the visibility of severely degraded images captured under insufficient illumination,alleviating the adverse effects of illumination degradation on image quality.Traditional ... Low-light image enhancement aims to improve the visibility of severely degraded images captured under insufficient illumination,alleviating the adverse effects of illumination degradation on image quality.Traditional Retinex-based approaches,inspired by human visual perception of brightness and color,decompose an image into illumination and reflectance components to restore fine details.However,their limited capacity for handling noise and complex lighting conditions often leads to distortions and artifacts in the enhanced results,particularly under extreme low-light scenarios.Although deep learning methods built upon Retinex theory have recently advanced the field,most still suffer frominsufficient interpretability and sub-optimal enhancement performance.This paper presents RetinexWT,a novel framework that tightly integrates classical Retinex theory with modern deep learning.Following Retinex principles,RetinexWT employs wavelet transforms to estimate illumination maps for brightness adjustment.A detail-recovery module that synergistically combines Vision Transformer(ViT)and wavelet transforms is then introduced to guide the restoration of lost details,thereby improving overall image quality.Within the framework,wavelet decomposition splits input features into high-frequency and low-frequency components,enabling scale-specific processing of global illumination/color cues and fine textures.Furthermore,a gating mechanism selectively fuses down-sampled and up-sampled features,while an attention-based fusion strategy enhances model interpretability.Extensive experiments on the LOL dataset demonstrate that RetinexWT surpasses existing Retinex-oriented deeplearning methods,achieving an average Peak Signal-to-Noise Ratio(PSNR)improvement of 0.22 dB over the current StateOfTheArt(SOTA),thereby confirming its superiority in low-light image enhancement.Code is available at https://github.com/CHEN-hJ516/RetinexWT(accessed on 14 October 2025). 展开更多
关键词 Low-light image enhancement retinex algorithm wavelet transform vision transformer
在线阅读 下载PDF
Enhanced flotation separation of bastnaesite and monazite by suspension roasting:A study of flotation performance and mechanisms
8
作者 Linghui Zhang Wenbo Li +4 位作者 Shaokai Cheng Xiaolong Zhang Junyan Sun Rui Qu Maoyuan Wang 《Journal of Rare Earths》 2026年第3期957-968,I0007,共13页
Bayan Obo rare earth mine is the largest light rare earth resource worldwide,primarily extracts rare earth elements(REEs)from mixed RE concentrates with bastnaesite and monazite.Nevertheless,the adoption of the concen... Bayan Obo rare earth mine is the largest light rare earth resource worldwide,primarily extracts rare earth elements(REEs)from mixed RE concentrates with bastnaesite and monazite.Nevertheless,the adoption of the concentrated sulfuric acid roasting metallurgical process has resulted in damage to the environment.Therefore,this paper adopted the method of selective mineral phase transformation(MPT)followed by enhanced micro-flotation.By determining the optimal MPT co nditions,the flotation recovery of bastnaesite-roasted products by the collector(phthalic acid,PA)is improved,and the enhanced separation of bastnaesite with monazite is realized.The results show that with the increase of roasting temperature and time,the bastnaesite decomposition product is CeOF and monazite does not change significantly.Subsequent micro-flotation exhibits a gradual decline in the PA consumption of bastnaesiteroasted products,while the flotation recovery of monazite-roasted products remains poor.The artificial mixed ore experiments result in a CeOF foam product with a content of 94.14%and a recovery of 85.80%,and a monazite tank product with a content of 73.53%and a recovery of 87.87%.Compared with the preroasting ore,the surface and interior of bastnaesite-roasted products develop numerous cracks and porosities,and no obvious structural damage is observed in monazite-roasted particles.As the roasting temperature increases,the mineral particles undergo recrystallization or closure,reducing the specific surface area of bastnaesite-roasted products and enhancing hydrophobicity,leading to diminished PA consumption.Fourier transform infrared and other flotation-relation tests show that PA is chemisorbed on the surface of CeOF.The MPT conditions are optimized in this study,which provides a reference for further advancing the efficient separation of bastnaesite and monazite. 展开更多
关键词 Rare earths BASTNAESITE MONAZITE Mineral phase transformation enhanced flotation separation MECHANISMS
原文传递
基于可变形卷积与Transformer的遥感影像变化检测方法
9
作者 苏志鹏 王欢 潘清莹 《遥感信息》 北大核心 2026年第1期155-162,共8页
针对当前遥感影像变化检测方法在复杂场景下全局建模能力不足、小目标检测效果差等问题,提出一种基于可变形卷积与Transformer的混合网络模型。首先,设计差异增强模块,通过通道注意力机制突出双时相影像的特征差异;其次,采用可变形卷积... 针对当前遥感影像变化检测方法在复杂场景下全局建模能力不足、小目标检测效果差等问题,提出一种基于可变形卷积与Transformer的混合网络模型。首先,设计差异增强模块,通过通道注意力机制突出双时相影像的特征差异;其次,采用可变形卷积改进ResNet18主干网络,增强对多尺度目标的特征提取能力;最后,引入Transformer编码-解码结构,建立全局上下文依赖关系。在LEVIR-CD和CDD数据集上的实验结果表明,该方法在F1分数和IoU指标上分别达到90.55%、80.98%和94.25%、89.57%,显著优于对比方法。消融实验证实各模块的有效性,其中可变形卷积使IoU提升1.86个百分点,差异增强模块使IoU提升0.71个百分点。该方法为提升复杂场景下的变化检测精度提供了有效解决方案。 展开更多
关键词 变化检测 可变形卷积 差异增强模块 transformer
在线阅读 下载PDF
Extreme Attitude Prediction of Amphibious Vehicles Based on Improved Transformer Model and Extreme Loss Function
10
作者 Qinghuai Zhang Boru Jia +3 位作者 Zhengdao Zhu Jianhua Xiang Yue Liu Mengwei Li 《哈尔滨工程大学学报(英文版)》 2026年第1期228-238,共11页
Amphibious vehicles are more prone to attitude instability compared to ships,making it crucial to develop effective methods for monitoring instability risks.However,large inclination events,which can lead to instabili... Amphibious vehicles are more prone to attitude instability compared to ships,making it crucial to develop effective methods for monitoring instability risks.However,large inclination events,which can lead to instability,occur frequently in both experimental and operational data.This infrequency causes events to be overlooked by existing prediction models,which lack the precision to accurately predict inclination attitudes in amphibious vehicles.To address this gap in predicting attitudes near extreme inclination points,this study introduces a novel loss function,termed generalized extreme value loss.Subsequently,a deep learning model for improved waterborne attitude prediction,termed iInformer,was developed using a Transformer-based approach.During the embedding phase,a text prototype is created based on the vehicle’s operation log data is constructed to help the model better understand the vehicle’s operating environment.Data segmentation techniques are used to highlight local data variation features.Furthermore,to mitigate issues related to poor convergence and slow training speeds caused by the extreme value loss function,a teacher forcing mechanism is integrated into the model,enhancing its convergence capabilities.Experimental results validate the effectiveness of the proposed method,demonstrating its ability to handle data imbalance challenges.Specifically,the model achieves over a 60%improvement in root mean square error under extreme value conditions,with significant improvements observed across additional metrics. 展开更多
关键词 Amphibious vehicle Attitude prediction Extreme value loss function enhanced transformer architecture External information embedding
在线阅读 下载PDF
Knowledge graph-enhanced framework for electric power engineering report generation
11
作者 Chen Qian Yu-Yan Chen +3 位作者 Jia-Ying Yang Xiao-Wen Le Xiao-Yang Shen Yi-Heng Zeng 《Journal of Electronic Science and Technology》 2026年第1期46-64,共19页
Due to the complex structural hierarchy,with deeply nested associative relations between entities such as equipment,specifications,and business processes,intelligent power grid engineering is challenging.Meanwhile,lim... Due to the complex structural hierarchy,with deeply nested associative relations between entities such as equipment,specifications,and business processes,intelligent power grid engineering is challenging.Meanwhile,limited by the fragmented data and loss of contextual information,the generated reports are prone to the problems such as content redundancy and omission of critical information,failing to meet the demands of efficient decision-making and accurate management in modern power systems.To address these issues,this paper proposes a knowledge graph(KG)-enhanced framework to automatically generate electric power engineering reports.In the KG construction phase,a feature-fused entity recognition model named BERT-BiLSTM-CRF is adopted to improve the accuracy of entity recognition in scenarios involving power engineering professional terminology,thereby solving the problem of ambiguous entity boundaries in traditional models;then a BERT-attention relation extraction model is proposed to enhance the completeness of extracting complex hierarchical and implicit relations in power grid data.In the report generation phase,an improved Transformer architecture is adopted to accurately transform structured knowledge into natural language reports that comply with engineering specifications,addressing the issue of semantic inconsistency caused by the loss of structural information in existing models.By validating with real-world projects,the results show that the proposed framework significantly outperforms existing baseline models in entity recognition,confirming its superiority and applicability in practical engineering. 展开更多
关键词 Entity recognition Improved transformer model Knowledge graph enhancement Power grid engineering report generation Relation extraction
在线阅读 下载PDF
A Transformer Network Combing CBAM for Low-Light Image Enhancement 被引量:1
12
作者 Zhefeng Sun Chen Wang 《Computers, Materials & Continua》 2025年第3期5205-5220,共16页
Recently,a multitude of techniques that fuse deep learning with Retinex theory have been utilized in the field of low-light image enhancement,yielding remarkable outcomes.Due to the intricate nature of imaging scenari... Recently,a multitude of techniques that fuse deep learning with Retinex theory have been utilized in the field of low-light image enhancement,yielding remarkable outcomes.Due to the intricate nature of imaging scenarios,including fluctuating noise levels and unpredictable environmental elements,these techniques do not fully resolve these challenges.We introduce an innovative strategy that builds upon Retinex theory and integrates a novel deep network architecture,merging the Convolutional Block Attention Module(CBAM)with the Transformer.Our model is capable of detecting more prominent features across both channel and spatial domains.We have conducted extensive experiments across several datasets,namely LOLv1,LOLv2-real,and LOLv2-sync.The results show that our approach surpasses other methods when evaluated against critical metrics such as Peak Signal-to-Noise Ratio(PSNR)and Structural Similarity Index(SSIM).Moreover,we have visually assessed images enhanced by various techniques and utilized visual metrics like LPIPS for comparison,and the experimental data clearly demonstrate that our approach excels visually over other methods as well. 展开更多
关键词 Low-light image enhancement CBAM transformer
在线阅读 下载PDF
Retinexformer+:Retinex-Based Dual-Channel Transformer for Low-Light Image Enhancement
13
作者 Song Liu Hongying Zhang +1 位作者 Xue Li Xi Yang 《Computers, Materials & Continua》 2025年第2期1969-1984,共16页
Enhancing low-light images with color distortion and uneven multi-light source distribution presents challenges. Most advanced methods for low-light image enhancement are based on the Retinex model using deep learning... Enhancing low-light images with color distortion and uneven multi-light source distribution presents challenges. Most advanced methods for low-light image enhancement are based on the Retinex model using deep learning. Retinexformer introduces channel self-attention mechanisms in the IG-MSA. However, it fails to effectively capture long-range spatial dependencies, leaving room for improvement. Based on the Retinexformer deep learning framework, we designed the Retinexformer+ network. The “+” signifies our advancements in extracting long-range spatial dependencies. We introduced multi-scale dilated convolutions in illumination estimation to expand the receptive field. These convolutions effectively capture the weakening semantic dependency between pixels as distance increases. In illumination restoration, we used Unet++ with multi-level skip connections to better integrate semantic information at different scales. The designed Illumination Fusion Dual Self-Attention (IF-DSA) module embeds multi-scale dilated convolutions to achieve spatial self-attention. This module captures long-range spatial semantic relationships within acceptable computational complexity. Experimental results on the Low-Light (LOL) dataset show that Retexformer+ outperforms other State-Of-The-Art (SOTA) methods in both quantitative and qualitative evaluations, with the computational complexity increased to an acceptable 51.63 G FLOPS. On the LOL_v1 dataset, RetinexFormer+ shows an increase of 1.15 in Peak Signal-to-Noise Ratio (PSNR) and a decrease of 0.39 in Root Mean Square Error (RMSE). On the LOL_v2_real dataset, the PSNR increases by 0.42 and the RMSE decreases by 0.18. Experimental results on the Exdark dataset show that Retexformer+ can effectively enhance real-scene images and maintain their semantic information. 展开更多
关键词 Low-light image enhancement RETINEX transformer model
在线阅读 下载PDF
基于小波变换增强位置编码Transformer的空域流量预测 被引量:3
14
作者 唐卫贞 刘波 +1 位作者 黄洲升 田齐齐 《现代电子技术》 北大核心 2025年第8期127-132,共6页
随着全球化进程的加快和航空技术的发展,对空中交通流量预测的精度要求也越来越高。为提高空中交通流量预测精度,减轻空中交通管制员的压力,提出一种增强位置编码的Transformer模型。利用小波变换对原始空域流量数据进行分析,通过信噪... 随着全球化进程的加快和航空技术的发展,对空中交通流量预测的精度要求也越来越高。为提高空中交通流量预测精度,减轻空中交通管制员的压力,提出一种增强位置编码的Transformer模型。利用小波变换对原始空域流量数据进行分析,通过信噪比选出性能最优的小波基函数,再进一步计算出小波系数并将其融入位置编码,以增强模型对时间序列数据的理解能力。实验结果表明,所提模型能够准确捕捉空中交通流量数据中的非平稳性和突变特征,其RMSE和MAPE评估指标较原始Transformer模型分别降低了29.9与2.9%,较LSTM模型分别降低了34.5与3.4%。该模型不仅提升了空域流量预测的准确性,也证实了小波变换在增强模型时间序列数据理解中的有效性,且为交通流量管理提供了一种新的技术方案。 展开更多
关键词 空域流量预测 增强位置编码 transformer模型 小波变换 LSTM模型 小波基函数
在线阅读 下载PDF
双向自回归Transformer与快速傅里叶卷积增强的壁画修复 被引量:3
15
作者 陈永 张世龙 杜婉君 《湖南大学学报(自然科学版)》 北大核心 2025年第4期1-15,共15页
针对现有深度学习算法在壁画修复时,存在全局语义一致性约束不足及局部特征提取不充分,导致修复后的壁画易出现边界效应和细节模糊等问题,提出一种双向自回归Transformer与快速傅里叶卷积增强的壁画修复方法.首先,设计基于Transformer... 针对现有深度学习算法在壁画修复时,存在全局语义一致性约束不足及局部特征提取不充分,导致修复后的壁画易出现边界效应和细节模糊等问题,提出一种双向自回归Transformer与快速傅里叶卷积增强的壁画修复方法.首先,设计基于Transformer结构的全局语义特征修复模块,利用双向自回归机制与掩码语言模型(masked language modeling,MLM),提出改进的多头注意力全局语义壁画修复模块,提高对全局语义特征的修复能力.然后,构建了由门控卷积和残差模块组成的全局语义增强模块,增强全局语义特征一致性约束.最后,设计局部细节修复模块,采用大核注意力机制(large kernel attention,LKA)与快速傅里叶卷积提高细节特征的捕获能力,同时减少局部细节信息的丢失,提升修复壁画局部和整体特征的一致性.通过对敦煌壁画数字化修复实验,结果表明,所提算法修复性能更优,客观评价指标均优于比较算法. 展开更多
关键词 壁画修复 双向自回归transformer 掩码语言模型 快速傅里叶卷积 语义增强
在线阅读 下载PDF
结合并联Transformer和残差U-Net网络的水下图像增强 被引量:1
16
作者 陈清江 李宗莹 《电子科技》 2025年第8期57-65,共9页
针对光在水中传播时被吸收,水下图像存在颜色失真、对比度低和细节模糊等问题,文中设计了一个基于并联Transformer和残差卷积的U-Net网络进行水下图像增强。在新U-Net结构中,在编码和解码部分分别置入混合卷积Transformer块(Hybrid Conv... 针对光在水中传播时被吸收,水下图像存在颜色失真、对比度低和细节模糊等问题,文中设计了一个基于并联Transformer和残差卷积的U-Net网络进行水下图像增强。在新U-Net结构中,在编码和解码部分分别置入混合卷积Transformer块(Hybrid Convolution Transformer Block,HCTB)。综合了Transformer的捕获全局信息能力和卷积块捕获局部信息能力,并且在跳跃连接部分搭建了若干平行注意模块(Parallel Attention Module,PAM)来提取更重要的像素和通道信息。采用现有UIEB(Underwater Image Enhancement Benchmark dataset)配对数据集对网络进行训练。为验证所提算法的有效性,选取不同偏色程度的水下图像进行实验与测试。实验结果表明,所提模型较其他先进模型的峰值信噪比PSNR(Peak Single-to-Ratio)值提升了4.3%,获得了较好的主观和客观评价结果,有效提升了水下图像的增强水平。 展开更多
关键词 水下图像增强 transformer 残差卷积 U-Net网络 平行注意模块 通道注意 像素注意 卷积神经网络 深度学习
在线阅读 下载PDF
基于Transformer的道路场景语义分割综述
17
作者 黄天云 向明建 邵世霖 《西南民族大学学报(自然科学版)》 2025年第2期193-205,共13页
在自动驾驶领域,通过对道路场景进行高质量的语义分割,可以为自动驾驶汽车的安全行驶提供重要保障.近年来,随着自动驾驶技术的不断进步,人们对语义分割模型在尺寸、计算成本和分割精度等方面的要求也日益提高,这促使研究者们探索更为先... 在自动驾驶领域,通过对道路场景进行高质量的语义分割,可以为自动驾驶汽车的安全行驶提供重要保障.近年来,随着自动驾驶技术的不断进步,人们对语义分割模型在尺寸、计算成本和分割精度等方面的要求也日益提高,这促使研究者们探索更为先进的算法.首先介绍了语义分割技术在深度学习快速发展下取得的显著进展与不足,从而引出基于Transformer的道路场景语义分割方法.相较于传统的深度学习算法,Transformer具备全面理解复杂场景中上下文关系的能力,尤其在处理多对象和复杂环境时表现出显著优势.接着,根据不同的特征处理策略和模型架构,将基于Transformer的道路场景语义分割方法分为四类:基于全局特征提取的方法、基于局部特征增强的方法、基于混合架构的方法以及基于自监督学习的方法.最后,分析和对比了每类方法的代表性算法,概括总结了各类方法的技术特点和优缺点. 展开更多
关键词 语义分割 transformer 全局特征提取 局部特征增强 混合架构 自监督学习
在线阅读 下载PDF
亮度信噪比引导Transformer的低照度图像增强 被引量:2
18
作者 杜晓刚 路文杰 +1 位作者 雷涛 王营博 《计算机工程与应用》 北大核心 2025年第6期263-272,共10页
一些低照度图像增强方法生成的增强图像存在亮度不均、去噪效果较差和缺少细节的问题。为了解决上述问题,提出了基于亮度和信噪比引导Transformer的低照度图像增强网络BSGFormer。该网络具有三个优势:设计了亮度信噪比生成子网络,旨在... 一些低照度图像增强方法生成的增强图像存在亮度不均、去噪效果较差和缺少细节的问题。为了解决上述问题,提出了基于亮度和信噪比引导Transformer的低照度图像增强网络BSGFormer。该网络具有三个优势:设计了亮度信噪比生成子网络,旨在提取全局光照信息和定位信息量缺失的暗区域;通过亮度和信噪比特征引导Transformer,仅对信息量缺失的暗区提取长距离特征以减少计算量,同时引导后续特征融合模块,借助亮区信息来丰富暗区细节并实现信息共享;设计了交叉融合注意力模块并将其引入到编解码器之间,改善网络对图像细节信息的保留能力。在四个公开数据集上进行实验表明,与主流方法相比,BSGFormer在主观视觉和客观评价两方面均得到了更好的增强效果。 展开更多
关键词 低照度图像 图像增强 transformer 卷积残差
在线阅读 下载PDF
结合多尺度特征增强与记忆引导Transformer的遥感图像描述算法 被引量:2
19
作者 姚志远 桑国明 张益嘉 《小型微型计算机系统》 北大核心 2025年第8期1978-1985,共8页
为解决传统的遥感图像描述算法对图像多尺度信息利用不充分的问题,本文提出了结合多尺度特征增强与记忆引导Transformer的遥感图像描述生成算法(MFE-MGT).首先,利用预训练的视觉特征提取器提取图像特征,并将卷积神经网络中浅层与深层的... 为解决传统的遥感图像描述算法对图像多尺度信息利用不充分的问题,本文提出了结合多尺度特征增强与记忆引导Transformer的遥感图像描述生成算法(MFE-MGT).首先,利用预训练的视觉特征提取器提取图像特征,并将卷积神经网络中浅层与深层的特征进行拼接;其次,通过多尺度特征增强模块获得融合增强后的图像特征,以更好地捕捉多尺度特征;接着,将融合增强后的视觉特征输入记忆引导Transformer的编码器进行编码聚合;最后,通过Transformer记忆解码器生成图像描述.模型采用RSICD数据集进行训练,实验结果表明,MFE-MGT在多个评价指标上的表现均优于当前主流的遥感图像描述生成算法,能够准确的描述图像内容. 展开更多
关键词 多尺度特征增强 深度神经网络 transformer 遥感图像描述
在线阅读 下载PDF
基于CNN-transformer轻量级网络的低光图像增强方法 被引量:2
20
作者 贺晨亚 崔学英 《灯与照明》 2025年第1期118-120,共3页
由于设备、光照、拍摄条件等因素影响,导致拍摄的图片曝光不足出现暗区、褪色区域,影响图像中物体的识别。本文提出了一种轻量级的基于CNN-transformer的低曝光图像增强方法,用于图像的实时增强。该方法借助过曝光图像提供的信息,利用... 由于设备、光照、拍摄条件等因素影响,导致拍摄的图片曝光不足出现暗区、褪色区域,影响图像中物体的识别。本文提出了一种轻量级的基于CNN-transformer的低曝光图像增强方法,用于图像的实时增强。该方法借助过曝光图像提供的信息,利用卷积模块和Transformer模块分别提取低曝光图像与过曝光图像的局部和全局信息,并将提取的局部信息与全局信息融合,以估计像素级的高阶曲线的动态范围来调整给定的低曝光图像,得到增强后的图像,实验结果说明了提出的方法的优越性。该技术可嵌入到手机和数码相机等设备中用于照片曝光强度的校正,满足实时性需求。 展开更多
关键词 transformer 图像增强 轻量级 双分支
在线阅读 下载PDF
上一页 1 2 89 下一页 到第
使用帮助 返回顶部