期刊文献+
共找到2,133篇文章
< 1 2 107 >
每页显示 20 50 100
M2ATNet: Multi-Scale Multi-Attention Denoising and Feature Fusion Transformer for Low-Light Image Enhancement
1
作者 Zhongliang Wei Jianlong An Chang Su 《Computers, Materials & Continua》 2026年第1期1819-1838,共20页
Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approach... Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approaches,while effective in global illumination modeling,often struggle to simultaneously suppress noise and preserve structural details,especially under heterogeneous lighting.Furthermore,misalignment between luminance and color channels introduces additional challenges to accurate enhancement.In response to the aforementioned difficulties,we introduce a single-stage framework,M2ATNet,using the multi-scale multi-attention and Transformer architecture.First,to address the problems of texture blurring and residual noise,we design a multi-scale multi-attention denoising module(MMAD),which is applied separately to the luminance and color channels to enhance the structural and texture modeling capabilities.Secondly,to solve the non-alignment problem of the luminance and color channels,we introduce the multi-channel feature fusion Transformer(CFFT)module,which effectively recovers the dark details and corrects the color shifts through cross-channel alignment and deep feature interaction.To guide the model to learn more stably and efficiently,we also fuse multiple types of loss functions to form a hybrid loss term.We extensively evaluate the proposed method on various standard datasets,including LOL-v1,LOL-v2,DICM,LIME,and NPE.Evaluation in terms of numerical metrics and visual quality demonstrate that M2ATNet consistently outperforms existing advanced approaches.Ablation studies further confirm the critical roles played by the MMAD and CFFT modules to detail preservation and visual fidelity under challenging illumination-deficient environments. 展开更多
关键词 Low-light image enhancement multi-scale multi-attention transformer
在线阅读 下载PDF
Multi-Scale Vision Transformer with Dynamic Multi-Loss Function for Medical Image Retrieval and Classification
2
作者 Omar Alqahtani Mohamed Ghouse +2 位作者 Asfia Sabahath Omer Bin Hussain Arshiya Begum 《Computers, Materials & Continua》 2025年第5期2221-2244,共24页
This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi... This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi-scale encoding significantly enhances the model’s ability to capture both fine-grained and global features,while the dynamic loss function adapts during training to optimize classification accuracy and retrieval performance.Our approach was evaluated on the ISIC-2018 and ChestX-ray14 datasets,yielding notable improvements.Specifically,on the ISIC-2018 dataset,our method achieves an F1-Score improvement of+4.84% compared to the standard ViT,with a precision increase of+5.46% for melanoma(MEL).On the ChestX-ray14 dataset,the method delivers an F1-Score improvement of 5.3%over the conventional ViT,with precision gains of+5.0% for pneumonia(PNEU)and+5.4%for fibrosis(FIB).Experimental results demonstrate that our approach outperforms traditional CNN-based models and existing ViT variants,particularly in retrieving relevant medical cases and enhancing diagnostic accuracy.These findings highlight the potential of the proposedmethod for large-scalemedical image analysis,offering improved tools for clinical decision-making through superior classification and case comparison. 展开更多
关键词 Medical image retrieval vision transformer multi-scale encoding multi-loss function ISIC-2018 ChestX-ray14
在线阅读 下载PDF
CT-MFENet:Context Transformer and Multi-Scale Feature Extraction Network via Global-Local Features Fusion for Retinal Vessels Segmentation
3
作者 SHAO Dangguo YANG Yuanbiao +1 位作者 MA Lei YI Sanli 《Journal of Shanghai Jiaotong university(Science)》 2025年第4期668-682,共15页
Segmentation of the retinal vessels in the fundus is crucial for diagnosing ocular diseases.Retinal vessel images often suffer from category imbalance and large scale variations.This ultimately results in incomplete v... Segmentation of the retinal vessels in the fundus is crucial for diagnosing ocular diseases.Retinal vessel images often suffer from category imbalance and large scale variations.This ultimately results in incomplete vessel segmentation and poor continuity.In this study,we propose CT-MFENet to address the aforementioned issues.First,the use of context transformer(CT)allows for the integration of contextual feature information,which helps establish the connection between pixels and solve the problem of incomplete vessel continuity.Second,multi-scale dense residual networks are used instead of traditional CNN to address the issue of inadequate local feature extraction when the model encounters vessels at multiple scales.In the decoding stage,we introduce a local-global fusion module.It enhances the localization of vascular information and reduces the semantic gap between high-and low-level features.To address the class imbalance in retinal images,we propose a hybrid loss function that enhances the segmentation ability of the model for topological structures.We conducted experiments on the publicly available DRIVE,CHASEDB1,STARE,and IOSTAR datasets.The experimental results show that our CT-MFENet performs better than most existing methods,including the baseline U-Net. 展开更多
关键词 retinal vessel segmentation context transformer(CT) multi-scale dense residual hybrid loss function global-local fusion
原文传递
结合空间多层图卷积和时序分段Transformer的分心驾驶识别方法
4
作者 葛慧敏 欧阳宁 吴沛桐 《计算机工程与应用》 北大核心 2026年第4期152-167,共16页
识别分心驾驶行为是提升驾驶安全性的重要手段之一。目前基于图卷积的骨架动作识别方法采用单一的骨架图结构而忽略了关节点间的多种交互关系,且对骨架序列局部及全局时间特征提取能力不足。针对上述问题,提出一种结合空间多层图卷积和... 识别分心驾驶行为是提升驾驶安全性的重要手段之一。目前基于图卷积的骨架动作识别方法采用单一的骨架图结构而忽略了关节点间的多种交互关系,且对骨架序列局部及全局时间特征提取能力不足。针对上述问题,提出一种结合空间多层图卷积和时序分段Transformer的分心驾驶识别模型。在空间建模方面,通过多种索引方式构建包含多种空间关系的驾驶员关节点的多层图结构,并引入图注意力机制动态调整图结构中边的连接强度,利用层内与层间图卷积操作提取与融合空间特征。在时间建模方面,对时间序列进行分段处理,并使用Transformer来有效捕捉分段时间的局部特征及跨时段的全局特征。最终在Drive&Act、DAD数据集上对模型进行了性能验证,结果表明,模型相较于现有方法进一步提高了分心驾驶行为识别的准确率。 展开更多
关键词 智能交通 分心驾驶 基于骨架的动作识别 时序transformer 空间多层图
在线阅读 下载PDF
融合多重卷积和Dense Transformer的高光谱图像分类
5
作者 魏林 杨霄 尹玉萍 《红外技术》 北大核心 2026年第2期193-203,共11页
高光谱图像蕴含丰富的光谱空间信息。如何充分挖掘空谱信息进行分类,是一个关键的研究问题。在处理高光谱图像分类时,卷积擅长提取局部特征,Transformer能够捕获长距离特征依赖性,学习全局特征信息。针对卷积和Transformer的优势,提出... 高光谱图像蕴含丰富的光谱空间信息。如何充分挖掘空谱信息进行分类,是一个关键的研究问题。在处理高光谱图像分类时,卷积擅长提取局部特征,Transformer能够捕获长距离特征依赖性,学习全局特征信息。针对卷积和Transformer的优势,提出了一种结合三维卷积、空间通道重建卷积和Transformer的高光谱图像分类方法。首先将降维后的图像块,利用三维卷积进行综合的空谱特征提取;随后用空间通道重建卷积过滤冗余信息;最后用具有密集连接的Transformer对卷积提取的空谱特征建立长距离依赖关系,并使用多层感知机进行分类。实验表明,该方法在Pavia University、Salinas和Botswana数据集上总体分类精度分别为99.51%、99.85%、97.57%,均表现优异。 展开更多
关键词 高光谱图像 特征提取 三维卷积 空间通道重建卷积 transformer
在线阅读 下载PDF
基于改进YOLOv11的CNN-Transformer混合水域垃圾检测算法
6
作者 赵建永 李瑞东 +1 位作者 姚浩 魏秀蓉 《无线互联科技》 2026年第4期21-25,50,共6页
河流水面漂浮物检测受限于复杂环境条件(如光照变化、波纹干扰)和检测目标尺度较小的特点,传统方法难以实现高精度检测。文章提出一种面向复杂水域场景的单阶段检测模型YOLOv11n-SPT,在YOLOv11n基础上引入新型Spatial Pyramid Transform... 河流水面漂浮物检测受限于复杂环境条件(如光照变化、波纹干扰)和检测目标尺度较小的特点,传统方法难以实现高精度检测。文章提出一种面向复杂水域场景的单阶段检测模型YOLOv11n-SPT,在YOLOv11n基础上引入新型Spatial Pyramid Transformer(SPT)模块与通道注意力机制。SPT模块采用多分支空间金字塔结构,实现高分辨率细节保留与超大感受野全局建模的协同。在FloW-Img数据集上,YOLOv11n-SPT的mAP@0.5达到81.2%,较基线YOLOv11n提升2.9个百分点;消融实验表明,单独引入SPT模块使mAP@0.5提升2.0%,召回率提升2.1%,进一步叠加通道注意力后精确率提升至85.4%。YOLOv11n-SPT在微小目标与强干扰场景下表现出更强的鲁棒性与定位精度,为无人清漂船、无人机巡河等实际水域环境治理任务提供了高效可靠的感知方案。 展开更多
关键词 水面漂浮物检测 spatial Pyramid transformer YOLOv11
在线阅读 下载PDF
MewCDNet: A Wavelet-Based Multi-Scale Interaction Network for Efficient Remote Sensing Building Change Detection
7
作者 Jia Liu Hao Chen +5 位作者 Hang Gu Yushan Pan Haoran Chen Erlin Tian Min Huang Zuhe Li 《Computers, Materials & Continua》 2026年第1期687-710,共24页
Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectra... Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability. 展开更多
关键词 Remote sensing change detection deep learning wavelet transform multi-scale
在线阅读 下载PDF
Multi-scale analysis of spatiotemporal evolution and driving factors of eco-environmental quality in a Ningxia irrigation district,China
8
作者 LI Zequan CHAI Mingtang +4 位作者 ZHU Lei HE Junjie DING Yimin XU Fengkun XU Xiyuan 《Journal of Geographical Sciences》 2026年第2期471-493,共23页
The Qingtongxia Irrigation District in Ningxia is an important hydrological and ecological region.To assess its ecological environment quality from 2001 to 2021 across multiple scales and identify driving factors,a mo... The Qingtongxia Irrigation District in Ningxia is an important hydrological and ecological region.To assess its ecological environment quality from 2001 to 2021 across multiple scales and identify driving factors,a modified remote sensing ecological index(MRSEI)was developed by incorporating evapotranspiration.Spatial and temporal patterns were analyzed using the coefficient of variation,spatial autocorrelation,and semi-variogram methods,while influencing factors were explored via the optimal parameter geographical detector model.The MRSEI’s first principal component loadings and rankings aligned with those of RSEI(average contribution:81.31%),effectively reflecting spatiotemporal variations.At sub-irrigation district and landscape scales,ecological quality was slightly lower than at the district level but remained stable.Moderate and good ecological grades accounted for 36.28%and 33.38%of the area,respectively,at the district scale,and the moderate grade reached 70.48%on smaller scales.Spatial heterogeneity intensified with decreasing scale,and human activity lost explanatory power below a 5 km range.Human factors mainly drove ecological differentiation at the district scale,while natural factors dominated at finer scales.The MRSEI offers a novel tool for ecological assessment in arid/semi-arid areas and supports scale-adapted ecological protection strategies. 展开更多
关键词 ecological environment quality multi-scales remote sensing ecological index spatial heterogeneity semi-variance function
原文传递
融合时间空间的多尺度Transformer人脸伪造检测
9
作者 杜利莎 杨高明 《兰州工业学院学报》 2026年第1期15-20,共6页
针对目前人脸伪造检测无法充分提取时间特征、检测效率低等问题,提出一种融合时间特征和空间特征的多尺度人脸伪造检测方法MST-ViT。MST-ViT方法设计双流结构提取包含全局信息和细节信息的多尺度特征,设计帧间差异捕获模块增强对时间伪... 针对目前人脸伪造检测无法充分提取时间特征、检测效率低等问题,提出一种融合时间特征和空间特征的多尺度人脸伪造检测方法MST-ViT。MST-ViT方法设计双流结构提取包含全局信息和细节信息的多尺度特征,设计帧间差异捕获模块增强对时间伪影的提取,并通过时空Transformer提取时间特征和空间特征。实验结果表明:所提模型在FF++数据集内的AUC结果提升1.71%;在具有挑战性的DFDC跨数据集实验中AUC提升2.06%。 展开更多
关键词 人脸伪造检测 空间特征 时间特征 多尺度特征 transformer
在线阅读 下载PDF
Multi-Scale Fusion Network Using Time-Division Fourier Transform for Rolling Bearing Fault Diagnosis
10
作者 Ronghua Wang Shibao Sun +3 位作者 Pengcheng Zhao Xianglan Yang Xingjia Wei Changyang Hu 《Computers, Materials & Continua》 2025年第8期3519-3539,共21页
The capacity to diagnose faults in rolling bearings is of significant practical importance to ensure the normal operation of the equipment.Frequency-domain features can effectively enhance the identification of fault ... The capacity to diagnose faults in rolling bearings is of significant practical importance to ensure the normal operation of the equipment.Frequency-domain features can effectively enhance the identification of fault modes.However,existing methods often suffer from insufficient frequency-domain representation in practical applications,which greatly affects diagnostic performance.Therefore,this paper proposes a rolling bearing fault diagnosismethod based on aMulti-Scale FusionNetwork(MSFN)using the Time-Division Fourier Transform(TDFT).The method constructs multi-scale channels to extract time-domain and frequency-domain features of the signal in parallel.A multi-level,multi-scale filter-based approach is designed to extract frequency-domain features in a segmented manner.A cross-attention mechanism is introduced to facilitate the fusion of the extracted time-frequency domain features.The performance of the proposed method is validated using the CWRU and Ottawa datasets.The results show that the average accuracy of MSFN under complex noisy signals is 97.75%and 94.41%.The average accuracy under variable load conditions is 98.68%.This demonstrates its significant application potential compared to existing methods. 展开更多
关键词 Rolling bearing fault diagnosis time-division fourier transform cross-attention multi-scale feature fusion
在线阅读 下载PDF
CATrans:基于跨尺度注意力Transformer的高分辨率遥感影像土地覆盖语义分割框架 被引量:1
11
作者 陈丽佳 陈宏辉 +3 位作者 谢艳秋 何天友 叶菁 吴林煌 《地球信息科学学报》 北大核心 2025年第7期1624-1637,共14页
【目的】高分辨率遥感影像语义分割通过精准提取地物信息,为城市规划、土地分析利用提供了重要的数据支持。当前分割方法通常将遥感影像划分为标准块,进行多尺度局部分割和层次推理,未充分考虑影像中的上下文先验知识和局部特征交互能力... 【目的】高分辨率遥感影像语义分割通过精准提取地物信息,为城市规划、土地分析利用提供了重要的数据支持。当前分割方法通常将遥感影像划分为标准块,进行多尺度局部分割和层次推理,未充分考虑影像中的上下文先验知识和局部特征交互能力,影响了推理分割质量。【方法】为了解决这一问题,本文提出了一种联合跨尺度注意力和语义视觉Transformer的遥感影像分割框架(Cross-scale Attention Transformer,CATrans),融合跨尺度注意力模块和语义视觉Transformer,提取上下文先验知识增强局部特征表示和分割性能。首先,跨尺度注意力模块通过空间和通道两个维度进行并行特征处理,分析浅层-深层和局部-全局特征之间的依赖关系,提升对遥感影像中不同粒度对象的注意力。其次,语义视觉Transformer通过空间注意力机制捕捉上下文语义信息,建模语义信息之间的依赖关系。【结果】本文在DeepGlobe、Inria Aerial和LoveDA数据集上进行对比实验,结果表明:CATrans的分割性能优于现有的WSDNet(Discrete Wavelet Smooth Network)和ISDNet(Integrating Shallow and Deep Network)等分割算法,分别取得了76.2%、79.2%、54.2%的平均交并比(Mean Intersection over Union,mIoU)和86.5%、87.8%、66.8%的平均F1得分(Mean F1 Score,mF1),推理速度分别达到38.1 FPS、13.2 FPS和95.22 FPS。相较于本文所对比的最佳方法WSDNet,mIoU和mF1在3个数据集中分别提升2.1%、4.0%、5.3%和1.3%、1.8%、5.6%,在每类地物的分割中都具有显著优势。【结论】本方法实现了高效率、高精度的高分辨率遥感影像语义分割。 展开更多
关键词 高分辨率 语义分割 跨尺度注意力 视觉transformer 上下文先验 空间注意力 语义信息
原文传递
Fundamental Theories of Spatial Similarity Relations in Multi-scale Map Spaces 被引量:22
12
作者 YAN Haowen 《Chinese Geographical Science》 SCIE CSCD 2010年第1期18-22,共5页
Similarity relation is one of the spatial relations in the community of geographic information science and cartography.It is widely used in the retrieval of spatial databases, the recognition of spatial objects from i... Similarity relation is one of the spatial relations in the community of geographic information science and cartography.It is widely used in the retrieval of spatial databases, the recognition of spatial objects from images, and the description of spatial features on maps.However, little achievements have been made for it by far.In this paper, spatial similarity relation was put forward with the introduction of automated map generalization in the construction of multi-scale map databases;then the definition of spatial similarity relations was presented based on set theory, the concept of spatial similarity degree was given, and the characteristics of spatial similarity were discussed in detail, in-cluding reflexivity, symmetry, non-transitivity, self-similarity in multi-scale spaces, and scale-dependence.Finally a classification system for spatial similarity relations in multi-scale map spaces was addressed.This research may be useful to automated map generalization, spatial similarity retrieval and spatial reasoning. 展开更多
关键词 similarity relation spatial relation multi-scale map spaces
在线阅读 下载PDF
Multi-scale analysis of the spatial structure of China’s major function zoning 被引量:10
13
作者 WANG Yafei FAN Jie 《Journal of Geographical Sciences》 SCIE CSCD 2020年第2期197-211,共15页
The spatial structures of China’s Major Function Zoning are important constraining indicators in all types of spatial planning and key parameters for accurately downscaling major functions.Taking the proportion of ur... The spatial structures of China’s Major Function Zoning are important constraining indicators in all types of spatial planning and key parameters for accurately downscaling major functions.Taking the proportion of urbanization zones,agricultural development zones and ecological security zones as the basic parameter,this paper explores the spatial structures of major function zoning at different scales using spatial statistics,spatial modeling and landscape metrics methods.The results show:First,major function zones have spatial gradient structures,which are prominently represented by latitudinal and longitudinal gradients,a coastal distance gradient,and an eastern-central-western gradient.Second,the pole-axis system structure and core-periphery structure exist at provincial scales.The general principle of the pole-axis structure is that as one moves along the distance axis,the proportion of urbanization zones decreases and the proportion of ecological security zones increases.This also means that the proportion of different function zones has a ring-shaped spatial differentiation principle with distance from the core.Third,there is a spatial mosaic structure at the city and county scale.This spatial mosaic structure has features of both spatial heterogeneity,such as agglomeration and dispersion,as well as of mutual,adjacent topological correlation and spatial proximity.The results of this study contribute to scientific knowledge on major function zones and the principles of spatial organization,and it acts as an important reference for China’s integrated geographical zoning. 展开更多
关键词 China major function zoning multi-scale spatial gradient pole-axis CORE-PERIPHERY spatial mosaic
原文传递
三维卷积与Transformer支持下联合空谱特征的高光谱影像分类 被引量:1
14
作者 何光 吴田军 《计算机工程与应用》 北大核心 2025年第2期259-272,共14页
由于CNN对局部特征提取能力强,目前仍是高光谱影像处理和分析中的主流深度模型,但是CNN感受野有限,无法建立长距离依赖关系,学习全局语义信息受限。Transformer的自注意力机制可以对输入序列中的每个位置进行注意力计算,从而能有效获取... 由于CNN对局部特征提取能力强,目前仍是高光谱影像处理和分析中的主流深度模型,但是CNN感受野有限,无法建立长距离依赖关系,学习全局语义信息受限。Transformer的自注意力机制可以对输入序列中的每个位置进行注意力计算,从而能有效获取全局上下文信息。如何实现CNN和Transformer的技术耦合并充分利用空间信息和光谱信息进行高光谱遥感影像分类是一个重要的待研问题。鉴于此,提出一种新的基于三维卷积和Transformer的高光谱遥感影像分类方法,尝试联合空谱特征实现解译能力的提升。使用主成分分析方法对高光谱遥感影像沿垂直方向降维;用非负矩阵分解算法对降维后遥感影像沿水平方向进行空间特征提取,将两种工具处理后遥感影像进行拼接,以充分保留信息;再用三维卷积核对拼接后遥感影像进行空间特征和光谱特征的综合提取;用Transformer的注意力机制对提取空间信息和光谱信息的遥感影像序列建立长距离依赖关系并使用多层感知机完成分类任务。实验表明,所提方法在WHU-Hi龙口、汉川、洪湖以及雄安新区马蹄湾村数据集上均表现出比对比方法更优异的分类性能,表明该方法具有一定的泛化性和稳健性。 展开更多
关键词 非负矩阵分解 特征融合 三维卷积 空谱联合 transformer 高光谱遥感影像分类
在线阅读 下载PDF
Formula for calculating spatial similarity degrees between point clouds on multi-scale maps taking map scale change as the only independent variable 被引量:6
15
作者 Yang Weifang Yan Haowen Li Jonathan 《Geodesy and Geodynamics》 2015年第2期113-125,共13页
The degree of spatial similarity plays an important role in map generalization, yet there has been no quantitative research into it. To fill this gap, this study first defines map scale change and spatial similarity d... The degree of spatial similarity plays an important role in map generalization, yet there has been no quantitative research into it. To fill this gap, this study first defines map scale change and spatial similarity degree/relation in multi-scale map spaces and then proposes a model for calculating the degree of spatial similarity between a point cloud at one scale and its gener- alized counterpart at another scale. After validation, the new model features 16 points with map scale change as the x coordinate and the degree of spatial similarity as the y coordinate. Finally, using an application for curve fitting, the model achieves an empirical formula that can calculate the degree of spatial similarity using map scale change as the sole independent variable, and vice versa. This formula can be used to automate algorithms for point feature generalization and to determine when to terminate them during the generalization. 展开更多
关键词 spatial similarity degree Map generalization Map scale change Point clouds Quantitative description spatial similarity relations multi-scale map spaces Curve fitting method
原文传递
Denoising of seismic data via multi-scale ridgelet transform 被引量:4
16
作者 Henglei Zhang Tianyou Liu Yuncui Zhang 《Earthquake Science》 CSCD 2009年第5期493-498,共6页
Noise has traditionally been suppressed or eliminated in seismic data sets by the use of Fourier filters and, to a lesser degree, nonlinear statistical filters. Although these methods are quite useful under specific c... Noise has traditionally been suppressed or eliminated in seismic data sets by the use of Fourier filters and, to a lesser degree, nonlinear statistical filters. Although these methods are quite useful under specific conditions, they may produce undesirable effects for the low signal to noise ratio data. In this paper, a new method, multi-scale ridgelet transform, is used in the light of the theory of ridgelet transform. We employ wavelet transform to do sub-band decomposition for the signals and then use non-linear thresholding in ridgelet domain for every block. In other words, it is based on the idea of partition, at sufficiently fine scale, a curving singularity looks straight, and so ridgelet transform can work well in such cases. Applications on both synthetic data and actual seismic data from Sichuan basin, South China, show that the new method eliminates the noise portion of the signal more efficiently and retains a greater amount of geologic data than other methods, the quality and consecutiveness of seismic event are improved obviously as well as the quality of section is improved. 展开更多
关键词 ridgelet transform multi-scale random noise sub-band decomposition complex Morlet wavelet
在线阅读 下载PDF
Coupling the Power of YOLOv9 with Transformer for Small Object Detection in Remote-Sensing Images 被引量:1
17
作者 Mohammad Barr 《Computer Modeling in Engineering & Sciences》 2025年第4期593-616,共24页
Recent years have seen a surge in interest in object detection on remote sensing images for applications such as surveillance andmanagement.However,challenges like small object detection,scale variation,and the presen... Recent years have seen a surge in interest in object detection on remote sensing images for applications such as surveillance andmanagement.However,challenges like small object detection,scale variation,and the presence of closely packed objects in these images hinder accurate detection.Additionally,the motion blur effect further complicates the identification of such objects.To address these issues,we propose enhanced YOLOv9 with a transformer head(YOLOv9-TH).The model introduces an additional prediction head for detecting objects of varying sizes and swaps the original prediction heads for transformer heads to leverage self-attention mechanisms.We further improve YOLOv9-TH using several strategies,including data augmentation,multi-scale testing,multi-model integration,and the introduction of an additional classifier.The cross-stage partial(CSP)method and the ghost convolution hierarchical graph(GCHG)are combined to improve detection accuracy by better utilizing feature maps,widening the receptive field,and precisely extracting multi-scale objects.Additionally,we incorporate the E-SimAM attention mechanism to address low-resolution feature loss.Extensive experiments on the VisDrone2021 and DIOR datasets demonstrate the effectiveness of YOLOv9-TH,showing good improvement in mAP compared to the best existing methods.The YOLOv9-TH-e achieved 54.2% of mAP50 on the VisDrone2021 dataset and 92.3% of mAP on the DIOR dataset.The results confirmthemodel’s robustness and suitability for real-world applications,particularly for small object detection in remote sensing images. 展开更多
关键词 Remote sensing images YOLOv9-TH multi-scale object detection transformer heads VisDrone2021 dataset
在线阅读 下载PDF
基于Swin-AK Transformer的智能手机拍摄图像质量评价方法
18
作者 侯国鹏 董武 +4 位作者 陆利坤 周子镱 马倩 柏振 郑晟辉 《光电工程》 北大核心 2025年第1期116-130,共15页
本文提出了一种基于双交叉注意力融合的Swin-AK Transformer(Swin Transformer based on alterable kernel convolution)和手工特征相结合的智能手机拍摄图像质量评价方法。首先,提取了影响图像质量的手工特征,这些特征可以捕捉到图像... 本文提出了一种基于双交叉注意力融合的Swin-AK Transformer(Swin Transformer based on alterable kernel convolution)和手工特征相结合的智能手机拍摄图像质量评价方法。首先,提取了影响图像质量的手工特征,这些特征可以捕捉到图像中细微的视觉变化;其次,提出了Swin-AK Transformer,增强了模型对局部信息的提取和处理能力。此外,本文设计了双交叉注意力融合模块,结合空间注意力和通道注意力机制,融合了手工特征与深度特征,实现了更加精确的图像质量预测。实验结果表明,在SPAQ和LIVE-C数据集上,皮尔森线性相关系数分别达到0.932和0.885,斯皮尔曼等级排序相关系数分别达到0.929和0.858。上述结果证明了本文提出的方法能够有效地预测智能手机拍摄图像的质量。 展开更多
关键词 图像质量评价 智能手机拍摄图像 Swin transformer 手工特征 空间注意力 通道注意力
在线阅读 下载PDF
Sub-Regional Infrared-Visible Image Fusion Using Multi-Scale Transformation 被引量:2
19
作者 Yexin Liu Ben Xu +2 位作者 Mengmeng Zhang Wei Li Ran Tao 《Journal of Beijing Institute of Technology》 EI CAS 2022年第6期535-550,共16页
Infrared-visible image fusion plays an important role in multi-source data fusion,which has the advantage of integrating useful information from multi-source sensors.However,there are still challenges in target enhanc... Infrared-visible image fusion plays an important role in multi-source data fusion,which has the advantage of integrating useful information from multi-source sensors.However,there are still challenges in target enhancement and visual improvement.To deal with these problems,a sub-regional infrared-visible image fusion method(SRF)is proposed.First,morphology and threshold segmentation is applied to extract targets interested in infrared images.Second,the infrared back-ground is reconstructed based on extracted targets and the visible image.Finally,target and back-ground regions are fused using a multi-scale transform.Experimental results are obtained using public data for comparison and evaluation,which demonstrate that the proposed SRF has poten-tial benefits over other methods. 展开更多
关键词 image fusion infrared image visible image multi-scale transform
在线阅读 下载PDF
基于Transformer与图卷积网络的三维人体姿态估计
20
作者 王宇晶 包明明 刘星 《传感技术学报》 北大核心 2025年第9期1624-1630,共7页
提出了一种Transformer与图网络相结合的网络模型,用于对视觉传感器采集到的视频图像进行三维人体姿态估计。Transformer能够有效地从二维关键关节点中提取时空维度高相关性特征,而图网络则能够感知细节相关性特征,通过融合这两种网络结... 提出了一种Transformer与图网络相结合的网络模型,用于对视觉传感器采集到的视频图像进行三维人体姿态估计。Transformer能够有效地从二维关键关节点中提取时空维度高相关性特征,而图网络则能够感知细节相关性特征,通过融合这两种网络结构,提高了三维姿态估计的精度。在公开数据集Human3.6M上进行了仿真实验,验证了Transformer与图卷积融合算法的性能。实验结果显示,最终估计得到的三维人体关节点的平均关节点位置偏差(Mean Per Joint Position Error,MPJPE)为38.4 mm,相较于现有方法有一定提升,表明该方法具有较强的应用价值,可应用于许多下游相关工作中。 展开更多
关键词 三维人体姿态估计 transformer 图卷积 时空相关性
在线阅读 下载PDF
上一页 1 2 107 下一页 到第
使用帮助 返回顶部