期刊文献+
共找到1,398篇文章
< 1 2 70 >
每页显示 20 50 100
Spectral matching algorithm based on nonsubsampled contourlet transform and scale-invariant feature transform 被引量:4
1
作者 Dong Liang Pu Yan +2 位作者 Ming Zhu Yizheng Fan Kui Wang 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2012年第3期453-459,共7页
A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low freq... A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low frequency image and several high frequency images, and the scale-invariant feature transform is employed to extract feature points from the low frequency im- age. A proximity matrix is constructed for the feature points of two related images. By singular value decomposition of the proximity matrix, a matching matrix (or matching result) reflecting the match- ing degree among feature points is obtained. Experimental results indicate that the proposed algorithm can reduce time complexity and possess a higher accuracy. 展开更多
关键词 point pattern matching nonsubsampled contourlet transform scale-invariant feature transform spectral algorithm.
在线阅读 下载PDF
Active Shape Models Using Scale Invariant Feature Transform
2
作者 史勇红 戚飞虎 +1 位作者 栾红霞 吴国荣 《Journal of Shanghai Jiaotong university(Science)》 EI 2007年第6期713-718,共6页
A new active shape models (ASMs) was presented, which is driven by scale invariant feature transform (SIFT) local descriptor instead of normalizing first order derivative profiles in the original formulation, to segme... A new active shape models (ASMs) was presented, which is driven by scale invariant feature transform (SIFT) local descriptor instead of normalizing first order derivative profiles in the original formulation, to segment lung fields from chest radiographs. The modified SIFT local descriptor, more distinctive than the general intensity and gradient features, is used to characterize the image features in the vicinity of each pixel at each resolution level during the segmentation optimization procedure. Experimental results show that the proposed method is more robust and accurate than the original ASMs in terms of an average overlap percentage and average contour distance in segmenting the lung fields from an available public database. 展开更多
关键词 active shape model (ASM) deformable segmentation CHEST RADIOGRAPH scale invariant feature transform (SIFT) local DESCRIPTOR
在线阅读 下载PDF
Robust Wide Baseline Point Matching Based on Scale Invariant Feature Descriptor 被引量:6
3
作者 岳思聪 王庆 赵荣椿 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2009年第1期70-74,共5页
In order to obtain a large number of correct matches with high accuracy,this article proposes a robust wide baseline point matching method,which is based on Scott s proximity matrix and uses the scale invariant featur... In order to obtain a large number of correct matches with high accuracy,this article proposes a robust wide baseline point matching method,which is based on Scott s proximity matrix and uses the scale invariant feature transform (SIFT). First,the distance between SIFT features is included in the equations of the proximity matrix to measure the similarity between two feature points; then the normalized cross correlation (NCC) used in Scott s method,which has been modified with adaptive scale and orientation,... 展开更多
关键词 computer vision image analysis image match scale invariant feature descriptor
原文传递
Target classification using SIFT sequence scale invariants 被引量:5
4
作者 Xufeng Zhu Caiwen Ma +1 位作者 Bo Liu Xiaoqian Cao 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2012年第5期633-639,共7页
On the basis of scale invariant feature transform(SIFT) descriptors,a novel kind of local invariants based on SIFT sequence scale(SIFT-SS) is proposed and applied to target classification.First of all,the merits o... On the basis of scale invariant feature transform(SIFT) descriptors,a novel kind of local invariants based on SIFT sequence scale(SIFT-SS) is proposed and applied to target classification.First of all,the merits of using an SIFT algorithm for target classification are discussed.Secondly,the scales of SIFT descriptors are sorted by descending as SIFT-SS,which is sent to a support vector machine(SVM) with radial based function(RBF) kernel in order to train SVM classifier,which will be used for achieving target classification.Experimental results indicate that the SIFT-SS algorithm is efficient for target classification and can obtain a higher recognition rate than affine moment invariants(AMI) and multi-scale auto-convolution(MSA) in some complex situations,such as the situation with the existence of noises and occlusions.Moreover,the computational time of SIFT-SS is shorter than MSA and longer than AMI. 展开更多
关键词 target classification scale invariant feature transform descriptors sequence scale support vector machine
在线阅读 下载PDF
TransSSA: Invariant Cue Perceptual Feature Focused Learning for Dynamic Fruit Target Detection
5
作者 Jianyin Tang Zhenglin Yu Changshun Shao 《Computers, Materials & Continua》 2025年第5期2829-2850,共22页
In the field of automated fruit harvesting,precise and efficient fruit target recognition and localization play a pivotal role in enhancing the efficiency of harvesting robots.However,this domain faces two core challe... In the field of automated fruit harvesting,precise and efficient fruit target recognition and localization play a pivotal role in enhancing the efficiency of harvesting robots.However,this domain faces two core challenges:firstly,the dynamic nature of the automatic picking process requires fruit target detection algorithms to adapt to multi-view characteristics,ensuring effective recognition of the same fruit from different perspectives.Secondly,fruits in natural environments often suffer from interference factors such as overlapping,occlusion,and illumination fluctuations,which increase the difficulty of image capture and recognition.To address these challenges,this study conducted an in-depth analysis of the key features in fruit recognition and discovered that the stem,body,and base serve as constant and core information in fruit identification,exhibiting long-term dependent semantic relationships during the recognition process.These invariant features provide a stable foundation for dynamic fruit recognition,contributing to improved recognition accuracy and robustness.Specifically,the morphology and position of the stem,body,and base are relatively fixed,and the effective extraction of these features plays a crucial role in fruit recognition.This paper proposes a novel model,TransSSA,and designs two innovative modules to effectively extract fruit image features.The Self-Attention Core Feature Extraction(SAF)module integrates YOLOV8 and Swin Transformer as backbone networks and introduces the Shuffle Attention self-attention mechanism,significantly enhancing the ability to extract core features.This module focuses on constant features such as the stem,body,and base,ensuring accurate fruit recognition in different environments.On the other hand,the Squeeze and Excitation Aggregation(SAE)module combines the network’s ability to capture channel patterns with global knowledge,further optimizing the extraction of effective features.Additionally,to improve detection accuracy,this studymodifies the regression loss function to EIOU.To validate the effectiveness of the TransSSA model,this study conducted extensive visualization analysis to support the interpretability of the SAF and SAE modules.Experimental results demonstrate that TransSSA achieves a performance of 91.3%on a tomato dataset,fully proving its innovative capabilities.Through this research,we provide amore effective solution for using fruit harvesting robots in complex environments. 展开更多
关键词 Fruit recognition invariant features TransSSA model swin transformer self-attention mechanism
在线阅读 下载PDF
Feature Extraction by Multi-Scale Principal Component Analysis and Classification in Spectral Domain 被引量:2
6
作者 Shengkun Xie Anna T. Lawnizak +1 位作者 Pietro Lio Sridhar Krishnan 《Engineering(科研)》 2013年第10期268-271,共4页
Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (... Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (PCA), wavelets transform or Fourier transform methods are often used for feature extraction. In this paper, we propose a multi-scale PCA, which combines discrete wavelet transform, and PCA for feature extraction of signals in both the spatial and temporal domains. Our study shows that the multi-scale PCA combined with the proposed new classification methods leads to high classification accuracy for the considered signals. 展开更多
关键词 MULTI-scale Principal Component Analysis Discrete WAVELET transform feature Extraction Signal CLASsifiCATION Empirical CLASsifiCATION
在线阅读 下载PDF
多尺度特征提取的Transformer短期风电功率预测 被引量:5
7
作者 徐武 范鑫豪 +1 位作者 沈智方 刘洋 《太阳能学报》 北大核心 2025年第2期640-648,共9页
针对短期风电功率预测特征提取尺度单一问题,设计一种基于多尺度特征提取的Transformer短期风电功率预测模型(MTPNet)。首先,在Transformer构架的基础上,利用维数不变嵌入,设计多尺度特征提取网络挖掘风电功率序列本身时序特征,保证了... 针对短期风电功率预测特征提取尺度单一问题,设计一种基于多尺度特征提取的Transformer短期风电功率预测模型(MTPNet)。首先,在Transformer构架的基础上,利用维数不变嵌入,设计多尺度特征提取网络挖掘风电功率序列本身时序特征,保证了特征提取时维数不被破坏;其次,利用融合自注意力机制的长短期记忆网络挖掘气象条件与功率之间的全局依赖关系;最后,融合风电功率序列本身时序特征和气象条件依赖关系,实现短期风电功率预测。实例仿真结果表明,MTPNet模型预测精度得到提升;消融实验证明了模型各模块的可靠性和有效性,具有一定的实用价值。 展开更多
关键词 风电功率预测 transformER 注意力机制 特征提取 长短期记忆网络 维数不变嵌入层
原文传递
基于时序二维变换和多尺度Transformer的电能质量扰动分类方法 被引量:2
8
作者 王守相 李慧强 +3 位作者 赵倩宇 郭陆阳 王同勋 王洋 《电力系统自动化》 北大核心 2025年第7期198-207,共10页
随着新能源渗透率的不断提高,电网面临的电能质量扰动(PQD)问题变得更加复杂,基于一维PQD信号的传统分类方法难以同时提取并辨识周期性与趋势性扰动。针对此问题,提出了一种基于时序二维变换和多尺度Transformer的PQD分类方法。首先,利... 随着新能源渗透率的不断提高,电网面临的电能质量扰动(PQD)问题变得更加复杂,基于一维PQD信号的传统分类方法难以同时提取并辨识周期性与趋势性扰动。针对此问题,提出了一种基于时序二维变换和多尺度Transformer的PQD分类方法。首先,利用时序二维变换将一维PQD时间序列转换为一组基于多个周期的二维张量,以实现在二维空间中深入挖掘PQD信号中所包含的特征信息。然后,通过多尺度Transformer编码器模块提取PQD信号的多尺度特征图,利用多尺度Transformer解码器模块对多尺度特征图进行拼接和特征融合,有效合并在不同尺度上提取的特征图。最后,通过全连接层和Softmax分类器完成PQD分类任务。为验证所提方法的有效性,建立了含24种PQD的数据集对模型进行测试,结果表明所提方法对PQD信号具有较高的分类准确率和噪声鲁棒性。 展开更多
关键词 电能质量 扰动 分类 时序二维变换 多尺度transformer 特征提取 特征融合
在线阅读 下载PDF
多维度聚合Transformer的图像超分辨率重建 被引量:1
9
作者 陈清江 陈鹏民 《光学精密工程》 北大核心 2025年第12期1955-1970,共16页
针对现有基于Transformer的图像超分辨率网络中感受野尺度单一以及未充分挖掘额外维度信息等问题,本文提出了一种多维度聚合Transformer网络。首先,通过构建多尺度交互调制模块,从低分辨率图像中提取多尺度特征,以增强信息流的丰富性。... 针对现有基于Transformer的图像超分辨率网络中感受野尺度单一以及未充分挖掘额外维度信息等问题,本文提出了一种多维度聚合Transformer网络。首先,通过构建多尺度交互调制模块,从低分辨率图像中提取多尺度特征,以增强信息流的丰富性。其次,设计了空间-通道交互模块,并将其集成于Transformer层中,利用四种形式的注意力机制充分提取关键特征并实现特征融合,从而提升模型性能。最后,提出了特征重用Transformer模块,深入挖掘各层特征之间的关联,精准提取并高效重用重要特征,进一步加强模型表现。实验结果表明,在五个基准测试集上,所提方法优于其他先进算法。在不同放大倍数的超分辨率任务中,相较于基于Swin Transformer的图像恢复方法,峰值信噪比和结构相似度分别平均提升了约0.26 dB和0.0024,且重建效果更加清晰。该方法有效克服了现有方法的不足,在超分辨率任务中展现出显著的性能提升和应用潜力。 展开更多
关键词 图像超分辨率 transformER 注意力机制 特征交互 特征重用 多尺度
在线阅读 下载PDF
基于Transformer和门控融合机制的图像去雾算法 被引量:1
10
作者 王燕 陈燕燕 +1 位作者 刘晶晶 胡津源 《计算机系统应用》 2025年第2期1-10,共10页
针对现有的图像去雾算法仍然存在去雾不彻底、去雾后的图像边缘模糊、细节信息丢失等问题,本文提出了一种基于Transformer和门控融合机制的图像去雾算法.通过改进的通道自注意力机制提取图像的全局特征,提高模型处理图像的效率,设计多... 针对现有的图像去雾算法仍然存在去雾不彻底、去雾后的图像边缘模糊、细节信息丢失等问题,本文提出了一种基于Transformer和门控融合机制的图像去雾算法.通过改进的通道自注意力机制提取图像的全局特征,提高模型处理图像的效率,设计多尺度门控融合块捕获不同尺度的特征,门控融合机制通过动态调整权重,提高模型对不同雾化程度的适应能力,同时更好地保留图像边缘及细节信息,并使用残差连接增强特征的重用性,提高模型泛化能力.经实验验证,所提出的去雾算法可以有效恢复真实有雾图像中的内容信息,在合成的有雾图像数据集SOTS上的峰值信噪比达到了34.841 dB,结构相似性达到了0.984,去雾后的图像内容信息完整且没有出现细节信息模糊和去雾不彻底等现象. 展开更多
关键词 图像去雾 transformER 自注意力机制 门控融合机制 多尺度特征融合
在线阅读 下载PDF
Spatiotemporal features of farmland scaling and the mechanisms that underlie these changes within the Three Gorges Reservoir Area 被引量:10
11
作者 LIANG Xinyuan LI Yangbing 《Journal of Geographical Sciences》 SCIE CSCD 2019年第4期563-580,共18页
Discussions regarding the functional transformation of agricultural utilization and the mechanisms that underlie these changes within the Three Gorges Reservoir Area(TGRA)reflect variati ons in the relati on ship betw... Discussions regarding the functional transformation of agricultural utilization and the mechanisms that underlie these changes within the Three Gorges Reservoir Area(TGRA)reflect variati ons in the relati on ship betwee n people and their environme nt in China's central and wester ns part,an area of mountains and reservoirs.A clear understa nding of these changes also provides the scientific basis for the development of multi-functional agriculture in typical mountainous areas.Five counties were selected for analysis in this study from the hinterland of the TGRA;we analyzed changes in farmland scaling and corresponding under?lying mechanisms by defining the concepts of“Scaling Farmland”(SF)and by using the software packages ArcGIS10.2,SPSS,and Geographical Detectors.The results of this analysis show that sources of increased SF have mainly comprised cultivated and shrub land.In deed,with the excepti on of some alpine off-season vegetables,SF growth has mainly occurred in low altitude areas and in places where the slope is less than 30°.We also show that spatial changes in various SF types have also been substantially different,but in all cases are closely related to road and township administrative centers.Natural factors at the patch level,including elevation and slope,have contributed significantly to SF,while at the township level,underlying socioeconomic and humanistic factors have tended to include road traffic and agricultural population density.In contrast,at the region al level,underlying driving forces within each have tended to be more significant than overall study area scale.We show that while changes in,and the development of,SF have been driven by numerous factors,agri?cultural policies have always been amongst the most important.The results clearly elucidate general land use transformation patter ns within the mountain regi ons of western China. 展开更多
关键词 Three Gorges Reservoir Area functional transformation of agricultural land SCALING FARMLAND SPATIOTEMPORAL features UNDERLYING driving FORCES
原文传递
Multi-source Remote Sensing Image Registration Based on Contourlet Transform and Multiple Feature Fusion 被引量:6
12
作者 Huan Liu Gen-Fu Xiao +1 位作者 Yun-Lan Tan Chun-Juan Ouyang 《International Journal of Automation and computing》 EI CSCD 2019年第5期575-588,共14页
Image registration is an indispensable component in multi-source remote sensing image processing. In this paper, we put forward a remote sensing image registration method by including an improved multi-scale and multi... Image registration is an indispensable component in multi-source remote sensing image processing. In this paper, we put forward a remote sensing image registration method by including an improved multi-scale and multi-direction Harris algorithm and a novel compound feature. Multi-scale circle Gaussian combined invariant moments and multi-direction gray level co-occurrence matrix are extracted as features for image matching. The proposed algorithm is evaluated on numerous multi-source remote sensor images with noise and illumination changes. Extensive experimental studies prove that our proposed method is capable of receiving stable and even distribution of key points as well as obtaining robust and accurate correspondence matches. It is a promising scheme in multi-source remote sensing image registration. 展开更多
关键词 feature fusion multi-scale circle Gaussian combined invariant MOMENT multi-direction GRAY level CO-OCCURRENCE matrix MULTI-SOURCE remote sensing image registration CONTOURLET transform
原文传递
多尺度时频协同Transformer驱动的航空发动机故障诊断方法
13
作者 连帅 《电子测量技术》 北大核心 2025年第20期90-102,共13页
航空发动机作为飞行器的核心动力部件,其运行可靠性直接关系到飞行安全与运行效率,轴间轴承的故障诊断是保障其稳定工作的关键环节。本文针对航空发动机轴间轴承故障诊断问题展开研究,归纳总结现有1DCNN网络与1D-Transformer方法的局限... 航空发动机作为飞行器的核心动力部件,其运行可靠性直接关系到飞行安全与运行效率,轴间轴承的故障诊断是保障其稳定工作的关键环节。本文针对航空发动机轴间轴承故障诊断问题展开研究,归纳总结现有1DCNN网络与1D-Transformer方法的局限性:自注意力机制易受原始振动信号中高频噪声与冗余信息干扰,关键故障特征聚焦能力不足;纯Transformer架构对局部细微特征的捕捉能力较弱。为此,提出多尺度时频协同Transformer驱动的故障诊断方法,通过融合多尺度时频特征提取与Transformer全局建模能力,实现对振动信号局部细微特征与全局关联特征的协同捕捉。实验结果表明,该方法在航空发动机轴间轴承故障诊断中表现优异:在高斯白噪声环境下(信噪比-4~4 dB),诊断准确率与F1-Score均为最优,强噪声(-4 dB)时达96.04%,弱噪声(4 dB)时达99.84%,抗噪稳定性优于五种对比方法;在CWRU基准数据集的无噪声与噪声场景中,可稳定识别不同程度故障(如轻度内圈故障),强噪声(-4 dB)时准确率99.01%,弱噪声(4 dB)时达99.78%,验证了泛化能力,有效改善了噪声干扰下特征聚焦不足与局部特征捕捉薄弱的问题。综上,多尺度时频协同Transformer为航空发动机轴间轴承故障诊断提供了高效稳健的解决方案,其强抗噪性与精准识别能力满足实际工程复杂振动环境需求,为提升故障监测可靠性提供技术支撑。 展开更多
关键词 航空发动机 轴间轴承故障诊断 多尺度特征提取 双通道动态协同注意力 小波时频层级transformer
原文传递
Study of Human Action Recognition Based on Improved Spatio-temporal Features 被引量:7
14
作者 Xiao-Fei Ji Qian-Qian Wu +1 位作者 Zhao-Jie Ju Yang-Yang Wang 《International Journal of Automation and computing》 EI CSCD 2014年第5期500-509,共10页
Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combin... Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information(PDI) of interest points, a novel motion descriptor is proposed in this paper. The proposed method detects interest points by using an improved interest point detection method. Then, 3-dimensional scale-invariant feature transform(3D SIFT) descriptors are extracted for every interest point. In order to obtain a compact description and efficient computation, the principal component analysis(PCA) method is utilized twice on the 3D SIFT descriptors of single frame and multiple frames. Simultaneously, the PDI of the interest points are computed and combined with the above features. The combined features are quantified and selected and finally tested by using the support vector machine(SVM) recognition algorithm on the public KTH dataset. The testing results have showed that the recognition rate has been significantly improved and the proposed features can more accurately describe human motion with high adaptability to scenarios. 展开更多
关键词 Action recognition spatio-temporal interest points 3-dimensional scale-invariant feature transform (3D SIFT) positional distribution information dimension reduction
原文传递
一种交互连接CNN和Transformer的肠道息肉图像分类网络 被引量:1
15
作者 曹博 叶淑芳 +3 位作者 饶钰君 汤晓恒 何熊熊 李胜 《小型微型计算机系统》 北大核心 2025年第4期932-939,共8页
利用内镜图像对结直肠息肉进行风险分类至关重要,能够提高临床诊断准确性并降低结直肠癌死亡率.然而,目前基于卷积神经网络(CNN)或视觉Transformer(ViT)的分类方法不能很好地区分类内尺度大和类间相似性高的息肉图像,针对息肉风险的分... 利用内镜图像对结直肠息肉进行风险分类至关重要,能够提高临床诊断准确性并降低结直肠癌死亡率.然而,目前基于卷积神经网络(CNN)或视觉Transformer(ViT)的分类方法不能很好地区分类内尺度大和类间相似性高的息肉图像,针对息肉风险的分类任务亟需改善.CNN中的卷积算子擅长提取局部特征.ViT通过级联自注意力模块可以捕获长距离依赖关系和全局特征.本文提出一个交互连接模块,以交互式的方式将CNN和ViT相连接,以整合多尺度特征;所设计的交互混合模型,能最大限度地保留局部特征和全局表示,显著缓解息肉多分类的类内差异性大、类间相似性高的问题;在大规模自然图像数据集中进行预训练;通过微调模型结构,使用预训练的交互混合模型参数初始化主干网络,并迁移至结直肠息肉数据集中再次训练,实现息肉多分类.在结直肠息肉私有数据集和Kvasir公共数据集上评估所提出模型,实验结果显示总体分类准确率分别达到了85.83%和96.84%,优于本文比较的其他算法;且引入迁移学习可以在降低训练成本的同时提升交互混合模型的分类性能和泛化性,在有限的训练数据集下有助于提高临床诊断效率. 展开更多
关键词 卷积神经网络(CNN) 视觉transformer(ViT) 结直肠息肉分类 多尺度特征 迁移学习
在线阅读 下载PDF
Face recognition using SIFT features under 3D meshes 被引量:1
16
作者 张诚 谷宇章 +1 位作者 胡珂立 王营冠 《Journal of Central South University》 SCIE EI CAS CSCD 2015年第5期1817-1825,共9页
Expression, occlusion, and pose variations are three main challenges for 3D face recognition. A novel method is presented to address 3D face recognition using scale-invariant feature transform(SIFT) features on 3D mes... Expression, occlusion, and pose variations are three main challenges for 3D face recognition. A novel method is presented to address 3D face recognition using scale-invariant feature transform(SIFT) features on 3D meshes. After preprocessing, shape index extrema on the 3D facial surface are selected as keypoints in the difference scale space and the unstable keypoints are removed after two screening steps. Then, a local coordinate system for each keypoint is established by principal component analysis(PCA).Next, two local geometric features are extracted around each keypoint through the local coordinate system. Additionally, the features are augmented by the symmetrization according to the approximate left-right symmetry in human face. The proposed method is evaluated on the Bosphorus, BU-3DFE, and Gavab databases, respectively. Good results are achieved on these three datasets. As a result, the proposed method proves robust to facial expression variations, partial external occlusions and large pose changes. 展开更多
关键词 3D face recognition seale-invariant feature transform (SIFT) expression OCCLUSION large pose changes 3D meshes
在线阅读 下载PDF
融合通道注意力的跨尺度Transformer图像超分辨率重建 被引量:1
17
作者 李焱 董仕豪 +2 位作者 张家伟 赵茹 郑钰辉 《中国图象图形学报》 北大核心 2025年第3期784-797,共14页
目的针对在超分辨率任务中,Transformer模型存在特征提取模式单一、重建图像高频细节丢失和结构失真的问题,提出了一种融合通道注意力的跨尺度Transformer图像超分辨率重建模型。方法模型由4个模块组成:浅层特征提取、跨尺度深层特征提... 目的针对在超分辨率任务中,Transformer模型存在特征提取模式单一、重建图像高频细节丢失和结构失真的问题,提出了一种融合通道注意力的跨尺度Transformer图像超分辨率重建模型。方法模型由4个模块组成:浅层特征提取、跨尺度深层特征提取、多级特征融合以及高质量重建模块。浅层特征提取利用卷积处理早期图像,获得更稳定的输出;跨尺度深层特征提取利用跨尺度Transformer和强化通道注意力机制,扩大感受野并通过加权筛选提取不同尺度特征以便融合;多级特征融合模块利用强化通道注意力机制,实现对不同尺度特征通道权重的动态调整,促进模型对丰富上下文信息的学习,增强模型在图像超分辨率重建任务中的能力。结果在Set5、Set14、BSD100(Berkeley segmentation dataset 100)、Urban100(urban scene 100)和Manga109标准数据集上的模型评估结果表明,相较于SwinIR超分辨率模型,所提模型在峰值信噪比上提高了0.06~0.25 dB,且重建图像视觉效果更好。结论提出的融合通道注意力的跨尺度Transformer图像超分辨率重建模型,通过融合卷积特征与Transformer特征,并利用强化通道注意力机制减少图像中噪声和冗余信息,降低模型产生图像模糊失真的可能性,图像超分辨率性能有效提升,在多个公共实验数据集的测试结果验证了所提模型的有效性。 展开更多
关键词 图像超分辨率 跨尺度transformer 通道注意力机制 特征融合 深度学习
原文传递
多尺度特征融合的双阶段Transformer去雨网络 被引量:2
18
作者 李世平 周冬明 《小型微型计算机系统》 北大核心 2025年第4期898-906,共9页
图像去雨研究旨在提升图像质量,强化视觉感知.现有去雨算法由于通常采用单阶段实现,在去除雨纹干扰的同时会造成无雨背景的信息缺失,导致无法兼顾去雨效果和图像清晰度.为此,本文提出了一种基于Transformer的多尺度、双阶段U型去雨网络... 图像去雨研究旨在提升图像质量,强化视觉感知.现有去雨算法由于通常采用单阶段实现,在去除雨纹干扰的同时会造成无雨背景的信息缺失,导致无法兼顾去雨效果和图像清晰度.为此,本文提出了一种基于Transformer的多尺度、双阶段U型去雨网络,将去雨任务通过两个分别侧重于雨纹提取和细节修复的子网络逐步完成.第1阶段,引入反投射技术提出了一种特征融合模块,通过迭代逐渐融合不同尺度下的特征信息以弥补U型结构造成的信息缺失.同时,基于Boosting算法提出了一种增强连接的特征提取模块,以增强细节特征,提高输出信噪比.第2阶段,提出了一种细节增强注意力模块对粗糙去雨图像进行细节修复以生成轮廓清晰的无雨图像.实验结果表明,本文提出的算法在合成和真实数据集上都取得了出色的去雨效果,在Rain100H、SPA-data等数据集上相比近期其他优秀去雨算法均有一定程度的指标提升. 展开更多
关键词 图像去雨 transformER 多阶段网络 多尺度特征融合
在线阅读 下载PDF
基于Bag of Features模型的害虫图像分类技术研究 被引量:1
19
作者 姜祖新 赵小军 +3 位作者 王复元 盛强 谢鹏 徐擎宇 《粮食储藏》 2015年第4期28-32,共5页
将Bag of Features模型结合OpenCV开源图像库提取害虫图像的特征,然后用Kmedoids算法对其进行聚类,生成关键字,最后用AdaBoosting算法构建分类器,实验采用Pascal Voc图像库中的数据进行训练和测试,实验表明,该算法分类精度高、特征提取... 将Bag of Features模型结合OpenCV开源图像库提取害虫图像的特征,然后用Kmedoids算法对其进行聚类,生成关键字,最后用AdaBoosting算法构建分类器,实验采用Pascal Voc图像库中的数据进行训练和测试,实验表明,该算法分类精度高、特征提取速度和分类速度也比较快。 展开更多
关键词 SIFT特征 聚类算法 图像分类性能
在线阅读 下载PDF
基于融合Swin Transformer网络的腰椎解剖区域自动分割方法
20
作者 张英迪 史泽林 +6 位作者 王欢 崔少千 张磊 刘嘉琛 单修祺 刘云鹏 赵恩波 《信息与控制》 北大核心 2025年第3期390-400,共11页
腰椎解剖区域自动分割在脊柱影像自动分析流程中发挥着重要作用。尽管经典的卷积神经网络能够捕捉影像全局特征,其局部先验和权重共享的特性限制了长距离建模的能力。为了解决以上问题,本文提出了一种用于腰椎解剖区域分割的Swin Transf... 腰椎解剖区域自动分割在脊柱影像自动分析流程中发挥着重要作用。尽管经典的卷积神经网络能够捕捉影像全局特征,其局部先验和权重共享的特性限制了长距离建模的能力。为了解决以上问题,本文提出了一种用于腰椎解剖区域分割的Swin Transformer融合网络,将Swin Transformer网络和多尺度空洞卷积融合作为编码器来得到全局和局部特征的层次化表达。设计了特征耦合模块,在通道和空间2个维度将来自Transformer模块和卷积模块的特征进行耦合,提高了模型的局部和长距离建模能力。为了解决开源数据缺乏的问题,提出了带有体素级标注的、包含663个腰椎椎骨计算断层成像的数据集。在此数据集上的实验表明提出的模型分割精度超过了典型医学图像分割方法,本文模型的骰子系数、Hausdorff距离和平均表面距离分别为88.24%、14.48和0.997。消融实验进一步验证了所提出模块的有效性。 展开更多
关键词 卷积神经网络 医学图像分割 transformER 多尺度特征提取
原文传递
上一页 1 2 70 下一页 到第
使用帮助 返回顶部