期刊文献+
共找到2,063篇文章
< 1 2 104 >
每页显示 20 50 100
结合空间多层图卷积和时序分段Transformer的分心驾驶识别方法
1
作者 葛慧敏 欧阳宁 吴沛桐 《计算机工程与应用》 北大核心 2026年第4期152-167,共16页
识别分心驾驶行为是提升驾驶安全性的重要手段之一。目前基于图卷积的骨架动作识别方法采用单一的骨架图结构而忽略了关节点间的多种交互关系,且对骨架序列局部及全局时间特征提取能力不足。针对上述问题,提出一种结合空间多层图卷积和... 识别分心驾驶行为是提升驾驶安全性的重要手段之一。目前基于图卷积的骨架动作识别方法采用单一的骨架图结构而忽略了关节点间的多种交互关系,且对骨架序列局部及全局时间特征提取能力不足。针对上述问题,提出一种结合空间多层图卷积和时序分段Transformer的分心驾驶识别模型。在空间建模方面,通过多种索引方式构建包含多种空间关系的驾驶员关节点的多层图结构,并引入图注意力机制动态调整图结构中边的连接强度,利用层内与层间图卷积操作提取与融合空间特征。在时间建模方面,对时间序列进行分段处理,并使用Transformer来有效捕捉分段时间的局部特征及跨时段的全局特征。最终在Drive&Act、DAD数据集上对模型进行了性能验证,结果表明,模型相较于现有方法进一步提高了分心驾驶行为识别的准确率。 展开更多
关键词 智能交通 分心驾驶 基于骨架的动作识别 时序transformer 空间多层图
在线阅读 下载PDF
The Decline and Reinvention of Marketplace Culture:Civic Social Networks in the Spatial Transformation of Chengdu Teahouses
2
作者 Ding Ding Wang Hao 《Contemporary Social Sciences》 2025年第3期18-33,共16页
Chengdu teahouses,as core public spaces in marketplace society,have undergone transformative reconstruction-from“containers of everyday life”to“containers of commercial traffic and digital flows”-during the proces... Chengdu teahouses,as core public spaces in marketplace society,have undergone transformative reconstruction-from“containers of everyday life”to“containers of commercial traffic and digital flows”-during the process of modernization.Employing spatial archaeology as a methodology,combined with fieldwork and analysis of historical documents,this study systematically examines the diachronic evolution of architectural forms,functional orientations,and social networks within Chengdu teahouses.The study reveals the logic of spatial reconstruction under the interplay of multiple forces,including cultural heritage preservation,capital-driven development,and technological intervention.The findings identify three paradigms of spatial transformation in teahouses.First,heritage specimenization,which reinforces the continuity of collective memory through symbolic extraction but risks diminishing the vitality of everyday social interactions.Second,consumption upgrading,which caters to the demands of emerging groups through iterative business models yet necessitates vigilance against spatial differentiation eroding marketplace inclusivity.Third,digital parasitism,which expands communicative dimensions through technological empowerment but confronts the risk of flattening localized knowledge.These paradigms reflect both adaptive responses of traditional spaces to contemporary pressure and the tension of reconstruction imposed by instrumental rationality on marketplace networks.The study demonstrates that spatial transformation in Chengdu teahouses is not unidirectional alienation but rather a multifaceted configuration where the continuity of tradition coexists with innovative practices amid functional diversification.This research advocates for striking a balance between the preservation of traditional spaces and modern renewal and explores organic integration approaches for traditional and modern elements,thereby providing a theoretical framework and practical insights for the transformation of traditional public spaces. 展开更多
关键词 teahouse culture marketplace culture spatial transformation capital-driven development cultural heritage preservation
在线阅读 下载PDF
融合多重卷积和Dense Transformer的高光谱图像分类
3
作者 魏林 杨霄 尹玉萍 《红外技术》 北大核心 2026年第2期193-203,共11页
高光谱图像蕴含丰富的光谱空间信息。如何充分挖掘空谱信息进行分类,是一个关键的研究问题。在处理高光谱图像分类时,卷积擅长提取局部特征,Transformer能够捕获长距离特征依赖性,学习全局特征信息。针对卷积和Transformer的优势,提出... 高光谱图像蕴含丰富的光谱空间信息。如何充分挖掘空谱信息进行分类,是一个关键的研究问题。在处理高光谱图像分类时,卷积擅长提取局部特征,Transformer能够捕获长距离特征依赖性,学习全局特征信息。针对卷积和Transformer的优势,提出了一种结合三维卷积、空间通道重建卷积和Transformer的高光谱图像分类方法。首先将降维后的图像块,利用三维卷积进行综合的空谱特征提取;随后用空间通道重建卷积过滤冗余信息;最后用具有密集连接的Transformer对卷积提取的空谱特征建立长距离依赖关系,并使用多层感知机进行分类。实验表明,该方法在Pavia University、Salinas和Botswana数据集上总体分类精度分别为99.51%、99.85%、97.57%,均表现优异。 展开更多
关键词 高光谱图像 特征提取 三维卷积 空间通道重建卷积 transformER
在线阅读 下载PDF
Spatial Heterogeneities of O2O Retail Space in Urban China Under Digital Transformation:Evidence from Guangzhou,China
4
作者 WEI Zongcai HUANG Weichao +1 位作者 TANG Qijing XIE Ruimin 《Chinese Geographical Science》 2025年第5期963-981,共19页
The rapid development of digital technologies has driven the emergence and popularization of online-to-offline(O2O)retail,reshaping the retail landscape in urban China.However,spatial distribution characteristics and ... The rapid development of digital technologies has driven the emergence and popularization of online-to-offline(O2O)retail,reshaping the retail landscape in urban China.However,spatial distribution characteristics and influencing mechanisms of emerging O2O retail have not been thoroughly investigated in extant studies.Taking the central urban area of Guangzhou as the case,this study utilized multi-source data and machine learning methods to explore the distribution characteristics of O2O retail space and to further identify the nonlinear effects of the built environment,sociodemographic,and economic factors on its distribution.The results revealed that O2O retail space exhibited a‘single-center’distribution pattern,in contrast to the‘multi-center’distribution pattern of traditional retail space.This finding supported the diffusion of innovation hypothesis,highlighting that the expansion of O2O retail modes first spread from traditional developed retail space.Furthermore,spatial heterogeneities were observed across different types of O2O retail space,with O2O in-store showing a‘core-periphery’spatial structure as described by Central Place Theory,whereas O2O delivery displaying a‘horizontal,non-hierarchical,and multi-centered’network structure following Central Flow Theory.Compared to traditional retail space,the distribution of O2O retail space was more influenced by sociodemographic factors such as the proportion of youth,education level,and income level,but less affected by the built environment factors like office and building density.Furthermore,nonlinear effects of these influencing factors on the distribution of O2O retail space were identified,which enriched the existing literature by highlighting effective ranges and threshold effects.These findings provided valuable insights into O2O retail space development in the context of digital transformation. 展开更多
关键词 online-to-offline(O2O)retail spatial distribution characteristics digital transformation nonlinear effect gradient boosting decision trees Guangzhou China
在线阅读 下载PDF
基于改进YOLOv11的CNN-Transformer混合水域垃圾检测算法
5
作者 赵建永 李瑞东 +1 位作者 姚浩 魏秀蓉 《无线互联科技》 2026年第4期21-25,50,共6页
河流水面漂浮物检测受限于复杂环境条件(如光照变化、波纹干扰)和检测目标尺度较小的特点,传统方法难以实现高精度检测。文章提出一种面向复杂水域场景的单阶段检测模型YOLOv11n-SPT,在YOLOv11n基础上引入新型Spatial Pyramid Transform... 河流水面漂浮物检测受限于复杂环境条件(如光照变化、波纹干扰)和检测目标尺度较小的特点,传统方法难以实现高精度检测。文章提出一种面向复杂水域场景的单阶段检测模型YOLOv11n-SPT,在YOLOv11n基础上引入新型Spatial Pyramid Transformer(SPT)模块与通道注意力机制。SPT模块采用多分支空间金字塔结构,实现高分辨率细节保留与超大感受野全局建模的协同。在FloW-Img数据集上,YOLOv11n-SPT的mAP@0.5达到81.2%,较基线YOLOv11n提升2.9个百分点;消融实验表明,单独引入SPT模块使mAP@0.5提升2.0%,召回率提升2.1%,进一步叠加通道注意力后精确率提升至85.4%。YOLOv11n-SPT在微小目标与强干扰场景下表现出更强的鲁棒性与定位精度,为无人清漂船、无人机巡河等实际水域环境治理任务提供了高效可靠的感知方案。 展开更多
关键词 水面漂浮物检测 spatial Pyramid transformer YOLOv11
在线阅读 下载PDF
融合时间空间的多尺度Transformer人脸伪造检测
6
作者 杜利莎 杨高明 《兰州工业学院学报》 2026年第1期15-20,共6页
针对目前人脸伪造检测无法充分提取时间特征、检测效率低等问题,提出一种融合时间特征和空间特征的多尺度人脸伪造检测方法MST-ViT。MST-ViT方法设计双流结构提取包含全局信息和细节信息的多尺度特征,设计帧间差异捕获模块增强对时间伪... 针对目前人脸伪造检测无法充分提取时间特征、检测效率低等问题,提出一种融合时间特征和空间特征的多尺度人脸伪造检测方法MST-ViT。MST-ViT方法设计双流结构提取包含全局信息和细节信息的多尺度特征,设计帧间差异捕获模块增强对时间伪影的提取,并通过时空Transformer提取时间特征和空间特征。实验结果表明:所提模型在FF++数据集内的AUC结果提升1.71%;在具有挑战性的DFDC跨数据集实验中AUC提升2.06%。 展开更多
关键词 人脸伪造检测 空间特征 时间特征 多尺度特征 transformER
在线阅读 下载PDF
Sound Source Localization Based on SRP-PHAT Spatial Spectrum and Deep Neural Network 被引量:3
7
作者 Xiaoyan Zhao Shuwen Chen +1 位作者 Lin Zhou Ying Chen 《Computers, Materials & Continua》 SCIE EI 2020年第7期253-271,共19页
Microphone array-based sound source localization(SSL)is a challenging task in adverse acoustic scenarios.To address this,a novel SSL algorithm based on deep neural network(DNN)using steered response power-phase transf... Microphone array-based sound source localization(SSL)is a challenging task in adverse acoustic scenarios.To address this,a novel SSL algorithm based on deep neural network(DNN)using steered response power-phase transform(SRP-PHAT)spatial spectrum as input feature is presented in this paper.Since the SRP-PHAT spatial power spectrum contains spatial location information,it is adopted as the input feature for sound source localization.DNN is exploited to extract the efficient location information from SRP-PHAT spatial power spectrum due to its advantage on extracting high-level features.SRP-PHAT at each steering position within a frame is arranged into a vector,which is treated as DNN input.A DNN model which can map the SRP-PHAT spatial spectrum to the azimuth of sound source is learned from the training signals.The azimuth of sound source is estimated through trained DNN model from the testing signals.Experiment results demonstrate that the proposed algorithm significantly improves localization performance whether the training and testing condition setup are the same or not,and is more robust to noise and reverberation. 展开更多
关键词 Sound source localization microphone array steered response power-phase transform(srp-phat)spatial spectrum deep neural network
在线阅读 下载PDF
Air pollution effects of industrial transformation in the Yangtze River Delta from the perspective of spatial spillover 被引量:4
8
作者 CHEN Yufan XU Yong WANG Fuyuan 《Journal of Geographical Sciences》 SCIE CSCD 2022年第1期156-176,共21页
The Yangtze River Delta(YRD) is a region in China with a serious contradiction between economic growth and environmental pollution. Exploring the spatiotemporal effects and influencing factors of air pollution in the ... The Yangtze River Delta(YRD) is a region in China with a serious contradiction between economic growth and environmental pollution. Exploring the spatiotemporal effects and influencing factors of air pollution in the region is highly important for formulating policies to promote the high-quality development of urban industries. This study uses the spatial Durbin model(SDM) to analyze the local direct and spatial spillover effects of industrial transformation on air pollution and quantifies the contribution of each factor. From 2008 to 2018, there was a significant spatial agglomeration of industrial sulfur dioxide emissions(ISDE) in the YRD, and every 1% increase in ISDE led to a synchronous increase of 0.603% in the ISDE in adjacent cities. The industrial scale index(ISCI) and industrial structure index(ISTI), as the core factors of industrial transformation, significantly affect the emissions of sulfur dioxide in the YRD, and the elastic coefficients are 0.677 and-0.368, respectively. The order of the direct effect of the explanatory variables on local ISDE is ISCI>ISTI>foreign direct investment(FDI)>enterprise technological innovation(ETI)>environmental regulation(ER)> per capita GDP(PGDP). Similarly, the order of the spatial spillover effect of all variables on ISDE in adjacent cities is ISCI>PGDP>FDI>ETI>ISTI>ER, and the coefficients of the ISCI and ISTI are 1.531 and 0.113, respectively. This study contributes to the existing research that verifies the environmental Kuznets curve in the YRD, denies the pollution heaven hypothesis, indicates the Porter hypothesis, and provides empirical evidence for the formation mechanism of regional environmental pollution from a spatial spillover perspective. 展开更多
关键词 industrial agglomeration industrial structure adjustment industrial transformation air pollution spatial spillover effect spatial Durbin model
原文传递
Spatial transformation of general sampling-aliasing frequency region for rotating-blade parameter identification with emphasis on single-probe blade tip-timing measurement 被引量:2
9
作者 Jiahui CAO Zhibo YANG +3 位作者 Guangrong TENG Shaohua TIAN Guoyong YE Xuefeng CHEN 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2023年第3期220-240,共21页
Blade-health monitoring is intensely required for turbomachinery because of the high failure risk of rotating blades.Blade-Tip Timing(BTT)is considered as the most promising technique for operational blade-vibration m... Blade-health monitoring is intensely required for turbomachinery because of the high failure risk of rotating blades.Blade-Tip Timing(BTT)is considered as the most promising technique for operational blade-vibration monitoring,which obtains the parameters that characterize the blade condition from recorded signals.However,its application is hindered by severe undersampling and stringent probe layouts.An inappropriate probe layout can make most of the existing methods invalid or inaccurate.Additionally,a general conflict arises between the allowed and required layouts because of arrangement restrictions.For the sake of economy and safety,parameter identification based on fewer probes has been preferred by users.In this work,a spatial-transformation-based method for parameter identification is proposed based on a single-probe BTT measurement.To present the general Sampling-Aliasing Frequency(SAFE)map definition,the traditional time-frequency analysis methods are extended to a time-sampling frequency.Then,a SAFE map is projected onto a parameter space using spatial transformation to extract the slope and intercept parameters,which can be physically interpreted as an engine order and a natural frequency using coordinate transformation.Finally,the effectiveness and robustness of the proposed method are verified by simulations and experiments under uniformly and nonuniformly variable speed conditions. 展开更多
关键词 Blade-tip timing(BTT) Extended time-frequency analysis Image-feature recognition Parameter extraction Sampling-aliasing frequency map spatial transformation Vibration analysis
原文传递
CATrans:基于跨尺度注意力Transformer的高分辨率遥感影像土地覆盖语义分割框架 被引量:1
10
作者 陈丽佳 陈宏辉 +3 位作者 谢艳秋 何天友 叶菁 吴林煌 《地球信息科学学报》 北大核心 2025年第7期1624-1637,共14页
【目的】高分辨率遥感影像语义分割通过精准提取地物信息,为城市规划、土地分析利用提供了重要的数据支持。当前分割方法通常将遥感影像划分为标准块,进行多尺度局部分割和层次推理,未充分考虑影像中的上下文先验知识和局部特征交互能力... 【目的】高分辨率遥感影像语义分割通过精准提取地物信息,为城市规划、土地分析利用提供了重要的数据支持。当前分割方法通常将遥感影像划分为标准块,进行多尺度局部分割和层次推理,未充分考虑影像中的上下文先验知识和局部特征交互能力,影响了推理分割质量。【方法】为了解决这一问题,本文提出了一种联合跨尺度注意力和语义视觉Transformer的遥感影像分割框架(Cross-scale Attention Transformer,CATrans),融合跨尺度注意力模块和语义视觉Transformer,提取上下文先验知识增强局部特征表示和分割性能。首先,跨尺度注意力模块通过空间和通道两个维度进行并行特征处理,分析浅层-深层和局部-全局特征之间的依赖关系,提升对遥感影像中不同粒度对象的注意力。其次,语义视觉Transformer通过空间注意力机制捕捉上下文语义信息,建模语义信息之间的依赖关系。【结果】本文在DeepGlobe、Inria Aerial和LoveDA数据集上进行对比实验,结果表明:CATrans的分割性能优于现有的WSDNet(Discrete Wavelet Smooth Network)和ISDNet(Integrating Shallow and Deep Network)等分割算法,分别取得了76.2%、79.2%、54.2%的平均交并比(Mean Intersection over Union,mIoU)和86.5%、87.8%、66.8%的平均F1得分(Mean F1 Score,mF1),推理速度分别达到38.1 FPS、13.2 FPS和95.22 FPS。相较于本文所对比的最佳方法WSDNet,mIoU和mF1在3个数据集中分别提升2.1%、4.0%、5.3%和1.3%、1.8%、5.6%,在每类地物的分割中都具有显著优势。【结论】本方法实现了高效率、高精度的高分辨率遥感影像语义分割。 展开更多
关键词 高分辨率 语义分割 跨尺度注意力 视觉transformer 上下文先验 空间注意力 语义信息
原文传递
Reflectionless spatial beam benders with arbitrary bending angle by introducing optic-null medium into transformation optics
11
作者 Fei Sun Yi-Chao Liu +3 位作者 Yi-Biao Yang Hong-Ming Fei Zhi-Hui Chen Sai-Ling He 《Chinese Physics B》 SCIE EI CAS CSCD 2021年第3期242-247,共6页
By introducing an optic-null medium into the finite embedded transformation,a reflectionless spatial beam bender is designed,which can steer the output beam by a fixed pre-designed angleβfor an arbitrary incident ang... By introducing an optic-null medium into the finite embedded transformation,a reflectionless spatial beam bender is designed,which can steer the output beam by a fixed pre-designed angleβfor an arbitrary incident angle.The bending angleβof the beam bender is determined by the geometrical angle of the device,which can be changed by simply choosing different geometrical angles.For various bending angles,the designed spatial beam bender can be realized by the same materials(i.e.,an optic-null medium),which is a homogenous anisotropic material.Numerical simulations verify the reflectionless bending effect and rotated imaging ability of the proposed beam bender.A reduction model of the optic-null medium is studied,which can also be used for a reflectionless spatial beam bender with a pre-designed bending angle. 展开更多
关键词 spatial beam benders optic-null medium transformation optics
原文传递
Predict the Future Hospitalized Patients Number Based on Patient’s Temporal and Spatial Fluctuations Using a Hybrid ARIMA and Wavelet Transform Model
12
作者 Shundong Lei 《Journal of Geographic Information System》 2017年第4期456-465,共10页
Relative to hospitalized patient information, outpatient admission information is relatively simple. It only includes the patient admission time, place of residence and other information. Traditionally, the excavation... Relative to hospitalized patient information, outpatient admission information is relatively simple. It only includes the patient admission time, place of residence and other information. Traditionally, the excavation of this information is not sufficient. However, when a large number of patients admitted time and residence information combined to consider, and add some data mining technology, some of the previously ignored regular information is likely to be found. Using 5 years of data mining research and admission data from a paediatric department at a large women’s and children’s hospital in China, we found important fluctuation rules regarding admissions using wavelet analysis on hospital admission data among different scales of cyclical fluctuations. Method: Seasonal distribution of patient number was analysed based on Haar wavelet transformation, and level 3 and level 2 of wavelets were extracted out to fit the data. The distribution function of hospitalized patients was visualized by kernel density estimation. Using linear regression and ARIMA (autoregressive integrated moving average model) predict the seasonally number of patients in the future. Results: The data analysis demonstrates the total surge of inpatients was decomposed into one mother wavelet and five small wavelets, each of which represents different time frequency. Besides, as distance from hospital increases, the number of patients decreased exponentially. The seasonal factors are the largest time factor influencing the number changes of patients. Conclusion: By wavelet analysis and the improved prediction model, we could make forecast on the future inpatient number trend and prove factors such as geographic position is influential on inpatient amount. Additionally, the concept of data mining based on spatial distribution and spectral analysis could be applied to other aspects of social management. 展开更多
关键词 Medical Resources Data Mining MULTI-SCALE ARIMA WAVELET transform spatial Distribution
暂未订购
Transient-Spatial Pattern Mining of Eddy Current Pulsed Thermography Using Wavelet Transform 被引量:2
13
作者 YANG Hailong GAO Bin +2 位作者 TIAN Guiyun REN Wenwei WOO Wai Lok 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2014年第4期768-778,共11页
Eddy current pulsed thermography(ECPT) is an emerging Non-destructive testing and evaluation(NDT E) technique, which uses hybrid eddy current and thermography NDT E techniques that enhances the detectability fro... Eddy current pulsed thermography(ECPT) is an emerging Non-destructive testing and evaluation(NDT E) technique, which uses hybrid eddy current and thermography NDT E techniques that enhances the detectability from their compensation. Currently, this technique is limited by the manual selection of proper contrast frames and the issue of improving the efficiency of defect detection of complex structure samples remains a challenge. In order to select a specific frame from transient thermal image sequences to maximize the contrast of thermal variation and defect pattern from complex structure samples, an energy driven approach to compute the coefficient energy of wavelet transform is proposed which has the potential of automatically selecting both optimal transient frame and spatial scale for defect detection using ECPT. According to analysis of the variation of different frequency component and the comparison study of the detection performance of different scale and wavelets, the frame at the end of heating phase is automatically selected as an optimal transient frame for defect detection. In addition, the detection capabilities of the complex structure samples can be enhanced through proper spatial scale and wavelet selection. The proposed method has successfully been applied to low speed impact damage detection of carbon fibre reinforced polymer(CFRP) composite as well as providing the guidance to improve the detectability of ECPT technique. 展开更多
关键词 non-destructive testing and evaluation composite impact damage detection wavelet transform energy driven approach transient-spatial analysis
在线阅读 下载PDF
SPATIALLY SCALABLE RESOLUTION IMAGE CODING METHOD WITH MEMORY OPTIMIZATION BASED ON WAVELET TRANSFORM
14
作者 WangNa ZhangLi +2 位作者 ZhouXiao'an JiaChuanying LiXia 《Journal of Electronics(China)》 2005年第1期94-97,共4页
This letter exploits fundamental characteristics of a wavelet transform image to form a progressive octave-based spatial resolution. Each wavelet subband is coded based on zeroblock and quardtree partitioning ordering... This letter exploits fundamental characteristics of a wavelet transform image to form a progressive octave-based spatial resolution. Each wavelet subband is coded based on zeroblock and quardtree partitioning ordering scheme with memory optimization technique. The method proposed in this letter is of low complexity and efficient for Internet plug-in software. 展开更多
关键词 Memory optimization spatially resolution scalability Wavelet transform Quard-tree partitioning
在线阅读 下载PDF
Spatial Transformations and Urban Conservation of Religious-Historic Towns: A Case of Vrindavan, India
15
作者 Sunanda Kapoor Vandana Sehgal Mayank Mathur 《Journal of Geoscience and Environment Protection》 2022年第8期289-308,共20页
The spatial transformations can be observed at different religious-historic towns of India due to urbanization. Research is based upon fact that there is substantial change in the built environment because of spatial ... The spatial transformations can be observed at different religious-historic towns of India due to urbanization. Research is based upon fact that there is substantial change in the built environment because of spatial transformations at the religious-historic towns. The process of modernization in the functions and spatial layout is unavoidable at any historic town. The study attempts to focus on various urban historic conservation components, including the look of historic buildings, their earlier uses, and its immediate surroundings to improve the built environment of historic towns. A theoretical framework for the urban conservation of ancient towns is the main objective of study. How to modernize the historic conservation function while preserving the space’s texture and integrity. The research started with the investigation of the morphological growth of Mathura district, India through satellite images and in-depth study of the evolution process of street network in Vrindavan town, which is one of the main temple towns of Mathura district. There is a significant difference in the layout & architectural character of old part and the newly developed Vrindavan. Due to increased accessibility and movement, the spatial structure of traditional religious precincts, which were once local integration centres, has significantly changed. Increasing & changing mode of transportation and further increase in the religious tourism might be the cause or a big reason for the spatial transformations and correspondingly there is a challenge to conserve & preserve the religious precincts of historic towns. The study tries to analyze spatial transformations with the help of Historical GIS at different scales of urban form. Suggestive measures to conserve the environmental ambience of religious-historic towns are the outcome of the research. 展开更多
关键词 URBANIZATION spatial transformations Religious-Historic Town Environmental Ambience
在线阅读 下载PDF
三维卷积与Transformer支持下联合空谱特征的高光谱影像分类 被引量:1
16
作者 何光 吴田军 《计算机工程与应用》 北大核心 2025年第2期259-272,共14页
由于CNN对局部特征提取能力强,目前仍是高光谱影像处理和分析中的主流深度模型,但是CNN感受野有限,无法建立长距离依赖关系,学习全局语义信息受限。Transformer的自注意力机制可以对输入序列中的每个位置进行注意力计算,从而能有效获取... 由于CNN对局部特征提取能力强,目前仍是高光谱影像处理和分析中的主流深度模型,但是CNN感受野有限,无法建立长距离依赖关系,学习全局语义信息受限。Transformer的自注意力机制可以对输入序列中的每个位置进行注意力计算,从而能有效获取全局上下文信息。如何实现CNN和Transformer的技术耦合并充分利用空间信息和光谱信息进行高光谱遥感影像分类是一个重要的待研问题。鉴于此,提出一种新的基于三维卷积和Transformer的高光谱遥感影像分类方法,尝试联合空谱特征实现解译能力的提升。使用主成分分析方法对高光谱遥感影像沿垂直方向降维;用非负矩阵分解算法对降维后遥感影像沿水平方向进行空间特征提取,将两种工具处理后遥感影像进行拼接,以充分保留信息;再用三维卷积核对拼接后遥感影像进行空间特征和光谱特征的综合提取;用Transformer的注意力机制对提取空间信息和光谱信息的遥感影像序列建立长距离依赖关系并使用多层感知机完成分类任务。实验表明,所提方法在WHU-Hi龙口、汉川、洪湖以及雄安新区马蹄湾村数据集上均表现出比对比方法更优异的分类性能,表明该方法具有一定的泛化性和稳健性。 展开更多
关键词 非负矩阵分解 特征融合 三维卷积 空谱联合 transformER 高光谱遥感影像分类
在线阅读 下载PDF
基于Swin-AK Transformer的智能手机拍摄图像质量评价方法
17
作者 侯国鹏 董武 +4 位作者 陆利坤 周子镱 马倩 柏振 郑晟辉 《光电工程》 北大核心 2025年第1期116-130,共15页
本文提出了一种基于双交叉注意力融合的Swin-AK Transformer(Swin Transformer based on alterable kernel convolution)和手工特征相结合的智能手机拍摄图像质量评价方法。首先,提取了影响图像质量的手工特征,这些特征可以捕捉到图像... 本文提出了一种基于双交叉注意力融合的Swin-AK Transformer(Swin Transformer based on alterable kernel convolution)和手工特征相结合的智能手机拍摄图像质量评价方法。首先,提取了影响图像质量的手工特征,这些特征可以捕捉到图像中细微的视觉变化;其次,提出了Swin-AK Transformer,增强了模型对局部信息的提取和处理能力。此外,本文设计了双交叉注意力融合模块,结合空间注意力和通道注意力机制,融合了手工特征与深度特征,实现了更加精确的图像质量预测。实验结果表明,在SPAQ和LIVE-C数据集上,皮尔森线性相关系数分别达到0.932和0.885,斯皮尔曼等级排序相关系数分别达到0.929和0.858。上述结果证明了本文提出的方法能够有效地预测智能手机拍摄图像的质量。 展开更多
关键词 图像质量评价 智能手机拍摄图像 Swin transformer 手工特征 空间注意力 通道注意力
在线阅读 下载PDF
End-to-end spatial transform face detection and recognition
18
作者 Hongxin ZHANG Liying CHI 《Virtual Reality & Intelligent Hardware》 2020年第2期119-131,共13页
Background Several face detection and recogni tion methods have been proposed in the past decades that have excellent performance.The conventional face recognition pipeline comprises the following:(1)face detection,(2... Background Several face detection and recogni tion methods have been proposed in the past decades that have excellent performance.The conventional face recognition pipeline comprises the following:(1)face detection,(2)face alignment,(3)feature extraction,and(4)similarity,which are independent of each other.The separate facial analysis stages lead to redundant model calculations,and are difficult for use in end-to-end training.Methods In this paper,we propose a novel end-to-end trainable convolutional network framework for face detection and recognition,in which a geometric transformation matrix is directly learned to align the faces rather than predicting the facial landmarks.In the training stage,our single CNN model is supervised only by face bounding boxes and personal identities,which are publicly available from WIDER FACE and CASIA-WebFace datasets.Our model is tested on Face Detection Dataset and Benchmark(FDDB)and Labeled Face in the Wild(LFW)datasets.Results The results show 89.24%recall for face detection tasks and 98.63%accura cy for face recognition tasks. 展开更多
关键词 Face detection Face recognition spatial transform Feature fusion
在线阅读 下载PDF
基于Transformer与图卷积网络的三维人体姿态估计
19
作者 王宇晶 包明明 刘星 《传感技术学报》 北大核心 2025年第9期1624-1630,共7页
提出了一种Transformer与图网络相结合的网络模型,用于对视觉传感器采集到的视频图像进行三维人体姿态估计。Transformer能够有效地从二维关键关节点中提取时空维度高相关性特征,而图网络则能够感知细节相关性特征,通过融合这两种网络结... 提出了一种Transformer与图网络相结合的网络模型,用于对视觉传感器采集到的视频图像进行三维人体姿态估计。Transformer能够有效地从二维关键关节点中提取时空维度高相关性特征,而图网络则能够感知细节相关性特征,通过融合这两种网络结构,提高了三维姿态估计的精度。在公开数据集Human3.6M上进行了仿真实验,验证了Transformer与图卷积融合算法的性能。实验结果显示,最终估计得到的三维人体关节点的平均关节点位置偏差(Mean Per Joint Position Error,MPJPE)为38.4 mm,相较于现有方法有一定提升,表明该方法具有较强的应用价值,可应用于许多下游相关工作中。 展开更多
关键词 三维人体姿态估计 transformER 图卷积 时空相关性
在线阅读 下载PDF
基于改进Swin Transformer的遥感图像建筑物变化检测 被引量:1
20
作者 罗季 王晓红 +1 位作者 杨祎斐 肖剑 《现代电子技术》 北大核心 2025年第23期58-64,共7页
针对遥感图像建筑物变化检测中变化对象和背景之间边界模糊、特征损失严重以及错检、漏检等问题,文中提出基于改进Swin Transformer的遥感图像建筑物变化检测方法。该方法在4个Swin Transformer block之后嵌入全局通道空间注意力(GCSA)... 针对遥感图像建筑物变化检测中变化对象和背景之间边界模糊、特征损失严重以及错检、漏检等问题,文中提出基于改进Swin Transformer的遥感图像建筑物变化检测方法。该方法在4个Swin Transformer block之后嵌入全局通道空间注意力(GCSA)模块,来捕捉通道与空间维度的依赖关系,降低在特征提取中的信息损失,从而对全局特征的利用更加充分,提升边界划分的检测能力。将方法在公开数据集LEVIR⁃CD和WHU⁃CD上进行训练和测试,与原Swin Transformer相比,新网络在两个数据集上的总体精度(Acc)分别提高了0.51%和0.49%,交并比(IoU)分别提高了6.79%和6.40%,有效提高了建筑物变化区域的识别精度。 展开更多
关键词 遥感图像 变化检测 Swin transformer 通道注意力 空间注意力 特征提取 信息损失 边界划分
在线阅读 下载PDF
上一页 1 2 104 下一页 到第
使用帮助 返回顶部