期刊文献+
共找到275篇文章
< 1 2 14 >
每页显示 20 50 100
Feature Extraction by Multi-Scale Principal Component Analysis and Classification in Spectral Domain 被引量:2
1
作者 Shengkun Xie Anna T. Lawnizak +1 位作者 Pietro Lio Sridhar Krishnan 《Engineering(科研)》 2013年第10期268-271,共4页
Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (... Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (PCA), wavelets transform or Fourier transform methods are often used for feature extraction. In this paper, we propose a multi-scale PCA, which combines discrete wavelet transform, and PCA for feature extraction of signals in both the spatial and temporal domains. Our study shows that the multi-scale PCA combined with the proposed new classification methods leads to high classification accuracy for the considered signals. 展开更多
关键词 multi-scale Principal Component Analysis Discrete WAVELET TRANSFORM feature extraction Signal CLASSIFICATION Empirical CLASSIFICATION
在线阅读 下载PDF
A multi-scale convolutional auto-encoder and its application in fault diagnosis of rolling bearings 被引量:12
2
作者 Ding Yunhao Jia Minping 《Journal of Southeast University(English Edition)》 EI CAS 2019年第4期417-423,共7页
Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on ... Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on the standard convolutional auto-encoder.In this model,the parallel convolutional and deconvolutional kernels of different scales are used to extract the features from the input signal and reconstruct the input signal;then the feature map extracted by multi-scale convolutional kernels is used as the input of the classifier;and finally the parameters of the whole model are fine-tuned using labeled data.Experiments on one set of simulation fault data and two sets of rolling bearing fault data are conducted to validate the proposed method.The results show that the model can achieve 99.75%,99.3%and 100%diagnostic accuracy,respectively.In addition,the diagnostic accuracy and reconstruction error of the one-dimensional multi-scale convolutional auto-encoder are compared with traditional machine learning,convolutional neural networks and a traditional convolutional auto-encoder.The final results show that the proposed model has a better recognition effect for rolling bearing fault data. 展开更多
关键词 fault diagnosis deep learning convolutional auto-encoder multi-scale convolutional kernel feature extraction
在线阅读 下载PDF
Multi-Scale Mixed Attention Tea Shoot Instance Segmentation Model 被引量:1
3
作者 Dongmei Chen Peipei Cao +5 位作者 Lijie Yan Huidong Chen Jia Lin Xin Li Lin Yuan Kaihua Wu 《Phyton-International Journal of Experimental Botany》 SCIE 2024年第2期261-275,共15页
Tea leaf picking is a crucial stage in tea production that directly influences the quality and value of the tea.Traditional tea-picking machines may compromise the quality of the tea leaves.High-quality teas are often... Tea leaf picking is a crucial stage in tea production that directly influences the quality and value of the tea.Traditional tea-picking machines may compromise the quality of the tea leaves.High-quality teas are often handpicked and need more delicate operations in intelligent picking machines.Compared with traditional image processing techniques,deep learning models have stronger feature extraction capabilities,and better generalization and are more suitable for practical tea shoot harvesting.However,current research mostly focuses on shoot detection and cannot directly accomplish end-to-end shoot segmentation tasks.We propose a tea shoot instance segmentation model based on multi-scale mixed attention(Mask2FusionNet)using a dataset from the tea garden in Hangzhou.We further analyzed the characteristics of the tea shoot dataset,where the proportion of small to medium-sized targets is 89.9%.Our algorithm is compared with several mainstream object segmentation algorithms,and the results demonstrate that our model achieves an accuracy of 82%in recognizing the tea shoots,showing a better performance compared to other models.Through ablation experiments,we found that ResNet50,PointRend strategy,and the Feature Pyramid Network(FPN)architecture can improve performance by 1.6%,1.4%,and 2.4%,respectively.These experiments demonstrated that our proposed multi-scale and point selection strategy optimizes the feature extraction capability for overlapping small targets.The results indicate that the proposed Mask2FusionNet model can perform the shoot segmentation in unstructured environments,realizing the individual distinction of tea shoots,and complete extraction of the shoot edge contours with a segmentation accuracy of 82.0%.The research results can provide algorithmic support for the segmentation and intelligent harvesting of premium tea shoots at different scales. 展开更多
关键词 Tea shoots attention mechanism multi-scale feature extraction instance segmentation deep learning
在线阅读 下载PDF
Gender-Specific Multi-Task Micro-Expression Recognition Using Pyramid CGBP-TOP Feature
4
作者 Chunlong Hu Jianjun Chen +3 位作者 Xin Zuo Haitao Zou Xing Deng Yucheng Shu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2019年第3期547-559,共13页
Micro-expression recognition has attracted growing research interests in the field of compute vision.However,micro-expression usually lasts a few seconds,thus it is difficult to detect.This paper presents a new framew... Micro-expression recognition has attracted growing research interests in the field of compute vision.However,micro-expression usually lasts a few seconds,thus it is difficult to detect.This paper presents a new framework to recognize micro-expression using pyramid histogram of Centralized Gabor Binary Pattern from Three Orthogonal Panels(CGBP-TOP)which is an extension of Local Gabor Binary Pattern from Three Orthogonal Panels feature.CGBP-TOP performs spatial and temporal analysis to capture the local facial characteristics of micro-expression image sequences.In order to keep more local information of the face,CGBP-TOP is extracted based on pyramid subregions of the micro-expression video frame.The combination of CGBP-TOP and spatial pyramid can represent well and truly the facial movements of the micro-expression image sequences.However,the dimension of our pyramid CGBP-TOP tends to be very high,which may lead to high data redundancy problem.In addition,it is clear that people of different genders usually have different ways of micro-expression.Therefore,in this paper,in order to select the relevant features of micro-expression,the gender-specific sparse multi-task learning method with adaptive regularization term is adopted to learn a compact subset of pyramid CGBP-TOP feature for micro-expression classification of different sexes.Finally,extensive experiments on widely used CASME II and SMIC databases demonstrate that our method can efficiently extract micro-expression motion features in the micro-expression video clip.Moreover,our proposed approach achieves comparable results with the state-of-the-art methods. 展开更多
关键词 Micro-expression recognition feature extraction spatial PYRAMID MULTI-TASK learning REGULARIZATION
在线阅读 下载PDF
Ship recognition based on HRRP via multi-scale sparse preserving method
5
作者 YANG Xueling ZHANG Gong SONG Hu 《Journal of Systems Engineering and Electronics》 SCIE CSCD 2024年第3期599-608,共10页
In order to extract the richer feature information of ship targets from sea clutter, and address the high dimensional data problem, a method termed as multi-scale fusion kernel sparse preserving projection(MSFKSPP) ba... In order to extract the richer feature information of ship targets from sea clutter, and address the high dimensional data problem, a method termed as multi-scale fusion kernel sparse preserving projection(MSFKSPP) based on the maximum margin criterion(MMC) is proposed for recognizing the class of ship targets utilizing the high-resolution range profile(HRRP). Multi-scale fusion is introduced to capture the local and detailed information in small-scale features, and the global and contour information in large-scale features, offering help to extract the edge information from sea clutter and further improving the target recognition accuracy. The proposed method can maximally preserve the multi-scale fusion sparse of data and maximize the class separability in the reduced dimensionality by reproducing kernel Hilbert space. Experimental results on the measured radar data show that the proposed method can effectively extract the features of ship target from sea clutter, further reduce the feature dimensionality, and improve target recognition performance. 展开更多
关键词 ship target recognition high-resolution range profile(HRRP) multi-scale fusion kernel sparse preserving projection(MSFKSPP) feature extraction dimensionality reduction
在线阅读 下载PDF
RealFuVSR:Feature enhanced real-world video super-resolution
6
作者 Zhi LI Xiongwen PANG +1 位作者 Yiyue JIANG Yujie WANG 《Virtual Reality & Intelligent Hardware》 EI 2023年第6期523-537,共15页
Background Recurrent recovery is a common method for video super-resolution(VSR)that models the correlation between frames via hidden states.However,the application of this structure in real-world scenarios can lead t... Background Recurrent recovery is a common method for video super-resolution(VSR)that models the correlation between frames via hidden states.However,the application of this structure in real-world scenarios can lead to unsatisfactory artifacts.We found that in real-world VSR training,the use of unknown and complex degradation can better simulate the degradation process in the real world.Methods Based on this,we propose the RealFuVSR model,which simulates real-world degradation and mitigates artifacts caused by the VSR.Specifically,we propose a multiscale feature extraction module(MSF)module that extracts and fuses features from multiple scales,thereby facilitating the elimination of hidden state artifacts.To improve the accuracy of the hidden state alignment information,RealFuVSR uses an advanced optical flow-guided deformable convolution.Moreover,a cascaded residual upsampling module was used to eliminate noise caused by the upsampling process.Results The experiment demonstrates that RealFuVSR model can not only recover high-quality videos but also outperforms the state-of-the-art RealBasicVSR and RealESRGAN models. 展开更多
关键词 Video super-resolution Deformable convolution Cascade residual upsampling Second-order degradation multi-scale feature extraction
在线阅读 下载PDF
Underwater Image Enhancement Based on Multi-scale Adversarial Network
7
作者 ZENG Jun-yang SI Zhan-jun 《印刷与数字媒体技术研究》 CAS 北大核心 2024年第5期70-77,共8页
In this study,an underwater image enhancement method based on multi-scale adversarial network was proposed to solve the problem of detail blur and color distortion in underwater images.Firstly,the local features of ea... In this study,an underwater image enhancement method based on multi-scale adversarial network was proposed to solve the problem of detail blur and color distortion in underwater images.Firstly,the local features of each layer were enhanced into the global features by the proposed residual dense block,which ensured that the generated images retain more details.Secondly,a multi-scale structure was adopted to extract multi-scale semantic features of the original images.Finally,the features obtained from the dual channels were fused by an adaptive fusion module to further optimize the features.The discriminant network adopted the structure of the Markov discriminator.In addition,by constructing mean square error,structural similarity,and perceived color loss function,the generated image is consistent with the reference image in structure,color,and content.The experimental results showed that the enhanced underwater image deblurring effect of the proposed algorithm was good and the problem of underwater image color bias was effectively improved.In both subjective and objective evaluation indexes,the experimental results of the proposed algorithm are better than those of the comparison algorithm. 展开更多
关键词 Underwater image enhancement Generative adversarial network multi-scale feature extraction Residual dense block
在线阅读 下载PDF
基于时空特征提取及深度集成网络的交通韧性预测
8
作者 夏溪蔓 孟学雷 +2 位作者 王莉 林立 韩正 《铁道科学与工程学报》 北大核心 2026年第1期63-76,共14页
目前,多制式交通跨线运营已成为提升城市交通网络性能的关键途径。精确量化多制式轨道交通网络的韧性,对于优化交通资源配置、增强交通系统抗风险能力具有重要意义。针对多制式交通跨线运营模式下时空特征关联性日渐复杂及多源异构数据... 目前,多制式交通跨线运营已成为提升城市交通网络性能的关键途径。精确量化多制式轨道交通网络的韧性,对于优化交通资源配置、增强交通系统抗风险能力具有重要意义。针对多制式交通跨线运营模式下时空特征关联性日渐复杂及多源异构数据处理困难的问题,提出一种基于时空特征提取及深度集成网络的交通韧性预测模型(spatial-temporal feature extraction and deep integrated network,STFEDIN)。该模型构建了时空特征融合网络(spatial-temporal feature fusion network,STNet),通过多尺度卷积与跨时间门控机制的协同实现对交通数据中非线性特征、时序依赖关系及空间异构性特征的有效提取。针对传统Transformer框架在时空特征建模中存在的长距离依赖捕获效率不足及空间结构信息利用不充分问题,引入混合头注意力机制(mixture-of-head,MoH)替代传统Transformer预测模型中的注意力结构,MoH模型可以通过动态路由策略实现注意力头间的协同优化,有效增强模型对多维度时空关联特征的动态解析能力与复杂场景适应性。以某城市的市域铁路与城市轨道交通系统跨线运营为例,验证模型的预测性能。实验结果表明,STFEDIN模型相对于传统的数理统计模型或单一机器学习模型有较好的预测性能,相较于时空演化建模图神经网络(spatial-temporal evolution modeling graph neural network,StemGNN)模型,平均绝对误差f_(mae)下降了0.01,均方根误差f_(rmse)下降了0.012,平均绝对百分比误差f_(mape)下降了1.701,决定系数f_(r2)上升了2.27%;与卷积长短时记忆网络(convolutional long short-term memory,ConvLSTM)模型相比,f_(mae)下降了0.045,f_(rmse)下降了0.057,f_(mape)下降了7.845,f_(r2)上升了26.60%。消融实验进一步证明了STFEDIN模型结构的合理性。研究成果为多制式交通跨线运营场景下的网络韧性评估提供了有效的解决途径。 展开更多
关键词 交通韧性 多制式交通跨线运营 时空特征提取 混合头注意力机制 TRANSFORMER
在线阅读 下载PDF
融合多重卷积和Dense Transformer的高光谱图像分类
9
作者 魏林 杨霄 尹玉萍 《红外技术》 北大核心 2026年第2期193-203,共11页
高光谱图像蕴含丰富的光谱空间信息。如何充分挖掘空谱信息进行分类,是一个关键的研究问题。在处理高光谱图像分类时,卷积擅长提取局部特征,Transformer能够捕获长距离特征依赖性,学习全局特征信息。针对卷积和Transformer的优势,提出... 高光谱图像蕴含丰富的光谱空间信息。如何充分挖掘空谱信息进行分类,是一个关键的研究问题。在处理高光谱图像分类时,卷积擅长提取局部特征,Transformer能够捕获长距离特征依赖性,学习全局特征信息。针对卷积和Transformer的优势,提出了一种结合三维卷积、空间通道重建卷积和Transformer的高光谱图像分类方法。首先将降维后的图像块,利用三维卷积进行综合的空谱特征提取;随后用空间通道重建卷积过滤冗余信息;最后用具有密集连接的Transformer对卷积提取的空谱特征建立长距离依赖关系,并使用多层感知机进行分类。实验表明,该方法在Pavia University、Salinas和Botswana数据集上总体分类精度分别为99.51%、99.85%、97.57%,均表现优异。 展开更多
关键词 高光谱图像 特征提取 三维卷积 空间通道重建卷积 TRANSFORMER
在线阅读 下载PDF
融合Transformer与BiLSTM的野外动态面部表情识别方法
10
作者 郭岱朋 徐飞 Nouman Hameed 《西安工业大学学报》 2026年第1期121-130,共10页
针对动态面部表情识别中时空特征提取与建模不足的问题,提出了一种结合Transformer与BiLSTM的动态面部表情识别方法。该方法通过Transformer进行空间特征提取,并利用BiLSTM对时序信息进行建模,从而提高动态面部表情的识别精度。实验结... 针对动态面部表情识别中时空特征提取与建模不足的问题,提出了一种结合Transformer与BiLSTM的动态面部表情识别方法。该方法通过Transformer进行空间特征提取,并利用BiLSTM对时序信息进行建模,从而提高动态面部表情的识别精度。实验结果表明,在DFEW数据集上,未加权平均召回率和加权平均召回率较现有方法分别提高了4.14%和2.52%;在FERV39k数据集上,提高了1.64%和1.80%。实验验证了该方法在动态面部表情识别中的有效性。 展开更多
关键词 动态面部表情识别 特征提取 空间特征 时序信息
在线阅读 下载PDF
基于多尺度混合注意力的遥感图像超分辨率重建
11
作者 邓峰良 钱育蓉 +3 位作者 范迎迎 白璐 王元旭 孔维泉 《微电子学与计算机》 2026年第3期98-110,共13页
现有基于Transformer的方法在处理复杂遥感场景时表现不佳,容易出现伪影和细节丢失,特别是在局部信息捕捉和空间关系建模方面存在明显局限。为解决上述问题,提出了一种多尺度混合注意力网络(Multi-scale Hybrid Attention Network,MsHAN... 现有基于Transformer的方法在处理复杂遥感场景时表现不佳,容易出现伪影和细节丢失,特别是在局部信息捕捉和空间关系建模方面存在明显局限。为解决上述问题,提出了一种多尺度混合注意力网络(Multi-scale Hybrid Attention Network,MsHAN)。该网络设计了大核多尺度注意力机制(Large Kernel Multi-scale Attention Mechanism,LKMSA)、多尺度动态窗口空洞注意力模块(Multi-scale Dynamic Window Hole Attention Module,MSDWDA)和空间前馈模块(Spatial Feedforward Module,SFM),全面提升了遥感图像超分辨率重建的性能。LKMSA结合大核卷积和多尺度机制,显著提高了对长距离依赖的建模能力和细节恢复效果。MSDWDA通过动态窗口划分和多尺度空洞卷积,有效增强了局部细节捕捉和全局一致性,并抑制了伪影累积。SFM通过优化前馈网络(Feed-Forward Network,FFN)结构,提升空间信息的建模能力,同时降低了计算复杂度。在AID、UCMerced与NWPU-RESISC45数据集上,MsHAN与现有常用、最新超分辨率重建方法(如EDSR、RCAN、MAN等)进行对比实验,结果显示:在各项评价指标上均取得了优异的表现。以PSNR指标为例,MsHAN相较最新的MAN方法在AID、UCMerced数据集上分别提升了0.05 dB与0.11 dB。这些结果表明,所提方法在细节恢复和整体图像质量方面具有显著优势。 展开更多
关键词 遥感图像 超分辨率重建 混合注意力 多尺度特征提取融合 空间前馈 深度学习
在线阅读 下载PDF
基于实景三维数据的复杂场景空间位置判定
12
作者 张宇蓉 钱彬 《电子设计工程》 2026年第3期27-30,35,共5页
复杂场景中的干扰因素较多,传统空间位置判定方法难以避免干扰因素的影响。为获取精准的空间位置信息,提出基于实景三维数据的复杂场景空间位置判定方法。应用三维扫描技术获取复杂场景的实景三维数据(点云数据);采用高斯滤波算法与曲... 复杂场景中的干扰因素较多,传统空间位置判定方法难以避免干扰因素的影响。为获取精准的空间位置信息,提出基于实景三维数据的复杂场景空间位置判定方法。应用三维扫描技术获取复杂场景的实景三维数据(点云数据);采用高斯滤波算法与曲率采样方法对数据进行去噪与精简,提取点云数据特征,确定物体中心及其法向量;构造空间距离与空间夹角计算公式,实现对复杂场景空间位置的精准判定。实验结果表明,在复杂场景中,该方法获得的物体空间位置判定结果(空间距离、空间夹角)与实际空间位置数值高度吻合,完成全部判定任务仅需2 min。 展开更多
关键词 空间位置判定 实景三维数据 点云数据去噪 复杂场景 关键特征点提取
在线阅读 下载PDF
Spectral-spatial target detection based on data field modeling for hyperspectral data 被引量:4
13
作者 Da LIU Jianxun LI 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2018年第4期795-805,共11页
Target detection is always an important application in hyperspectral image processing field. In this paper, a spectral-spatial target detection algorithm for hyperspectral data is proposed.The spatial feature and spec... Target detection is always an important application in hyperspectral image processing field. In this paper, a spectral-spatial target detection algorithm for hyperspectral data is proposed.The spatial feature and spectral feature were unified based on the data filed theory and extracted by weighted manifold embedding. The novelties of the proposed method lie in two aspects. One is the way in which the spatial features and spectral features were fused as a new feature based on the data field theory, and the other is that local information was introduced to describe the decision boundary and explore the discriminative features for target detection. The extracted features based on data field modeling and manifold embedding techniques were considered for a target detection task.Three standard hyperspectral datasets were considered in the analysis. The effectiveness of the proposed target detection algorithm based on data field theory was proved by the higher detection rates with lower False Alarm Rates(FARs) with respect to those achieved by conventional hyperspectral target detectors. 展开更多
关键词 Data field modeling feature extraction Hyperspectral data Spectral-spatial Target detection
原文传递
Fault diagnosis of rolling bearing based on two-dimensional composite multi-scale ensemble Gramian dispersion entropy
14
作者 Wenqing Ding Jinde Zheng +3 位作者 Jianghong Li Haiyang Pan Jian Cheng Jinyu Tong 《Chinese Journal of Mechanical Engineering》 2026年第1期125-144,共20页
One-dimensional ensemble dispersion entropy(EDE1D)is an effective nonlinear dynamic analysis method for complexity measurement of time series.However,it is only restricted to assessing the complexity of one-di-mension... One-dimensional ensemble dispersion entropy(EDE1D)is an effective nonlinear dynamic analysis method for complexity measurement of time series.However,it is only restricted to assessing the complexity of one-di-mensional time series(TS1d)with the extracted complexity features only at a single scale.Aiming at these problems,a new nonlinear dynamic analysis method termed two-dimensional composite multi-scale ensemble Gramian dispersion entropy(CMEGDE_(2D))is proposed in this paper.First,the TS_(1D) is transformed into a two-dimensional image(I_(2D))by using Gramian angular fields(GAF)with more internal data structures and geometri features,which preserve the global characteristics and time dependence of vibration signals.Second,the I2D is analyzed at multiple scales through the composite coarse-graining method,which overcomes the limitation of a single scale and provides greater stability compared to traditional coarse-graining methods.Subsequently,a new fault diagnosis method of rolling bearing is proposed based on the proposed CMEGDE_(2D) for fault feature ex-traction and the chicken swarm algorithm optimized support vector machine(CsO-SvM)for fault pattern identification.The simulation signals and two data sets of rolling bearings are utilized to verify the effectiveness of the proposed fault diagnosis method.The results demonstrate that the proposed method has stronger dis-crimination ability,higher fault diagnosis accuracy and better stability than the other compared methods. 展开更多
关键词 Composite multi-scale ensemble Gramian dispersion entropy Dispersion entropy Fault diagnosis Rolling bearing feature extraction
在线阅读 下载PDF
多阶段渐进处理的图像去雨方法
15
作者 廉继红 王平 +1 位作者 李英 李云红 《西北大学学报(自然科学版)》 北大核心 2025年第2期297-308,共12页
针对现有图像去雨方法中存在雨纹去除不彻底、纹理信息丢失等问题,提出一种多阶段渐进式处理的图像去雨算法,可以同时将上下阶段的特征融合,使去雨算法的性能有很大的提高。该去雨网络模型由3个阶段构成。前2个阶段采用改进后的U-Net编... 针对现有图像去雨方法中存在雨纹去除不彻底、纹理信息丢失等问题,提出一种多阶段渐进式处理的图像去雨算法,可以同时将上下阶段的特征融合,使去雨算法的性能有很大的提高。该去雨网络模型由3个阶段构成。前2个阶段采用改进后的U-Net编码器解码器结构学习多尺度上下文特征信息,特征提取部分采用有效通道注意力机制(efficient channel attention network,ECANet),使网络模型参数变小,更加轻量级;第3阶段加入并行注意力机制(parallel attention subnetwork,PASNet),在学习上下文信息和空间细节特征的同时还能生成高分辨率特征,更好地保留图像的输出细节。此外,还引入监督注意力模块(supervised attention module,SAM)以加强特征学习。实验结果表明,在数据集Rain100H上PSNR达到29.37 dB,SSIM为0.88;在Test1200上PSNR达到32.50 dB,SSIM为0.93,验证了所提方法在图像去雨任务上的有效性。 展开更多
关键词 图像去雨 特征提取 监督注意力 并行注意力机制 空间细节
在线阅读 下载PDF
空间关系增强与异构特征融合相结合的道路信息提取方法
16
作者 曹云刚 杨鹏 +2 位作者 龚江波 朱高 沈星宇 《测绘学报》 北大核心 2025年第12期2219-2232,共14页
针对高原特殊环境下遥感影像中道路材质与背景混淆、结构细长易断裂等问题,本文提出了一种结合空间关系增强器(spatial relationship enhancer,SRE)和连通性约束损失(connectivity loss,Cnt_Loss)的改进道路提取模型SRENet。核心贡献包... 针对高原特殊环境下遥感影像中道路材质与背景混淆、结构细长易断裂等问题,本文提出了一种结合空间关系增强器(spatial relationship enhancer,SRE)和连通性约束损失(connectivity loss,Cnt_Loss)的改进道路提取模型SRENet。核心贡献包括:①设计空间关系增强器,通过关键点图卷积显式建模道路拓扑结构,显著提升弯曲与遮挡区域的连通性检测能力;②构建双分支架构并设计异构特征融合模块,实现语义特征与空间细节的互补增强,增强对材质与环境类似的低对比度道路的提取能力;③提出连通性约束损失函数,通过几何驱动优化抑制狭窄断裂区域的误分割。本文方法以双分支深度神经网络为基础,通过异构特征融合模块实现多尺度特征互补,并结合连通性约束损失函数Cnt_Loss对道路几何特征进行优化。研究表明:SRENet在JL1与DGRD数据集上的IoU分别达到0.7002和0.6604,较现有模型分别提升了0.0116和0.0252;在道路连接性优化方面表现突出,显著减少了在弯曲路段与行道树遮挡区域的断裂数量;提出的Cnt_Loss函数通过几何约束机制,有效解决了弱边界道路的漏检问题。 展开更多
关键词 青藏高原 道路提取 异构特征融合 空间关系增强 图卷积
在线阅读 下载PDF
基于CoordEF−YOLOv9t的煤矿井下人员行为识别
17
作者 潘红光 卫泽尘 +3 位作者 雷心宇 姚超修 蒋泽 张立斌 《工矿自动化》 北大核心 2025年第8期59-66,共8页
基于深度学习的人员行为识别方法在煤矿井下应用存在对多类别行为识别缺乏系统性分类架构、光线昏暗和低清晰度图像导致细节丢失、矿工姿态和视角差异引发特征形变等问题。提出一种煤矿井下人员行为识别模型CoordEF−YOLOv9t。该模型分... 基于深度学习的人员行为识别方法在煤矿井下应用存在对多类别行为识别缺乏系统性分类架构、光线昏暗和低清晰度图像导致细节丢失、矿工姿态和视角差异引发特征形变等问题。提出一种煤矿井下人员行为识别模型CoordEF−YOLOv9t。该模型分别从边缘细节与空间位置特征提取2个方面对YOLOv9t进行改进:YOLOv9t中RepNCSPELAN4模块的卷积操作在捕捉细微或模糊边缘时易导致细节模糊,针对该问题,设计了融合Sobel算子的边缘特征提取模块(EFEM),在RepNCSPELAN4模块中嵌入EFEM,增强主干网络与颈部网络对人体边缘细节的感知能力。传统卷积神经网络难以感知位置信息并充分学习人员位置与动作的空间特征,针对该问题,在颈部网络末端引入坐标卷积,提升模型对人员行为位置信息的感知能力。实验结果表明,CoordEF−YOLOv9t精确率P为73.4%,召回率R为73.7%,mAP@0.5为74.8%,mAP@0.5:0.95为61.1%,相较于YOLOv9t分别提升1.2%,3.2%,1.0%,2.1%;与RT−DETR,YOLOv11,YOLOv12等主流模型相比,CoordEF−YOLOv9t综合性能更优,能更精准地识别煤矿井下人员行为。 展开更多
关键词 井下人员行为识别 YOLOv9t 边缘特征提取 空间位置特征提取 SOBEL算子 坐标卷积
在线阅读 下载PDF
结合倒残差自注意力机制的遥感图像目标检测
18
作者 赵文清 赵振寰 巩佳潇 《智能系统学报》 北大核心 2025年第1期64-72,共9页
针对遥感图像目标检测存在背景信息干扰严重、待检测目标尺寸差异大等问题,提出一种结合倒残差自注意力机制的目标检测方法。首先,使用具有强特征提取能力的倒残差自注意力机制骨干网络充分提取目标特征,降低复杂背景信息的干扰;其次,... 针对遥感图像目标检测存在背景信息干扰严重、待检测目标尺寸差异大等问题,提出一种结合倒残差自注意力机制的目标检测方法。首先,使用具有强特征提取能力的倒残差自注意力机制骨干网络充分提取目标特征,降低复杂背景信息的干扰;其次,构造多尺度空间金字塔池化模块,提供多尺度感受野,增强捕捉不同尺寸目标的能力;最后,提出轻量级特征融合模块,对骨干网络提取的特征图进行融合,充分结合低层与高层特征,提高网络对不同尺寸目标的检测能力。与传统网络及其他改进目标检测算法进行对比,实验发现该方法的检测精度明显优于其他算法。此外,在DIOR数据集和RSOD数据集上设计消融实验,结果表明,该方法在DIOR数据集与RSOD数据集上的平均精度均值比YOLOv8算法分别提升4.6和4.2百分点,明显提升遥感图像目标检测的精度。 展开更多
关键词 遥感图像 目标检测 倒残差 自注意力机制 多尺度 空间金字塔 特征提取 特征融合
在线阅读 下载PDF
基于航拍图像的自适应感知目标检测网络 被引量:3
19
作者 袁玲玲 陈春梅 +2 位作者 朱天鑫 邓豪 刘桂华 《电子测量技术》 北大核心 2025年第2期57-65,共9页
由于无人机拍摄高度和角度的多样性,其图像往往呈现背景复杂且小目标居多的特征,这导致了相关检测算法性能较差。针对此问题,本文提出了一种基于自适应感知网络的航拍图像车辆检测方法,旨在从提高车辆特征显著度和改善特征信息损失两个... 由于无人机拍摄高度和角度的多样性,其图像往往呈现背景复杂且小目标居多的特征,这导致了相关检测算法性能较差。针对此问题,本文提出了一种基于自适应感知网络的航拍图像车辆检测方法,旨在从提高车辆特征显著度和改善特征信息损失两个方面来提升小目标的检测性能。首先,为了提取更高效的特征表征,提出了自适应感知特征提取模块,该模块通过捕捉长程依赖关系和更强的几何特征表示,能够自适应地对物体的形状进行建模。其次,为了减少下采样和连续池化造成的信息损失,设计了双分支空间感知下采样模块,该模块混合不同通道的特征图,以最大限度地保留小目标特征信息。然后,在特征融合网络中,引入了具有丰富空间信息的浅层特征图,以增强小目标的检测能力。最后,设计了新的动态回归损失函数DEIoU,该函数引入惩罚项来度量真实框与检测框之间横纵比的相关性,从而进一步提高网络的预测精度。在Visdrone数据集上的实验结果表明,所提方法的平均精度均值mAP达到了70%,推理速度达到了99.26 fps,实现了较好的速度与精度的平衡,并且所提方法在UCAS-AOD数据集上取得了最佳的检测精度,具有较强的泛化能力。 展开更多
关键词 无人机 目标检测 自适应感知特征提取 特征融合网络 双分支空间感知下采样
原文传递
基于高阶空间特征提取的无人机航拍小目标检测算法 被引量:4
20
作者 张轩宇 周思航 +1 位作者 黄健 王冬 《计算机工程与应用》 北大核心 2025年第12期210-221,共12页
针对视觉算法在检测航拍图像中密集小目标时容易受到目标重叠、遮挡等情况干扰的现象,提出了一种基于高阶空间特征(目标形状、位置等信息的高级表示)提取的Transformer检测头HSF-TPH(Transformer prediction head with high-order spati... 针对视觉算法在检测航拍图像中密集小目标时容易受到目标重叠、遮挡等情况干扰的现象,提出了一种基于高阶空间特征(目标形状、位置等信息的高级表示)提取的Transformer检测头HSF-TPH(Transformer prediction head with high-order spatial feature extraction)。所提检测头中将自注意力机制中的二阶交互扩展到三阶以生成高阶空间特征,提取更有区分度的空间关系,突出每一个小目标在空间上的语义信息。同时,为了缓解骨干网络过度下采样对小目标信息的压缩,设计了一种高分辨率特征图生成机制,增加头部网络的输入特征分辨率,以提升HSFTPH检测密集小目标的效果。设计了新的损失函数USIoU,降低算法位置偏差敏感性。在VisDrone2019数据集上开展实验证明,所提算法在面积最小、密度最高的人类目标的检测任务中实现了mAP50指标10个百分点以上的性能提升。 展开更多
关键词 无人机航拍 小目标检测 高阶空间特征提取 注意力机制 损失函数
在线阅读 下载PDF
上一页 1 2 14 下一页 到第
使用帮助 返回顶部