期刊文献+
共找到281篇文章
< 1 2 15 >
每页显示 20 50 100
DL-YOLO:AMulti-Scale Feature Fusion Detection Algorithm for Low-Light Environments
1
作者 Yuanmeng Chang Hongmei Liu 《Computers, Materials & Continua》 2026年第5期1901-1915,共15页
Driven by rapid advances in deep learning,object detection has been widely adopted across diverse application scenarios.However,in low-light conditions,critical visual cues of target objects are severely degraded,posi... Driven by rapid advances in deep learning,object detection has been widely adopted across diverse application scenarios.However,in low-light conditions,critical visual cues of target objects are severely degraded,posing a significant challenge for accurate low-light object detection.Existing methods struggle to preserve discriminative features while maintaining semantic consistency between low-light and normal-light images.For this purpose,this study proposes a DL-YOLO model specially tailored for low-light detection.To mitigate target feature attenuation introduced by repeated downsampling,we design aMulti-Scale FeatureConvolution(MSF-Conv)module that captures rich,multi-level details via multi-scale feature learning,thereby reducing model complexity and computational cost.For feature fusion,we integrated the C3k2-DWRmodule by embedding the Dilation-wise Residual(DWR)mechanism into the 2-core optimized Cross Stage Partial(C3)framework,achieving efficient feature integration.In addition,we replace conventional localization losses with WIoU(Weighted Intersection over Union),which dynamically adjusts gradient gain according to sample quality,thereby improving localization robustness and precision.Experiments on the ExDark dataset demonstrate that DL-YOLO delivers strong low-light detection performance.The relevant code is published at https://github.com/cym0997/DL-YOLO. 展开更多
关键词 multi-scale feature extraction object detection low-light environments ExDark dataset
在线阅读 下载PDF
Feature Extraction by Multi-Scale Principal Component Analysis and Classification in Spectral Domain 被引量:2
2
作者 Shengkun Xie Anna T. Lawnizak +1 位作者 Pietro Lio Sridhar Krishnan 《Engineering(科研)》 2013年第10期268-271,共4页
Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (... Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (PCA), wavelets transform or Fourier transform methods are often used for feature extraction. In this paper, we propose a multi-scale PCA, which combines discrete wavelet transform, and PCA for feature extraction of signals in both the spatial and temporal domains. Our study shows that the multi-scale PCA combined with the proposed new classification methods leads to high classification accuracy for the considered signals. 展开更多
关键词 multi-scale Principal Component Analysis Discrete WAVELET TRANSFORM feature extraction Signal CLASSIFICATION Empirical CLASSIFICATION
在线阅读 下载PDF
A multi-scale convolutional auto-encoder and its application in fault diagnosis of rolling bearings 被引量:12
3
作者 Ding Yunhao Jia Minping 《Journal of Southeast University(English Edition)》 EI CAS 2019年第4期417-423,共7页
Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on ... Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on the standard convolutional auto-encoder.In this model,the parallel convolutional and deconvolutional kernels of different scales are used to extract the features from the input signal and reconstruct the input signal;then the feature map extracted by multi-scale convolutional kernels is used as the input of the classifier;and finally the parameters of the whole model are fine-tuned using labeled data.Experiments on one set of simulation fault data and two sets of rolling bearing fault data are conducted to validate the proposed method.The results show that the model can achieve 99.75%,99.3%and 100%diagnostic accuracy,respectively.In addition,the diagnostic accuracy and reconstruction error of the one-dimensional multi-scale convolutional auto-encoder are compared with traditional machine learning,convolutional neural networks and a traditional convolutional auto-encoder.The final results show that the proposed model has a better recognition effect for rolling bearing fault data. 展开更多
关键词 fault diagnosis deep learning convolutional auto-encoder multi-scale convolutional kernel feature extraction
在线阅读 下载PDF
Multi-Scale Mixed Attention Tea Shoot Instance Segmentation Model 被引量:1
4
作者 Dongmei Chen Peipei Cao +5 位作者 Lijie Yan Huidong Chen Jia Lin Xin Li Lin Yuan Kaihua Wu 《Phyton-International Journal of Experimental Botany》 SCIE 2024年第2期261-275,共15页
Tea leaf picking is a crucial stage in tea production that directly influences the quality and value of the tea.Traditional tea-picking machines may compromise the quality of the tea leaves.High-quality teas are often... Tea leaf picking is a crucial stage in tea production that directly influences the quality and value of the tea.Traditional tea-picking machines may compromise the quality of the tea leaves.High-quality teas are often handpicked and need more delicate operations in intelligent picking machines.Compared with traditional image processing techniques,deep learning models have stronger feature extraction capabilities,and better generalization and are more suitable for practical tea shoot harvesting.However,current research mostly focuses on shoot detection and cannot directly accomplish end-to-end shoot segmentation tasks.We propose a tea shoot instance segmentation model based on multi-scale mixed attention(Mask2FusionNet)using a dataset from the tea garden in Hangzhou.We further analyzed the characteristics of the tea shoot dataset,where the proportion of small to medium-sized targets is 89.9%.Our algorithm is compared with several mainstream object segmentation algorithms,and the results demonstrate that our model achieves an accuracy of 82%in recognizing the tea shoots,showing a better performance compared to other models.Through ablation experiments,we found that ResNet50,PointRend strategy,and the Feature Pyramid Network(FPN)architecture can improve performance by 1.6%,1.4%,and 2.4%,respectively.These experiments demonstrated that our proposed multi-scale and point selection strategy optimizes the feature extraction capability for overlapping small targets.The results indicate that the proposed Mask2FusionNet model can perform the shoot segmentation in unstructured environments,realizing the individual distinction of tea shoots,and complete extraction of the shoot edge contours with a segmentation accuracy of 82.0%.The research results can provide algorithmic support for the segmentation and intelligent harvesting of premium tea shoots at different scales. 展开更多
关键词 Tea shoots attention mechanism multi-scale feature extraction instance segmentation deep learning
在线阅读 下载PDF
Gender-Specific Multi-Task Micro-Expression Recognition Using Pyramid CGBP-TOP Feature
5
作者 Chunlong Hu Jianjun Chen +3 位作者 Xin Zuo Haitao Zou Xing Deng Yucheng Shu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2019年第3期547-559,共13页
Micro-expression recognition has attracted growing research interests in the field of compute vision.However,micro-expression usually lasts a few seconds,thus it is difficult to detect.This paper presents a new framew... Micro-expression recognition has attracted growing research interests in the field of compute vision.However,micro-expression usually lasts a few seconds,thus it is difficult to detect.This paper presents a new framework to recognize micro-expression using pyramid histogram of Centralized Gabor Binary Pattern from Three Orthogonal Panels(CGBP-TOP)which is an extension of Local Gabor Binary Pattern from Three Orthogonal Panels feature.CGBP-TOP performs spatial and temporal analysis to capture the local facial characteristics of micro-expression image sequences.In order to keep more local information of the face,CGBP-TOP is extracted based on pyramid subregions of the micro-expression video frame.The combination of CGBP-TOP and spatial pyramid can represent well and truly the facial movements of the micro-expression image sequences.However,the dimension of our pyramid CGBP-TOP tends to be very high,which may lead to high data redundancy problem.In addition,it is clear that people of different genders usually have different ways of micro-expression.Therefore,in this paper,in order to select the relevant features of micro-expression,the gender-specific sparse multi-task learning method with adaptive regularization term is adopted to learn a compact subset of pyramid CGBP-TOP feature for micro-expression classification of different sexes.Finally,extensive experiments on widely used CASME II and SMIC databases demonstrate that our method can efficiently extract micro-expression motion features in the micro-expression video clip.Moreover,our proposed approach achieves comparable results with the state-of-the-art methods. 展开更多
关键词 Micro-expression recognition feature extraction spatial PYRAMID MULTI-TASK learning REGULARIZATION
在线阅读 下载PDF
Ship recognition based on HRRP via multi-scale sparse preserving method
6
作者 YANG Xueling ZHANG Gong SONG Hu 《Journal of Systems Engineering and Electronics》 SCIE CSCD 2024年第3期599-608,共10页
In order to extract the richer feature information of ship targets from sea clutter, and address the high dimensional data problem, a method termed as multi-scale fusion kernel sparse preserving projection(MSFKSPP) ba... In order to extract the richer feature information of ship targets from sea clutter, and address the high dimensional data problem, a method termed as multi-scale fusion kernel sparse preserving projection(MSFKSPP) based on the maximum margin criterion(MMC) is proposed for recognizing the class of ship targets utilizing the high-resolution range profile(HRRP). Multi-scale fusion is introduced to capture the local and detailed information in small-scale features, and the global and contour information in large-scale features, offering help to extract the edge information from sea clutter and further improving the target recognition accuracy. The proposed method can maximally preserve the multi-scale fusion sparse of data and maximize the class separability in the reduced dimensionality by reproducing kernel Hilbert space. Experimental results on the measured radar data show that the proposed method can effectively extract the features of ship target from sea clutter, further reduce the feature dimensionality, and improve target recognition performance. 展开更多
关键词 ship target recognition high-resolution range profile(HRRP) multi-scale fusion kernel sparse preserving projection(MSFKSPP) feature extraction dimensionality reduction
在线阅读 下载PDF
RealFuVSR:Feature enhanced real-world video super-resolution
7
作者 Zhi LI Xiongwen PANG +1 位作者 Yiyue JIANG Yujie WANG 《Virtual Reality & Intelligent Hardware》 EI 2023年第6期523-537,共15页
Background Recurrent recovery is a common method for video super-resolution(VSR)that models the correlation between frames via hidden states.However,the application of this structure in real-world scenarios can lead t... Background Recurrent recovery is a common method for video super-resolution(VSR)that models the correlation between frames via hidden states.However,the application of this structure in real-world scenarios can lead to unsatisfactory artifacts.We found that in real-world VSR training,the use of unknown and complex degradation can better simulate the degradation process in the real world.Methods Based on this,we propose the RealFuVSR model,which simulates real-world degradation and mitigates artifacts caused by the VSR.Specifically,we propose a multiscale feature extraction module(MSF)module that extracts and fuses features from multiple scales,thereby facilitating the elimination of hidden state artifacts.To improve the accuracy of the hidden state alignment information,RealFuVSR uses an advanced optical flow-guided deformable convolution.Moreover,a cascaded residual upsampling module was used to eliminate noise caused by the upsampling process.Results The experiment demonstrates that RealFuVSR model can not only recover high-quality videos but also outperforms the state-of-the-art RealBasicVSR and RealESRGAN models. 展开更多
关键词 Video super-resolution Deformable convolution Cascade residual upsampling Second-order degradation multi-scale feature extraction
在线阅读 下载PDF
Underwater Image Enhancement Based on Multi-scale Adversarial Network
8
作者 ZENG Jun-yang SI Zhan-jun 《印刷与数字媒体技术研究》 CAS 北大核心 2024年第5期70-77,共8页
In this study,an underwater image enhancement method based on multi-scale adversarial network was proposed to solve the problem of detail blur and color distortion in underwater images.Firstly,the local features of ea... In this study,an underwater image enhancement method based on multi-scale adversarial network was proposed to solve the problem of detail blur and color distortion in underwater images.Firstly,the local features of each layer were enhanced into the global features by the proposed residual dense block,which ensured that the generated images retain more details.Secondly,a multi-scale structure was adopted to extract multi-scale semantic features of the original images.Finally,the features obtained from the dual channels were fused by an adaptive fusion module to further optimize the features.The discriminant network adopted the structure of the Markov discriminator.In addition,by constructing mean square error,structural similarity,and perceived color loss function,the generated image is consistent with the reference image in structure,color,and content.The experimental results showed that the enhanced underwater image deblurring effect of the proposed algorithm was good and the problem of underwater image color bias was effectively improved.In both subjective and objective evaluation indexes,the experimental results of the proposed algorithm are better than those of the comparison algorithm. 展开更多
关键词 Underwater image enhancement Generative adversarial network multi-scale feature extraction Residual dense block
在线阅读 下载PDF
基于多尺度编码器融合的三维人体姿态估计算法
9
作者 包晓安 陈恩琳 +3 位作者 张娜 涂小妹 吴彪 张庆琪 《浙江大学学报(工学版)》 北大核心 2026年第3期565-573,584,共10页
针对冗余信息干扰与信息完整性需求之间的矛盾,提出基于多尺度编码器融合的三维人体姿态估计方法.该方法由关键帧时空编码器(KFSTE)和全局保留自注意力编码器(GRSAE)构成.KFSTE通过关键帧选择器对骨架特征序列进行筛选后,由时间编码器... 针对冗余信息干扰与信息完整性需求之间的矛盾,提出基于多尺度编码器融合的三维人体姿态估计方法.该方法由关键帧时空编码器(KFSTE)和全局保留自注意力编码器(GRSAE)构成.KFSTE通过关键帧选择器对骨架特征序列进行筛选后,由时间编码器获取局部时空建模.GRSAE通过保留编码器进行全局单阶段编码来获取全局骨架序列特征,避免因关键帧筛选偏差导致的信息损失.通过对双编码器的特征拼接及回归处理,预测得到三维人体姿态坐标.实验结果表明,在较大规模的Human3.6M数据集上,所提方法的平均关节位置误差(MPJPE)比MixSTE低3%,有11个动作获得最佳. 展开更多
关键词 三维人体姿态估计 时空编码器 关键帧提取 保留自注意力编码 多编码特征融合
在线阅读 下载PDF
差分RCSP运动想象脑电特征提取算法
10
作者 陈东毅 陈建国 《计算机仿真》 2026年第1期347-353,共7页
为了提高运动想象脑电信号的识别精度,提出DRCSP和ULDA对脑电信号的特征提取和分类进行优化。脑电信号通过改进EMD滤波后进行信号重构,DRCSP特征是在得到最大化类与类之间距离的空间投影矩阵后对投影后的新信号进行差分和归一化处理,再... 为了提高运动想象脑电信号的识别精度,提出DRCSP和ULDA对脑电信号的特征提取和分类进行优化。脑电信号通过改进EMD滤波后进行信号重构,DRCSP特征是在得到最大化类与类之间距离的空间投影矩阵后对投影后的新信号进行差分和归一化处理,再通过ULDA将特征投影到类间距离最大的低维空间而得到。分别在实验数据集上验证运动想象脑电信号的动作识别正确率,DRCSP特征的识别正确率均高于RCSP特征,相比CSP及其衍生算法具有更大的类间距离,平均识别正确率提高10%左右。相比较于其它研究中所提及算法的平均识别正确率提高了近15%,系统运行的平均损耗时间相比降低了近20%,DRCSP特征还具有良好的鲁棒性并且性能不依赖于分类器选型。 展开更多
关键词 脑电信号 共空间模式 特征提取 动作分类
暂未订购
基于关键点特征提取与融合的人体姿态检测模型
11
作者 刘启恒 胡永祥 +1 位作者 潘长宁 韩龙志 《现代电子技术》 北大核心 2026年第7期31-39,共9页
针对不同场景下人体关键点特征提取能力和融合能力不足的问题,文中提出一种改进的YOLOv11n-Pose模型。该模型使用PKI Block替换原有的Bottleneck结构,以增强关键点特征的提取能力。同时,设计多膨胀率的空间金字塔卷积模块,以提高特征提... 针对不同场景下人体关键点特征提取能力和融合能力不足的问题,文中提出一种改进的YOLOv11n-Pose模型。该模型使用PKI Block替换原有的Bottleneck结构,以增强关键点特征的提取能力。同时,设计多膨胀率的空间金字塔卷积模块,以提高特征提取时的灵活性和表达能力。最后,引入CAF Block网络,进一步提升多尺度特征的融合效果。实验结果表明,所提算法在COCO2017数据集上的精确率、召回率、mAP@0.5、mAP@0.5:0.9相较于原模型分别提高了3.1%、2.9%、3.5%和1.2%。在实际推理中关键点位置估计误差和漏检情况显著减少,具有较好的应用价值。 展开更多
关键词 姿态检测 注意力机制 特征提取 特征融合 空间金字塔卷积 CAF Block网络
在线阅读 下载PDF
基于时空特征提取及深度集成网络的交通韧性预测
12
作者 夏溪蔓 孟学雷 +2 位作者 王莉 林立 韩正 《铁道科学与工程学报》 北大核心 2026年第1期63-76,共14页
目前,多制式交通跨线运营已成为提升城市交通网络性能的关键途径。精确量化多制式轨道交通网络的韧性,对于优化交通资源配置、增强交通系统抗风险能力具有重要意义。针对多制式交通跨线运营模式下时空特征关联性日渐复杂及多源异构数据... 目前,多制式交通跨线运营已成为提升城市交通网络性能的关键途径。精确量化多制式轨道交通网络的韧性,对于优化交通资源配置、增强交通系统抗风险能力具有重要意义。针对多制式交通跨线运营模式下时空特征关联性日渐复杂及多源异构数据处理困难的问题,提出一种基于时空特征提取及深度集成网络的交通韧性预测模型(spatial-temporal feature extraction and deep integrated network,STFEDIN)。该模型构建了时空特征融合网络(spatial-temporal feature fusion network,STNet),通过多尺度卷积与跨时间门控机制的协同实现对交通数据中非线性特征、时序依赖关系及空间异构性特征的有效提取。针对传统Transformer框架在时空特征建模中存在的长距离依赖捕获效率不足及空间结构信息利用不充分问题,引入混合头注意力机制(mixture-of-head,MoH)替代传统Transformer预测模型中的注意力结构,MoH模型可以通过动态路由策略实现注意力头间的协同优化,有效增强模型对多维度时空关联特征的动态解析能力与复杂场景适应性。以某城市的市域铁路与城市轨道交通系统跨线运营为例,验证模型的预测性能。实验结果表明,STFEDIN模型相对于传统的数理统计模型或单一机器学习模型有较好的预测性能,相较于时空演化建模图神经网络(spatial-temporal evolution modeling graph neural network,StemGNN)模型,平均绝对误差f_(mae)下降了0.01,均方根误差f_(rmse)下降了0.012,平均绝对百分比误差f_(mape)下降了1.701,决定系数f_(r2)上升了2.27%;与卷积长短时记忆网络(convolutional long short-term memory,ConvLSTM)模型相比,f_(mae)下降了0.045,f_(rmse)下降了0.057,f_(mape)下降了7.845,f_(r2)上升了26.60%。消融实验进一步证明了STFEDIN模型结构的合理性。研究成果为多制式交通跨线运营场景下的网络韧性评估提供了有效的解决途径。 展开更多
关键词 交通韧性 多制式交通跨线运营 时空特征提取 混合头注意力机制 TRANSFORMER
在线阅读 下载PDF
融合多重卷积和Dense Transformer的高光谱图像分类
13
作者 魏林 杨霄 尹玉萍 《红外技术》 北大核心 2026年第2期193-203,共11页
高光谱图像蕴含丰富的光谱空间信息。如何充分挖掘空谱信息进行分类,是一个关键的研究问题。在处理高光谱图像分类时,卷积擅长提取局部特征,Transformer能够捕获长距离特征依赖性,学习全局特征信息。针对卷积和Transformer的优势,提出... 高光谱图像蕴含丰富的光谱空间信息。如何充分挖掘空谱信息进行分类,是一个关键的研究问题。在处理高光谱图像分类时,卷积擅长提取局部特征,Transformer能够捕获长距离特征依赖性,学习全局特征信息。针对卷积和Transformer的优势,提出了一种结合三维卷积、空间通道重建卷积和Transformer的高光谱图像分类方法。首先将降维后的图像块,利用三维卷积进行综合的空谱特征提取;随后用空间通道重建卷积过滤冗余信息;最后用具有密集连接的Transformer对卷积提取的空谱特征建立长距离依赖关系,并使用多层感知机进行分类。实验表明,该方法在Pavia University、Salinas和Botswana数据集上总体分类精度分别为99.51%、99.85%、97.57%,均表现优异。 展开更多
关键词 高光谱图像 特征提取 三维卷积 空间通道重建卷积 TRANSFORMER
在线阅读 下载PDF
融合Transformer与BiLSTM的野外动态面部表情识别方法
14
作者 郭岱朋 徐飞 Nouman Hameed 《西安工业大学学报》 2026年第1期121-130,共10页
针对动态面部表情识别中时空特征提取与建模不足的问题,提出了一种结合Transformer与BiLSTM的动态面部表情识别方法。该方法通过Transformer进行空间特征提取,并利用BiLSTM对时序信息进行建模,从而提高动态面部表情的识别精度。实验结... 针对动态面部表情识别中时空特征提取与建模不足的问题,提出了一种结合Transformer与BiLSTM的动态面部表情识别方法。该方法通过Transformer进行空间特征提取,并利用BiLSTM对时序信息进行建模,从而提高动态面部表情的识别精度。实验结果表明,在DFEW数据集上,未加权平均召回率和加权平均召回率较现有方法分别提高了4.14%和2.52%;在FERV39k数据集上,提高了1.64%和1.80%。实验验证了该方法在动态面部表情识别中的有效性。 展开更多
关键词 动态面部表情识别 特征提取 空间特征 时序信息
在线阅读 下载PDF
基于多尺度混合注意力的遥感图像超分辨率重建
15
作者 邓峰良 钱育蓉 +3 位作者 范迎迎 白璐 王元旭 孔维泉 《微电子学与计算机》 2026年第3期98-110,共13页
现有基于Transformer的方法在处理复杂遥感场景时表现不佳,容易出现伪影和细节丢失,特别是在局部信息捕捉和空间关系建模方面存在明显局限。为解决上述问题,提出了一种多尺度混合注意力网络(Multi-scale Hybrid Attention Network,MsHAN... 现有基于Transformer的方法在处理复杂遥感场景时表现不佳,容易出现伪影和细节丢失,特别是在局部信息捕捉和空间关系建模方面存在明显局限。为解决上述问题,提出了一种多尺度混合注意力网络(Multi-scale Hybrid Attention Network,MsHAN)。该网络设计了大核多尺度注意力机制(Large Kernel Multi-scale Attention Mechanism,LKMSA)、多尺度动态窗口空洞注意力模块(Multi-scale Dynamic Window Hole Attention Module,MSDWDA)和空间前馈模块(Spatial Feedforward Module,SFM),全面提升了遥感图像超分辨率重建的性能。LKMSA结合大核卷积和多尺度机制,显著提高了对长距离依赖的建模能力和细节恢复效果。MSDWDA通过动态窗口划分和多尺度空洞卷积,有效增强了局部细节捕捉和全局一致性,并抑制了伪影累积。SFM通过优化前馈网络(Feed-Forward Network,FFN)结构,提升空间信息的建模能力,同时降低了计算复杂度。在AID、UCMerced与NWPU-RESISC45数据集上,MsHAN与现有常用、最新超分辨率重建方法(如EDSR、RCAN、MAN等)进行对比实验,结果显示:在各项评价指标上均取得了优异的表现。以PSNR指标为例,MsHAN相较最新的MAN方法在AID、UCMerced数据集上分别提升了0.05 dB与0.11 dB。这些结果表明,所提方法在细节恢复和整体图像质量方面具有显著优势。 展开更多
关键词 遥感图像 超分辨率重建 混合注意力 多尺度特征提取融合 空间前馈 深度学习
在线阅读 下载PDF
MSC-Deep LabV3+:A Segmentation Model for Slender Fabric Roll Seam Detection
16
作者 Weimin Shi Kuntao Lv +1 位作者 Chang Xuan Ji Wu 《Computers, Materials & Continua》 2026年第5期480-498,共19页
The application of deep learning in fabric defect detection has become increasingly widespread.To address false positives and false negatives in fabric roll seam detection,and to improve automation efficiency and prod... The application of deep learning in fabric defect detection has become increasingly widespread.To address false positives and false negatives in fabric roll seam detection,and to improve automation efficiency and product quality,we propose the Multi-scale Context DeepLabV3+(MSC-DeepLabV3+),a semantic segmentation network designed for fabric roll seam detection,based on DeepLabV3+.The model improvements include enhancing the backbone performance through optimization of the UIB-MobileNetV2 network;designing the Dynamic Atrous and Sliding-window Fusion(DASF)module to improve adaptability to multi-scale seam structures with dynamic dilation rates and a sliding-window mechanism;and utilizing the Progressive Low-level Feature Fusion(PLFF)module to progressively restore seam boundary details via shallow feature fusion.Additionally,an enhanced 3-SE attention mechanism is employed,replacing the direct concatenation operation.Experimental results show thatMSCDeepLabV3+outperforms classical and recent segmentation models.Compared to DeepLabV3+with an Xception backbone,MSC-DeepLabV3+achieves a mean intersection over union(mIoU)of 92.30%and the boundary Fscore(BF)of 92.54%,representing improvements of 3.04%and 3.14%,respectively.Moreover,the model complexity is significantly reduced,with the model parameters(params)decreasing to 3.44M and Frames Per Second(FPS)increasing from 101 to 273,demonstrating its potential for deployment in resource-constrained industrial scenarios. 展开更多
关键词 Fabric roll seam detection semantic segmentation deep learning lightweight network multi-scale feature extraction improved attention mechanism
在线阅读 下载PDF
基于实景三维数据的复杂场景空间位置判定
17
作者 张宇蓉 钱彬 《电子设计工程》 2026年第3期27-30,35,共5页
复杂场景中的干扰因素较多,传统空间位置判定方法难以避免干扰因素的影响。为获取精准的空间位置信息,提出基于实景三维数据的复杂场景空间位置判定方法。应用三维扫描技术获取复杂场景的实景三维数据(点云数据);采用高斯滤波算法与曲... 复杂场景中的干扰因素较多,传统空间位置判定方法难以避免干扰因素的影响。为获取精准的空间位置信息,提出基于实景三维数据的复杂场景空间位置判定方法。应用三维扫描技术获取复杂场景的实景三维数据(点云数据);采用高斯滤波算法与曲率采样方法对数据进行去噪与精简,提取点云数据特征,确定物体中心及其法向量;构造空间距离与空间夹角计算公式,实现对复杂场景空间位置的精准判定。实验结果表明,在复杂场景中,该方法获得的物体空间位置判定结果(空间距离、空间夹角)与实际空间位置数值高度吻合,完成全部判定任务仅需2 min。 展开更多
关键词 空间位置判定 实景三维数据 点云数据去噪 复杂场景 关键特征点提取
在线阅读 下载PDF
Spectral-spatial target detection based on data field modeling for hyperspectral data 被引量:4
18
作者 Da LIU Jianxun LI 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2018年第4期795-805,共11页
Target detection is always an important application in hyperspectral image processing field. In this paper, a spectral-spatial target detection algorithm for hyperspectral data is proposed.The spatial feature and spec... Target detection is always an important application in hyperspectral image processing field. In this paper, a spectral-spatial target detection algorithm for hyperspectral data is proposed.The spatial feature and spectral feature were unified based on the data filed theory and extracted by weighted manifold embedding. The novelties of the proposed method lie in two aspects. One is the way in which the spatial features and spectral features were fused as a new feature based on the data field theory, and the other is that local information was introduced to describe the decision boundary and explore the discriminative features for target detection. The extracted features based on data field modeling and manifold embedding techniques were considered for a target detection task.Three standard hyperspectral datasets were considered in the analysis. The effectiveness of the proposed target detection algorithm based on data field theory was proved by the higher detection rates with lower False Alarm Rates(FARs) with respect to those achieved by conventional hyperspectral target detectors. 展开更多
关键词 Data field modeling feature extraction Hyperspectral data Spectral-spatial Target detection
原文传递
Fault diagnosis of rolling bearing based on two-dimensional composite multi-scale ensemble Gramian dispersion entropy
19
作者 Wenqing Ding Jinde Zheng +3 位作者 Jianghong Li Haiyang Pan Jian Cheng Jinyu Tong 《Chinese Journal of Mechanical Engineering》 2026年第1期125-144,共20页
One-dimensional ensemble dispersion entropy(EDE1D)is an effective nonlinear dynamic analysis method for complexity measurement of time series.However,it is only restricted to assessing the complexity of one-di-mension... One-dimensional ensemble dispersion entropy(EDE1D)is an effective nonlinear dynamic analysis method for complexity measurement of time series.However,it is only restricted to assessing the complexity of one-di-mensional time series(TS1d)with the extracted complexity features only at a single scale.Aiming at these problems,a new nonlinear dynamic analysis method termed two-dimensional composite multi-scale ensemble Gramian dispersion entropy(CMEGDE_(2D))is proposed in this paper.First,the TS_(1D) is transformed into a two-dimensional image(I_(2D))by using Gramian angular fields(GAF)with more internal data structures and geometri features,which preserve the global characteristics and time dependence of vibration signals.Second,the I2D is analyzed at multiple scales through the composite coarse-graining method,which overcomes the limitation of a single scale and provides greater stability compared to traditional coarse-graining methods.Subsequently,a new fault diagnosis method of rolling bearing is proposed based on the proposed CMEGDE_(2D) for fault feature ex-traction and the chicken swarm algorithm optimized support vector machine(CsO-SvM)for fault pattern identification.The simulation signals and two data sets of rolling bearings are utilized to verify the effectiveness of the proposed fault diagnosis method.The results demonstrate that the proposed method has stronger dis-crimination ability,higher fault diagnosis accuracy and better stability than the other compared methods. 展开更多
关键词 Composite multi-scale ensemble Gramian dispersion entropy Dispersion entropy Fault diagnosis Rolling bearing feature extraction
在线阅读 下载PDF
多阶段渐进处理的图像去雨方法
20
作者 廉继红 王平 +1 位作者 李英 李云红 《西北大学学报(自然科学版)》 北大核心 2025年第2期297-308,共12页
针对现有图像去雨方法中存在雨纹去除不彻底、纹理信息丢失等问题,提出一种多阶段渐进式处理的图像去雨算法,可以同时将上下阶段的特征融合,使去雨算法的性能有很大的提高。该去雨网络模型由3个阶段构成。前2个阶段采用改进后的U-Net编... 针对现有图像去雨方法中存在雨纹去除不彻底、纹理信息丢失等问题,提出一种多阶段渐进式处理的图像去雨算法,可以同时将上下阶段的特征融合,使去雨算法的性能有很大的提高。该去雨网络模型由3个阶段构成。前2个阶段采用改进后的U-Net编码器解码器结构学习多尺度上下文特征信息,特征提取部分采用有效通道注意力机制(efficient channel attention network,ECANet),使网络模型参数变小,更加轻量级;第3阶段加入并行注意力机制(parallel attention subnetwork,PASNet),在学习上下文信息和空间细节特征的同时还能生成高分辨率特征,更好地保留图像的输出细节。此外,还引入监督注意力模块(supervised attention module,SAM)以加强特征学习。实验结果表明,在数据集Rain100H上PSNR达到29.37 dB,SSIM为0.88;在Test1200上PSNR达到32.50 dB,SSIM为0.93,验证了所提方法在图像去雨任务上的有效性。 展开更多
关键词 图像去雨 特征提取 监督注意力 并行注意力机制 空间细节
在线阅读 下载PDF
上一页 1 2 15 下一页 到第
使用帮助 返回顶部