期刊文献+
共找到287篇文章
< 1 2 15 >
每页显示 20 50 100
SEFormer:A Lightweight CNN-Transformer Based on Separable Multiscale Depthwise Convolution and Efficient Self-Attention for Rotating Machinery Fault Diagnosis 被引量:3
1
作者 Hongxing Wang Xilai Ju +1 位作者 Hua Zhu Huafeng Li 《Computers, Materials & Continua》 SCIE EI 2025年第1期1417-1437,共21页
Traditional data-driven fault diagnosis methods depend on expert experience to manually extract effective fault features of signals,which has certain limitations.Conversely,deep learning techniques have gained promine... Traditional data-driven fault diagnosis methods depend on expert experience to manually extract effective fault features of signals,which has certain limitations.Conversely,deep learning techniques have gained prominence as a central focus of research in the field of fault diagnosis by strong fault feature extraction ability and end-to-end fault diagnosis efficiency.Recently,utilizing the respective advantages of convolution neural network(CNN)and Transformer in local and global feature extraction,research on cooperating the two have demonstrated promise in the field of fault diagnosis.However,the cross-channel convolution mechanism in CNN and the self-attention calculations in Transformer contribute to excessive complexity in the cooperative model.This complexity results in high computational costs and limited industrial applicability.To tackle the above challenges,this paper proposes a lightweight CNN-Transformer named as SEFormer for rotating machinery fault diagnosis.First,a separable multiscale depthwise convolution block is designed to extract and integrate multiscale feature information from different channel dimensions of vibration signals.Then,an efficient self-attention block is developed to capture critical fine-grained features of the signal from a global perspective.Finally,experimental results on the planetary gearbox dataset and themotor roller bearing dataset prove that the proposed framework can balance the advantages of robustness,generalization and lightweight compared to recent state-of-the-art fault diagnosis models based on CNN and Transformer.This study presents a feasible strategy for developing a lightweight rotating machinery fault diagnosis framework aimed at economical deployment. 展开更多
关键词 CNN-Transformer separable multiscale depthwise convolution efficient self-attention fault diagnosis
在线阅读 下载PDF
A multiscale adaptive framework based on convolutional neural network:Application to fluid catalytic cracking product yield prediction 被引量:3
2
作者 Nan Liu Chun-Meng Zhu +1 位作者 Meng-Xuan Zhang Xing-Ying Lan 《Petroleum Science》 SCIE EI CAS CSCD 2024年第4期2849-2869,共21页
Since chemical processes are highly non-linear and multiscale,it is vital to deeply mine the multiscale coupling relationships embedded in the massive process data for the prediction and anomaly tracing of crucial pro... Since chemical processes are highly non-linear and multiscale,it is vital to deeply mine the multiscale coupling relationships embedded in the massive process data for the prediction and anomaly tracing of crucial process parameters and production indicators.While the integrated method of adaptive signal decomposition combined with time series models could effectively predict process variables,it does have limitations in capturing the high-frequency detail of the operation state when applied to complex chemical processes.In light of this,a novel Multiscale Multi-radius Multi-step Convolutional Neural Network(Msrt Net)is proposed for mining spatiotemporal multiscale information.First,the industrial data from the Fluid Catalytic Cracking(FCC)process decomposition using Complete Ensemble Empirical Mode Decomposition with Adaptive Noise(CEEMDAN)extract the multi-energy scale information of the feature subset.Then,convolution kernels with varying stride and padding structures are established to decouple the long-period operation process information encapsulated within the multi-energy scale data.Finally,a reconciliation network is trained to reconstruct the multiscale prediction results and obtain the final output.Msrt Net is initially assessed for its capability to untangle the spatiotemporal multiscale relationships among variables in the Tennessee Eastman Process(TEP).Subsequently,the performance of Msrt Net is evaluated in predicting product yield for a 2.80×10^(6) t/a FCC unit,taking diesel and gasoline yield as examples.In conclusion,Msrt Net can decouple and effectively extract spatiotemporal multiscale information from chemical process data and achieve a approximately reduction of 30%in prediction error compared to other time-series models.Furthermore,its robustness and transferability underscore its promising potential for broader applications. 展开更多
关键词 Fluid catalytic cracking Product yield Data-driven modeling multiscale prediction Data decomposition convolution neural network
原文传递
Establishing and validating a spotted tongue recognition and extraction model based on multiscale convolutional neural network 被引量:10
3
作者 PENG Chengdong WANG Li +3 位作者 JIANG Dongmei YANG Nuo CHEN Renming DONG Changwu 《Digital Chinese Medicine》 2022年第1期49-58,共10页
Objective In tongue diagnosis,the location,color,and distribution of spots can be used to speculate on the viscera and severity of the heat evil.This work focuses on the image analysis method of artificial intelligenc... Objective In tongue diagnosis,the location,color,and distribution of spots can be used to speculate on the viscera and severity of the heat evil.This work focuses on the image analysis method of artificial intelligence(AI)to study the spotted tongue recognition of traditional Chinese medicine(TCM).Methods A model of spotted tongue recognition and extraction is designed,which is based on the principle of image deep learning and instance segmentation.This model includes multiscale feature map generation,region proposal searching,and target region recognition.Firstly,deep convolution network is used to build multiscale low-and high-abstraction feature maps after which,target candidate box generation algorithm and selection strategy are used to select high-quality target candidate regions.Finally,classification network is used for classifying target regions and calculating target region pixels.As a result,the region segmentation of spotted tongue is obtained.Under non-standard illumination conditions,various tongue images were taken by mobile phones,and experiments were conducted.Results The spotted tongue recognition achieved an area under curve(AUC)of 92.40%,an accuracy of 84.30%with a sensitivity of 88.20%,a specificity of 94.19%,a recall of 88.20%,a regional pixel accuracy index pixel accuracy(PA)of 73.00%,a mean pixel accuracy(m PA)of73.00%,an intersection over union(Io U)of 60.00%,and a mean intersection over union(mIo U)of 56.00%.Conclusion The results of the study verify that the model is suitable for the application of the TCM tongue diagnosis system.Spotted tongue recognition via multiscale convolutional neural network(CNN)would help to improve spot classification and the accurate extraction of pixels of spot area as well as provide a practical method for intelligent tongue diagnosis of TCM. 展开更多
关键词 Spotted tongue recognition and extraction The feature of tongue Instance segmentation multiscale convolutional neural network(CNN) Tongue diagnosis system Artificial intelligence(AI)
在线阅读 下载PDF
Research on traffic flow prediction with multiscale temporal awareness and graph diffusion attention networks
4
作者 CAO Jie ZHANG Pengcheng +2 位作者 ZHANG Hong HOU Liang CHEN Zuohan 《High Technology Letters》 2025年第4期383-396,共14页
Precise traffic flow forecasting is essential for mitigating urban traffic congestion.However,it is difficult for existing methods to adequately capture the dynamic spatio-temporal characteristics and multiscale tempo... Precise traffic flow forecasting is essential for mitigating urban traffic congestion.However,it is difficult for existing methods to adequately capture the dynamic spatio-temporal characteristics and multiscale temporal dependencies of traffic flow.A traffic flow prediction model with multiscale temporal awareness and graph diffusion attention networks(MT-GDAN)is proposed to address these issues.Specifically,a graph diffusion attention module is constructed,which dynamically adjusts and calculates the weights of neighboring nodes in the graph structure using a random graph attention network(GAT)and captures the spatial characteristics of hidden nodes through an adaptive adjacency matrix,thus better exploiting the dynamic spatio-temporal properties of traffic flow.Secondly,a multiscale isometric convolutional network and bi-level routing attention are used to construct a multiscale temporal awareness module.The former extracts local information of traffic flow segments by convolution with different sizes of convolution kernels and then introduces isometric convolution to obtain the global temporal relationship between local features of traffic flow segments;the latter filters irrelevant spatio-temporal features at a coarse regional level and focuses locally on key points to more accurately capture the multiscale temporal dependencies of traffic flows.Experimental results reveal that the MT-GDAN model surpasses the mainstream baseline model in terms of forecasting accuracy and exhibits good prediction performance. 展开更多
关键词 intelligent transportation traffic flow prediction graph attention network multiscale isometric convolution bi-level routing attention
在线阅读 下载PDF
Convolutional Neural Network Based on Spatial Pyramid for Image Classification 被引量:2
5
作者 Gaihua Wang Meng Lu +2 位作者 Tao Li Guoliang Yuan Wenzhou Liu 《Journal of Beijing Institute of Technology》 EI CAS 2018年第4期630-636,共7页
A novel convolutional neural network based on spatial pyramid for image classification is proposed.The network exploits image features with spatial pyramid representation.First,it extracts global features from an orig... A novel convolutional neural network based on spatial pyramid for image classification is proposed.The network exploits image features with spatial pyramid representation.First,it extracts global features from an original image,and then different layers of grids are utilized to extract feature maps from different convolutional layers.Inspired by the spatial pyramid,the new network contains two parts,one of which is just like a standard convolutional neural network,composing of alternating convolutions and subsampling layers.But those convolution layers would be averagely pooled by the grid way to obtain feature maps,and then concatenated into a feature vector individually.Finally,those vectors are sequentially concatenated into a total feature vector as the last feature to the fully connection layer.This generated feature vector derives benefits from the classic and previous convolution layer,while the size of the grid adjusting the weight of the feature maps improves the recognition efficiency of the network.Experimental results demonstrate that this model improves the accuracy and applicability compared with the traditional model. 展开更多
关键词 convolutional neural network multiscale feature extraction image classification
在线阅读 下载PDF
Feature Fusion Multi_XMNet Convolution Neural Network for Clothing Image Classification 被引量:2
6
作者 ZHOU Honglei PENG Zhifei +1 位作者 TAO Ran ZHANG Lu 《Journal of Donghua University(English Edition)》 CAS 2021年第6期519-526,共8页
Faced with the massive amount of online shopping clothing images,how to classify them quickly and accurately is a challenging task in image classification.In this paper,we propose a novel method,named Multi_XMNet,to s... Faced with the massive amount of online shopping clothing images,how to classify them quickly and accurately is a challenging task in image classification.In this paper,we propose a novel method,named Multi_XMNet,to solve the clothing images classification problem.The proposed method mainly consists of two convolution neural network(CNN)branches.One branch extracts multiscale features from the whole expressional image by Multi_X which is designed by improving the Xception network,while the other extracts attention mechanism features from the whole expressional image by MobileNetV3-small network.Both multiscale and attention mechanism features are aggregated before making classification.Additionally,in the training stage,global average pooling(GAP),convolutional layers,and softmax classifiers are used instead of the fully connected layer to classify the final features,which speed up model training and alleviate the problem of overfitting caused by too many parameters.Experimental comparisons are made in the public DeepFashion dataset.The experimental results show that the classification accuracy of this method is 95.38%,which is better than InceptionV3,Xception and InceptionV3_Xception by 5.58%,3.32%,and 2.22%,respectively.The proposed Multi_XMNet image classification model can help enterprises and researchers in the field of clothing e-commerce to automaticly,efficiently and accurately classify massive clothing images. 展开更多
关键词 feature extraction feature fusion multiscale feature convolution neural network(CNN) clothing image classification
在线阅读 下载PDF
Infrasound Event Classification Fusion Model Based on Multiscale SE-CNN and BiLSTM 被引量:1
7
作者 Hongru Li Xihai Li +3 位作者 Xiaofeng Tan Chao Niu Jihao Liu Tianyou Liu 《Applied Geophysics》 SCIE CSCD 2024年第3期579-592,620,共15页
The classification of infrasound events has considerable importance in improving the capability to identify the types of natural disasters.The traditional infrasound classification mainly relies on machine learning al... The classification of infrasound events has considerable importance in improving the capability to identify the types of natural disasters.The traditional infrasound classification mainly relies on machine learning algorithms after artificial feature extraction.However,guaranteeing the effectiveness of the extracted features is difficult.The current trend focuses on using a convolution neural network to automatically extract features for classification.This method can be used to extract signal spatial features automatically through a convolution kernel;however,infrasound signals contain not only spatial information but also temporal information when used as a time series.These extracted temporal features are also crucial.If only a convolution neural network is used,then the time dependence of the infrasound sequence will be missed.Using long short-term memory networks can compensate for the missing time-series features but induces spatial feature information loss of the infrasound signal.A multiscale squeeze excitation–convolution neural network–bidirectional long short-term memory network infrasound event classification fusion model is proposed in this study to address these problems.This model automatically extracted temporal and spatial features,adaptively selected features,and also realized the fusion of the two types of features.Experimental results showed that the classification accuracy of the model was more than 98%,thus verifying the effectiveness and superiority of the proposed model. 展开更多
关键词 infrasound classification channel attention convolution neural network bidirectional long short-term memory network multiscale feature fusion
在线阅读 下载PDF
Automatic Pavement Crack Detection Based on Octave Convolution Neural Network with Hierarchical Feature Learning 被引量:1
8
作者 Minggang Xu Chong Li +1 位作者 Ying Chen Wu Wei 《Journal of Beijing Institute of Technology》 EI CAS 2024年第5期422-435,共14页
Automatic pavement crack detection plays an important role in ensuring road safety.In images of cracks,information about the cracks can be conveyed through high-frequency and low-fre-quency signals that focus on fine ... Automatic pavement crack detection plays an important role in ensuring road safety.In images of cracks,information about the cracks can be conveyed through high-frequency and low-fre-quency signals that focus on fine details and global structures,respectively.The output features obtained from different convolutional layers can be combined to represent information about both high-frequency and low-frequency signals.In this paper,we propose an encoder-decoder framework called octave hierarchical network(Octave-H),which is based on the U-Network(U-Net)architec-ture and utilizes an octave convolutional neural network and a hierarchical feature learning module for performing crack detection.The proposed octave convolution is capable of extracting multi-fre-quency feature maps,capturing both fine details and global cracks.We propose a hierarchical feature learning module that merges multi-frequency-scale feature maps with different levels(high and low)of octave convolutional layers.To verify the superiority of the proposed Octave-H,we employed the CrackForest dataset(CFD)and AigleRN databases to evaluate this method.The experimental results demonstrate that Octave-H outperforms other algorithms with satisfactory performance. 展开更多
关键词 automated pavement crack detection octave convolutional network hierarchical feature multiscale MULTIFREQUENCY
在线阅读 下载PDF
Grid Side Distributed Energy Storage Cloud Group End Region Hierarchical Time-Sharing Configuration Algorithm Based onMulti-Scale and Multi Feature Convolution Neural Network 被引量:1
9
作者 Wen Long Bin Zhu +3 位作者 Huaizheng Li Yan Zhu Zhiqiang Chen Gang Cheng 《Energy Engineering》 EI 2023年第5期1253-1269,共17页
There is instability in the distributed energy storage cloud group end region on the power grid side.In order to avoid large-scale fluctuating charging and discharging in the power grid environment and make the capaci... There is instability in the distributed energy storage cloud group end region on the power grid side.In order to avoid large-scale fluctuating charging and discharging in the power grid environment and make the capacitor components showa continuous and stable charging and discharging state,a hierarchical time-sharing configuration algorithm of distributed energy storage cloud group end region on the power grid side based on multi-scale and multi feature convolution neural network is proposed.Firstly,a voltage stability analysis model based onmulti-scale and multi feature convolution neural network is constructed,and the multi-scale and multi feature convolution neural network is optimized based on Self-OrganizingMaps(SOM)algorithm to analyze the voltage stability of the cloud group end region of distributed energy storage on the grid side under the framework of credibility.According to the optimal scheduling objectives and network size,the distributed robust optimal configuration control model is solved under the framework of coordinated optimal scheduling at multiple time scales;Finally,the time series characteristics of regional power grid load and distributed generation are analyzed.According to the regional hierarchical time-sharing configuration model of“cloud”,“group”and“end”layer,the grid side distributed energy storage cloud group end regional hierarchical time-sharing configuration algorithm is realized.The experimental results show that after applying this algorithm,the best grid side distributed energy storage configuration scheme can be determined,and the stability of grid side distributed energy storage cloud group end region layered timesharing configuration can be improved. 展开更多
关键词 multiscale and multi feature convolution neural network distributed energy storage at grid side cloud group end region layered time-sharing configuration algorithm
在线阅读 下载PDF
Multidimensional attention and multiscale upsampling for semantic segmentation
10
作者 LU Zhongda ZHANG Chunda +1 位作者 WANG Lijing XU Fengxia 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2022年第1期68-78,共11页
Semantic segmentation is for pixel-level classification tasks,and contextual information has an important impact on the performance of segmentation.In order to capture richer contextual information,we adopt ResNet as ... Semantic segmentation is for pixel-level classification tasks,and contextual information has an important impact on the performance of segmentation.In order to capture richer contextual information,we adopt ResNet as the backbone network and designs an encoder-decoder architecture based on multidimensional attention(MDA)module and multiscale upsampling(MSU)module.The MDA module calculates the attention matrices of the three dimensions to capture the dependency of each position,and adaptively captures the image features.The MSU module adopts parallel branches to capture the multiscale features of the images,and multiscale feature aggregation can enhance contextual information.A series of experiments demonstrate the validity of the model on Cityscapes and Camvid datasets. 展开更多
关键词 semantic segmentation attention mechanism multiscale feature convolutional neural network(CNN) residual network(ResNet)
在线阅读 下载PDF
基于多级卷积与形状增强的蒙皮缺陷检测方法
11
作者 王珏 卢震宇 +2 位作者 张晓巍 孙玉文 朱丽 《航空制造技术》 北大核心 2026年第6期22-29,共8页
飞机蒙皮作为飞机关键结构部件,表面缺陷直接影响飞机整体结构性能和隐身性能。本文提出一种基于RT-DETR模型的深度学习检测网络,以提升飞机蒙皮缺陷检测的准确性与鲁棒性。针对缺陷多尺度、形态多变,以及分布复杂的问题,设计多项创新... 飞机蒙皮作为飞机关键结构部件,表面缺陷直接影响飞机整体结构性能和隐身性能。本文提出一种基于RT-DETR模型的深度学习检测网络,以提升飞机蒙皮缺陷检测的准确性与鲁棒性。针对缺陷多尺度、形态多变,以及分布复杂的问题,设计多项创新技术予以优化。特征提取阶段引入多级卷积块(Multilevel convolution blocks,MCB),通过多层次卷积操作强化不同尺度特征的判别性,有效捕捉各层次细节信息;特征融合阶段采用多尺度特征增强(Multiscale feature enhancement,MSFE)模块,通过多尺寸深度卷积核构建上下文信息,提升网络对多尺度缺陷特征的鲁棒性与适应性;回归阶段引入形状感知(Shape-IoU)优化模块,通过优化边界框与缺陷轮廓的匹配度,显著提升检测结果的精确度。试验结果显示,所提出的检测网络在Aircraft数据集上的mAP@0.5达94.8%,较原RTDETR模型提升12.7%;在NEU-DET测试集上的mAP@0.5为92.5%。上述结果验证了该模型在提升飞机蒙皮缺陷检测精度与泛化能力方面的有效性。 展开更多
关键词 蒙皮缺陷检测 多级卷积块 多尺度特征增强 目标检测 损失函数
在线阅读 下载PDF
基于多尺度感知的多维空间融合水下图像增强算法
12
作者 郭伟 王曼婷 曲海成 《计算机应用》 北大核心 2026年第1期224-232,共9页
针对深海拍摄会导致水下图像色彩偏移、对比度过低和结构不清晰等问题,提出一种基于多尺度感知的多维空间融合水下图像增强算法,结合空间、通道和三维特征将图像信息并行传入多维特征提取网络和编码器中。首先,在多维特征提取网络中引... 针对深海拍摄会导致水下图像色彩偏移、对比度过低和结构不清晰等问题,提出一种基于多尺度感知的多维空间融合水下图像增强算法,结合空间、通道和三维特征将图像信息并行传入多维特征提取网络和编码器中。首先,在多维特征提取网络中引入多尺度特征精炼模块进一步处理提取到的特征信息,使网络更准确地学习不同尺度的信息;然后,在编码器中引入多维色彩增强模块,增强图像细节和色彩;最后,设计自适应增强网络来进一步处理特征信息并融合多级信息,再通过解码器得到最终的增强图像。在公开数据集上的实验结果表明,所提算法表现优异,它的峰值信噪比(PSNR)和结构相似性(SSIM)最高分别达到24.8651 dB和0.8954,比混合融合方法(HFM)分别提升了1.5806 dB和0.0398;水下色彩质量评价(UCIQE)和水下图像质量测量(UIQM)最高分别达到0.5931和3.1028,比HFM分别提升了0.0384和0.1514。可见,所提算法能有效提升水下视觉效果。 展开更多
关键词 图像处理 特征提取 多尺度特征 深层卷积 强化色彩
在线阅读 下载PDF
基于视图学习和通道特征拓扑融合的骨架行为识别
13
作者 谭台哲 张泽翰 +3 位作者 胡平川 朱辉果 战荫伟 杨卓 《计算机工程与设计》 北大核心 2026年第1期217-225,共9页
在人体骨架行为识别中,图卷积网络可提取人体骨架拓扑结构来聚合特征信息。但现有方法既未有效关联骨架特征与拓扑关系,也忽略了不同视图下拓扑关系的变化性。为此,提出基于视图学习和通道特征拓扑融合的行为识别方法(VLCTF-GCN)。依据... 在人体骨架行为识别中,图卷积网络可提取人体骨架拓扑结构来聚合特征信息。但现有方法既未有效关联骨架特征与拓扑关系,也忽略了不同视图下拓扑关系的变化性。为此,提出基于视图学习和通道特征拓扑融合的行为识别方法(VLCTF-GCN)。依据骨架的视图特征学习拓扑关系,为每个视图构建具有区分性的共享视图拓扑关系。在不同聚合程度上,结合视图与自适应拓扑关系,融合骨架通道特征与拓扑关系,使得拓扑结构能够自适应关联骨架特征,通过多尺度时间卷积提取不同时间长度的关节变化。在两个大型数据集的实验结果表明,所提方法性能优于现有方法。 展开更多
关键词 行为识别 人体骨架 图卷积 通道特征拓扑融合 视图学习 多尺度时间卷积 共享拓扑
在线阅读 下载PDF
基于DenseNet和多域特征融合的表面肌电手势识别研究
14
作者 金亚辉 刘鑫 +1 位作者 连大山 郭一娜 《传感器与微系统》 北大核心 2026年第1期30-34,共5页
基于表面肌电(sEMG)信号的手势识别在人机交互领域应用广泛,快速准确识别手势动作可以提供更好的用户体验。由于个体差异性导致在多任务中,整体识别准确率低。提出一种密集连接卷积网络(DenseNet)和多域特征融合的sEMG手势识别方法。首... 基于表面肌电(sEMG)信号的手势识别在人机交互领域应用广泛,快速准确识别手势动作可以提供更好的用户体验。由于个体差异性导致在多任务中,整体识别准确率低。提出一种密集连接卷积网络(DenseNet)和多域特征融合的sEMG手势识别方法。首先,从sEMG信号中提取时域和频域特征构成特征集,并与原始信号融合作为网络输入,增强网络输入数据的表达能力。其次,使用融合挤压—激励(SE)注意力和多尺度空洞卷积的DenseNet进行特征提取与分类识别。实验结果表明,在NinaPro DB2数据集中,手势识别整体准确率达到了88.06%,在整体和分类运动中识别性能都有所提升。 展开更多
关键词 表面肌电信号 手势识别 挤压—激励注意力 多尺度空洞卷积 密集连接卷积网络
在线阅读 下载PDF
一种融合多尺度动态注意力与1D-2D卷积的旋转机械声学故障轻量诊断方法
15
作者 何新荣 杜小泽 +2 位作者 谭锐 蒋国安 徐超 《现代制造工程》 北大核心 2026年第2期143-150,共8页
针对旋转机械声学信号中存在的非平稳性强与噪声干扰显著等问题,以及现有方法在时间-尺度建模能力不足、依赖手工时频变换且模型复杂不利于边缘部署的局限,提出了一种融合多尺度动态注意力与1D-2D卷积结构的轻量级端到端故障诊断模型(Mu... 针对旋转机械声学信号中存在的非平稳性强与噪声干扰显著等问题,以及现有方法在时间-尺度建模能力不足、依赖手工时频变换且模型复杂不利于边缘部署的局限,提出了一种融合多尺度动态注意力与1D-2D卷积结构的轻量级端到端故障诊断模型(Multiscale Dynamic Attention and 1D-2D convolutional Fusion Network,MDAF-Net)。该模型集成4项关键模块:首先,构建多尺度动态加权特征提取(Multiscale Dynamic Weighting Feature Extractor,MDW-FE)模块,结合多尺度卷积核与自适应加权机制,以增强对非平稳声学特征的感知能力;其次,设计多尺度映射层(Reshaped Multiscale Projection,RMP),实现一维序列向二维结构的转换,保留时间-尺度关联信息;然后,引入融合深度可分卷积的金字塔注意力机制(Pyramid Convolutional Block Attention Module integrated with Depthwise Separable Convolution,P-CBAM-DSC),提升模型对故障区域的聚焦能力与上下文表达能力;最终,通过全局特征聚合分类器(Global Feature Aggregation Classifier,GFA-C)实现高效的端到端故障识别。在DCASE2023公开声音数据集与自建滚动轴承声纹平台上的实验结果表明,所提方法在准确率、模型轻量化与推理效率方面均优于主流轻量模型,展现出良好的诊断性能、噪声鲁棒性与边缘部署适应性。 展开更多
关键词 旋转机械 故障诊断 声学信号 轻量化网络 1D-2D卷积建模 多尺度动态注意力
在线阅读 下载PDF
动态特征聚合与多层次协同的无人机红外目标实例分割 被引量:3
16
作者 何自芬 王启刚 +3 位作者 张印辉 黄滢 彭伟 陈光晨 《红外与激光工程》 北大核心 2025年第8期246-258,共13页
针对无人机红外成像中因距离较远导致的图像轮廓模糊及目标尺度变化致使分割精度下降的问题,文中提出动态特征聚合与多层次协同的无人机红外目标实例分割模型(Dynamic feature aggregation and multi-level collaboration,DFMCNet)。首... 针对无人机红外成像中因距离较远导致的图像轮廓模糊及目标尺度变化致使分割精度下降的问题,文中提出动态特征聚合与多层次协同的无人机红外目标实例分割模型(Dynamic feature aggregation and multi-level collaboration,DFMCNet)。首先,设计区域特征自适应卷积模块(Spatial attention dynamic convolution,SADConv),采用动态卷积核和注意力机制,有效缓解特征图降维引发的细节丢失,抑制背景噪声干扰;其次,构建特征感知重组上采样模块(Feature sensing recombination upsampling module,FRUM),利用并行化可学习权重实现特征重组,在恢复特征图分辨率时保留空间特征并增强空间结构信息关注;最后,引入多尺度上下文聚合模块(Multi-scale context aggregation feature extraction module,MSFE),通过跨层级特征融合捕获多尺度上下文信息,提升模型对尺寸差异目标的泛化性。在红外航拍交通数据集Aerial-Mancar上的实验表明,DFMCNet的mAP50精度为78.4%较基准模型提升9.7%,mAP50-95精度为51.1%提升5.6%,与YOLOv12n-seg相比mAP50提高7.2%,验证了其在无人机红外场景下实现红外目标精确分割的有效性。 展开更多
关键词 无人机红外 动态卷积核 特征重组 多尺度聚合
原文传递
基于双路多尺度卷积的近红外光谱羊绒羊毛纤维预测模型 被引量:2
17
作者 陈锦妮 田谷丰 +4 位作者 李云红 朱耀麟 陈鑫 门玉乐 魏小双 《光谱学与光谱分析》 北大核心 2025年第3期678-684,共7页
羊绒具有轻盈舒适、光滑柔软、稀释透气以及保暖好的特点,由于羊绒价格十分昂贵,因此市场上的羊绒产品质量良莠不齐。现有的显微镜法、DNA法、化学溶解法和基于图像的方法具有损坏样本、设备昂贵、主观性强等不足。近红外光谱技术是一... 羊绒具有轻盈舒适、光滑柔软、稀释透气以及保暖好的特点,由于羊绒价格十分昂贵,因此市场上的羊绒产品质量良莠不齐。现有的显微镜法、DNA法、化学溶解法和基于图像的方法具有损坏样本、设备昂贵、主观性强等不足。近红外光谱技术是一种非破坏性、可进行建模操作的快速测量方法。针对传统的建模方法通常无法学习出通用的近红外光谱波段特征,导致泛化能力弱,且羊绒羊毛纤维的近红外光谱波段特征相似,难以区分的问题,本文提出一种基于双路多尺度卷积的近红外光谱羊绒羊毛纤维预测模型。采集了羊绒羊毛样品的近红外光谱波段数据共1170个进行验证,近红外光谱波段数据范围是1300~2500 nm。利用两个并行卷积神经网络来提取近红外光谱波段的特征,采用原始近红外光谱波段数据和降维近红外光谱波段数据同时输入的方式,并利用多尺度特征提取模块进一步提取中间具有贡献力的近红外光谱波段特征,利用路径交流模块用于两路近红外光谱波段特征的信息交流,最后利用类级别融合得到羊绒羊毛纤维预测结果。在实验过程中,将采集的80%近红外光谱波段数据用于模型训练,20%近红外光谱波段数据用于模型测试。模型测试集的平均预测准确率为94.45%,与传统算法中的随机森林、SVM、1D-CNN等算法相比较分别提升了7.33%、5.22%、2.96%,并进行消融实验对所提模型的结构进一步验证。实验结果表明,本文提出的双路多尺度卷积的近红外光谱羊绒羊毛纤维预测模型可实现羊绒羊毛纤维的快速无损预测,为近红外光谱羊绒羊毛纤维预测提供了新的思路。 展开更多
关键词 羊绒羊毛 近红外光谱 深度学习 双路多尺度卷积神经网络
在线阅读 下载PDF
基于改进SSD算法的地铁场景小行人目标检测 被引量:3
18
作者 张秀再 邱野 沈涛 《计算机研究与发展》 北大核心 2025年第2期397-407,共11页
在地铁场景中,小行人目标由于分辨率低,包含特征信息较少,现阶段目标检测器对此类目标的检测仍具有挑战性.SSD目标检测算法利用金字塔网络的多尺度检测头,能一定程度提高行人目标检测性能,但将其应用于地铁等复杂环境中实现小行人目标... 在地铁场景中,小行人目标由于分辨率低,包含特征信息较少,现阶段目标检测器对此类目标的检测仍具有挑战性.SSD目标检测算法利用金字塔网络的多尺度检测头,能一定程度提高行人目标检测性能,但将其应用于地铁等复杂环境中实现小行人目标检测仍具有一定局限性.针对上述问题,提出一种改进SSD算法以加强地铁场景中小行人目标检测效果.通过构建地铁场景行人目标数据集,标注相应标签,同时进行数据预处理操作;在特征提取网络中加入金字塔特征加强模块,将多分支残差单元、亚像素卷积和特征金字塔相结合获得图像多尺度、多感受野融合特征;利用上下文信息融合模块将图像低层特征与上下文特征相融合,生成扩展特征层用于检测小行人目标;设计一种基于Anchor-free的动态正负样本分配策略,为小行人目标生成最优正样本.实验结果表明:提出的改进SSD算法能有效提高地铁场景小行人目标检测性能,对遮挡严重的小行人目标检测,效果提升更为明显. 展开更多
关键词 小行人目标检测 SSD算法 注意力机制 亚像素卷积 多尺度特征融合
在线阅读 下载PDF
融合多尺度特征的高效网片缺陷检测算法 被引量:1
19
作者 何钢 姚远 +2 位作者 韩征彤 邹华涛 王田田 《计算机工程与应用》 北大核心 2025年第11期316-324,共9页
针对人工目视排查大型旋转过滤设备网片缺陷时存在的效率低下,缺陷与背景之间边界模糊及网孔中的小水珠产生反光现象等问题,提出了一个基于多维特征融合的高效网片缺陷检测算法。引入了泊松图像增强技术,实现了缺陷目标与正常背景区域... 针对人工目视排查大型旋转过滤设备网片缺陷时存在的效率低下,缺陷与背景之间边界模糊及网孔中的小水珠产生反光现象等问题,提出了一个基于多维特征融合的高效网片缺陷检测算法。引入了泊松图像增强技术,实现了缺陷目标与正常背景区域的平滑融合,增加了小样本缺陷数量的同时解决了缺陷数量分布不均匀的问题。在YOLOv8中融入轻量多维卷积改进的C2fLWDC(C2flightweight multi-dimensional convolution)模块及加权多特征增强模块,既增强了网络对缺陷特征的提取又实现了各级特征的高效融合,提升了对多尺度缺陷样本的表征能力。采用EIOU(efficient intersection over union)定位损失函数,加速了对缺陷目标的准确定位。网片数据集检测结果表明,改进后的算法mAP(mean average precision)达到92%,相较于原始模型提升了16.8个百分点,能很好地完成缺陷目标的检测任务。 展开更多
关键词 网片缺陷 YOLOv8 轻量多维卷积 特征融合 多尺度
在线阅读 下载PDF
基于样本迭代优化策略的密集连接多尺度土地覆盖语义分割 被引量:1
20
作者 郑宗生 高萌 +3 位作者 周文睆 王政翰 霍志俊 张月维 《自然资源遥感》 北大核心 2025年第2期11-18,共8页
针对分割结果小尺度地物遗漏、连续地物缺乏完整性问题,提出密集连接多尺度语义分割模型(densely connected multi-scale semantic segmentation network, DMS-Net),实现土地覆盖分割。通过多尺度密集连接空洞空间卷积金字塔池化(multi-... 针对分割结果小尺度地物遗漏、连续地物缺乏完整性问题,提出密集连接多尺度语义分割模型(densely connected multi-scale semantic segmentation network, DMS-Net),实现土地覆盖分割。通过多尺度密集连接空洞空间卷积金字塔池化(multi-scale dense connected atrous spatial convolution pyramid pooling module, MDCA)和条形池化(spatial pyramid pooling, SP)提取多尺度和空间连续性地物;利用特征增强双注意力并联模块(position paralleling channel attention module, PPCA)衡量特征权重,实现高效表达;采用浅层特征级联模块(cascade low-level feature fusion, CLFF)捕捉被忽略的浅层特征,进一步补充细节。实验结果表明:DMS-Net模型在迭代扩充数据集上的总体精度(overall accuracy, OA)达到89.97%,平均交并比(mean intersection over union, mIoU)达到75.59%,高于传统机器学习方法及U-Net, PSPNet, Deeplabv3+等深度学习模型。分割结果显示,地物结构完整且边缘分割明晰,在实现多尺度的土地覆盖遥感信息提取分析中具有较好的实用价值。 展开更多
关键词 深度学习 全卷积神经网络 多尺度 语义分割 土地覆盖
在线阅读 下载PDF
上一页 1 2 15 下一页 到第
使用帮助 返回顶部