期刊文献+
共找到260篇文章
< 1 2 13 >
每页显示 20 50 100
Remaining Useful Life Prediction of Aeroengine Based on Principal Component Analysis and One-Dimensional Convolutional Neural Network 被引量:5
1
作者 LYU Defeng HU Yuwen 《Transactions of Nanjing University of Aeronautics and Astronautics》 EI CSCD 2021年第5期867-875,共9页
In order to directly construct the mapping between multiple state parameters and remaining useful life(RUL),and reduce the interference of random error on prediction accuracy,a RUL prediction model of aeroengine based... In order to directly construct the mapping between multiple state parameters and remaining useful life(RUL),and reduce the interference of random error on prediction accuracy,a RUL prediction model of aeroengine based on principal component analysis(PCA)and one-dimensional convolution neural network(1D-CNN)is proposed in this paper.Firstly,multiple state parameters corresponding to massive cycles of aeroengine are collected and brought into PCA for dimensionality reduction,and principal components are extracted for further time series prediction.Secondly,the 1D-CNN model is constructed to directly study the mapping between principal components and RUL.Multiple convolution and pooling operations are applied for deep feature extraction,and the end-to-end RUL prediction of aeroengine can be realized.Experimental results show that the most effective principal component from the multiple state parameters can be obtained by PCA,and the long time series of multiple state parameters can be directly mapped to RUL by 1D-CNN,so as to improve the efficiency and accuracy of RUL prediction.Compared with other traditional models,the proposed method also has lower prediction error and better robustness. 展开更多
关键词 AEROENGINE remaining useful life(RUL) principal component analysis(PCA) one-dimensional convolution neural network(1D-CNN) time series prediction state parameters
在线阅读 下载PDF
Robust Damage Detection and Localization Under Complex Environmental Conditions Using Singular Value Decomposition-based Feature Extraction and One-dimensional Convolutional Neural Network 被引量:1
2
作者 Shengkang Zong Sheng Wang +3 位作者 Zhitao Luo Xinkai Wu Hui Zhang Zhonghua Ni 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2023年第3期252-261,共10页
Ultrasonic guided wave is an attractive monitoring technique for large-scale structures but is vulnerable to changes in environmental and operational conditions(EOC),which are inevitable in the normal inspection of ci... Ultrasonic guided wave is an attractive monitoring technique for large-scale structures but is vulnerable to changes in environmental and operational conditions(EOC),which are inevitable in the normal inspection of civil and mechanical structures.This paper thus presents a robust guided wave-based method for damage detection and localization under complex environmental conditions by singular value decomposition-based feature extraction and one-dimensional convolutional neural network(1D-CNN).After singular value decomposition-based feature extraction processing,a temporal robust damage index(TRDI)is extracted,and the effect of EOCs is well removed.Hence,even for the signals with a very large temperature-varying range and low signal-to-noise ratios(SNRs),the final damage detection and localization accuracy retain perfect 100%.Verifications are conducted on two different experimental datasets.The first dataset consists of guided wave signals collected from a thin aluminum plate with artificial noises,and the second is a publicly available experimental dataset of guided wave signals acquired on a composite plate with a temperature ranging from 20℃to 60℃.It is demonstrated that the proposed method can detect and localize the damage accurately and rapidly,showing great potential for application in complex and unknown EOC. 展开更多
关键词 Ultrasonic guided waves Singular value decomposition Damage detection and localization Environmental and operational conditions one-dimensional convolutional neural network
在线阅读 下载PDF
Research on Behaviour Recognition Method for Moving Target Based on Deep Convolutional Neural Network
3
作者 Jianfang Liu Hao Zheng Mengyi Liao 《Journal of Computer and Communications》 2020年第9期54-66,共13页
Aiming at the problem that the average recognition degree of the moving target line is low with the traditional motion target behaviour recognition method, a motion recognition method based on deep convolutional neura... Aiming at the problem that the average recognition degree of the moving target line is low with the traditional motion target behaviour recognition method, a motion recognition method based on deep convolutional neural network is proposed in this paper. A target model of deep convolutional neural network is constructed and the basic unit of the network is designed by using the model. By setting the unit, the returned unit is calculated into the standard density diagram, and the position of the moving target is determined by the local maximum method to realize the behavior identification of the moving target. The experimental results show that the multi-parameter SICNN256 model is slightly better than other model structures. The average recognition rate and recognition rate of the moving target behavior recognition method based on deep convolutional neural network are higher than those of the traditional method, which proves its effectiveness. Since the frequency of single target is higher than that of multiple recognition and there is no target similarity recognition, similar target error detection cannot be excluded. 展开更多
关键词 convolutional neural network Moving Target RECOGNITION depth
在线阅读 下载PDF
Fault Line Detection Using Waveform Fusion and One-dimensional Convolutional Neural Network in Resonant Grounding Distribution Systems 被引量:10
4
作者 Jianhong Gao Moufa Guo Duan-Yu Chen 《CSEE Journal of Power and Energy Systems》 SCIE CSCD 2021年第2期250-260,共11页
Effective features are essential for fault diagnosis.Due to the faint characteristics of a single line-to-ground(SLG)fault,fault line detection has become a challenge in resonant grounding distribution systems.This pa... Effective features are essential for fault diagnosis.Due to the faint characteristics of a single line-to-ground(SLG)fault,fault line detection has become a challenge in resonant grounding distribution systems.This paper proposes a novel fault line detection method using waveform fusion and one-dimensional convolutional neural networks(1-D CNN).After an SLG fault occurs,the first-half waves of zero-sequence currents are collected and superimposed with each other to achieve waveform fusion.The compelling feature of fused waveforms is extracted by 1-D CNN to determine whether the fused waveform source contains the fault line.Then,the 1-D CNN output is used to update the value of the counter in order to identify the fault line.Given the lack of fault data in existing distribution systems,the proposed method only needs a small quantity of data for model training and fault line detection.In addition,the proposed method owns fault-tolerant performance.Even if a few samples are misjudged,the fault line can still be detected correctly based on the full output results of 1-D CNN.Experimental results verified that the proposed method can work effectively under various fault conditions. 展开更多
关键词 Fault line detection one-dimensional convolutional neural network resonant grounding distribution systems waveform fusion
原文传递
Temporally Consistent Depth Map Prediction Using Deep Convolutional Neural Network and Spatial-Temporal Conditional Random Field 被引量:2
5
作者 Xu-Ran Zhao Xun Wang Qi-Chao Chen 《Journal of Computer Science & Technology》 SCIE EI CSCD 2017年第3期443-456,共14页
Deep convolutional neural networks (DCNNs) based methods recently keep setting new records on the tasks of predicting depth maps from monocular images. When dealing with video-based applications such as 2D (2-dimen... Deep convolutional neural networks (DCNNs) based methods recently keep setting new records on the tasks of predicting depth maps from monocular images. When dealing with video-based applications such as 2D (2-dimensional) to 3D (3-dimensional) video conversion, however, these approaches tend to produce temporally inconsistent depth maps, since their CNN models are optimized over single frames. In this paper, we address this problem by introducing a novel spatial-temporal conditional random fields (CRF) model into the DCNN architecture, which is able to enforce temporal consistency between depth map estimations over consecutive video frames. In our approach, temporally consistent superpixel (TSP) is first applied to an image sequence to establish the correspondence of targets in consecutive frames. A DCNN is then used to regress the depth value of each temporal superpixel, followed by a spatial-temporal CRF layer to model the relationship of the estimated depths in both spatial and temporal domains. The parameters in both DCNN and CRF models are jointly optimized with back propagation. Experimental results show that our approach not only is able to significantly enhance the temporal consistency of estimated depth maps over existing single-frame-based approaches, but also improves the depth estimation accuracy in terms of various evaluation metrics. 展开更多
关键词 depth estimation temporal consistency convolutional neural network conditional random fields
原文传递
基于卷积网络和深度相机的飞机牵引车防碰撞安全检测系统设计
6
作者 孙丰源 张军 +3 位作者 黄明辉 向富尧 王一旋 刘宇新 《科学技术与工程》 北大核心 2026年第4期1728-1734,共7页
针对飞机牵引作业时的视野盲区大,存在安全隐患的问题,提出以深度相机与卷积神经网络(convolutional neural network,CNN)模型相融合的防碰撞检测方法。采用卷积网络实现环境目标的自动识别,利用深度相机获取目标距离信息,二者联合使用... 针对飞机牵引作业时的视野盲区大,存在安全隐患的问题,提出以深度相机与卷积神经网络(convolutional neural network,CNN)模型相融合的防碰撞检测方法。采用卷积网络实现环境目标的自动识别,利用深度相机获取目标距离信息,二者联合使用实现牵引过程障碍物的定位。将训练的卷积网络模型和ZED2i双目相机部署再飞机牵引车上,通过CAN总线进行通信,在试验场开展了避障实验。结果表明:构建的卷积网络模型识别准确率达到0.911,召回率达到0.803;在10 m测距范围内,测距误差在0.3 m以内,能够为飞机牵引车在牵引作业时的防碰撞安全检测提供技术参考。 展开更多
关键词 飞机牵引车 防碰撞检测 深度相机 卷积神经网络(CNN) 目标定位
在线阅读 下载PDF
Fault Diagnosis for Wind Turbine Flange Bolts Based on One-Dimensional Depthwise Separable Convolutions
7
作者 Yongchao Liu Shuqing Dong +3 位作者 Qingfeng Wang Wenhe Cai Ruizhuo Song Qinglai Wei 《The International Journal of Intelligent Control and Systems》 2024年第1期42-47,共6页
In this paper,a new bolt fault diagnosis method is developed to solve the fault diagnosis problem of wind turbine flange bolts using one-dimensional depthwise separable convolutions.The main idea is to use a one-dimen... In this paper,a new bolt fault diagnosis method is developed to solve the fault diagnosis problem of wind turbine flange bolts using one-dimensional depthwise separable convolutions.The main idea is to use a one-dimensional convolutional neural network model to classify and identify the acoustic vibration signals of bolts,which represent different bolt damage states.Through the methods of knock test and modal simulation,it is concluded that the damage state of wind turbine flange bolt is related to the natural frequency distribution of acoustic vibration signal.It is found that the bolt damage state affects the modal shape of the structure,and then affects the natural frequency distribution of the bolt vibration signal.Therefore,the damage state can be obtained by identifying the natural frequency distribution of the bolt acoustic vibration signal.In the present one-dimensional depth-detachable convolutional neural network model,the one-dimensional vector is first convolved into multiple channels,and then each channel is separately learned by depth-detachable convolution,which can effectively improve the feature quality and the effect of data classification.From the perspective of the realization mechanism of convolution operation,the depthwise separable convolution operation has fewer parameters and faster computing speed,making it easier to build lightweight models and deploy them to mobile devices. 展开更多
关键词 Wind turbine flange bolts one-dimensional convolutional neural network(1DCNN)model depthwise separable convolutions damage identification
在线阅读 下载PDF
基于Group-Depth U-Net的电子显微图像中神经元结构分割 被引量:2
8
作者 李玉慧 梁创学 李军 《中国医学物理学杂志》 CSCD 2020年第6期720-725,共6页
针对电子显微(EM)成像存在边界有损、模糊不均匀以及神经元结构本身轮廓纹理复杂难以定位的问题,提出一种深层卷积神经网络模型Group-Depth U-Net,以实现EM图像中神经元结构的自动分割。该模型采用更加深层的U-Net架构作为骨架网络,以... 针对电子显微(EM)成像存在边界有损、模糊不均匀以及神经元结构本身轮廓纹理复杂难以定位的问题,提出一种深层卷积神经网络模型Group-Depth U-Net,以实现EM图像中神经元结构的自动分割。该模型采用更加深层的U-Net架构作为骨架网络,以获取更加丰富的图像特征信息;同时采用分组卷积网络结构,使模型更加高效、防止过拟合,从而提高分割的准确性与效率。公开的数据集实验表明该模型相比U-Net达到了更好的分割准确率。 展开更多
关键词 深层卷积神经网络 分组卷积网络 神经元结构分割 电子显微成像 Group-depth U-Net
在线阅读 下载PDF
1D-CNN:Speech Emotion Recognition System Using a Stacked Network with Dilated CNN Features 被引量:6
9
作者 Mustaqeem Soonil Kwon 《Computers, Materials & Continua》 SCIE EI 2021年第6期4039-4059,共21页
Emotion recognition from speech data is an active and emerging area of research that plays an important role in numerous applications,such as robotics,virtual reality,behavior assessments,and emergency call centers.Re... Emotion recognition from speech data is an active and emerging area of research that plays an important role in numerous applications,such as robotics,virtual reality,behavior assessments,and emergency call centers.Recently,researchers have developed many techniques in this field in order to ensure an improvement in the accuracy by utilizing several deep learning approaches,but the recognition rate is still not convincing.Our main aim is to develop a new technique that increases the recognition rate with reasonable cost computations.In this paper,we suggested a new technique,which is a one-dimensional dilated convolutional neural network(1D-DCNN)for speech emotion recognition(SER)that utilizes the hierarchical features learning blocks(HFLBs)with a bi-directional gated recurrent unit(BiGRU).We designed a one-dimensional CNN network to enhance the speech signals,which uses a spectral analysis,and to extract the hidden patterns from the speech signals that are fed into a stacked one-dimensional dilated network that are called HFLBs.Each HFLB contains one dilated convolution layer(DCL),one batch normalization(BN),and one leaky_relu(Relu)layer in order to extract the emotional features using a hieratical correlation strategy.Furthermore,the learned emotional features are feed into a BiGRU in order to adjust the global weights and to recognize the temporal cues.The final state of the deep BiGRU is passed from a softmax classifier in order to produce the probabilities of the emotions.The proposed model was evaluated over three benchmarked datasets that included the IEMOCAP,EMO-DB,and RAVDESS,which achieved 72.75%,91.14%,and 78.01%accuracy,respectively. 展开更多
关键词 Affective computing one-dimensional dilated convolutional neural network emotion recognition gated recurrent unit raw audio clips
在线阅读 下载PDF
Automatic depth matching method of well log based on deep reinforcement learning 被引量:5
10
作者 XIONG Wenjun XIAO Lizhi +1 位作者 YUAN Jiangru YUE Wenzheng 《Petroleum Exploration and Development》 SCIE 2024年第3期634-646,共13页
In the traditional well log depth matching tasks,manual adjustments are required,which means significantly labor-intensive for multiple wells,leading to low work efficiency.This paper introduces a multi-agent deep rei... In the traditional well log depth matching tasks,manual adjustments are required,which means significantly labor-intensive for multiple wells,leading to low work efficiency.This paper introduces a multi-agent deep reinforcement learning(MARL)method to automate the depth matching of multi-well logs.This method defines multiple top-down dual sliding windows based on the convolutional neural network(CNN)to extract and capture similar feature sequences on well logs,and it establishes an interaction mechanism between agents and the environment to control the depth matching process.Specifically,the agent selects an action to translate or scale the feature sequence based on the double deep Q-network(DDQN).Through the feedback of the reward signal,it evaluates the effectiveness of each action,aiming to obtain the optimal strategy and improve the accuracy of the matching task.Our experiments show that MARL can automatically perform depth matches for well-logs in multiple wells,and reduce manual intervention.In the application to the oil field,a comparative analysis of dynamic time warping(DTW),deep Q-learning network(DQN),and DDQN methods revealed that the DDQN algorithm,with its dual-network evaluation mechanism,significantly improves performance by identifying and aligning more details in the well log feature sequences,thus achieving higher depth matching accuracy. 展开更多
关键词 artificial intelligence machine learning depth matching well log multi-agent deep reinforcement learning convolutional neural network double deep Q-network
在线阅读 下载PDF
Hand segmentation from a single depth image based on histogram threshold selection and shallow CNN 被引量:1
11
作者 XU Zhengze ZHANG Wenjun 《上海大学学报(自然科学版)》 CAS CSCD 北大核心 2018年第5期675-685,共11页
Real-time hand gesture recognition technology significantly improves the user's experience for virtual reality/augmented reality(VR/AR) applications, which relies on the identification of the orientation of the ha... Real-time hand gesture recognition technology significantly improves the user's experience for virtual reality/augmented reality(VR/AR) applications, which relies on the identification of the orientation of the hand in captured images or videos. A new three-stage pipeline approach for fast and accurate hand segmentation for the hand from a single depth image is proposed. Firstly, a depth frame is segmented into several regions by histogrambased threshold selection algorithm and by tracing the exterior boundaries of objects after thresholding. Secondly, each segmentation proposal is evaluated by a three-layers shallow convolutional neural network(CNN) to determine whether or not the boundary is associated with the hand. Finally, all hand components are merged as the hand segmentation result. Compared with algorithms based on random decision forest(RDF), the experimental results demonstrate that the approach achieves better performance with high-accuracy(88.34% mean intersection over union, mIoU) and a shorter processing time(≤8 ms). 展开更多
关键词 HAND SEGMENTATION HISTOGRAM THRESHOLD selection convolutional neural network(CNN) depth map
在线阅读 下载PDF
No-reference synthetic image quality assessment with convolutional neural network and local image saliency 被引量:3
12
作者 Xiaochuan Wang Xiaohui Liang +1 位作者 Bailin Yang Frederick W.B.Li 《Computational Visual Media》 CSCD 2019年第2期193-208,共16页
Depth-image-based rendering(DIBR) is widely used in 3 DTV, free-viewpoint video, and interactive 3 D graphics applications. Typically, synthetic images generated by DIBR-based systems incorporate various distortions, ... Depth-image-based rendering(DIBR) is widely used in 3 DTV, free-viewpoint video, and interactive 3 D graphics applications. Typically, synthetic images generated by DIBR-based systems incorporate various distortions, particularly geometric distortions induced by object dis-occlusion. Ensuring the quality of synthetic images is critical to maintaining adequate system service. However, traditional 2 D image quality metrics are ineffective for evaluating synthetic images as they are not sensitive to geometric distortion. In this paper, we propose a novel no-reference image quality assessment method for synthetic images based on convolutional neural networks, introducing local image saliency as prediction weights. Due to the lack of existing training data, we construct a new DIBR synthetic image dataset as part of our contribution. Experiments were conducted on both the public benchmark IRCCyN/IVC DIBR image dataset and our own dataset. Results demonstrate that our proposed metric outperforms traditional 2 D image quality metrics and state-of-the-art DIBR-related metrics. 展开更多
关键词 IMAGE quality assessment SYNTHETIC IMAGE depth-image-based rendering(DIBR) convolutional neural network local IMAGE SALIENCY
原文传递
A method to generate foggy optical images based on unsupervised depth estimation
13
作者 WANG Xiangjun LIU Linghao +1 位作者 NI Yubo WANG Lin 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2021年第1期44-52,共9页
For traffic object detection in foggy environment based on convolutional neural network(CNN),data sets in fog-free environment are generally used to train the network directly.As a result,the network cannot learn the ... For traffic object detection in foggy environment based on convolutional neural network(CNN),data sets in fog-free environment are generally used to train the network directly.As a result,the network cannot learn the object characteristics in the foggy environment in the training set,and the detection effect is not good.To improve the traffic object detection in foggy environment,we propose a method of generating foggy images on fog-free images from the perspective of data set construction.First,taking the KITTI objection detection data set as an original fog-free image,we generate the depth image of the original image by using improved Monodepth unsupervised depth estimation method.Then,a geometric prior depth template is constructed to fuse the image entropy taken as weight with the depth image.After that,a foggy image is acquired from the depth image based on the atmospheric scattering model.Finally,we take two typical object-detection frameworks,that is,the two-stage object-detection Fster region-based convolutional neural network(Faster-RCNN)and the one-stage object-detection network YOLOv4,to train the original data set,the foggy data set and the mixed data set,respectively.According to the test results on RESIDE-RTTS data set in the outdoor natural foggy environment,the model under the training on the mixed data set shows the best effect.The mean average precision(mAP)values are increased by 5.6%and by 5.0%under the YOLOv4 model and the Faster-RCNN network,respectively.It is proved that the proposed method can effectively improve object identification ability foggy environment. 展开更多
关键词 traffic object detection foggy images generation unsupervised depth estimation YOLOv4 model Faster region-based convolutional neural network(Faster-RCNN)
在线阅读 下载PDF
基于用户数据特征深度挖掘的快速图书检索算法 被引量:1
14
作者 窦淑庆 刘思豆 《现代电子技术》 北大核心 2025年第14期137-142,共6页
针对传统图书推荐系统所得到的计算结果滞后于实时需求且准确性较低的缺陷,文中基于用户画像数据,提出一种快速图书检索算法。该算法在用户画像构建部分对静态属性抽取和动态标签行为进行建模。在图书特征提取模型中,使用BERT-Word2Vec... 针对传统图书推荐系统所得到的计算结果滞后于实时需求且准确性较低的缺陷,文中基于用户画像数据,提出一种快速图书检索算法。该算法在用户画像构建部分对静态属性抽取和动态标签行为进行建模。在图书特征提取模型中,使用BERT-Word2Vec作为基础框架进行多模态特征提取,并利用双塔深度匹配模型构建了用户MLP塔和图书改进CNN塔,对特征进行充分细致的多维分析。模型通过将实时反馈机制Kafka-Redis流处理算法与会话注意力加权融合,最终实现了场景化的推荐。实验测试结果显示,NDCG@10指标较最优基准提升了约21.0%,行为反馈延迟在峰值500 QPS流量下小于等于3.5 s。表明所提算法能够为知识服务场景提供兼具准确性、时效性与场景适应性的信息推荐解决方案。 展开更多
关键词 用户画像 双向编码器表示技术 双塔深度匹配模型 多层感知器 卷积神经网络 推荐算法
在线阅读 下载PDF
航空活塞发动机进排气堵塞的常规与燃烧视角深度特征诊断研究
15
作者 徐劲松 王博 +1 位作者 韦宝涛 盛润 《电子测量与仪器学报》 北大核心 2025年第11期234-245,共12页
针对进排气的不同堵塞程度会导致航空活塞发动机的性能退化问题,设计了基于常规进排气与缸内燃烧数据的双通道深度视角特征融合诊断模型。为增强对燃烧特征的提取能力,在构建的双通道深度卷积神经网络(DCNN)诊断架构的燃烧视角通道中引... 针对进排气的不同堵塞程度会导致航空活塞发动机的性能退化问题,设计了基于常规进排气与缸内燃烧数据的双通道深度视角特征融合诊断模型。为增强对燃烧特征的提取能力,在构建的双通道深度卷积神经网络(DCNN)诊断架构的燃烧视角通道中引入自注意力机制(SA)。通过设定的5类不同程度进排气堵塞健康等级,获得海拔1920 m的地面台架试验和发动机AMESim+Simulink联合仿真的性能退化数据集,且包含起飞与巡航两种典型工况。以螺旋桨转速2300 r/min的起飞工况为案例,进行不同进排气堵塞程度的缸压变化趋势分析、各网络层的t-SNE深度特征分布及分类诊断分析,并借助模型组件消融实验进一步验证该诊断架构的合理性。结果表明,针对航空活塞发动机进排气堵塞案例的双通道自注意力深度卷积神经网络(SA-DCNN)诊断模型,其5类健康等级诊断的平均准确率分别达到98.95%和98.62%,表明该诊断模型具有较高的准确性。 展开更多
关键词 航空活塞发动机 进排气堵塞 常规与燃烧视角深度特征诊断 自注意力深度卷积神经网络
原文传递
深度度量注意力混合模型表情识别方法
16
作者 姚丽莎 《计算机工程与应用》 北大核心 2025年第7期245-254,共10页
深度学习网络在人脸表情识别中已广泛采用,但因表情图像复杂多变,受光照、个体差异等各个因素的影响,现有方法的识别效果有待提高。为了提高深度学习网络的表达能力,在深度学习网络中,结合面部关键区域的位置特征,提出融合位置信息的深... 深度学习网络在人脸表情识别中已广泛采用,但因表情图像复杂多变,受光照、个体差异等各个因素的影响,现有方法的识别效果有待提高。为了提高深度学习网络的表达能力,在深度学习网络中,结合面部关键区域的位置特征,提出融合位置信息的深层注意力反馈机制卷积神经网络模型。同时,由于表情特征的类间差异小,为了提高分类器的分类学习能力,引入度量学习方法增强特征的判别性,使同类之间的距离减小,异类之间的距离加大。通过度量学习将面部表情图像的特征映射到具有表情判别性的新的特征空间中,由此判断各表情样本的表情类别。对原图进行人脸检测,确定人脸裁剪出人脸关键区域,去除头发、背景等因素的干扰;通过深层注意力反馈机制的CNN模型对人脸关键区域进行特征学习,学习获得面部表情深度特征,之后引入判别性度量学习方法,通过度量矩阵将特征向量映射为新的学习后的特征向量;将提取的样本表情特征送入全连接层并通过Softmax分类器识别划分到预先定义好的7种基本表情。在CK+和RAF-DB数据库的实验表明,该方法取得了98.69%和87.68%的平均识别率,提高了分类器的分类学习能力。 展开更多
关键词 深度注意力 表情识别 卷积神经网络 度量学习
在线阅读 下载PDF
卷积增强Vision Mamba模型的构建及其应用 被引量:1
17
作者 俞焕友 范静 黄凡 《计算机技术与发展》 2025年第8期45-52,共8页
针对Vision Mamba(Vim)模型的局限性,该文提出了一种改进的模型——Convolutional Vision Mamba(CvM)。此模型通过摒弃Vim中的图形切割和位置编码机制,转而采用卷积操作进行替代,以实现对全局视觉信息的更高效处理。同时,此模型对Vim模... 针对Vision Mamba(Vim)模型的局限性,该文提出了一种改进的模型——Convolutional Vision Mamba(CvM)。此模型通过摒弃Vim中的图形切割和位置编码机制,转而采用卷积操作进行替代,以实现对全局视觉信息的更高效处理。同时,此模型对Vim模型中的位置嵌入模块进行了优化,以解决其固有的高计算量和内存消耗问题。进而,该文将CvM模型应用于医学图像分类领域,选用了血细胞图像、脑肿瘤图像、胸部CT扫描、病理性近视眼底图像以及肺炎X射线影像等数据集进行实验。实验结果表明,与Vim模型及其他5个神经网络模型相比,CvM模型在准确率上表现更为出色,在内存占用和参数数量方面也展现出明显的优势。消融实验表明,深度可分离卷积比标准卷积使用的参数和显存占用更少,而且在血细胞图像、脑肿瘤图像等医学图像分类上,准确率还有了显著提升。这些结果充分说明了CvM模型的优势和可行性。 展开更多
关键词 深度学习 Vision Mamba 卷积神经网络 深度可分离卷积 医学图像分类
在线阅读 下载PDF
融合多尺度注意力神经网络的港口起重装备故障时序数据预测方法 被引量:3
18
作者 雷鹏 谢敬玲 +4 位作者 许洪祖 焦锋 魏立明 张忠岩 吕成兴 《机电工程》 北大核心 2025年第2期277-286,共10页
近年来,深度神经网络在轴承时序预测领域得到了广泛应用。为了进一步提升港口起重装备滚动轴承时序模型预测的准确度,以青岛港门机为例对港口起重装备关键部位的滚动轴承时序预测进行了建模,提出了一种融合改进变分模态分解的多尺度注... 近年来,深度神经网络在轴承时序预测领域得到了广泛应用。为了进一步提升港口起重装备滚动轴承时序模型预测的准确度,以青岛港门机为例对港口起重装备关键部位的滚动轴承时序预测进行了建模,提出了一种融合改进变分模态分解的多尺度注意力机制港口装备故障时序数据预测方法。首先,采用了融合非线性策略与混沌映射的改进灰狼优化算法(IGWO),自适应地确定了变分模态分解(VMD)的模态数与惩罚因子;然后,将变分模态分解得到的本征模态函数进一步作为融合多尺度注意力神经网络(FMANN)模型的时序输入,进行了多尺度通道特征融合;最后,对各个本征模态函数的预测结果进行了融合,得到了最终预测结果。研究结果表明:FMANN模型在回转机构数据集上的均方根误差(RMSE)为0.001 12,平均绝对百分比误差(MAPE)为6.396 3%,决定系数为0.999 8;相比于其他预测模型,FMANN预测效果更加拟合实际数据。FMANN模型能够准确地预测设备轴承的时序振动,有望为未来实际工业生产提供一条新思路。 展开更多
关键词 滚动轴承 故障诊断 变分模态分解 注意力机制 灰狼优化算法 融合多尺度注意力神经网络 深度可分离卷积
在线阅读 下载PDF
基于卷积神经网络的GFRP/NOMEX蜂窝夹层结构缺陷红外热成像检测 被引量:1
19
作者 唐庆菊 谷卓妍 +3 位作者 卜红茹 徐贵鹏 谭鑫杰 谢锐 《光谱学与光谱分析》 北大核心 2025年第2期542-550,共9页
蜂窝夹层结构是复合材料领域的重要结构形式之一,由于其制备工艺复杂服役环境恶劣,极易产生分层、脱粘等缺陷,严重影响材料使用寿命。为确保相关构件的使用性能以及质量安全,有必要通过合适的无损检测技术对蜂窝夹层结构进行定期的质量... 蜂窝夹层结构是复合材料领域的重要结构形式之一,由于其制备工艺复杂服役环境恶劣,极易产生分层、脱粘等缺陷,严重影响材料使用寿命。为确保相关构件的使用性能以及质量安全,有必要通过合适的无损检测技术对蜂窝夹层结构进行定期的质量监测以及探伤。因此,实现缺陷的量化检测是预防以及解决此类问题发生的根本。基于红外热成像技术以含预制分层及脱粘缺陷的GFRP/NOMEX蜂窝夹层结构试件为研究对象进行脉冲红外热波无损检测试验研究,采集若干帧试件表面温度信号分布热图,取若干缺陷区域及健康区域内像素点的温度信号构建样本数据集,并将其随机划分为训练集及验证集,取第四行缺陷中心水平线区域作为测试集数据。结合卷积神经网络技术实现GFRP/NOMEX蜂窝夹层结构缺陷检测识别以及深度预测。分析一维卷积神经网络结构,引入多尺度空洞卷积、残差模块、注意力机制,搭建一维卷积神经网络预测模型,使用构建的温度信号数据集训练搭建的网络模型。训练结果表明,验证集和训练集的Loss及RMSE趋势一致,验证集最终Loss为1.67×10^(-5),RMSE为0.0058,并未出现过拟合现象。将测试集数据输入至训练完成的网络中。结果表明,所搭建的网络可以有效识别出缺陷,对于缺陷中心处的深度预测误差控制在2%以内。将卷积神经网络与红外热成像检测技术相结合,能够实现GFRP/NOMEX蜂窝夹层结构缺陷的可靠性检测及缺陷埋深的稳定预测,同时为其他复合材料缺陷识别以及量化检测提供参考。 展开更多
关键词 蜂窝夹层结构 深度预测 卷积神经网络 红外热成像
在线阅读 下载PDF
基于可信度的非视距识别与定位算法
20
作者 刘林 宋雨昊 《中国惯性技术学报》 北大核心 2025年第10期972-978,共7页
为提高非视距场景下超宽带(UWB)定位精度,提出了基于可信度的非视距识别与定位算法。首先,利用UWB诊断寄存器提取实时信道冲击响应特征及测距值,通过一维卷积神经网络进行非视距识别,估计测距为视距或非视距的概率。然后,利用该概率构... 为提高非视距场景下超宽带(UWB)定位精度,提出了基于可信度的非视距识别与定位算法。首先,利用UWB诊断寄存器提取实时信道冲击响应特征及测距值,通过一维卷积神经网络进行非视距识别,估计测距为视距或非视距的概率。然后,利用该概率构建可信度,基于可信度进行定位基站筛选及定位算法改进,设计基于可信度的加权最小二乘-泰勒(WLS-Taylor)融合滤波算法。在多种场景下采集静态和动态测试数据进行性能验证,实验结果表明:所提算法能够有效抑制非视距对定位结果的影响,非视距环境下定位误差均值小于10 cm;在非视距相对严重环境下,所提算法的定位误差较基于距离加权的WLS算法降低了76.94 cm。 展开更多
关键词 超宽带 信道响应特征 非视距识别 一维深度卷积神经网络 可信度
在线阅读 下载PDF
上一页 1 2 13 下一页 到第
使用帮助 返回顶部