期刊文献+
共找到1,110篇文章
< 1 2 56 >
每页显示 20 50 100
3D Data Scattergram Image Classification Based Protection for Transmission Line Connecting BESS Using Depth-wise Separable Convolution Based CNN
1
作者 Yingyu Liang Yi Ren +1 位作者 Xiaoyang Yang Wenting Zha 《Journal of Modern Power Systems and Clean Energy》 2025年第2期609-621,共13页
The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data... The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data scattergrams are constructed using current data from both sides of the transmission line and their sum.Following a comprehensive analysis of the varying characteristics of 3D data scattergrams under different conditions,a 3D data scattergram image classification based protection method is developed.The depth-wise separable convolution is used to ensure a lightweight convolutional neural network(CNN)structure without compromising performance.In addition,a Bayesian hyperparameter optimization algorithm is used to achieve a hyperparametric search to simplify the training process.Compared with artificial neural networks and CNNs,the depth-wise separable convolution based CNN(DPCNN)achieves a higher recognition accuracy.The 3D data scattergram image classification based protection method using DPCNN can accurately separate internal faults from other disturbances and identify fault phases under different operating states and fault conditions.The proposed protection method also shows first-class tolerability against current transformer(CT)saturation and CT measurement errors. 展开更多
关键词 convolutional neural network(CNN) battery energy storage station(BESS) depth-wise separable convolution hyperparameter optimization fault classification line protection
原文传递
SEFormer:A Lightweight CNN-Transformer Based on Separable Multiscale Depthwise Convolution and Efficient Self-Attention for Rotating Machinery Fault Diagnosis 被引量:1
2
作者 Hongxing Wang Xilai Ju +1 位作者 Hua Zhu Huafeng Li 《Computers, Materials & Continua》 SCIE EI 2025年第1期1417-1437,共21页
Traditional data-driven fault diagnosis methods depend on expert experience to manually extract effective fault features of signals,which has certain limitations.Conversely,deep learning techniques have gained promine... Traditional data-driven fault diagnosis methods depend on expert experience to manually extract effective fault features of signals,which has certain limitations.Conversely,deep learning techniques have gained prominence as a central focus of research in the field of fault diagnosis by strong fault feature extraction ability and end-to-end fault diagnosis efficiency.Recently,utilizing the respective advantages of convolution neural network(CNN)and Transformer in local and global feature extraction,research on cooperating the two have demonstrated promise in the field of fault diagnosis.However,the cross-channel convolution mechanism in CNN and the self-attention calculations in Transformer contribute to excessive complexity in the cooperative model.This complexity results in high computational costs and limited industrial applicability.To tackle the above challenges,this paper proposes a lightweight CNN-Transformer named as SEFormer for rotating machinery fault diagnosis.First,a separable multiscale depthwise convolution block is designed to extract and integrate multiscale feature information from different channel dimensions of vibration signals.Then,an efficient self-attention block is developed to capture critical fine-grained features of the signal from a global perspective.Finally,experimental results on the planetary gearbox dataset and themotor roller bearing dataset prove that the proposed framework can balance the advantages of robustness,generalization and lightweight compared to recent state-of-the-art fault diagnosis models based on CNN and Transformer.This study presents a feasible strategy for developing a lightweight rotating machinery fault diagnosis framework aimed at economical deployment. 展开更多
关键词 CNN-Transformer separable multiscale depthwise convolution efficient self-attention fault diagnosis
在线阅读 下载PDF
Remaining Useful Life Prediction of Rail Based on Improved Pulse Separable Convolution Enhanced Transformer Encoder
3
作者 Zhongmei Wang Min Li +2 位作者 Jing He Jianhua Liu Lin Jia 《Journal of Transportation Technologies》 2024年第2期137-160,共24页
In order to prevent possible casualties and economic loss, it is critical to accurate prediction of the Remaining Useful Life (RUL) in rail prognostics health management. However, the traditional neural networks is di... In order to prevent possible casualties and economic loss, it is critical to accurate prediction of the Remaining Useful Life (RUL) in rail prognostics health management. However, the traditional neural networks is difficult to capture the long-term dependency relationship of the time series in the modeling of the long time series of rail damage, due to the coupling relationship of multi-channel data from multiple sensors. Here, in this paper, a novel RUL prediction model with an enhanced pulse separable convolution is used to solve this issue. Firstly, a coding module based on the improved pulse separable convolutional network is established to effectively model the relationship between the data. To enhance the network, an alternate gradient back propagation method is implemented. And an efficient channel attention (ECA) mechanism is developed for better emphasizing the useful pulse characteristics. Secondly, an optimized Transformer encoder was designed to serve as the backbone of the model. It has the ability to efficiently understand relationship between the data itself and each other at each time step of long time series with a full life cycle. More importantly, the Transformer encoder is improved by integrating pulse maximum pooling to retain more pulse timing characteristics. Finally, based on the characteristics of the front layer, the final predicted RUL value was provided and served as the end-to-end solution. The empirical findings validate the efficacy of the suggested approach in forecasting the rail RUL, surpassing various existing data-driven prognostication techniques. Meanwhile, the proposed method also shows good generalization performance on PHM2012 bearing data set. 展开更多
关键词 Equipment Health Prognostics Remaining Useful Life Prediction Pulse separable convolution Attention Mechanism Transformer Encoder
在线阅读 下载PDF
Validation Research on the Application of Depthwise Separable Convolutional Al Facial Expression Recognition in Non-pharmacological Treatment of BPSD
4
作者 Xiangyu Liu 《Journal of Clinical and Nursing Research》 2021年第4期31-37,共7页
One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence... One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence of emotions.Observing the reduction of BPSD in dementia through emotions can be considered effective and widely used in the field of non-pharmacological therapy.At present,this article will verify whether the image recognition artificial intelligence(AI)system can correctly reflect the emotional performance of the elderly with dementia through a questionnaire survey of three professional elderly nursing staff.The ANOVA(sig.=0.50)is used to determine that the judgment given by the nursing staff has no obvious deviation,and then Kendall's test(0.722**)and spearman's test(0.863**)are used to verify the judgment severity of the emotion recognition system and the nursing staff unanimously.This implies the usability of the tool.Additionally,it can be expected to be further applied in the research related to BPSD elderly emotion detection. 展开更多
关键词 depth-wise separable convolution EMOTION BPSD DEMENTIA Nursing
暂未订购
Coal/Gangue Volume Estimation with Convolutional Neural Network and Separation Based on Predicted Volume and Weight
5
作者 Zenglun Guan Murad S.Alfarzaeai +2 位作者 Eryi Hu Taqiaden Alshmeri Wang Peng 《Computers, Materials & Continua》 SCIE EI 2024年第4期279-306,共28页
In the coal mining industry,the gangue separation phase imposes a key challenge due to the high visual similaritybetween coal and gangue.Recently,separation methods have become more intelligent and efficient,using new... In the coal mining industry,the gangue separation phase imposes a key challenge due to the high visual similaritybetween coal and gangue.Recently,separation methods have become more intelligent and efficient,using newtechnologies and applying different features for recognition.One such method exploits the difference in substancedensity,leading to excellent coal/gangue recognition.Therefore,this study uses density differences to distinguishcoal from gangue by performing volume prediction on the samples.Our training samples maintain a record of3-side images as input,volume,and weight as the ground truth for the classification.The prediction process relieson a Convolutional neural network(CGVP-CNN)model that receives an input of a 3-side image and then extractsthe needed features to estimate an approximation for the volume.The classification was comparatively performedvia ten different classifiers,namely,K-Nearest Neighbors(KNN),Linear Support Vector Machines(Linear SVM),Radial Basis Function(RBF)SVM,Gaussian Process,Decision Tree,Random Forest,Multi-Layer Perceptron(MLP),Adaptive Boosting(AdaBosst),Naive Bayes,and Quadratic Discriminant Analysis(QDA).After severalexperiments on testing and training data,results yield a classification accuracy of 100%,92%,95%,96%,100%,100%,100%,96%,81%,and 92%,respectively.The test reveals the best timing with KNN,which maintained anaccuracy level of 100%.Assessing themodel generalization capability to newdata is essential to ensure the efficiencyof the model,so by applying a cross-validation experiment,the model generalization was measured.The useddataset was isolated based on the volume values to ensure the model generalization not only on new images of thesame volume but with a volume outside the trained range.Then,the predicted volume values were passed to theclassifiers group,where classification reported accuracy was found to be(100%,100%,100%,98%,88%,87%,100%,87%,97%,100%),respectively.Although obtaining a classification with high accuracy is the main motive,this workhas a remarkable reduction in the data preprocessing time compared to related works.The CGVP-CNN modelmanaged to reduce the data preprocessing time of previous works to 0.017 s while maintaining high classificationaccuracy using the estimated volume value. 展开更多
关键词 COAL coal gangue convolutional neural network CNN object classification volume estimation separation system
在线阅读 下载PDF
Fire Detection Method Based on Depthwise Separable Convolution and YOLOv3 被引量:6
6
作者 Yue-Yan Qin Jiang-Tao Cao Xiao-Fei Ji 《International Journal of Automation and computing》 EI CSCD 2021年第2期300-310,共11页
Recently,video-based fire detection technology has become an important research topic in the field of machine vision.This paper proposes a method of combining the classification model and target detection model in dee... Recently,video-based fire detection technology has become an important research topic in the field of machine vision.This paper proposes a method of combining the classification model and target detection model in deep learning for fire detection.Firstly,the depthwise separable convolution is used to classify fire images,which saves a lot of detection time under the premise of ensuring detection accuracy.Secondly,You Only Look Once version 3(YOLOv3)target regression function is used to output the fire position information for the images whose classification result is fire,which avoids the problem that the accuracy of detection cannot be guaranteed by using YOLOv3 for target classification and position regression.At the same time,the detection time of target regression for images without fire is greatly reduced saved.The experiments were tested using a network public database.The detection accuracy reached 98%and the detection rate reached 38fps.This method not only saves the workload of manually extracting flame characteristics,reduces the calculation cost,and reduces the amount of parameters,but also improves the detection accuracy and detection rate. 展开更多
关键词 Fire detection depthwise separable convolution fire classification You Only Look Once version 3(YOLOv3) target regression
原文传递
MSSTNet:Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention
7
作者 Changchen ZHAO Hongsheng WANG Yuanjing FENG 《Virtual Reality & Intelligent Hardware》 2023年第2期124-141,共18页
Background The use of remote photoplethysmography(rPPG)to estimate blood volume pulse in a noncontact manner has been an active research topic in recent years.Existing methods are primarily based on a singlescale regi... Background The use of remote photoplethysmography(rPPG)to estimate blood volume pulse in a noncontact manner has been an active research topic in recent years.Existing methods are primarily based on a singlescale region of interest(ROI).However,some noise signals that are not easily separated in a single-scale space can be easily separated in a multi-scale space.Also,existing spatiotemporal networks mainly focus on local spatiotemporal information and do not emphasize temporal information,which is crucial in pulse extraction problems,resulting in insufficient spatiotemporal feature modelling.Methods Here,we propose a multi-scale facial video pulse extraction network based on separable spatiotemporal convolution(SSTC)and dimension separable attention(DSAT).First,to solve the problem of a single-scale ROI,we constructed a multi-scale feature space for initial signal separation.Second,SSTC and DSAT were designed for efficient spatiotemporal correlation modeling,which increased the information interaction between the long-span time and space dimensions;this placed more emphasis on temporal features.Results The signal-to-noise ratio(SNR)of the proposed network reached 9.58dB on the PURE dataset and 6.77dB on the UBFC-rPPG dataset,outperforming state-of-the-art algorithms.Conclusions The results showed that fusing multi-scale signals yielded better results than methods based on only single-scale signals.The proposed SSTC and dimension-separable attention mechanism will contribute to more accurate pulse signal extraction. 展开更多
关键词 Remote photoplethysmography Heart rate separable spatiotemporal convolution Dimension separable attention MULTI-SCALE Neural network
在线阅读 下载PDF
A Lightweight Convolutional Neural Network with Hierarchical Multi-Scale Feature Fusion for Image Classification 被引量:2
8
作者 Adama Dembele Ronald Waweru Mwangi Ananda Omutokoh Kube 《Journal of Computer and Communications》 2024年第2期173-200,共28页
Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware reso... Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware resources. To address this issue, the MobileNetV1 network was developed, which employs depthwise convolution to reduce network complexity. MobileNetV1 employs a stride of 2 in several convolutional layers to decrease the spatial resolution of feature maps, thereby lowering computational costs. However, this stride setting can lead to a loss of spatial information, particularly affecting the detection and representation of smaller objects or finer details in images. To maintain the trade-off between complexity and model performance, a lightweight convolutional neural network with hierarchical multi-scale feature fusion based on the MobileNetV1 network is proposed. The network consists of two main subnetworks. The first subnetwork uses a depthwise dilated separable convolution (DDSC) layer to learn imaging features with fewer parameters, which results in a lightweight and computationally inexpensive network. Furthermore, depthwise dilated convolution in DDSC layer effectively expands the field of view of filters, allowing them to incorporate a larger context. The second subnetwork is a hierarchical multi-scale feature fusion (HMFF) module that uses parallel multi-resolution branches architecture to process the input feature map in order to extract the multi-scale feature information of the input image. Experimental results on the CIFAR-10, Malaria, and KvasirV1 datasets demonstrate that the proposed method is efficient, reducing the network parameters and computational cost by 65.02% and 39.78%, respectively, while maintaining the network performance compared to the MobileNetV1 baseline. 展开更多
关键词 MobileNet Image Classification Lightweight convolutional Neural Network Depthwise Dilated separable convolution Hierarchical Multi-Scale Feature Fusion
在线阅读 下载PDF
SepFE:Separable Fusion Enhanced Network for Retinal Vessel Segmentation 被引量:2
9
作者 Yun Wu Ge Jiao Jiahao Liu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第9期2465-2485,共21页
The accurate and automatic segmentation of retinal vessels fromfundus images is critical for the early diagnosis and prevention ofmany eye diseases,such as diabetic retinopathy(DR).Existing retinal vessel segmentation... The accurate and automatic segmentation of retinal vessels fromfundus images is critical for the early diagnosis and prevention ofmany eye diseases,such as diabetic retinopathy(DR).Existing retinal vessel segmentation approaches based on convolutional neural networks(CNNs)have achieved remarkable effectiveness.Here,we extend a retinal vessel segmentation model with low complexity and high performance based on U-Net,which is one of the most popular architectures.In view of the excellent work of depth-wise separable convolution,we introduce it to replace the standard convolutional layer.The complexity of the proposed model is reduced by decreasing the number of parameters and calculations required for themodel.To ensure performance while lowering redundant parameters,we integrate the pre-trained MobileNet V2 into the encoder.Then,a feature fusion residual module(FFRM)is designed to facilitate complementary strengths by enhancing the effective fusion between adjacent levels,which alleviates extraneous clutter introduced by direct fusion.Finally,we provide detailed comparisons between the proposed SepFE and U-Net in three retinal image mainstream datasets(DRIVE,STARE,and CHASEDB1).The results show that the number of SepFE parameters is only 3%of U-Net,the Flops are only 8%of U-Net,and better segmentation performance is obtained.The superiority of SepFE is further demonstrated through comparisons with other advanced methods. 展开更多
关键词 Retinal vessel segmentation U-Net depth-wise separable convolution feature fusion
暂未订购
A WEIGHTED GENERAL DISCRETE FOURIER TRANSFORM FOR THE FREQUENCY-DOMAIN BLIND SOURCE SEPARATION OF CONVOLUTIVE MIXTURES 被引量:1
10
作者 Wang Chao Fang Yong Feng Jiuchao 《Journal of Electronics(China)》 2008年第6期830-833,共4页
This letter deals with the frequency domain Blind Source Separation of Convolutive Mixtures (CMBSS). From the frequency representation of the "overlap and save", a Weighted General Discrete Fourier Transform... This letter deals with the frequency domain Blind Source Separation of Convolutive Mixtures (CMBSS). From the frequency representation of the "overlap and save", a Weighted General Discrete Fourier Transform (WGDFT) is derived to replace the traditional Discrete Fourier Transform (DFT). The mixing matrix on each frequency bin could be estimated more precisely from WGDFT coefficients than from DFT coefficients, which improves separation performance. Simulation results verify the validity of WGDFT for frequency domain blind source separation of convolutive mixtures. 展开更多
关键词 Blind Source separation of convolutive Mixtures (CMBSS) Frequency representation of overlap and save Weighted General Discrete Fourier Transform (WGDFT)
在线阅读 下载PDF
Maximum Likelihood Blind Separation of Convolutively Mixed Discrete Sources
11
作者 辜方林 张杭 朱德生 《China Communications》 SCIE CSCD 2013年第6期60-67,共8页
In this paper,a Maximum Likelihood(ML) approach,implemented by Expectation-Maximization(EM) algorithm,is proposed to blind separation of convolutively mixed discrete sources.In order to carry out the expectation proce... In this paper,a Maximum Likelihood(ML) approach,implemented by Expectation-Maximization(EM) algorithm,is proposed to blind separation of convolutively mixed discrete sources.In order to carry out the expectation procedure of the EM algorithm with a less computational load,the algorithm named Iterative Maximum Likelihood algorithm(IML) is proposed to calculate the likelihood and recover the source signals.An important feature of the ML approach is that it has robust performance in noise environments by treating the covariance matrix of the additive Gaussian noise as a parameter.Another striking feature of the ML approach is that it is possible to separate more sources than sensors by exploiting the finite alphabet property of the sources.Simulation results show that the proposed ML approach works well either in determined mixtures or underdetermined mixtures.Furthermore,the performance of the proposed ML algorithm is close to the performance with perfect knowledge of the channel filters. 展开更多
关键词 Blind Source separation convolutive mixture EM Finite Alphabet
在线阅读 下载PDF
AN NMF ALGORITHM FOR BLIND SEPARATION OF CONVOLUTIVE MIXED SOURCE SIGNALS WITH LEAST CORRELATION CONSTRAINS
12
作者 Zhang Ye Fang Yong 《Journal of Electronics(China)》 2009年第4期557-563,共7页
Most of the existing algorithms for blind sources separation have a limitation that sources are statistically independent. However, in many practical applications, the source signals are non- negative and mutual stati... Most of the existing algorithms for blind sources separation have a limitation that sources are statistically independent. However, in many practical applications, the source signals are non- negative and mutual statistically dependent signals. When the observations are nonnegative linear combinations of nonnegative sources, the correlation coefficients of the observations are larger than these of source signals. In this letter, a novel Nonnegative Matrix Factorization (NMF) algorithm with least correlated component constraints to blind separation of convolutive mixed sources is proposed. The algorithm relaxes the source independence assumption and has low-complexity algebraic com- putations. Simulation results on blind source separation including real face image data indicate that the sources can be successfully recovered with the algorithm. 展开更多
关键词 Nonnegative matrix factorization convolutive blind source separation Correlation constrain
在线阅读 下载PDF
A Framework of Lightweight Deep Cross-Connected Convolution Kernel Mapping Support Vector Machines
13
作者 Qi Wang Zhaoying Liu +3 位作者 Ting Zhang Shanshan Tu Yujian Li Muhammad Waqas 《Journal on Artificial Intelligence》 2022年第1期37-48,共12页
Deep kernel mapping support vector machines have achieved good results in numerous tasks by mapping features from a low-dimensional space to a high-dimensional space and then using support vector machines for classifi... Deep kernel mapping support vector machines have achieved good results in numerous tasks by mapping features from a low-dimensional space to a high-dimensional space and then using support vector machines for classification.However,the depth kernel mapping support vector machine does not take into account the connection of different dimensional spaces and increases the model parameters.To further improve the recognition capability of deep kernel mapping support vector machines while reducing the number of model parameters,this paper proposes a framework of Lightweight Deep Convolutional Cross-Connected Kernel Mapping Support Vector Machines(LC-CKMSVM).The framework consists of a feature extraction module and a classification module.The feature extraction module first maps the data from low-dimensional to high-dimensional space by fusing the representations of different dimensional spaces through cross-connections;then,it uses depthwise separable convolution to replace part of the original convolution to reduce the number of parameters in the module;The classification module uses a soft margin support vector machine for classification.The results on 6 different visual datasets show that LC-CKMSVM obtains better classification accuracies on most cases than the other five models. 展开更多
关键词 convolutional neural network cross-connected lightweight framework depthwise separable convolution
在线阅读 下载PDF
基于YOLOv5s的轻量化森林火灾探测算法 被引量:2
14
作者 刘惠临 方琼 +3 位作者 江宇 魏华章 王涛 张树川 《中国安全科学学报》 北大核心 2025年第1期75-83,共9页
为解决当前基于深度学习的森林火灾探测算法存在结构复杂、规模庞大,且难以兼顾检测精度和效率的问题,提出一种基于YOLOv5s的轻量化森林火灾探测算法。首先,采用优化的背景差分技术消除背景图像中类火物体的干扰,减少分析图像所需的时间... 为解决当前基于深度学习的森林火灾探测算法存在结构复杂、规模庞大,且难以兼顾检测精度和效率的问题,提出一种基于YOLOv5s的轻量化森林火灾探测算法。首先,采用优化的背景差分技术消除背景图像中类火物体的干扰,减少分析图像所需的时间;其次,设计分组混洗策略优化常规卷积,并在特征提取的C3模块中融入高效通道注意力(ECA)机制和深度可分离卷积,增强图像特征提取与融合能力的同时有效降低模型的参数量;然后,采用动态非单调聚焦机制优化Wise-交并比(WIOU)损失函数,减少低质量样本产生的有害梯度;最后,在构建的森林火灾数据集上将所提算法与其他算法做充分的试验对比。结果表明:所提算法在各类场景均展现出良好的泛化性,对火焰目标的检测精度达到86.1%,较标准YOLOv5s检测精度提升2.7%,检测速度提升11.4%,有效降低了火灾误报率,增强了模型的检测性能。 展开更多
关键词 YOLOv5s 轻量化 森林火灾探测 深度可分离卷积 注意力 Wise-交并比(WIOU)
原文传递
Fault Diagnosis for Wind Turbine Flange Bolts Based on One-Dimensional Depthwise Separable Convolutions
15
作者 Yongchao Liu Shuqing Dong +3 位作者 Qingfeng Wang Wenhe Cai Ruizhuo Song Qinglai Wei 《The International Journal of Intelligent Control and Systems》 2024年第1期42-47,共6页
In this paper,a new bolt fault diagnosis method is developed to solve the fault diagnosis problem of wind turbine flange bolts using one-dimensional depthwise separable convolutions.The main idea is to use a one-dimen... In this paper,a new bolt fault diagnosis method is developed to solve the fault diagnosis problem of wind turbine flange bolts using one-dimensional depthwise separable convolutions.The main idea is to use a one-dimensional convolutional neural network model to classify and identify the acoustic vibration signals of bolts,which represent different bolt damage states.Through the methods of knock test and modal simulation,it is concluded that the damage state of wind turbine flange bolt is related to the natural frequency distribution of acoustic vibration signal.It is found that the bolt damage state affects the modal shape of the structure,and then affects the natural frequency distribution of the bolt vibration signal.Therefore,the damage state can be obtained by identifying the natural frequency distribution of the bolt acoustic vibration signal.In the present one-dimensional depth-detachable convolutional neural network model,the one-dimensional vector is first convolved into multiple channels,and then each channel is separately learned by depth-detachable convolution,which can effectively improve the feature quality and the effect of data classification.From the perspective of the realization mechanism of convolution operation,the depthwise separable convolution operation has fewer parameters and faster computing speed,making it easier to build lightweight models and deploy them to mobile devices. 展开更多
关键词 Wind turbine flange bolts one-dimensional convolutional neural network(1DCNN)model depthwise separable convolutions damage identification
在线阅读 下载PDF
基于DDE-BIT的无人机高速公路护栏损坏检测 被引量:2
16
作者 王洋 郭杜杜 帅洪波 《现代电子技术》 北大核心 2025年第4期123-129,共7页
针对现有方法对无人机高速公路护栏损坏检测存在边缘信息提取效果差、识别精度低的问题,提出一种基于深度学习的变化检测模型DDE-BIT。首先,采用深度可分离卷积优化主干网络Resnet18,减少模型的参数数量,降低计算成本;然后,在主干网络... 针对现有方法对无人机高速公路护栏损坏检测存在边缘信息提取效果差、识别精度低的问题,提出一种基于深度学习的变化检测模型DDE-BIT。首先,采用深度可分离卷积优化主干网络Resnet18,减少模型的参数数量,降低计算成本;然后,在主干网络输出部分引入ECA注意力模块,在仅增加少量参数的情况下提高模型的跨通道信息捕捉能力;最后,通过跳跃连接方式对BIT双时空图像转换器的输出特征进行堆叠,提高模型的上下文信息理解能力。以采集的无人机高速公路护栏损坏图像为实验数据,实验结果表明:DDE-BIT模型的交并比和F1分数分别为90.99%、95.28%,相较于原始模型分别提高了2.71%、1.51%,能够有效地提取护栏损坏的边缘信息。 展开更多
关键词 护栏损坏检测 无人机 ECA注意力机制 深度可分离卷积 图像处理 信息提取
在线阅读 下载PDF
基于轻量化PPINET的花生荚果实时识别方法 被引量:1
17
作者 员玉良 黄劲龙 +2 位作者 李德豪 王方艳 马德新 《农业工程学报》 北大核心 2025年第12期182-190,共9页
传统CNN算法在花生荚果外观识别任务中存在内存密集型和计算密集型问题,以及其在资源受限的边缘终端上部署困难,基于此,该研究提出了一种高效的花生荚果识别模型——PPINET(peanut pod identification network),以适应嵌入式设备的资源... 传统CNN算法在花生荚果外观识别任务中存在内存密集型和计算密集型问题,以及其在资源受限的边缘终端上部署困难,基于此,该研究提出了一种高效的花生荚果识别模型——PPINET(peanut pod identification network),以适应嵌入式设备的资源限制需求。该模型通过结合深度可分离卷积和倒残差结构显著降低参数量和计算量,同时保留特征提取能力,并引入MQA(multi-query attention)模块增强关键特征提取,并利用TuNAS(easy-to-tune and scalable implementation of efficient neural architecture search with weight sharing)策略优化模型结构,使其在资源受限设备上表现优异。此外,采用ResNet(residual neural network)进行知识蒸馏配合三折交叉验证训练提升精度,最终量化为RKNN格式并在瑞芯微RK3588上实现NPU加速部署。PPINET模型尺寸仅为1.85 MB,参数量为0.49 M,浮点运算数为0.30G。PPINET在花生荚果分类中表现优异,准确率达98.65%,在RK3588上推理速度达321 fps。该模型具备较高的识别准确率和快速的识别速度,能够实现花生荚果的实时精准检测。 展开更多
关键词 花生荚果 深度可分离卷积 三折交叉验证 知识蒸馏 嵌入式部署
在线阅读 下载PDF
基于自注意力机制的高分遥感影像语义分割 被引量:2
18
作者 杨军 张金影 康玥 《哈尔滨工程大学学报》 北大核心 2025年第2期344-354,共11页
针对遥感影像多尺度特征提取困难、上下文信息利用不足的问题,本文结合自注意力机制和深度可分离卷积提出一种线性多头自注意力网络模型,适用于高分辨率遥感影像语义分割。在自注意力模块之前引入深度可分离卷积,减少计算量的同时有助... 针对遥感影像多尺度特征提取困难、上下文信息利用不足的问题,本文结合自注意力机制和深度可分离卷积提出一种线性多头自注意力网络模型,适用于高分辨率遥感影像语义分割。在自注意力模块之前引入深度可分离卷积,减少计算量的同时有助于捕获局部特征;在编码器分支中提出线性的多头自注意力模块以降低模型的计算复杂度;设计一个解码器来恢复特征图分辨率,通过级联操作整合各层级的特征并生成高分辨率的语义分割结果。所提算法在ISPRS Vaihingen和Potsdam数据集上的分割结果的mF1分别达到了90.77%和92.36%,与目前主流算法相比,不透水表面、建筑、低矮植物、树木类的分割准确率及总体分割准确率均有提高。本文算法构建的线性多头自注意力网络是一种高效的高分辨率遥感影像语义分割模型。 展开更多
关键词 高分辨率遥感影像 多头自注意力 深度可分离卷积 语义分割 特征提取 卷积神经网络 编码器 解码器
在线阅读 下载PDF
基于深度可分离卷积混合网络模型的地浸采铀注液量预测研究 被引量:2
19
作者 刘志锋 唐俊贤 +1 位作者 林芝宁 周义朋 《铀矿冶》 2025年第1期9-17,共9页
地浸采铀作为铀矿的绿色开采技术,在生产运行中产生海量数据,利用这些海量数据进行大数据分析和趋势预测,能够提升技术人员制定生产计划的可靠性。目前采用的基于编码器-解码器结构的时序预测模型,由于存在注意力机制,导致计算复杂、内... 地浸采铀作为铀矿的绿色开采技术,在生产运行中产生海量数据,利用这些海量数据进行大数据分析和趋势预测,能够提升技术人员制定生产计划的可靠性。目前采用的基于编码器-解码器结构的时序预测模型,由于存在注意力机制,导致计算复杂、内存消耗大。本研究提出深度可分离卷积混合模型,通过动态序列分割模块降低固定分割带来的语义破坏,通过深度可分离卷积混合模块降低模型运行时间并捕获局部和全局特征。结果表明,深度可分离卷积混合网络模型的均方误差(Mean Square Error,MSE)与平均绝对误差(Mean Absolute Error,MAE)相较于时间序列分块自注意力模型(Patch Time Series Transformer,PatchTST)分别降低了1.04%和4.13%,提出的动态序列分割模块的MSE与MAE相较于原有模型分别降低了7.32%和5.03%;在性能对比分析上,深度可分离卷积混合模型的训练速度相较于趋势季节分解线性模型(Decomposition Linear,DLinear)提高了59.91%。建立的模型能够准确预测采区生产运行中硫酸注液量的变化趋势,改善了现有预测模型针对地浸铀矿数据集存在的运行时间长、运行内存大、数据拟合差的问题,可为地浸铀矿生产决策提供理论和实践参考。 展开更多
关键词 地浸采铀 注液量预测 深度可分离卷积 预测模型
在线阅读 下载PDF
一种基于ASPPUnet的道路裂缝检测模型 被引量:1
20
作者 曹一冰 张江水 +1 位作者 张政 赵鑫科 《测绘科学技术学报》 2025年第1期49-56,共8页
为了更加精确高效地对道路裂缝进行分割提取,提出一种基于多尺度特征与上下文信息融合的ASPPUnet道路裂缝检测模型。ASPPUnet通过U形编码解码器进行多尺度特征的提取,通过引入ASPP模块进行不同范围上下文信息的融合;同时模型还引入了深... 为了更加精确高效地对道路裂缝进行分割提取,提出一种基于多尺度特征与上下文信息融合的ASPPUnet道路裂缝检测模型。ASPPUnet通过U形编码解码器进行多尺度特征的提取,通过引入ASPP模块进行不同范围上下文信息的融合;同时模型还引入了深度可分离卷积模块,用以实现模型的轻量化;采用融合Dice和交叉熵的损失函数,均衡模型的查全率和查准率;采用动态数据集增广方法,使得模型在小数据集上也能实现良好的检测效果。通过与Unet等模型的实验对比可以看出,ASPPUnet拥有更好的检测效果和可塑性,具有较好的应用价值。 展开更多
关键词 裂缝检测 图像分割 深度可分离卷积 损失函数 ASPP模块 Unet模型
在线阅读 下载PDF
上一页 1 2 56 下一页 到第
使用帮助 返回顶部