期刊文献+
共找到1,149篇文章
< 1 2 58 >
每页显示 20 50 100
3D Data Scattergram Image Classification Based Protection for Transmission Line Connecting BESS Using Depth-wise Separable Convolution Based CNN 被引量:1
1
作者 Yingyu Liang Yi Ren +1 位作者 Xiaoyang Yang Wenting Zha 《Journal of Modern Power Systems and Clean Energy》 2025年第2期609-621,共13页
The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data... The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data scattergrams are constructed using current data from both sides of the transmission line and their sum.Following a comprehensive analysis of the varying characteristics of 3D data scattergrams under different conditions,a 3D data scattergram image classification based protection method is developed.The depth-wise separable convolution is used to ensure a lightweight convolutional neural network(CNN)structure without compromising performance.In addition,a Bayesian hyperparameter optimization algorithm is used to achieve a hyperparametric search to simplify the training process.Compared with artificial neural networks and CNNs,the depth-wise separable convolution based CNN(DPCNN)achieves a higher recognition accuracy.The 3D data scattergram image classification based protection method using DPCNN can accurately separate internal faults from other disturbances and identify fault phases under different operating states and fault conditions.The proposed protection method also shows first-class tolerability against current transformer(CT)saturation and CT measurement errors. 展开更多
关键词 convolutional neural network(CNN) battery energy storage station(BESS) depth-wise separable convolution hyperparameter optimization fault classification line protection
原文传递
SEFormer:A Lightweight CNN-Transformer Based on Separable Multiscale Depthwise Convolution and Efficient Self-Attention for Rotating Machinery Fault Diagnosis 被引量:3
2
作者 Hongxing Wang Xilai Ju +1 位作者 Hua Zhu Huafeng Li 《Computers, Materials & Continua》 SCIE EI 2025年第1期1417-1437,共21页
Traditional data-driven fault diagnosis methods depend on expert experience to manually extract effective fault features of signals,which has certain limitations.Conversely,deep learning techniques have gained promine... Traditional data-driven fault diagnosis methods depend on expert experience to manually extract effective fault features of signals,which has certain limitations.Conversely,deep learning techniques have gained prominence as a central focus of research in the field of fault diagnosis by strong fault feature extraction ability and end-to-end fault diagnosis efficiency.Recently,utilizing the respective advantages of convolution neural network(CNN)and Transformer in local and global feature extraction,research on cooperating the two have demonstrated promise in the field of fault diagnosis.However,the cross-channel convolution mechanism in CNN and the self-attention calculations in Transformer contribute to excessive complexity in the cooperative model.This complexity results in high computational costs and limited industrial applicability.To tackle the above challenges,this paper proposes a lightweight CNN-Transformer named as SEFormer for rotating machinery fault diagnosis.First,a separable multiscale depthwise convolution block is designed to extract and integrate multiscale feature information from different channel dimensions of vibration signals.Then,an efficient self-attention block is developed to capture critical fine-grained features of the signal from a global perspective.Finally,experimental results on the planetary gearbox dataset and themotor roller bearing dataset prove that the proposed framework can balance the advantages of robustness,generalization and lightweight compared to recent state-of-the-art fault diagnosis models based on CNN and Transformer.This study presents a feasible strategy for developing a lightweight rotating machinery fault diagnosis framework aimed at economical deployment. 展开更多
关键词 CNN-Transformer separable multiscale depthwise convolution efficient self-attention fault diagnosis
在线阅读 下载PDF
Validation Research on the Application of Depthwise Separable Convolutional Al Facial Expression Recognition in Non-pharmacological Treatment of BPSD
3
作者 Xiangyu Liu 《Journal of Clinical and Nursing Research》 2021年第4期31-37,共7页
One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence... One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence of emotions.Observing the reduction of BPSD in dementia through emotions can be considered effective and widely used in the field of non-pharmacological therapy.At present,this article will verify whether the image recognition artificial intelligence(AI)system can correctly reflect the emotional performance of the elderly with dementia through a questionnaire survey of three professional elderly nursing staff.The ANOVA(sig.=0.50)is used to determine that the judgment given by the nursing staff has no obvious deviation,and then Kendall's test(0.722**)and spearman's test(0.863**)are used to verify the judgment severity of the emotion recognition system and the nursing staff unanimously.This implies the usability of the tool.Additionally,it can be expected to be further applied in the research related to BPSD elderly emotion detection. 展开更多
关键词 depth-wise separable convolution EMOTION BPSD DEMENTIA Nursing
暂未订购
Fire Detection Method Based on Depthwise Separable Convolution and YOLOv3 被引量:6
4
作者 Yue-Yan Qin Jiang-Tao Cao Xiao-Fei Ji 《International Journal of Automation and computing》 EI CSCD 2021年第2期300-310,共11页
Recently,video-based fire detection technology has become an important research topic in the field of machine vision.This paper proposes a method of combining the classification model and target detection model in dee... Recently,video-based fire detection technology has become an important research topic in the field of machine vision.This paper proposes a method of combining the classification model and target detection model in deep learning for fire detection.Firstly,the depthwise separable convolution is used to classify fire images,which saves a lot of detection time under the premise of ensuring detection accuracy.Secondly,You Only Look Once version 3(YOLOv3)target regression function is used to output the fire position information for the images whose classification result is fire,which avoids the problem that the accuracy of detection cannot be guaranteed by using YOLOv3 for target classification and position regression.At the same time,the detection time of target regression for images without fire is greatly reduced saved.The experiments were tested using a network public database.The detection accuracy reached 98%and the detection rate reached 38fps.This method not only saves the workload of manually extracting flame characteristics,reduces the calculation cost,and reduces the amount of parameters,but also improves the detection accuracy and detection rate. 展开更多
关键词 Fire detection depthwise separable convolution fire classification You Only Look Once version 3(YOLOv3) target regression
原文传递
MSSTNet:Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention
5
作者 Changchen ZHAO Hongsheng WANG Yuanjing FENG 《Virtual Reality & Intelligent Hardware》 2023年第2期124-141,共18页
Background The use of remote photoplethysmography(rPPG)to estimate blood volume pulse in a noncontact manner has been an active research topic in recent years.Existing methods are primarily based on a singlescale regi... Background The use of remote photoplethysmography(rPPG)to estimate blood volume pulse in a noncontact manner has been an active research topic in recent years.Existing methods are primarily based on a singlescale region of interest(ROI).However,some noise signals that are not easily separated in a single-scale space can be easily separated in a multi-scale space.Also,existing spatiotemporal networks mainly focus on local spatiotemporal information and do not emphasize temporal information,which is crucial in pulse extraction problems,resulting in insufficient spatiotemporal feature modelling.Methods Here,we propose a multi-scale facial video pulse extraction network based on separable spatiotemporal convolution(SSTC)and dimension separable attention(DSAT).First,to solve the problem of a single-scale ROI,we constructed a multi-scale feature space for initial signal separation.Second,SSTC and DSAT were designed for efficient spatiotemporal correlation modeling,which increased the information interaction between the long-span time and space dimensions;this placed more emphasis on temporal features.Results The signal-to-noise ratio(SNR)of the proposed network reached 9.58dB on the PURE dataset and 6.77dB on the UBFC-rPPG dataset,outperforming state-of-the-art algorithms.Conclusions The results showed that fusing multi-scale signals yielded better results than methods based on only single-scale signals.The proposed SSTC and dimension-separable attention mechanism will contribute to more accurate pulse signal extraction. 展开更多
关键词 Remote photoplethysmography Heart rate separable spatiotemporal convolution Dimension separable attention MULTI-SCALE Neural network
在线阅读 下载PDF
Remaining Useful Life Prediction of Rail Based on Improved Pulse Separable Convolution Enhanced Transformer Encoder
6
作者 Zhongmei Wang Min Li +2 位作者 Jing He Jianhua Liu Lin Jia 《Journal of Transportation Technologies》 2024年第2期137-160,共24页
In order to prevent possible casualties and economic loss, it is critical to accurate prediction of the Remaining Useful Life (RUL) in rail prognostics health management. However, the traditional neural networks is di... In order to prevent possible casualties and economic loss, it is critical to accurate prediction of the Remaining Useful Life (RUL) in rail prognostics health management. However, the traditional neural networks is difficult to capture the long-term dependency relationship of the time series in the modeling of the long time series of rail damage, due to the coupling relationship of multi-channel data from multiple sensors. Here, in this paper, a novel RUL prediction model with an enhanced pulse separable convolution is used to solve this issue. Firstly, a coding module based on the improved pulse separable convolutional network is established to effectively model the relationship between the data. To enhance the network, an alternate gradient back propagation method is implemented. And an efficient channel attention (ECA) mechanism is developed for better emphasizing the useful pulse characteristics. Secondly, an optimized Transformer encoder was designed to serve as the backbone of the model. It has the ability to efficiently understand relationship between the data itself and each other at each time step of long time series with a full life cycle. More importantly, the Transformer encoder is improved by integrating pulse maximum pooling to retain more pulse timing characteristics. Finally, based on the characteristics of the front layer, the final predicted RUL value was provided and served as the end-to-end solution. The empirical findings validate the efficacy of the suggested approach in forecasting the rail RUL, surpassing various existing data-driven prognostication techniques. Meanwhile, the proposed method also shows good generalization performance on PHM2012 bearing data set. 展开更多
关键词 Equipment Health Prognostics Remaining Useful Life Prediction Pulse separable convolution Attention Mechanism Transformer Encoder
在线阅读 下载PDF
Coal/Gangue Volume Estimation with Convolutional Neural Network and Separation Based on Predicted Volume and Weight
7
作者 Zenglun Guan Murad S.Alfarzaeai +2 位作者 Eryi Hu Taqiaden Alshmeri Wang Peng 《Computers, Materials & Continua》 SCIE EI 2024年第4期279-306,共28页
In the coal mining industry,the gangue separation phase imposes a key challenge due to the high visual similaritybetween coal and gangue.Recently,separation methods have become more intelligent and efficient,using new... In the coal mining industry,the gangue separation phase imposes a key challenge due to the high visual similaritybetween coal and gangue.Recently,separation methods have become more intelligent and efficient,using newtechnologies and applying different features for recognition.One such method exploits the difference in substancedensity,leading to excellent coal/gangue recognition.Therefore,this study uses density differences to distinguishcoal from gangue by performing volume prediction on the samples.Our training samples maintain a record of3-side images as input,volume,and weight as the ground truth for the classification.The prediction process relieson a Convolutional neural network(CGVP-CNN)model that receives an input of a 3-side image and then extractsthe needed features to estimate an approximation for the volume.The classification was comparatively performedvia ten different classifiers,namely,K-Nearest Neighbors(KNN),Linear Support Vector Machines(Linear SVM),Radial Basis Function(RBF)SVM,Gaussian Process,Decision Tree,Random Forest,Multi-Layer Perceptron(MLP),Adaptive Boosting(AdaBosst),Naive Bayes,and Quadratic Discriminant Analysis(QDA).After severalexperiments on testing and training data,results yield a classification accuracy of 100%,92%,95%,96%,100%,100%,100%,96%,81%,and 92%,respectively.The test reveals the best timing with KNN,which maintained anaccuracy level of 100%.Assessing themodel generalization capability to newdata is essential to ensure the efficiencyof the model,so by applying a cross-validation experiment,the model generalization was measured.The useddataset was isolated based on the volume values to ensure the model generalization not only on new images of thesame volume but with a volume outside the trained range.Then,the predicted volume values were passed to theclassifiers group,where classification reported accuracy was found to be(100%,100%,100%,98%,88%,87%,100%,87%,97%,100%),respectively.Although obtaining a classification with high accuracy is the main motive,this workhas a remarkable reduction in the data preprocessing time compared to related works.The CGVP-CNN modelmanaged to reduce the data preprocessing time of previous works to 0.017 s while maintaining high classificationaccuracy using the estimated volume value. 展开更多
关键词 COAL coal gangue convolutional neural network CNN object classification volume estimation separation system
在线阅读 下载PDF
SepFE:Separable Fusion Enhanced Network for Retinal Vessel Segmentation 被引量:2
8
作者 Yun Wu Ge Jiao Jiahao Liu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第9期2465-2485,共21页
The accurate and automatic segmentation of retinal vessels fromfundus images is critical for the early diagnosis and prevention ofmany eye diseases,such as diabetic retinopathy(DR).Existing retinal vessel segmentation... The accurate and automatic segmentation of retinal vessels fromfundus images is critical for the early diagnosis and prevention ofmany eye diseases,such as diabetic retinopathy(DR).Existing retinal vessel segmentation approaches based on convolutional neural networks(CNNs)have achieved remarkable effectiveness.Here,we extend a retinal vessel segmentation model with low complexity and high performance based on U-Net,which is one of the most popular architectures.In view of the excellent work of depth-wise separable convolution,we introduce it to replace the standard convolutional layer.The complexity of the proposed model is reduced by decreasing the number of parameters and calculations required for themodel.To ensure performance while lowering redundant parameters,we integrate the pre-trained MobileNet V2 into the encoder.Then,a feature fusion residual module(FFRM)is designed to facilitate complementary strengths by enhancing the effective fusion between adjacent levels,which alleviates extraneous clutter introduced by direct fusion.Finally,we provide detailed comparisons between the proposed SepFE and U-Net in three retinal image mainstream datasets(DRIVE,STARE,and CHASEDB1).The results show that the number of SepFE parameters is only 3%of U-Net,the Flops are only 8%of U-Net,and better segmentation performance is obtained.The superiority of SepFE is further demonstrated through comparisons with other advanced methods. 展开更多
关键词 Retinal vessel segmentation U-Net depth-wise separable convolution feature fusion
暂未订购
A depth-wise separable residual neural network for PCDH8 status prediction in thyroid cancer pathological images
9
作者 Linlin Qi Xiangyu Li +2 位作者 Zhihong Liu Pei Zhang Liangliang Liu 《Intelligent Oncology》 2025年第4期290-298,共9页
Introduction:Accurate prediction of protocadherin 8(PCDH8)gene expression status from whole-slide images(WSIs)is critical for thyroid cancer diagnosis and prognosis,as PCDH8 overexpression is associated with tumor agg... Introduction:Accurate prediction of protocadherin 8(PCDH8)gene expression status from whole-slide images(WSIs)is critical for thyroid cancer diagnosis and prognosis,as PCDH8 overexpression is associated with tumor aggressiveness and poor outcomes.Existing methods for PCDH8 detection are often costly,time-consuming,or require specialized expertise.To address these limitations,we developed a novel depth-wise separable residual neural network(DSRNet)for noninvasive PCDH8 status prediction directly from WSIs.Materials and methods:We collected 403 thyroid cancer WSIs from The Cancer Genome Atlas(TCGA),with PCDH8 expression status classified as high or low based on median expression values.Each WSI was divided into 512×512 pixel tiles,with the top 100 non-white tiles selected per slide.DSRNet integrates depth-wise separable convolutions,residual connections,and a deformable convolutional pyramid pooling module to efficiently capture multiscale and long-range features in gigapixel WSIs.The model was trained using tenfold cross-validation.Results:DSRNet achieved state-of-the-art performance with 92.76%accuracy,91.92%precision,92.69%recall,and 0.93 area under the curve on the thyroid cancer dataset(TCGA-THCA),significantly outperforming leading convolutional neural networks and Transformer models.Ablation studies confirmed the contributions of each component,and attention visualization showed that DSRNet focuses on biologically relevant regions.The model also generalized well to a breast cancer dataset(TCGA-BRCA),achieving 89.13%accuracy.Conclusions:We developed DSRNet,a deep learning-based model for predicting PCDH8 status directly from routine hematoxylin and eosin-stained pathological images.DSRNet combines the efficiency of convolutional operations with enhanced long-range dependency modeling,providing a noninvasive,accurate,and interpretable tool for auxiliary thyroid cancer diagnosis and prognosis.The results demonstrate its strong potential for clinical translation,though further multicenter validation is warranted. 展开更多
关键词 Thyroid cancer Biomarker Whole-slide image depth-wise separable convolution Residual mechanism
在线阅读 下载PDF
A WEIGHTED GENERAL DISCRETE FOURIER TRANSFORM FOR THE FREQUENCY-DOMAIN BLIND SOURCE SEPARATION OF CONVOLUTIVE MIXTURES 被引量:1
10
作者 Wang Chao Fang Yong Feng Jiuchao 《Journal of Electronics(China)》 2008年第6期830-833,共4页
This letter deals with the frequency domain Blind Source Separation of Convolutive Mixtures (CMBSS). From the frequency representation of the "overlap and save", a Weighted General Discrete Fourier Transform... This letter deals with the frequency domain Blind Source Separation of Convolutive Mixtures (CMBSS). From the frequency representation of the "overlap and save", a Weighted General Discrete Fourier Transform (WGDFT) is derived to replace the traditional Discrete Fourier Transform (DFT). The mixing matrix on each frequency bin could be estimated more precisely from WGDFT coefficients than from DFT coefficients, which improves separation performance. Simulation results verify the validity of WGDFT for frequency domain blind source separation of convolutive mixtures. 展开更多
关键词 Blind Source separation of convolutive Mixtures (CMBSS) Frequency representation of overlap and save Weighted General Discrete Fourier Transform (WGDFT)
在线阅读 下载PDF
A Lightweight Convolutional Neural Network with Hierarchical Multi-Scale Feature Fusion for Image Classification 被引量:2
11
作者 Adama Dembele Ronald Waweru Mwangi Ananda Omutokoh Kube 《Journal of Computer and Communications》 2024年第2期173-200,共28页
Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware reso... Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware resources. To address this issue, the MobileNetV1 network was developed, which employs depthwise convolution to reduce network complexity. MobileNetV1 employs a stride of 2 in several convolutional layers to decrease the spatial resolution of feature maps, thereby lowering computational costs. However, this stride setting can lead to a loss of spatial information, particularly affecting the detection and representation of smaller objects or finer details in images. To maintain the trade-off between complexity and model performance, a lightweight convolutional neural network with hierarchical multi-scale feature fusion based on the MobileNetV1 network is proposed. The network consists of two main subnetworks. The first subnetwork uses a depthwise dilated separable convolution (DDSC) layer to learn imaging features with fewer parameters, which results in a lightweight and computationally inexpensive network. Furthermore, depthwise dilated convolution in DDSC layer effectively expands the field of view of filters, allowing them to incorporate a larger context. The second subnetwork is a hierarchical multi-scale feature fusion (HMFF) module that uses parallel multi-resolution branches architecture to process the input feature map in order to extract the multi-scale feature information of the input image. Experimental results on the CIFAR-10, Malaria, and KvasirV1 datasets demonstrate that the proposed method is efficient, reducing the network parameters and computational cost by 65.02% and 39.78%, respectively, while maintaining the network performance compared to the MobileNetV1 baseline. 展开更多
关键词 MobileNet Image Classification Lightweight convolutional Neural Network Depthwise Dilated separable convolution Hierarchical Multi-Scale Feature Fusion
在线阅读 下载PDF
Maximum Likelihood Blind Separation of Convolutively Mixed Discrete Sources
12
作者 辜方林 张杭 朱德生 《China Communications》 SCIE CSCD 2013年第6期60-67,共8页
In this paper,a Maximum Likelihood(ML) approach,implemented by Expectation-Maximization(EM) algorithm,is proposed to blind separation of convolutively mixed discrete sources.In order to carry out the expectation proce... In this paper,a Maximum Likelihood(ML) approach,implemented by Expectation-Maximization(EM) algorithm,is proposed to blind separation of convolutively mixed discrete sources.In order to carry out the expectation procedure of the EM algorithm with a less computational load,the algorithm named Iterative Maximum Likelihood algorithm(IML) is proposed to calculate the likelihood and recover the source signals.An important feature of the ML approach is that it has robust performance in noise environments by treating the covariance matrix of the additive Gaussian noise as a parameter.Another striking feature of the ML approach is that it is possible to separate more sources than sensors by exploiting the finite alphabet property of the sources.Simulation results show that the proposed ML approach works well either in determined mixtures or underdetermined mixtures.Furthermore,the performance of the proposed ML algorithm is close to the performance with perfect knowledge of the channel filters. 展开更多
关键词 Blind Source separation convolutive mixture EM Finite Alphabet
在线阅读 下载PDF
AN NMF ALGORITHM FOR BLIND SEPARATION OF CONVOLUTIVE MIXED SOURCE SIGNALS WITH LEAST CORRELATION CONSTRAINS
13
作者 Zhang Ye Fang Yong 《Journal of Electronics(China)》 2009年第4期557-563,共7页
Most of the existing algorithms for blind sources separation have a limitation that sources are statistically independent. However, in many practical applications, the source signals are non- negative and mutual stati... Most of the existing algorithms for blind sources separation have a limitation that sources are statistically independent. However, in many practical applications, the source signals are non- negative and mutual statistically dependent signals. When the observations are nonnegative linear combinations of nonnegative sources, the correlation coefficients of the observations are larger than these of source signals. In this letter, a novel Nonnegative Matrix Factorization (NMF) algorithm with least correlated component constraints to blind separation of convolutive mixed sources is proposed. The algorithm relaxes the source independence assumption and has low-complexity algebraic com- putations. Simulation results on blind source separation including real face image data indicate that the sources can be successfully recovered with the algorithm. 展开更多
关键词 Nonnegative matrix factorization convolutive blind source separation Correlation constrain
在线阅读 下载PDF
A Framework of Lightweight Deep Cross-Connected Convolution Kernel Mapping Support Vector Machines
14
作者 Qi Wang Zhaoying Liu +3 位作者 Ting Zhang Shanshan Tu Yujian Li Muhammad Waqas 《Journal on Artificial Intelligence》 2022年第1期37-48,共12页
Deep kernel mapping support vector machines have achieved good results in numerous tasks by mapping features from a low-dimensional space to a high-dimensional space and then using support vector machines for classifi... Deep kernel mapping support vector machines have achieved good results in numerous tasks by mapping features from a low-dimensional space to a high-dimensional space and then using support vector machines for classification.However,the depth kernel mapping support vector machine does not take into account the connection of different dimensional spaces and increases the model parameters.To further improve the recognition capability of deep kernel mapping support vector machines while reducing the number of model parameters,this paper proposes a framework of Lightweight Deep Convolutional Cross-Connected Kernel Mapping Support Vector Machines(LC-CKMSVM).The framework consists of a feature extraction module and a classification module.The feature extraction module first maps the data from low-dimensional to high-dimensional space by fusing the representations of different dimensional spaces through cross-connections;then,it uses depthwise separable convolution to replace part of the original convolution to reduce the number of parameters in the module;The classification module uses a soft margin support vector machine for classification.The results on 6 different visual datasets show that LC-CKMSVM obtains better classification accuracies on most cases than the other five models. 展开更多
关键词 convolutional neural network cross-connected lightweight framework depthwise separable convolution
在线阅读 下载PDF
频率感知驱动的深度鲁棒图像水印
15
作者 张国富 李鑫 +2 位作者 苏兆品 方涵 廉晨思 《中国图象图形学报》 北大核心 2026年第1期197-211,共15页
目的近年来,基于深度学习的水印方法得到了广泛研究。现有方法通常对特征图的低频和高频部分同等对待,忽视了不同频率成分之间的重要差异,导致模型在处理多样化攻击时缺乏灵活性,难以同时实现水印的高保真性和强鲁棒性。为此,本文提出... 目的近年来,基于深度学习的水印方法得到了广泛研究。现有方法通常对特征图的低频和高频部分同等对待,忽视了不同频率成分之间的重要差异,导致模型在处理多样化攻击时缺乏灵活性,难以同时实现水印的高保真性和强鲁棒性。为此,本文提出一种频率感知驱动的深度鲁棒图像水印技术(deep robust image watermarking driven by frequency awareness,RIWFP)。方法通过差异化机制处理低频和高频成分,提升水印性能。具体而言,低频成分通过小波卷积神经网络进行建模,利用宽感受野卷积在粗粒度层面高效学习全局结构和上下文信息;高频成分则采用深度可分离卷积和注意力机制组成的特征蒸馏块进行精炼,强化图像细节,在细粒度层面高效捕捉高频信息。此外,本文使用多频率小波损失函数,引导模型聚焦于不同频带的特征分布,进一步提升生成图像的质量。结果实验结果表明,提出的频率感知驱动的深度鲁棒图像水印技术在多个数据集上均表现出优越性能。在COCO(common objects in context)数据集上,RIWFP在随机丢弃攻击下的准确率达到91.4%;在椒盐噪声和中值滤波攻击下,RIWFP分别以100%和99.5%的准确率达到了最高水平,展现了其对高频信息的高效学习能力。在Ima⁃geNet数据集上,RIWFP在裁剪攻击下的准确率为93.4%;在JPEG压缩攻击下的准确率为99.6%,均显著优于其他对比方法。综合来看,RIWFP在COCO和ImageNet数据集上的平均准确率分别为96.7%和96.9%,均高于其他对比方法。结论本文所提方法通过频率感知的粗到细处理策略,显著增强了水印的不可见性和鲁棒性,在处理多种攻击时表现出优越性能。 展开更多
关键词 鲁棒图像水印 小波卷积神经网络 深度可分离卷积 注意力机制 多频率小波损失
原文传递
基于改进ConvNeXt Block的新型双域融合图像隐写
16
作者 段新涛 徐凯欧 +4 位作者 白鹿伟 张萌 保梦茹 武银行 秦川 《郑州大学学报(理学版)》 北大核心 2026年第1期1-9,共9页
针对图像隐写中不可感知性差、安全性不足和隐写容量低的问题,提出一种基于改进ConvNeXt Block的新型双域融合图像隐写方案。首先,改进后的深度可分离卷积模块可以学习到更为细节的图像特征信息。其次,设计一种新型的空间域和频域信息... 针对图像隐写中不可感知性差、安全性不足和隐写容量低的问题,提出一种基于改进ConvNeXt Block的新型双域融合图像隐写方案。首先,改进后的深度可分离卷积模块可以学习到更为细节的图像特征信息。其次,设计一种新型的空间域和频域信息融合方式来提高图像的不可感知性和安全性。最后,采用多个损失函数对网络进行级联约束。实验结果表明,相比其他隐写方案,所提方案在峰值信噪比上平均提高3~4 dB,结构相似性和学习感知图像块相似度的平均值分别为0.99和0.001;抗隐写分析能力更接近50%,具有更高的安全性,且大容量隐藏时仍具有较好效果。 展开更多
关键词 图像隐写 深度可分离卷积 空间域 频域 安全性 大容量
在线阅读 下载PDF
基于改进YOLOv5s的自动导引运输车托盘孔位视觉定位方法
17
作者 崔升 唐芳丽 +2 位作者 郑亮宇 曾伟理 曲伟伟 《食品与机械》 北大核心 2026年第1期79-85,共7页
[目的]自动导引运输车在搬运过程中,需定位的托盘孔位中存在的细小、形变、低对比度孔位的视觉定位不准的问题。因此,提出一种基于改进YOLOv5s的自动导引运输车托盘孔位视觉定位方法。[方法]结合ShuffleNetV2的通道混洗操作改进和CBAM... [目的]自动导引运输车在搬运过程中,需定位的托盘孔位中存在的细小、形变、低对比度孔位的视觉定位不准的问题。因此,提出一种基于改进YOLOv5s的自动导引运输车托盘孔位视觉定位方法。[方法]结合ShuffleNetV2的通道混洗操作改进和CBAM注意力机制改进,对基本YOLOv5s框架进行改进,使其聚焦于形变关键区域中亚像素级边界模糊的孔位区域;基于SloU损失函数关注微小孔位,并计算托盘孔位在相机坐标系下的空间三维坐标,得到相机坐标系到孔位区域坐标系的变换关系,采用改进的YOLOv5s框架输出AGV机械臂坐标系下的托盘孔位三维坐标。[结果]试验方法可有效捕捉亚像素级定位精度边界,绝对误差<0.03 cm,相对误差<0.83%;F1分数、mAP指标分别为95.2%、94.8%;浮点运算次数、参数量和模型体积分别为4.8 G、2.6 M、4.28 MB。[结论]试验方法有效解决了需定位托盘孔位中存在的细小、形变、低对比度孔位的视觉定位难题,提升了自动导引运输车托盘搬运效率。 展开更多
关键词 YOLOv5s 自动导引运输车 托盘孔位定位 深度可分离卷积 CBAM注意力
在线阅读 下载PDF
基于扩散先验的脑部MRI超分辨率重建
18
作者 熊承义 曹雨轩 高志荣 《中南民族大学学报(自然科学版)》 2026年第2期202-211,共10页
现有基于Transformer的MRI超分辨率方法虽具有良好的全局建模能力,但忽略了深度先验约束建模的重要性.为此,提出了一种基于扩散先验的脑部MRI超分辨率方法,利用潜在扩散模型生成的先验来引导Transformer进行超分辨率重建,以提升MRI细节... 现有基于Transformer的MRI超分辨率方法虽具有良好的全局建模能力,但忽略了深度先验约束建模的重要性.为此,提出了一种基于扩散先验的脑部MRI超分辨率方法,利用潜在扩散模型生成的先验来引导Transformer进行超分辨率重建,以提升MRI细节重建能力.具体而言,采用两阶段协同训练策略:第一阶段通过真实图像潜编码构建内容先验;第二阶段引入扩散模型重构先验,并联合优化去噪与重建过程,实现无监督条件下的图像超分辨率.此外,采用深度可分离卷积与置换自注意力机制,实现编码器的高效建模与感受野扩展.在IXI多模态MRI数据集上的4倍超分辨率实验表明:所提出方法在提升重建图像主客观质量与重建效率方面优于已有方法 . 展开更多
关键词 MRI超分辨率 扩散先验 置换自注意力 深度可分离卷积
在线阅读 下载PDF
一种基于FL-TransCNN神经网络的水声智能频谱感知算法
19
作者 李玉芳 王锴 +2 位作者 张力良 徐凌伟 Thomas Aaron Gulliver 《电讯技术》 北大核心 2026年第1期11-20,共10页
为了提高频谱利用率,提出了一种基于联邦学习(Federated Learning,FL)、Transformer和卷积神经网络(Convolutional Neural Network,CNN)的水声智能频谱感知算法。首先,基于FL实现数据隔离状态下的信息共享,并应用Paillier加密技术进行... 为了提高频谱利用率,提出了一种基于联邦学习(Federated Learning,FL)、Transformer和卷积神经网络(Convolutional Neural Network,CNN)的水声智能频谱感知算法。首先,基于FL实现数据隔离状态下的信息共享,并应用Paillier加密技术进行权重加密保障;其次,本地感知数据经连续小波变换构建为时频谱图;最后,融合CNN与Transformer构建了TransCNN感知器,通过并行分支实现了高精度感知。在信噪比-18~0 dB范围内,与RepVGG、Swin-Transformer、YOLOv7、MobileNet算法相比,所提的水声智能频谱感知算法的平均检测概率提升了4%~10%,平均虚警概率降低了2%~9%。 展开更多
关键词 海洋物联网 智能频谱感知 联邦学习 连续小波变换 深度可分离卷积
在线阅读 下载PDF
CPViG-Net:基于局部跨阶段视觉图卷积的学生课堂行为识别
20
作者 张浩鹏 施铮 +1 位作者 刘峰 宋婉茹 《计算机科学》 北大核心 2026年第2期57-66,共10页
随着教育范式从“人机协同”向“人智协同共育”演进,课堂教学的智能化评价也面临着新的要求和挑战,其中以学生行为为出发点的任务近些年来获得了广泛的关注。针对真实课堂环境中存在的学生行为多样、遮挡频繁及背景干扰严重等问题,提... 随着教育范式从“人机协同”向“人智协同共育”演进,课堂教学的智能化评价也面临着新的要求和挑战,其中以学生行为为出发点的任务近些年来获得了广泛的关注。针对真实课堂环境中存在的学生行为多样、遮挡频繁及背景干扰严重等问题,提出一种局部跨阶段视觉图卷积模型,旨在提升复杂课堂环境下的学生行为识别精度。该模型以经典目标检测算法为基准框架,通过融合视觉图卷积神经网络的动态特征建模能力,构建了局部最大相对图卷积模块(PMG)与局部跨阶段融合(CPF)模块。其中,PMG模块通过嵌入最大相对图卷积来捕捉节点间特征差异最大的邻域信息,进而针对性地解决局部区域遮挡引起的信息丢失问题,并结合了深度可分离卷积降低图卷积算法的计算开销;CPF模块利用全连接层重构特征结构,并通过C2f模块的跨阶段连接机制,实现多层级的特征融合,从而增强模型对小尺度目标的识别能力。此外,模型通过近邻K值优化,提出针对不同数据集的优化策略。在公开数据集SCB03-S上,CPViG-Net的mAP@50达到70.9%,较基准模型提升2个百分点;在多个公开数据集上的实验表明,该模型在处理真实课堂情境下学生行为识别面临的诸多问题中表现出较好的性能和较高的鲁棒性。 展开更多
关键词 学生行为 最大相对图卷积 多尺度目标识别 遮挡 深度可分离卷积
在线阅读 下载PDF
上一页 1 2 58 下一页 到第
使用帮助 返回顶部