In recent years,deep learning has been introduced into the field of Single-pixel imaging(SPI),garnering significant attention.However,conventional networks still exhibit limitations in preserving image details.To addr...In recent years,deep learning has been introduced into the field of Single-pixel imaging(SPI),garnering significant attention.However,conventional networks still exhibit limitations in preserving image details.To address this issue,we integrate Large Kernel Convolution(LKconv)into the U-Net framework,proposing an enhanced network structure named U-LKconv network,which significantly enhances the capability to recover image details even under low sampling conditions.展开更多
Significant advancements have been achieved in the field of Single Image Super-Resolution(SISR)through the utilization of Convolutional Neural Networks(CNNs)to attain state-of-the-art performance.Recent efforts have e...Significant advancements have been achieved in the field of Single Image Super-Resolution(SISR)through the utilization of Convolutional Neural Networks(CNNs)to attain state-of-the-art performance.Recent efforts have explored the incorporation of Transformers to augment network performance in SISR.However,the high computational cost of Transformers makes them less suitable for deployment on lightweight devices.Moreover,the majority of enhancements for CNNs rely predominantly on small spatial convolutions,thereby neglecting the potential advantages of large kernel convolution.In this paper,the authors propose a Multi-Perception Large Kernel convNet(MPLKN)which delves into the exploration of large kernel convolution.Specifically,the authors have architected a Multi-Perception Large Kernel(MPLK)module aimed at extracting multi-scale features and employ a stepwise feature fusion strategy to seamlessly integrate these features.In addition,to enhance the network's capacity for nonlinear spatial information processing,the authors have designed a Spatial-Channel Gated Feed-forward Network(SCGFN)that is capable of adapting to feature interactions across both spatial and channel dimensions.Experimental results demonstrate that MPLKN outperforms other lightweight image super-resolution models while maintaining a minimal number of parameters and FLOPs.展开更多
Robust watermarking requires finding invariant features under multiple attacks to ensure correct extraction.Deep learning has extremely powerful in extracting features,and watermarking algorithms based on deep learnin...Robust watermarking requires finding invariant features under multiple attacks to ensure correct extraction.Deep learning has extremely powerful in extracting features,and watermarking algorithms based on deep learning have attracted widespread attention.Most existing methods use 3×3 small kernel convolution to extract image features and embed the watermarking.However,the effective perception fields for small kernel convolution are extremely confined,so the pixels that each watermarking can affect are restricted,thus limiting the performance of the watermarking.To address these problems,we propose a watermarking network based on large kernel convolution and adaptive weight assignment for loss functions.It uses large-kernel depth-wise convolution to extract features for learning large-scale image information and subsequently projects the watermarking into a highdimensional space by 1×1 convolution to achieve adaptability in the channel dimension.Subsequently,the modification of the embedded watermarking on the cover image is extended to more pixels.Because the magnitude and convergence rates of each loss function are different,an adaptive loss weight assignment strategy is proposed to make theweights participate in the network training together and adjust theweight dynamically.Further,a high-frequency wavelet loss is proposed,by which the watermarking is restricted to only the low-frequency wavelet sub-bands,thereby enhancing the robustness of watermarking against image compression.The experimental results show that the peak signal-to-noise ratio(PSNR)of the encoded image reaches 40.12,the structural similarity(SSIM)reaches 0.9721,and the watermarking has good robustness against various types of noise.展开更多
Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on ...Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on the standard convolutional auto-encoder.In this model,the parallel convolutional and deconvolutional kernels of different scales are used to extract the features from the input signal and reconstruct the input signal;then the feature map extracted by multi-scale convolutional kernels is used as the input of the classifier;and finally the parameters of the whole model are fine-tuned using labeled data.Experiments on one set of simulation fault data and two sets of rolling bearing fault data are conducted to validate the proposed method.The results show that the model can achieve 99.75%,99.3%and 100%diagnostic accuracy,respectively.In addition,the diagnostic accuracy and reconstruction error of the one-dimensional multi-scale convolutional auto-encoder are compared with traditional machine learning,convolutional neural networks and a traditional convolutional auto-encoder.The final results show that the proposed model has a better recognition effect for rolling bearing fault data.展开更多
针对残差网络存在特征冗余、有效感受野不足等问题,提出了特征融合模块。该模块可以在模型通道扩增过程中,实现不同尺度特征融合,从而构建出信息更丰富的多尺度特征,提高通道利用率。并且该模块还包含了少量大核卷积,其有助于扩大模型...针对残差网络存在特征冗余、有效感受野不足等问题,提出了特征融合模块。该模块可以在模型通道扩增过程中,实现不同尺度特征融合,从而构建出信息更丰富的多尺度特征,提高通道利用率。并且该模块还包含了少量大核卷积,其有助于扩大模型的有效感受野,实现性能和计算效率的平衡。同时还提出了轻量化的下采样和混排压缩模块,可充分降低模型的参数,使整个方法更高效。将特征融合、下采样以及混排压缩模块引入残差网络可以构建出特征融合网络(FFNet:Feature Fusion Network)。其具有更快的收敛速度、更大的有效感受野,以及更好的性能表现。经在CIFAR(Canadian Institute for Advanced Research)、ImageNet以及COCO(Microsoft Common Objects in Context)数据集的大量实验结果证明了其能在仅增加少量参数和FLOPs(Floating Point Operations)的前提下,在分类、目标检测以及实例分割任务上使其性能显著提升。展开更多
为解决吊装作业数据集获取困难与吊装作业过程中重要对象(吊物与吊钩)监管难题,提出虚实结合的方法构建数据集,基于SketchUp软件建立虚拟吊装作业场景获取虚拟吊装作业图片,同时从网络获取吊装作业图片及现场作业视频截图,将真实作业场...为解决吊装作业数据集获取困难与吊装作业过程中重要对象(吊物与吊钩)监管难题,提出虚实结合的方法构建数据集,基于SketchUp软件建立虚拟吊装作业场景获取虚拟吊装作业图片,同时从网络获取吊装作业图片及现场作业视频截图,将真实作业场景的图片与虚拟作业场景的图片共同组成虚实结合的数据集。引入可改变核卷积(Arbitrary Kernel Convolution,AKConv)和鬼魅空洞可分离卷积(Concentrated-Comprehensive Convolution with GhostBottleneck,C3Ghost)改进目标检测算法模型YOLOv5(You Only Look Once version 5),改进后的模型比原始模型在精确率上高出2.6百分点,在推理速度上高出9.1帧/s,且模型所占存储容量降低1.9 MB。搭建可视化操作界面,与优化好的模型整合成吊装作业实时监测系统,实现对吊物和吊钩的安全状态识别和风险预警,及时进行风险管控。展开更多
为解决现有深度学习网络结构对红外弱小目标的识别针对性不足问题,提出了一种基于改进Yolov8的红外弱小目标识别算法(Yolov8n based on UniRepLK Block and Triplet Attention,UT-Yolov8)。该算法通过特征融合网络输出端的检测头引入三...为解决现有深度学习网络结构对红外弱小目标的识别针对性不足问题,提出了一种基于改进Yolov8的红外弱小目标识别算法(Yolov8n based on UniRepLK Block and Triplet Attention,UT-Yolov8)。该算法通过特征融合网络输出端的检测头引入三重注意力机制,为特征融合网络内部添加新的小目标检测层、检测头,以及在特征提取网络的空间池化金字塔内结合大内核卷积,针对红外弱小目标的成像特性进行改进。算法在真实红外图像数据上进行验证,实验结果表明,UT-Yolov8算法在保持高检测速度的同时,有效提高了网络对于红外弱小目标识别精度,平均精度均值mAP@0.5达到了95.9%。展开更多
文摘In recent years,deep learning has been introduced into the field of Single-pixel imaging(SPI),garnering significant attention.However,conventional networks still exhibit limitations in preserving image details.To address this issue,we integrate Large Kernel Convolution(LKconv)into the U-Net framework,proposing an enhanced network structure named U-LKconv network,which significantly enhances the capability to recover image details even under low sampling conditions.
文摘Significant advancements have been achieved in the field of Single Image Super-Resolution(SISR)through the utilization of Convolutional Neural Networks(CNNs)to attain state-of-the-art performance.Recent efforts have explored the incorporation of Transformers to augment network performance in SISR.However,the high computational cost of Transformers makes them less suitable for deployment on lightweight devices.Moreover,the majority of enhancements for CNNs rely predominantly on small spatial convolutions,thereby neglecting the potential advantages of large kernel convolution.In this paper,the authors propose a Multi-Perception Large Kernel convNet(MPLKN)which delves into the exploration of large kernel convolution.Specifically,the authors have architected a Multi-Perception Large Kernel(MPLK)module aimed at extracting multi-scale features and employ a stepwise feature fusion strategy to seamlessly integrate these features.In addition,to enhance the network's capacity for nonlinear spatial information processing,the authors have designed a Spatial-Channel Gated Feed-forward Network(SCGFN)that is capable of adapting to feature interactions across both spatial and channel dimensions.Experimental results demonstrate that MPLKN outperforms other lightweight image super-resolution models while maintaining a minimal number of parameters and FLOPs.
基金supported,in part,by the National Nature Science Foundation of China under grant numbers 62272236in part,by the Natural Science Foundation of Jiangsu Province under grant numbers BK20201136,BK20191401in part,by the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD)fund.
文摘Robust watermarking requires finding invariant features under multiple attacks to ensure correct extraction.Deep learning has extremely powerful in extracting features,and watermarking algorithms based on deep learning have attracted widespread attention.Most existing methods use 3×3 small kernel convolution to extract image features and embed the watermarking.However,the effective perception fields for small kernel convolution are extremely confined,so the pixels that each watermarking can affect are restricted,thus limiting the performance of the watermarking.To address these problems,we propose a watermarking network based on large kernel convolution and adaptive weight assignment for loss functions.It uses large-kernel depth-wise convolution to extract features for learning large-scale image information and subsequently projects the watermarking into a highdimensional space by 1×1 convolution to achieve adaptability in the channel dimension.Subsequently,the modification of the embedded watermarking on the cover image is extended to more pixels.Because the magnitude and convergence rates of each loss function are different,an adaptive loss weight assignment strategy is proposed to make theweights participate in the network training together and adjust theweight dynamically.Further,a high-frequency wavelet loss is proposed,by which the watermarking is restricted to only the low-frequency wavelet sub-bands,thereby enhancing the robustness of watermarking against image compression.The experimental results show that the peak signal-to-noise ratio(PSNR)of the encoded image reaches 40.12,the structural similarity(SSIM)reaches 0.9721,and the watermarking has good robustness against various types of noise.
基金The National Natural Science Foundation of China(No.51675098)
文摘Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on the standard convolutional auto-encoder.In this model,the parallel convolutional and deconvolutional kernels of different scales are used to extract the features from the input signal and reconstruct the input signal;then the feature map extracted by multi-scale convolutional kernels is used as the input of the classifier;and finally the parameters of the whole model are fine-tuned using labeled data.Experiments on one set of simulation fault data and two sets of rolling bearing fault data are conducted to validate the proposed method.The results show that the model can achieve 99.75%,99.3%and 100%diagnostic accuracy,respectively.In addition,the diagnostic accuracy and reconstruction error of the one-dimensional multi-scale convolutional auto-encoder are compared with traditional machine learning,convolutional neural networks and a traditional convolutional auto-encoder.The final results show that the proposed model has a better recognition effect for rolling bearing fault data.
文摘针对残差网络存在特征冗余、有效感受野不足等问题,提出了特征融合模块。该模块可以在模型通道扩增过程中,实现不同尺度特征融合,从而构建出信息更丰富的多尺度特征,提高通道利用率。并且该模块还包含了少量大核卷积,其有助于扩大模型的有效感受野,实现性能和计算效率的平衡。同时还提出了轻量化的下采样和混排压缩模块,可充分降低模型的参数,使整个方法更高效。将特征融合、下采样以及混排压缩模块引入残差网络可以构建出特征融合网络(FFNet:Feature Fusion Network)。其具有更快的收敛速度、更大的有效感受野,以及更好的性能表现。经在CIFAR(Canadian Institute for Advanced Research)、ImageNet以及COCO(Microsoft Common Objects in Context)数据集的大量实验结果证明了其能在仅增加少量参数和FLOPs(Floating Point Operations)的前提下,在分类、目标检测以及实例分割任务上使其性能显著提升。
文摘为解决吊装作业数据集获取困难与吊装作业过程中重要对象(吊物与吊钩)监管难题,提出虚实结合的方法构建数据集,基于SketchUp软件建立虚拟吊装作业场景获取虚拟吊装作业图片,同时从网络获取吊装作业图片及现场作业视频截图,将真实作业场景的图片与虚拟作业场景的图片共同组成虚实结合的数据集。引入可改变核卷积(Arbitrary Kernel Convolution,AKConv)和鬼魅空洞可分离卷积(Concentrated-Comprehensive Convolution with GhostBottleneck,C3Ghost)改进目标检测算法模型YOLOv5(You Only Look Once version 5),改进后的模型比原始模型在精确率上高出2.6百分点,在推理速度上高出9.1帧/s,且模型所占存储容量降低1.9 MB。搭建可视化操作界面,与优化好的模型整合成吊装作业实时监测系统,实现对吊物和吊钩的安全状态识别和风险预警,及时进行风险管控。
文摘为解决现有深度学习网络结构对红外弱小目标的识别针对性不足问题,提出了一种基于改进Yolov8的红外弱小目标识别算法(Yolov8n based on UniRepLK Block and Triplet Attention,UT-Yolov8)。该算法通过特征融合网络输出端的检测头引入三重注意力机制,为特征融合网络内部添加新的小目标检测层、检测头,以及在特征提取网络的空间池化金字塔内结合大内核卷积,针对红外弱小目标的成像特性进行改进。算法在真实红外图像数据上进行验证,实验结果表明,UT-Yolov8算法在保持高检测速度的同时,有效提高了网络对于红外弱小目标识别精度,平均精度均值mAP@0.5达到了95.9%。