期刊文献+
共找到653篇文章
< 1 2 33 >
每页显示 20 50 100
A Remote Sensing Image Semantic Segmentation Method by Combining Deformable Convolution with Conditional Random Fields 被引量:13
1
作者 Zongcheng ZUO Wen ZHANG Dongying ZHANG 《Journal of Geodesy and Geoinformation Science》 2020年第3期39-49,共11页
Currently,deep convolutional neural networks have made great progress in the field of semantic segmentation.Because of the fixed convolution kernel geometry,standard convolution neural networks have been limited the a... Currently,deep convolutional neural networks have made great progress in the field of semantic segmentation.Because of the fixed convolution kernel geometry,standard convolution neural networks have been limited the ability to simulate geometric transformations.Therefore,a deformable convolution is introduced to enhance the adaptability of convolutional networks to spatial transformation.Considering that the deep convolutional neural networks cannot adequately segment the local objects at the output layer due to using the pooling layers in neural network architecture.To overcome this shortcoming,the rough prediction segmentation results of the neural network output layer will be processed by fully connected conditional random fields to improve the ability of image segmentation.The proposed method can easily be trained by end-to-end using standard backpropagation algorithms.Finally,the proposed method is tested on the ISPRS dataset.The results show that the proposed method can effectively overcome the influence of the complex structure of the segmentation object and obtain state-of-the-art accuracy on the ISPRS Vaihingen 2D semantic labeling dataset. 展开更多
关键词 high-resolution remote sensing image semantic segmentation deformable convolution network conditions random fields
在线阅读 下载PDF
Multi-Layer Feature Extraction with Deformable Convolution for Fabric Defect Detection 被引量:1
2
作者 Jielin Jiang Chao Cui +1 位作者 Xiaolong Xu Yan Cui 《Intelligent Automation & Soft Computing》 2024年第4期725-744,共20页
In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process.... In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process.Traditional fabric defect detection algorithms can only detect specific materials and specific fabric defect types;in addition,their detection efficiency is low,and their detection results are relatively poor.Deep learning-based methods have many advantages in the field of fabric defect detection,however,such methods are less effective in identifying multiscale fabric defects and defects with complex shapes.Therefore,we propose an effective algorithm,namely multilayer feature extraction combined with deformable convolution(MFDC),for fabric defect detection.In MFDC,multi-layer feature extraction is used to fuse the underlying location features with high-level classification features through a horizontally connected top-down architecture to improve the detection of multi-scale fabric defects.On this basis,a deformable convolution is added to solve the problem of the algorithm’s weak detection ability of irregularly shaped fabric defects.In this approach,Roi Align and Cascade-RCNN are integrated to enhance the adaptability of the algorithm in materials with complex patterned backgrounds.The experimental results show that the MFDC algorithm can achieve good detection results for both multi-scale fabric defects and defects with complex shapes,at the expense of a small increase in detection time. 展开更多
关键词 Fabric defect detection multi-layer features deformable convolution
在线阅读 下载PDF
CW-HRNet:Constrained Deformable Sampling and Wavelet-Guided Enhancement for Lightweight Crack Segmentation
3
作者 Dewang Ma 《Journal of Electronic Research and Application》 2025年第5期269-280,共12页
This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two ke... This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two key modules:Constrained Deformable Convolution(CDC),which stabilizes geometric alignment by applying a tanh limiter and learnable scaling factor to the predicted offsets,and the Wavelet Frequency Enhancement Module(WFEM),which decomposes features using Haar wavelets to preserve low-frequency structures while enhancing high-frequency boundaries and textures.Evaluations on the CrackSeg9k benchmark demonstrate CW-HRNet’s superior performance,achieving 82.39%mIoU with only 7.49M parameters and 10.34 GFLOPs,outperforming HrSegNet-B48 by 1.83% in segmentation accuracy with minimal complexity overhead.The model also shows strong cross-dataset generalization,achieving 60.01%mIoU and 66.22%F1 on Asphalt3k without fine-tuning.These results highlight CW-HRNet’s favorable accuracyefficiency trade-off for real-world crack segmentation tasks. 展开更多
关键词 Crack segmentation Lightweight semantic segmentation deformable convolution Wavelet transform Road infrastructure
在线阅读 下载PDF
Optical Flow with Learning Feature for Deformable Medical Image Registration 被引量:1
4
作者 Jinrong Hu Lujin Li +3 位作者 Ying Fu Maoyang Zou Jiliu Zhou Shanhui Sun 《Computers, Materials & Continua》 SCIE EI 2022年第5期2773-2788,共16页
Deformable medical image registration plays a vital role in medical image applications,such as placing different temporal images at the same time point or different modality images into the same coordinate system.Vari... Deformable medical image registration plays a vital role in medical image applications,such as placing different temporal images at the same time point or different modality images into the same coordinate system.Various strategies have been developed to satisfy the increasing needs of deformable medical image registration.One popular registration method is estimating the displacement field by computing the optical flow between two images.The motion field(flow field)is computed based on either gray-value or handcrafted descriptors such as the scale-invariant feature transform(SIFT).These methods assume that illumination is constant between images.However,medical images may not always satisfy this assumption.In this study,we propose a metric learning-based motion estimation method called Siamese Flow for deformable medical image registration.We train metric learners using a Siamese network,which produces an image patch descriptor that guarantees a smaller feature distance in two similar anatomical structures and a larger feature distance in two dissimilar anatomical structures.In the proposed registration framework,the flow field is computed based on such features and is close to the real deformation field due to the excellent feature representation ability of the Siamese network.Experimental results demonstrate that the proposed method outperforms the Demons,SIFT Flow,Elastix,and VoxelMorph networks regarding registration accuracy and robustness,particularly with large deformations. 展开更多
关键词 deformation registration feature extraction optical flow convolutional neural network
在线阅读 下载PDF
A Deformable Network with Attention Mechanism for Retinal Vessel Segmentation
5
作者 Xiaolong Zhu Wenjian Li +2 位作者 Weihang Zhang Dongwei Li Huiqi Li 《Journal of Beijing Institute of Technology》 EI CAS 2024年第3期186-193,共8页
The intensive application of deep learning in medical image processing has facilitated the advancement of automatic retinal vessel segmentation research.To overcome the limitation that traditional U-shaped vessel segm... The intensive application of deep learning in medical image processing has facilitated the advancement of automatic retinal vessel segmentation research.To overcome the limitation that traditional U-shaped vessel segmentation networks fail to extract features in fundus image sufficiently,we propose a novel network(DSeU-net)based on deformable convolution and squeeze excitation residual module.The deformable convolution is utilized to dynamically adjust the receptive field for the feature extraction of retinal vessel.And the squeeze excitation residual module is used to scale the weights of the low-level features so that the network learns the complex relationships of the different feature layers efficiently.We validate the DSeU-net on three public retinal vessel segmentation datasets including DRIVE,CHASEDB1,and STARE,and the experimental results demonstrate the satisfactory segmentation performance of the network. 展开更多
关键词 retinal vessel segmentation deformable convolution attention mechanism deep learning
暂未订购
DSD-MatchingNet:Deformable sparse-to-dense feature matching for learning accurate correspondences
6
作者 Yicheng ZHAO Han ZHANG +3 位作者 Ping LU Ping LI Enhua WU Bin SHENG 《Virtual Reality & Intelligent Hardware》 2022年第5期432-443,共12页
Background Exploring correspondences across multiview images is the basis of various computer vision tasks.However,most existing methods have limited accuracy under challenging conditions.Method To learn more robust a... Background Exploring correspondences across multiview images is the basis of various computer vision tasks.However,most existing methods have limited accuracy under challenging conditions.Method To learn more robust and accurate correspondences,we propose DSD-MatchingNet for local feature matching in this study.First,we develop a deformable feature extraction module to obtain multilevel feature maps,which harvest contextual information from dynamic receptive fields.The dynamic receptive fields provided by the deformable convolution network ensure that our method obtains dense and robust correspondence.Second,we utilize sparse-to-dense matching with symmetry of correspondence to implement accurate pixel-level matching,which enables our method to produce more accurate correspondences.Result Experiments show that our proposed DSD-MatchingNet achieves a better performance on the image matching benchmark,as well as on the visual localization benchmark.Specifically,our method achieved 91.3%mean matching accuracy on the HPatches dataset and 99.3%visual localization recalls on the Aachen Day-Night dataset. 展开更多
关键词 Image matching deformable convolution network Sparse-to-dense matching
在线阅读 下载PDF
3D Data Scattergram Image Classification Based Protection for Transmission Line Connecting BESS Using Depth-wise Separable Convolution Based CNN 被引量:1
7
作者 Yingyu Liang Yi Ren +1 位作者 Xiaoyang Yang Wenting Zha 《Journal of Modern Power Systems and Clean Energy》 2025年第2期609-621,共13页
The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data... The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data scattergrams are constructed using current data from both sides of the transmission line and their sum.Following a comprehensive analysis of the varying characteristics of 3D data scattergrams under different conditions,a 3D data scattergram image classification based protection method is developed.The depth-wise separable convolution is used to ensure a lightweight convolutional neural network(CNN)structure without compromising performance.In addition,a Bayesian hyperparameter optimization algorithm is used to achieve a hyperparametric search to simplify the training process.Compared with artificial neural networks and CNNs,the depth-wise separable convolution based CNN(DPCNN)achieves a higher recognition accuracy.The 3D data scattergram image classification based protection method using DPCNN can accurately separate internal faults from other disturbances and identify fault phases under different operating states and fault conditions.The proposed protection method also shows first-class tolerability against current transformer(CT)saturation and CT measurement errors. 展开更多
关键词 convolutional neural network(CNN) battery energy storage station(BESS) depth-wise separable convolution hyperparameter optimization fault classification line protection
原文传递
Validation Research on the Application of Depthwise Separable Convolutional Al Facial Expression Recognition in Non-pharmacological Treatment of BPSD
8
作者 Xiangyu Liu 《Journal of Clinical and Nursing Research》 2021年第4期31-37,共7页
One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence... One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence of emotions.Observing the reduction of BPSD in dementia through emotions can be considered effective and widely used in the field of non-pharmacological therapy.At present,this article will verify whether the image recognition artificial intelligence(AI)system can correctly reflect the emotional performance of the elderly with dementia through a questionnaire survey of three professional elderly nursing staff.The ANOVA(sig.=0.50)is used to determine that the judgment given by the nursing staff has no obvious deviation,and then Kendall's test(0.722**)and spearman's test(0.863**)are used to verify the judgment severity of the emotion recognition system and the nursing staff unanimously.This implies the usability of the tool.Additionally,it can be expected to be further applied in the research related to BPSD elderly emotion detection. 展开更多
关键词 depth-wise separable convolution EMOTION BPSD DEMENTIA Nursing
暂未订购
Pore network modeling of gas-water two-phase flow in deformed multi-scale fracture-porous media
9
作者 Dai-Gang Wang Yu-Shan Ma +6 位作者 Zhe Hu Tong Wu Ji-Rui Hou Zhen-Chang Jiang Xin-Xuan Qi Kao-Ping Song Fang-zhou Liu 《Petroleum Science》 2025年第5期2096-2108,共13页
Two actual rocks drilled from a typical ultra-deep hydrocarbon reservoir in the Tarim Basin are selected to conduct in-situ stress-loading micro-focus CT scanning experiments.The gray images of rock microstructure at ... Two actual rocks drilled from a typical ultra-deep hydrocarbon reservoir in the Tarim Basin are selected to conduct in-situ stress-loading micro-focus CT scanning experiments.The gray images of rock microstructure at different stress loading stages are obtained.The U-Net fully convolutional neural network is utilized to achieve fine semantic segmentation of rock skeleton,pore space,and microfractures based on CT slice images of deep rocks.The three-dimensional digital rock models of deformed multiscale fractured-porous media at different stress loading stages are thereafter reconstructed,and the equivalent fracture-pore network models are finally extracted to explore the underlying mechanisms of gas-water two-phase flow at the pore-scale.Results indicate that,in the process of insitu stress loading,both the deep rocks have experienced three stages:linear elastic deformation,nonlinear plastic deformation,and shear failure.The micro-mechanical behavior greatly affects the dynamic deformation of rock microstructure and gas-water two-phase flow.In the linear elastic deformation stage,with the increase in in-situ stress,both the deep rocks are gradually compacted,leading to decreases in average pore radius,pore throat ratio,tortuosity,and water-phase relative permeability,while the coordination number nearly remains unchanged.In the plastic deformation stage,the synergistic influence of rock compaction and existence of micro-fractures typically exert a great effect on pore-throat topological properties and gas-water relative permeability.In the shear failure stage,due to the generation and propagation of micro-fractures inside the deep rock,the topological connectivity becomes better,fluid flow paths increase,and flow conductivity is promoted,thus leading to sharp increases in average pore radius and coordination number,rapid decreases in pore throat ratio and tortuosity,as well as remarkable improvement in relative permeability of gas phase and waterphase. 展开更多
关键词 Ultra-deep reservoir In-situ stress loading U-Netfully convolutional neural network CTscanning Microstructure deformation Pore-scalefluid flow
原文传递
基于多注意力机制的脊柱病灶MRI影像识别模型
10
作者 周慧 宋新景 《计算机科学与探索》 北大核心 2026年第1期291-300,共10页
人工检测脊柱病变是一项耗时的工作,并且高度依赖于该领域的专家,因此脊柱病灶的自动识别是非常必要的。然而,因为脊柱病灶的大小、位置和结构存在着广泛的差异,同时脊柱肿瘤与稀有病布鲁氏菌在影像上高度相似,所以脊柱病灶的准确定位... 人工检测脊柱病变是一项耗时的工作,并且高度依赖于该领域的专家,因此脊柱病灶的自动识别是非常必要的。然而,因为脊柱病灶的大小、位置和结构存在着广泛的差异,同时脊柱肿瘤与稀有病布鲁氏菌在影像上高度相似,所以脊柱病灶的准确定位和分类是一项具有挑战性的工作。为了应对这些挑战,提出了一种改进的脊柱病灶MRI影像识别模型。引入以ResNet-101为基础的双向特征金字塔主干网络,利用可变卷积在不同层替代传统的卷积神经网络,从特征层中获得更多的特征信息。在不同的模块中加入了多重注意力,包括自注意力机制和柔性注意力机制,有效地融合特征中贡献较大的部分。为了克服脊柱肿瘤、感染性病变、稀有病布鲁氏菌的数据不平衡问题,引入了改进的平衡交叉熵损失函数。在大连某医院提供的临床数据集上进行验证,识别精确率达到了94.2%,识别召回率达到90.8%。与其他识别模型进行对比实验,结果说明了该方法相对于其他模型识别性能更好。 展开更多
关键词 脊柱病灶识别 双向特征金字塔 多注意力机制 可变卷积 多特征融合
在线阅读 下载PDF
应用跨领域适应和偏移量引导的毛竹林分割算法
11
作者 叶李波 季志利 +3 位作者 朱珊 宋俊锋 叶振 王国相 《东北林业大学学报》 北大核心 2026年第1期61-67,90,共8页
为解决由于无人机视角下毛竹林的形状和纹理复杂,现有方法在分割精度和鲁棒性方面表现不佳的问题,提出了一种应用跨领域适应和偏移量引导的毛竹林分割网络——BFSNet。以百山祖国家公园为试验区,利用无人机拍摄周边毛竹林图像构建数据... 为解决由于无人机视角下毛竹林的形状和纹理复杂,现有方法在分割精度和鲁棒性方面表现不佳的问题,提出了一种应用跨领域适应和偏移量引导的毛竹林分割网络——BFSNet。以百山祖国家公园为试验区,利用无人机拍摄周边毛竹林图像构建数据集。为增强模型的特征提取能力,提出跨领域适应模块以有效利用源模型的强特征提取能力,并结合自主学习提取适用于毛竹林分割任务的特征,利用两者的优势进行互补。为提高模型对于不同形状毛竹林的识别和定位能力,结合可变形卷积的偏移量引导模块,引入可学习的偏移量参数,以适应不同形状的毛竹林目标。将BFSNet在DeepGlobe Land Cover Classification Challenge和自制数据集上进行模型训练和测试,并与多种主流图像分割方法进行对比。结果表明:BFSNet在交并比、Dice系数、精确率和召回率4项指标上均取得了最优的性能表现,分别获得了76.04%和71.93%的交并比。与多种主流的图像分割模型相比,BFSNet在毛竹林的分割效果方面表现最为出色,对毛竹林形状的精确建模能力能够有效地应对不同形态的毛竹林。 展开更多
关键词 毛竹林分割 跨领域适应 偏移量引导 可变形卷积
在线阅读 下载PDF
BurdenNet:先验信息导引的复杂环境下高炉多态料面目标检测网络
12
作者 倪梓明 陈先中 +1 位作者 侯庆文 张洁 《工程科学学报》 北大核心 2026年第1期26-38,共13页
传统的单一状态料面目标检测网络未能考虑高炉冶炼状态的交替变化,在复杂环境下整体准确度较低,针对上述问题,本文提出一种先验信息导引的多态料面目标检测网络BurdenNet.首先,提出基于原始信号距离向精度的图像预分类方法,构建三类典... 传统的单一状态料面目标检测网络未能考虑高炉冶炼状态的交替变化,在复杂环境下整体准确度较低,针对上述问题,本文提出一种先验信息导引的多态料面目标检测网络BurdenNet.首先,提出基于原始信号距离向精度的图像预分类方法,构建三类典型状态的料面图像数据集,并以预分类的状态为先验信息对网络通路进行剪枝.其次,将料面细长低曲率的形状特征与雷达采样信号的稀疏性质作为先验信息,提出空洞垂直偏移卷积(Atrous vertical deformable convolution,AVDC)模块提取多态料面特征.在此基础上,利用机械探尺数据构建先验空间注意力特征图,提出先验聚焦注意力(Prior focusing attention,PFA)模块,使网络优先聚焦于图像中的料面区域.最后对于边界框的回归,提出条带交并比(Band intersection over union,BIOU)损失函数进一步提升目标检测的速度与准确性.在钢铁公司高炉的实测数据上进行实验,结果表明,本文的BurdenNet相较于单一状态目标检测网络,在多态料面数据集上整体精确率提升了13.9%与5.2%,综合性能(F1-Score)提升了8.1%与4.3%,为复杂环境下多态料面图像的目标检测提供更准确的方法. 展开更多
关键词 多态料面 先验信息 空洞垂直偏移卷积 先验聚焦注意力 网络剪枝
在线阅读 下载PDF
基于可变形卷积和注意力机制的生丝疵点检测算法
13
作者 易娇娇 孙卫红 +1 位作者 梁曼 邵铁锋 《棉纺织技术》 2026年第1期36-42,共7页
针对生丝疵点小且形态多变导致检测中出现错检漏检的问题,提出一种基于可变形卷积和注意力机制的生丝疵点检测算法。以YOLOv8n为基准模型,首先在主干网络部分将可变形卷积DCNv2融入C2f中形成新的C2f-DCN模块,利用可变形卷积的任意采样... 针对生丝疵点小且形态多变导致检测中出现错检漏检的问题,提出一种基于可变形卷积和注意力机制的生丝疵点检测算法。以YOLOv8n为基准模型,首先在主干网络部分将可变形卷积DCNv2融入C2f中形成新的C2f-DCN模块,利用可变形卷积的任意采样形状特性自适应拟合疵点的几何形状,提升模型对不规则疵点的特征提取能力;其次在主干网络末端加入ECA注意力机制,通过跨通道交互抑制背景噪声等无用信息,提高模型对疵点特征信息的关注度;最后在颈部添加一个P2检测头获取浅层语义信息,构建四分支检测层结构,增强对小目标的响应能力。试验结果表明:与原始算法相比,该算法mAP@0.5和mAP@0.5∶0.95达到95.4%、75.9%,分别提升了3.3个百分点和9.0个百分点,模型推理速度达到65.2帧/s。该算法能够有效实现疵点检测,降低疵点的错检漏检现象,同时具有较好的检测速度,满足实时检测要求。 展开更多
关键词 疵点检测 可变形卷积 YOLOv8n 注意力机制 目标检测
在线阅读 下载PDF
An intelligent singular value diagnostic method for concrete dam deformation monitoring 被引量:5
14
作者 Jie Yang Xu-dong Qu Meng Chang 《Water Science and Engineering》 EI CAS CSCD 2019年第3期205-212,共8页
Extracting implicit anomaly information through deformation monitoring data mining is highly significant to determining dam safety status.As an intelligent singular value diagnostic method for concrete dam deformation... Extracting implicit anomaly information through deformation monitoring data mining is highly significant to determining dam safety status.As an intelligent singular value diagnostic method for concrete dam deformation monitoring, shallow neural network models result in local optima and overfitting, and require manual feature extraction.To obtain an intelligent singular value diagnosis model that can be used for dam safety monitoring, a convolutional neural network (CNN) model that has advantages of deep learning (DL), such as automatic feature extraction, good model fitting, and strong generalizability, was trained in this study.An engineering example shows that the predicted result of the intelligent singular value diagnostic method based on CNN is highly compatible with the confusion matrix, with a precision of 92.41%, receiver operating characteristic (ROC) coordinates of (0.03, 0.97), an area-under-curve (AUC) value of 0.99, and an F1-score of 0.91.Moreover, the performance of the CNN model is better than those of models based on decision tree (DT) and k-nearest neighbor (KNN) methods.Therefore, the intelligent singular value diagnostic method based on CNN is simple to operate, highly intelligent, and highly reliable, and it has a high potential for application in engineering. 展开更多
关键词 SINGULAR VALUE diagnosis convolutional NEURAL network Artificial INTELLIGENCE deformATION monitoring Concrete DAM
在线阅读 下载PDF
LDNet:structure-focused lane detection based on line deformation
15
作者 ZHANG Jun WANG Xingbin GUO Binglei 《High Technology Letters》 EI CAS 2022年第3期307-316,共10页
Lane detection is a fundamental necessary task for autonomous driving.The conventional methods mainly treat lane detection as a pixel-wise segmentation problem,which suffers from the challenge of uncontrollable drivin... Lane detection is a fundamental necessary task for autonomous driving.The conventional methods mainly treat lane detection as a pixel-wise segmentation problem,which suffers from the challenge of uncontrollable driving road environments and needs post-processing to abstract the lane parameters.In this work,a series of lines are used to represent traffic lanes and a novel line deformation network(LDNet) is proposed to directly predict the coordinates of lane line points.Inspired by the dynamic behavior of classic snake algorithms,LDNet uses a neural network to iteratively deform an initial lane line to match the lane markings.To capture the long and discontinuous structures of lane lines,1 D convolution in LDNet is used for structured feature learning along the lane lines.Based on LDNet,a two-stage pipeline is developed for lane marking detection:(1) initial lane line proposal to predict a list of lane line candidates,and(2) lane line deformation to obtain the coordinates of lane line points.Experiments show that the proposed approach achieves competitive performances on the TuSimple dataset while being efficient for real-time applications on a GTX 1650 GPU.In particular,the accuracy of LDNet with the annotated starting and ending points is up to99.45%,which indicates the improved initial lane line proposal method can further enhance the performance of LDNet. 展开更多
关键词 autonomous driving convolutional neural networks(CNNs) lane detection line deformation
在线阅读 下载PDF
基于改进YOLOv8的交通场景实例分割算法 被引量:4
16
作者 赵南南 高翡晨 《计算机工程》 北大核心 2025年第1期198-207,共10页
提出一种基于改进型YOLOv8的实例分割算法(DE-YOLO)。为减少图像中复杂背景的干扰,引入高效多尺度注意力机制,跨维交互使各特征组内空间语义特征平均分布。在主干网络部分,使用可变形卷积DCNv2结合C2f卷积层,突破原始卷积限制,提升可变... 提出一种基于改进型YOLOv8的实例分割算法(DE-YOLO)。为减少图像中复杂背景的干扰,引入高效多尺度注意力机制,跨维交互使各特征组内空间语义特征平均分布。在主干网络部分,使用可变形卷积DCNv2结合C2f卷积层,突破原始卷积限制,提升可变性。为减小有害梯度并提升检测器精度,采用动态非单调聚焦机制Wise-交并比(WIoU)替代联合完全交并(CIoU)损失函数进行质量评估,优化检测框定位,提升分割精度。同时,通过开启Mixup数据增强处理,充实数据集,丰富训练特征,提升模型学习能力。实验结果表明,DE-YOLO在城市景观数据集Cityscapes中的掩模平均精度均值(mAPmask)较基准模型YOLOv8n-seg提高了2.0百分点,IoU阈值为0.5时的平均精度提升了3.2百分点,所提算法在提升精度的同时,保持了优良的检测速度和较少的参数量,模型参数量较同类模型低2.2~31.3百分点。 展开更多
关键词 YOLOv8网络 实例分割 高效多尺度注意力 可变形卷积 损失函数
在线阅读 下载PDF
改进YOLOv8的矿井人员防护装备实时监测方法研究 被引量:3
17
作者 张磊 孙志鹏 +3 位作者 陶虹京 郝尚凯 燕倩如 李熙尉 《煤炭科学技术》 北大核心 2025年第S1期354-365,共12页
穿戴个人防护装备是保障矿井人员作业安全的重要手段,开展矿井人员防护装备监测是煤矿安全管理的重要工作内容。煤矿井下环境较为复杂,视频监控易受到噪声、光照以及粉尘等因素干扰,导致现有的目标检测方法对矿井人员防护装备存在检测... 穿戴个人防护装备是保障矿井人员作业安全的重要手段,开展矿井人员防护装备监测是煤矿安全管理的重要工作内容。煤矿井下环境较为复杂,视频监控易受到噪声、光照以及粉尘等因素干扰,导致现有的目标检测方法对矿井人员防护装备存在检测精度低、实时性差、模型复杂度高等问题。为此,提出一种改进YOLOv8的矿井人员防护装备实时监测方法,称为DBE-YOLO。DBEYOLO模型首先在基准模型主干网络的CBS模块中结合可变形卷积(DCNv2)组成DBS模块,使卷积具有可变形能力,在采样时可以更贴近检测物体的真实形状和尺寸,更具有鲁棒性,有效提升了其对不同尺度目标的特征获取能力,有利于模型提取更多人员防护装备的特征信息,提高模型检测精度。其次在特征增强网络融合了加权双向特征金字塔机制(BiFPN),在多尺度特征融合过程中删除效率较低的特征传输节点,实现更高层次的融合,提高了对不同尺度特征的融合效率,同时BiFPN引入了一个可以学习的权值,有助于让网络学习不同输入特征的重要性。最后使用WIoUv3作为模型的损失函数,其通过动态分配梯度增益,重点关注普通锚框质量,在模型训练过程中减少了低质量锚框产生的有害梯度,进一步提升了模型性能。实验结果表明,DBE-YOLO模型在矿井人员防护装备监测中有着良好的效果,查准率、查全率、平均精度分别为93.1%、93.0%、95.8%,相较于基准模型分别提高0.8%,2.9%,2.9%,检测实时性提升到65 f·s^(-1),提高了8.3%,此外,参数量、浮点计算量、模型体积分别为2 M、6.6 G、4.4 MB,相较于原模型分别降低33.3%、18.5%、30.2%。使用煤矿现场作业视频监控对改进模型进行验证,其有效改善了漏检和误检问题,为提高矿井人员的作业安全提供了技术手段。 展开更多
关键词 可变形卷积 目标检测 损失函数 深度学习 实时监测
在线阅读 下载PDF
高速动车组车体轻量化与模态匹配优化设计研究 被引量:1
18
作者 王浩 李凡松 +2 位作者 杜翔 王成强 邬平波 《中南大学学报(自然科学版)》 北大核心 2025年第4期1658-1672,共15页
针对高速动车组车体轻量化设计导致的一阶菱形模态频率降低问题,从车体设计出发,研究无横梁底架的车体提升一阶菱形模态频率的设计方法。基于灵敏度及模态振型分析对车体的结构进行优化、基于卷积神经网络代理模型以及协方差矩阵自适应... 针对高速动车组车体轻量化设计导致的一阶菱形模态频率降低问题,从车体设计出发,研究无横梁底架的车体提升一阶菱形模态频率的设计方法。基于灵敏度及模态振型分析对车体的结构进行优化、基于卷积神经网络代理模型以及协方差矩阵自适应演化优化算法对车体断面型材厚度进行优化。基于线路实测车轮和钢轨外形,建立考虑弹性车体的动车组刚柔耦合动力学模型。研究结果表明:优化后车体骨架质量减小680 kg,质量减小率为6.4%,整备状态下一阶菱形频率提升1.66 Hz,提升了19.1%。优化后的车体不仅轻量化程度更高,且能够有效抑制车体的异常弹性振动,提高乘客的乘坐舒适性。 展开更多
关键词 动车组 抖车 结构优化 菱形模态 卷积神经网络 优化算法
在线阅读 下载PDF
基于改进的YOLOv8检测网络在无人机航拍图像识别中的应用 被引量:2
19
作者 冉险生 刘圣斌 《现代电子技术》 北大核心 2025年第7期48-56,共9页
针对现有无人机航拍图像目标检测算法检测精度较低、检测小尺度车辆误差较大等问题,提出一种改进YOLOv8的无人机车辆检测算法Improve⁃YOLOv8。首先,在骨干网络中的C2f卷积层引入可变形卷积模块DCNv2,提高骨干网络适应不规则空间结构的能... 针对现有无人机航拍图像目标检测算法检测精度较低、检测小尺度车辆误差较大等问题,提出一种改进YOLOv8的无人机车辆检测算法Improve⁃YOLOv8。首先,在骨干网络中的C2f卷积层引入可变形卷积模块DCNv2,提高骨干网络适应不规则空间结构的能力,增强模型对遮挡重叠小目标的检测能力;其次,借鉴Large Separable Kernel Attention的思想,提出具有长程依赖性与自适应能力的SPPF⁃LSKA模块,有效减少背景对航拍图像检测的干扰;然后,通过引入DyHead检测头,融合尺度、空间和任务三种注意力机制提升模型检测性能;最后,使用WIoUv3作为边界框回归损失,采用明智的梯度分配策略提高模型的定位能力。实验结果表明,在Mapsai数据集上Improve⁃YOLOv8相较于基准模型,在准确率、召回率、平均精度上分别提升了5.1%、6.1%和5.1%,表现出良好的检测性能,具有实际应用潜力。 展开更多
关键词 无人机航拍图像 小目标 YOLOv8 目标检测 可变形卷积 注意力机制
在线阅读 下载PDF
基于EE-YOLOv8s的多场景火灾迹象检测算法 被引量:4
20
作者 崔克彬 耿佳昌 《图学学报》 北大核心 2025年第1期13-27,共15页
针对目前烟火场景检测中,光照变化、烟火动态性、复杂背景、目标过小等干扰因素导致的火灾迹象目标误检和漏检的问题,提出一种YOLOv8s改进模型EE-YOLOv8s。设计MBConv-Block卷积模块融入YOLOv8的Backbone部分,实现EfficientNetEasy特征... 针对目前烟火场景检测中,光照变化、烟火动态性、复杂背景、目标过小等干扰因素导致的火灾迹象目标误检和漏检的问题,提出一种YOLOv8s改进模型EE-YOLOv8s。设计MBConv-Block卷积模块融入YOLOv8的Backbone部分,实现EfficientNetEasy特征提取网络,保证模型轻量化的同时,优化图像特征提取;引入大型可分离核注意力机制LSKA改进SPPELAN模块,将空间金字塔部分改进为SPP_LSKA_ELAN,充分捕获大范围内的空间细节信息,在复杂多变的火灾场景中提取更全面的特征,从而区分目标与相似物体的差异;Neck部分引入可变形卷积DCN和跨空间高效多尺度注意力EMA,实现C2f_DCN_EMA可变形卷积校准模块,增强对烟火目标边缘轮廓变化的适应能力,促进特征的融合与校准,突出目标特征;在Head部分增设携带有轻量级、无参注意力机制SimAM的小目标检测头,并重新规划检测头通道数,加强多尺寸目标表征能力的同时,降低冗余以提高参数有效利用率。实验结果表明,改进后的EE-YOLOv8s网络模型相较于原模型,其参数量减少了13.6%,准确率提升了6.8%,召回率提升了7.3%,mAP提升了5.4%,保证检测速度的同时,提升了火灾迹象目标的检测性能。 展开更多
关键词 烟火目标检测 EfficientNetEasy主干网络 大型可分离核注意力机制 可变形卷积校准模块 小目标检测
在线阅读 下载PDF
上一页 1 2 33 下一页 到第
使用帮助 返回顶部