期刊文献+
共找到676篇文章
< 1 2 34 >
每页显示 20 50 100
A Remote Sensing Image Semantic Segmentation Method by Combining Deformable Convolution with Conditional Random Fields 被引量:13
1
作者 Zongcheng ZUO Wen ZHANG Dongying ZHANG 《Journal of Geodesy and Geoinformation Science》 2020年第3期39-49,共11页
Currently,deep convolutional neural networks have made great progress in the field of semantic segmentation.Because of the fixed convolution kernel geometry,standard convolution neural networks have been limited the a... Currently,deep convolutional neural networks have made great progress in the field of semantic segmentation.Because of the fixed convolution kernel geometry,standard convolution neural networks have been limited the ability to simulate geometric transformations.Therefore,a deformable convolution is introduced to enhance the adaptability of convolutional networks to spatial transformation.Considering that the deep convolutional neural networks cannot adequately segment the local objects at the output layer due to using the pooling layers in neural network architecture.To overcome this shortcoming,the rough prediction segmentation results of the neural network output layer will be processed by fully connected conditional random fields to improve the ability of image segmentation.The proposed method can easily be trained by end-to-end using standard backpropagation algorithms.Finally,the proposed method is tested on the ISPRS dataset.The results show that the proposed method can effectively overcome the influence of the complex structure of the segmentation object and obtain state-of-the-art accuracy on the ISPRS Vaihingen 2D semantic labeling dataset. 展开更多
关键词 high-resolution remote sensing image semantic segmentation deformable convolution network conditions random fields
在线阅读 下载PDF
Multi-Layer Feature Extraction with Deformable Convolution for Fabric Defect Detection 被引量:1
2
作者 Jielin Jiang Chao Cui +1 位作者 Xiaolong Xu Yan Cui 《Intelligent Automation & Soft Computing》 2024年第4期725-744,共20页
In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process.... In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process.Traditional fabric defect detection algorithms can only detect specific materials and specific fabric defect types;in addition,their detection efficiency is low,and their detection results are relatively poor.Deep learning-based methods have many advantages in the field of fabric defect detection,however,such methods are less effective in identifying multiscale fabric defects and defects with complex shapes.Therefore,we propose an effective algorithm,namely multilayer feature extraction combined with deformable convolution(MFDC),for fabric defect detection.In MFDC,multi-layer feature extraction is used to fuse the underlying location features with high-level classification features through a horizontally connected top-down architecture to improve the detection of multi-scale fabric defects.On this basis,a deformable convolution is added to solve the problem of the algorithm’s weak detection ability of irregularly shaped fabric defects.In this approach,Roi Align and Cascade-RCNN are integrated to enhance the adaptability of the algorithm in materials with complex patterned backgrounds.The experimental results show that the MFDC algorithm can achieve good detection results for both multi-scale fabric defects and defects with complex shapes,at the expense of a small increase in detection time. 展开更多
关键词 Fabric defect detection multi-layer features deformable convolution
在线阅读 下载PDF
CW-HRNet:Constrained Deformable Sampling and Wavelet-Guided Enhancement for Lightweight Crack Segmentation
3
作者 Dewang Ma 《Journal of Electronic Research and Application》 2025年第5期269-280,共12页
This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two ke... This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two key modules:Constrained Deformable Convolution(CDC),which stabilizes geometric alignment by applying a tanh limiter and learnable scaling factor to the predicted offsets,and the Wavelet Frequency Enhancement Module(WFEM),which decomposes features using Haar wavelets to preserve low-frequency structures while enhancing high-frequency boundaries and textures.Evaluations on the CrackSeg9k benchmark demonstrate CW-HRNet’s superior performance,achieving 82.39%mIoU with only 7.49M parameters and 10.34 GFLOPs,outperforming HrSegNet-B48 by 1.83% in segmentation accuracy with minimal complexity overhead.The model also shows strong cross-dataset generalization,achieving 60.01%mIoU and 66.22%F1 on Asphalt3k without fine-tuning.These results highlight CW-HRNet’s favorable accuracyefficiency trade-off for real-world crack segmentation tasks. 展开更多
关键词 Crack segmentation Lightweight semantic segmentation deformable convolution Wavelet transform Road infrastructure
在线阅读 下载PDF
Attention-Augmented YOLOv8 with Ghost Convolution for Real-Time Vehicle Detection in Intelligent Transportation Systems
4
作者 Syed Sajid Ullah Muhammad Zunair Zamir +1 位作者 Ahsan Ishfaq Salman Khan 《Journal on Artificial Intelligence》 2025年第1期255-274,共20页
Accurate vehicle detection is essential for autonomous driving,traffic monitoring,and intelligent transportation systems.This paper presents an enhanced YOLOv8n model that incorporates the Ghost Module,Convolutional B... Accurate vehicle detection is essential for autonomous driving,traffic monitoring,and intelligent transportation systems.This paper presents an enhanced YOLOv8n model that incorporates the Ghost Module,Convolutional Block Attention Module(CBAM),and Deformable Convolutional Networks v2(DCNv2).The Ghost Module streamlines feature generation to reduce redundancy,CBAM applies channel and spatial attention to improve feature focus,and DCNv2 enables adaptability to geometric variations in vehicle shapes.These components work together to improve both accuracy and computational efficiency.Evaluated on the KITTI dataset,the proposed model achieves 95.4%mAP@0.5—an 8.97% gain over standard YOLOv8n—along with 96.2% precision,93.7% recall,and a 94.93%F1-score.Comparative analysis with seven state-of-the-art detectors demonstrates consistent superiority in key performance metrics.An ablation study is also conducted to quantify the individual and combined contributions of GhostModule,CBAM,and DCNv2,highlighting their effectiveness in improving detection performance.By addressing feature redundancy,attention refinement,and spatial adaptability,the proposed model offers a robust and scalable solution for vehicle detection across diverse traffic scenarios. 展开更多
关键词 YOLOv8n vehicle detection deformable convolutional networks(DCNv2) ghost module convolutional block attention module(CBAM) attention mechanisms
在线阅读 下载PDF
Optical Flow with Learning Feature for Deformable Medical Image Registration 被引量:1
5
作者 Jinrong Hu Lujin Li +3 位作者 Ying Fu Maoyang Zou Jiliu Zhou Shanhui Sun 《Computers, Materials & Continua》 SCIE EI 2022年第5期2773-2788,共16页
Deformable medical image registration plays a vital role in medical image applications,such as placing different temporal images at the same time point or different modality images into the same coordinate system.Vari... Deformable medical image registration plays a vital role in medical image applications,such as placing different temporal images at the same time point or different modality images into the same coordinate system.Various strategies have been developed to satisfy the increasing needs of deformable medical image registration.One popular registration method is estimating the displacement field by computing the optical flow between two images.The motion field(flow field)is computed based on either gray-value or handcrafted descriptors such as the scale-invariant feature transform(SIFT).These methods assume that illumination is constant between images.However,medical images may not always satisfy this assumption.In this study,we propose a metric learning-based motion estimation method called Siamese Flow for deformable medical image registration.We train metric learners using a Siamese network,which produces an image patch descriptor that guarantees a smaller feature distance in two similar anatomical structures and a larger feature distance in two dissimilar anatomical structures.In the proposed registration framework,the flow field is computed based on such features and is close to the real deformation field due to the excellent feature representation ability of the Siamese network.Experimental results demonstrate that the proposed method outperforms the Demons,SIFT Flow,Elastix,and VoxelMorph networks regarding registration accuracy and robustness,particularly with large deformations. 展开更多
关键词 deformation registration feature extraction optical flow convolutional neural network
在线阅读 下载PDF
A Deformable Network with Attention Mechanism for Retinal Vessel Segmentation
6
作者 Xiaolong Zhu Wenjian Li +2 位作者 Weihang Zhang Dongwei Li Huiqi Li 《Journal of Beijing Institute of Technology》 EI CAS 2024年第3期186-193,共8页
The intensive application of deep learning in medical image processing has facilitated the advancement of automatic retinal vessel segmentation research.To overcome the limitation that traditional U-shaped vessel segm... The intensive application of deep learning in medical image processing has facilitated the advancement of automatic retinal vessel segmentation research.To overcome the limitation that traditional U-shaped vessel segmentation networks fail to extract features in fundus image sufficiently,we propose a novel network(DSeU-net)based on deformable convolution and squeeze excitation residual module.The deformable convolution is utilized to dynamically adjust the receptive field for the feature extraction of retinal vessel.And the squeeze excitation residual module is used to scale the weights of the low-level features so that the network learns the complex relationships of the different feature layers efficiently.We validate the DSeU-net on three public retinal vessel segmentation datasets including DRIVE,CHASEDB1,and STARE,and the experimental results demonstrate the satisfactory segmentation performance of the network. 展开更多
关键词 retinal vessel segmentation deformable convolution attention mechanism deep learning
暂未订购
DSD-MatchingNet:Deformable sparse-to-dense feature matching for learning accurate correspondences
7
作者 Yicheng ZHAO Han ZHANG +3 位作者 Ping LU Ping LI Enhua WU Bin SHENG 《Virtual Reality & Intelligent Hardware》 2022年第5期432-443,共12页
Background Exploring correspondences across multiview images is the basis of various computer vision tasks.However,most existing methods have limited accuracy under challenging conditions.Method To learn more robust a... Background Exploring correspondences across multiview images is the basis of various computer vision tasks.However,most existing methods have limited accuracy under challenging conditions.Method To learn more robust and accurate correspondences,we propose DSD-MatchingNet for local feature matching in this study.First,we develop a deformable feature extraction module to obtain multilevel feature maps,which harvest contextual information from dynamic receptive fields.The dynamic receptive fields provided by the deformable convolution network ensure that our method obtains dense and robust correspondence.Second,we utilize sparse-to-dense matching with symmetry of correspondence to implement accurate pixel-level matching,which enables our method to produce more accurate correspondences.Result Experiments show that our proposed DSD-MatchingNet achieves a better performance on the image matching benchmark,as well as on the visual localization benchmark.Specifically,our method achieved 91.3%mean matching accuracy on the HPatches dataset and 99.3%visual localization recalls on the Aachen Day-Night dataset. 展开更多
关键词 Image matching deformable convolution network Sparse-to-dense matching
在线阅读 下载PDF
3D Data Scattergram Image Classification Based Protection for Transmission Line Connecting BESS Using Depth-wise Separable Convolution Based CNN 被引量:1
8
作者 Yingyu Liang Yi Ren +1 位作者 Xiaoyang Yang Wenting Zha 《Journal of Modern Power Systems and Clean Energy》 2025年第2期609-621,共13页
The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data... The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data scattergrams are constructed using current data from both sides of the transmission line and their sum.Following a comprehensive analysis of the varying characteristics of 3D data scattergrams under different conditions,a 3D data scattergram image classification based protection method is developed.The depth-wise separable convolution is used to ensure a lightweight convolutional neural network(CNN)structure without compromising performance.In addition,a Bayesian hyperparameter optimization algorithm is used to achieve a hyperparametric search to simplify the training process.Compared with artificial neural networks and CNNs,the depth-wise separable convolution based CNN(DPCNN)achieves a higher recognition accuracy.The 3D data scattergram image classification based protection method using DPCNN can accurately separate internal faults from other disturbances and identify fault phases under different operating states and fault conditions.The proposed protection method also shows first-class tolerability against current transformer(CT)saturation and CT measurement errors. 展开更多
关键词 convolutional neural network(CNN) battery energy storage station(BESS) depth-wise separable convolution hyperparameter optimization fault classification line protection
原文传递
Validation Research on the Application of Depthwise Separable Convolutional Al Facial Expression Recognition in Non-pharmacological Treatment of BPSD
9
作者 Xiangyu Liu 《Journal of Clinical and Nursing Research》 2021年第4期31-37,共7页
One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence... One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence of emotions.Observing the reduction of BPSD in dementia through emotions can be considered effective and widely used in the field of non-pharmacological therapy.At present,this article will verify whether the image recognition artificial intelligence(AI)system can correctly reflect the emotional performance of the elderly with dementia through a questionnaire survey of three professional elderly nursing staff.The ANOVA(sig.=0.50)is used to determine that the judgment given by the nursing staff has no obvious deviation,and then Kendall's test(0.722**)and spearman's test(0.863**)are used to verify the judgment severity of the emotion recognition system and the nursing staff unanimously.This implies the usability of the tool.Additionally,it can be expected to be further applied in the research related to BPSD elderly emotion detection. 展开更多
关键词 depth-wise separable convolution EMOTION BPSD DEMENTIA Nursing
暂未订购
Pore network modeling of gas-water two-phase flow in deformed multi-scale fracture-porous media
10
作者 Dai-Gang Wang Yu-Shan Ma +6 位作者 Zhe Hu Tong Wu Ji-Rui Hou Zhen-Chang Jiang Xin-Xuan Qi Kao-Ping Song Fang-zhou Liu 《Petroleum Science》 2025年第5期2096-2108,共13页
Two actual rocks drilled from a typical ultra-deep hydrocarbon reservoir in the Tarim Basin are selected to conduct in-situ stress-loading micro-focus CT scanning experiments.The gray images of rock microstructure at ... Two actual rocks drilled from a typical ultra-deep hydrocarbon reservoir in the Tarim Basin are selected to conduct in-situ stress-loading micro-focus CT scanning experiments.The gray images of rock microstructure at different stress loading stages are obtained.The U-Net fully convolutional neural network is utilized to achieve fine semantic segmentation of rock skeleton,pore space,and microfractures based on CT slice images of deep rocks.The three-dimensional digital rock models of deformed multiscale fractured-porous media at different stress loading stages are thereafter reconstructed,and the equivalent fracture-pore network models are finally extracted to explore the underlying mechanisms of gas-water two-phase flow at the pore-scale.Results indicate that,in the process of insitu stress loading,both the deep rocks have experienced three stages:linear elastic deformation,nonlinear plastic deformation,and shear failure.The micro-mechanical behavior greatly affects the dynamic deformation of rock microstructure and gas-water two-phase flow.In the linear elastic deformation stage,with the increase in in-situ stress,both the deep rocks are gradually compacted,leading to decreases in average pore radius,pore throat ratio,tortuosity,and water-phase relative permeability,while the coordination number nearly remains unchanged.In the plastic deformation stage,the synergistic influence of rock compaction and existence of micro-fractures typically exert a great effect on pore-throat topological properties and gas-water relative permeability.In the shear failure stage,due to the generation and propagation of micro-fractures inside the deep rock,the topological connectivity becomes better,fluid flow paths increase,and flow conductivity is promoted,thus leading to sharp increases in average pore radius and coordination number,rapid decreases in pore throat ratio and tortuosity,as well as remarkable improvement in relative permeability of gas phase and waterphase. 展开更多
关键词 Ultra-deep reservoir In-situ stress loading U-Netfully convolutional neural network CTscanning Microstructure deformation Pore-scalefluid flow
原文传递
基于多模态语义信息的文本生成图像方法
11
作者 杨冰 周家辉 +1 位作者 姚金良 向学勤 《浙江大学学报(工学版)》 北大核心 2026年第2期360-369,共10页
针对文本语义与图像语义不一致以及图像细节表现不足的问题,提出新的文本生成图像方法.基于多模态语义信息建立鉴别依据,在文本语义基础上引入真实图像语义,以解决文本描述信息密度低的问题,有效缓解生成图像细节缺失或失真的现象.在生... 针对文本语义与图像语义不一致以及图像细节表现不足的问题,提出新的文本生成图像方法.基于多模态语义信息建立鉴别依据,在文本语义基础上引入真实图像语义,以解决文本描述信息密度低的问题,有效缓解生成图像细节缺失或失真的现象.在生成器中集成可变形卷积和星模块卷积,增强生成器表达能力,提高生成图像的细节表现和整体质量.为了验证所提方法的有效性,在CUB数据集和COCO数据集上进行模型训练及评估.与生成式对抗对比语言-图像预训练模型(GALIP)相比,所提方法在保证高效生成的同时,在细节表现、语义一致性及整体质量上具有显著优势. 展开更多
关键词 文本生成图像 多模态语义 可变形卷积 星模块卷积 语义对齐鉴别器
在线阅读 下载PDF
基于可变形卷积和注意力机制的路面裂缝检测
12
作者 谢永华 方育才 彭银佳 《计算机工程与设计》 北大核心 2026年第1期279-285,共7页
为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器... 为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器部分使用空洞残差模块扩大感受野并保留更多细节信息;在解码器部分添加注意力机制提高对裂缝特征的关注度,抑制背景噪声。实验结果表明,该网络在MPA、mIoU和F1值这3项指标上均优于其它对比网络,验证了该网络的有效性。 展开更多
关键词 裂缝检测 语义分割 编码解码 可变形卷积 空洞卷积 残差连接 注意力机制
在线阅读 下载PDF
基于改进YOLOv7的遥感图像目标检测方法
13
作者 陈辉 田博 +2 位作者 赵永红 瞿海平 梁建虎 《兰州理工大学学报》 北大核心 2026年第1期93-100,共8页
为了解决遥感图像中小目标规模大、目标分布密集以及容易产生漏检和误检等问题,提出了一种基于改进YOLOv7模型的遥感图像目标检测方法.该方法首先在YOLOv7模型中引入DCNv2结构和残差结构,重新构建了新的骨干网络,以增强目标浅层特征信... 为了解决遥感图像中小目标规模大、目标分布密集以及容易产生漏检和误检等问题,提出了一种基于改进YOLOv7模型的遥感图像目标检测方法.该方法首先在YOLOv7模型中引入DCNv2结构和残差结构,重新构建了新的骨干网络,以增强目标浅层特征信息的提取,并提高网络的准确性.其次,在颈部网络中采用新的特征融合模块,并通过SimAM注意力机制,自适应调节浅层特征的纹理信息和深层语义信息的融合权重,更有针对性地抑制提取浅层特征时带来的噪声.最后,采用归一化高斯瓦瑟斯坦距离损失作为模型的回归损失函数,取代传统的IOU,以提高多尺度目标的检测能力.该算法在DOTAv1.0数据集上小目标平均精度达到20.1%,在DIOR数据集上小目标平均精度达到29.0%.同时,与YOLOv7、YOLOv6等方法相比,该算法展现出了较强的竞争力. 展开更多
关键词 遥感图像 目标检测 可变形卷积网络 SimAM注意力机制 高斯瓦瑟斯坦距离
在线阅读 下载PDF
基于多注意力机制的脊柱病灶MRI影像识别模型
14
作者 周慧 宋新景 《计算机科学与探索》 北大核心 2026年第1期291-300,共10页
人工检测脊柱病变是一项耗时的工作,并且高度依赖于该领域的专家,因此脊柱病灶的自动识别是非常必要的。然而,因为脊柱病灶的大小、位置和结构存在着广泛的差异,同时脊柱肿瘤与稀有病布鲁氏菌在影像上高度相似,所以脊柱病灶的准确定位... 人工检测脊柱病变是一项耗时的工作,并且高度依赖于该领域的专家,因此脊柱病灶的自动识别是非常必要的。然而,因为脊柱病灶的大小、位置和结构存在着广泛的差异,同时脊柱肿瘤与稀有病布鲁氏菌在影像上高度相似,所以脊柱病灶的准确定位和分类是一项具有挑战性的工作。为了应对这些挑战,提出了一种改进的脊柱病灶MRI影像识别模型。引入以ResNet-101为基础的双向特征金字塔主干网络,利用可变卷积在不同层替代传统的卷积神经网络,从特征层中获得更多的特征信息。在不同的模块中加入了多重注意力,包括自注意力机制和柔性注意力机制,有效地融合特征中贡献较大的部分。为了克服脊柱肿瘤、感染性病变、稀有病布鲁氏菌的数据不平衡问题,引入了改进的平衡交叉熵损失函数。在大连某医院提供的临床数据集上进行验证,识别精确率达到了94.2%,识别召回率达到90.8%。与其他识别模型进行对比实验,结果说明了该方法相对于其他模型识别性能更好。 展开更多
关键词 脊柱病灶识别 双向特征金字塔 多注意力机制 可变卷积 多特征融合
在线阅读 下载PDF
基于改进YOLOv10的反应堆压力容器主螺栓孔螺纹缺陷视觉检测方法研究
15
作者 盛倍宁 崔淑梅 +3 位作者 杜华 杨泽 王炳炎 陈书华 《核动力工程》 北大核心 2026年第1期235-241,共7页
反应堆压力容器主螺栓孔表面状态影响着压力容器密封性能与运行安全,其在换料检修过程中容易产生接触损伤,而传统的人工目测检测方法存在效率低、精度低的问题,因此需对检测方法进行研究。为此,本文提出了一种基于改进YOLOv10的反应堆... 反应堆压力容器主螺栓孔表面状态影响着压力容器密封性能与运行安全,其在换料检修过程中容易产生接触损伤,而传统的人工目测检测方法存在效率低、精度低的问题,因此需对检测方法进行研究。为此,本文提出了一种基于改进YOLOv10的反应堆压力容器主螺栓孔螺纹缺陷视觉检测方法,能够有效提升螺纹缺陷的检测精度。通过分析螺纹缺陷特征,发现限制检测精度的主要因素为不规则和尺寸方差大的缺陷。针对缺陷形态不规则的问题,结合可变形卷积提出新的C2fDcn模块,增强模型对形状不规则缺陷的检测精度;针对缺陷尺寸方差大的问题,在模型低层次网络中引出新的检测头,增强模型对小缺陷的检测精度。实验结果表明:改进YOLOv10的平均精度为90.2%,较YOLOv10提升4.1%,小缺陷平均检测精度为90.3%,提升2.6%;划痕的平均检测精度为85.9%,提升11.3%。本文所提出的改进方法可以有效应对螺纹缺陷形状不规则、尺寸方差大难以检出的问题,对类似工业小缺陷、不规则缺陷的检测研究具有一定的参考价值。 展开更多
关键词 螺纹缺陷检测 改进YOLOv10 可变形卷积 检测头
原文传递
应用跨领域适应和偏移量引导的毛竹林分割算法
16
作者 叶李波 季志利 +3 位作者 朱珊 宋俊锋 叶振 王国相 《东北林业大学学报》 北大核心 2026年第1期61-67,90,共8页
为解决由于无人机视角下毛竹林的形状和纹理复杂,现有方法在分割精度和鲁棒性方面表现不佳的问题,提出了一种应用跨领域适应和偏移量引导的毛竹林分割网络——BFSNet。以百山祖国家公园为试验区,利用无人机拍摄周边毛竹林图像构建数据... 为解决由于无人机视角下毛竹林的形状和纹理复杂,现有方法在分割精度和鲁棒性方面表现不佳的问题,提出了一种应用跨领域适应和偏移量引导的毛竹林分割网络——BFSNet。以百山祖国家公园为试验区,利用无人机拍摄周边毛竹林图像构建数据集。为增强模型的特征提取能力,提出跨领域适应模块以有效利用源模型的强特征提取能力,并结合自主学习提取适用于毛竹林分割任务的特征,利用两者的优势进行互补。为提高模型对于不同形状毛竹林的识别和定位能力,结合可变形卷积的偏移量引导模块,引入可学习的偏移量参数,以适应不同形状的毛竹林目标。将BFSNet在DeepGlobe Land Cover Classification Challenge和自制数据集上进行模型训练和测试,并与多种主流图像分割方法进行对比。结果表明:BFSNet在交并比、Dice系数、精确率和召回率4项指标上均取得了最优的性能表现,分别获得了76.04%和71.93%的交并比。与多种主流的图像分割模型相比,BFSNet在毛竹林的分割效果方面表现最为出色,对毛竹林形状的精确建模能力能够有效地应对不同形态的毛竹林。 展开更多
关键词 毛竹林分割 跨领域适应 偏移量引导 可变形卷积
在线阅读 下载PDF
基于在线知识蒸馏与伪特征模拟的跨模态融合遥感图像建筑提取方法
17
作者 黄子恒 芮杰 +2 位作者 金飞 王淑香 林雨准 《地球信息科学学报》 北大核心 2026年第2期451-469,共19页
【目的】针对光学与SAR影像在实际应用中因数据缺失导致融合模型性能下降的问题,本研究旨在实现一种仅需单一模态输入即可获得接近双模态融合模型性能的轻量化建筑提取方法。【方法】提出了一种基于在线知识蒸馏的跨模态融合框架,其核... 【目的】针对光学与SAR影像在实际应用中因数据缺失导致融合模型性能下降的问题,本研究旨在实现一种仅需单一模态输入即可获得接近双模态融合模型性能的轻量化建筑提取方法。【方法】提出了一种基于在线知识蒸馏的跨模态融合框架,其核心包含一个双模态输入的教师网络和一个单模态输入的学生网络。框架的主要特点在于:在教师网络中设计了自适应门控注意力机制(AGAFM)以实现光学与SAR特征的有效互补;在学生网络中引入了伪特征生成模块(LDAF/ESAR)以模拟缺失模态的信息;并构建了特征层与输出层的多层次知识蒸馏损失,以迫使学生网络学习教师网络的融合表征能力。针对建筑几何特性,在网络中引入了可变形卷积(DCM)和边界感知增强模块(MAC-BEM)。【结果】在DDHRNet_DATA数据集的山东省和韩国浦项市子数据集上的实验表明:在SAR模态缺失时,学生网络的IoU分别达到83.68%和77.24%,相较于次优算法分别提升了3.06%和2.66%;在光学模态缺失时,学生网络的IoU分别达到77.78%和77.20%,相较于次优算法分别提升了4.01%和1.31%,性能显著优于SegNet、Deeplabv3、Deeplabv3+、UNetFormer、MFFDeeplabV3+、SC_Deep等单模态对比模型,消融实验验证了各核心模块的有效性。【结论】本文方法有效解决了测试阶段模态缺失的实用化瓶颈,为多模态遥感建筑提取技术的实际部署提供了可靠、高效的解决方案。 展开更多
关键词 知识蒸馏 跨模态融合 缺失模态 建筑提取 语义分割 注意力机制 特征模拟 可变形卷积
原文传递
轻量且高精度的飞行器关键点检测改进网络GMD-YOLO
18
作者 刘鹏飞 孙世岩 +1 位作者 李池 张瑜 《海军工程大学学报》 北大核心 2026年第1期76-84,共9页
针对空中飞行器关键点检测中存在的实时性要求高、低分辨率、多尺度分布及部分遮挡等挑战,本文提出了一种基于YOLOv11n-pose架构的轻量化高精度检测算法GMD-YOLO。首先,设计了双门控融合网络,通过中值增强通道注意力与动态门控瓶颈卷积... 针对空中飞行器关键点检测中存在的实时性要求高、低分辨率、多尺度分布及部分遮挡等挑战,本文提出了一种基于YOLOv11n-pose架构的轻量化高精度检测算法GMD-YOLO。首先,设计了双门控融合网络,通过中值增强通道注意力与动态门控瓶颈卷积双分支协同机制,增强复杂光照下的特征鲁棒性;其次,构建轻量动态特征融合模块,采用双阶段注意力实现跨层特征自适应加权,缓解多尺度目标错位问题;再次,引入可变形卷积增强的C2PSA模块,通过动态采样网格提升形变关键点建模能力;最后,提出自适应图卷积姿态头,显式编码关键点间刚体约束以优化空间一致性。在自建的飞行器仿真数据集上的实验结果表明:GMD-YOLO仅以3.50 MB参数量实现91.9%均值平均精度P_(mA)@0.5与81.7%的P_(mA)@0.5∶0.95,较基准模型分别提升了6.0%与5.3%,在复杂场景下展现出显著精度优势与工程应用潜力。 展开更多
关键词 关键点检测 固定翼飞行器 YOLOv11 可变形卷积 图卷积网络
在线阅读 下载PDF
BurdenNet:先验信息导引的复杂环境下高炉多态料面目标检测网络
19
作者 倪梓明 陈先中 +1 位作者 侯庆文 张洁 《工程科学学报》 北大核心 2026年第1期26-38,共13页
传统的单一状态料面目标检测网络未能考虑高炉冶炼状态的交替变化,在复杂环境下整体准确度较低,针对上述问题,本文提出一种先验信息导引的多态料面目标检测网络BurdenNet.首先,提出基于原始信号距离向精度的图像预分类方法,构建三类典... 传统的单一状态料面目标检测网络未能考虑高炉冶炼状态的交替变化,在复杂环境下整体准确度较低,针对上述问题,本文提出一种先验信息导引的多态料面目标检测网络BurdenNet.首先,提出基于原始信号距离向精度的图像预分类方法,构建三类典型状态的料面图像数据集,并以预分类的状态为先验信息对网络通路进行剪枝.其次,将料面细长低曲率的形状特征与雷达采样信号的稀疏性质作为先验信息,提出空洞垂直偏移卷积(Atrous vertical deformable convolution,AVDC)模块提取多态料面特征.在此基础上,利用机械探尺数据构建先验空间注意力特征图,提出先验聚焦注意力(Prior focusing attention,PFA)模块,使网络优先聚焦于图像中的料面区域.最后对于边界框的回归,提出条带交并比(Band intersection over union,BIOU)损失函数进一步提升目标检测的速度与准确性.在钢铁公司高炉的实测数据上进行实验,结果表明,本文的BurdenNet相较于单一状态目标检测网络,在多态料面数据集上整体精确率提升了13.9%与5.2%,综合性能(F1-Score)提升了8.1%与4.3%,为复杂环境下多态料面图像的目标检测提供更准确的方法. 展开更多
关键词 多态料面 先验信息 空洞垂直偏移卷积 先验聚焦注意力 网络剪枝
在线阅读 下载PDF
基于改进YOLO11的生活垃圾检测模型
20
作者 任梦晗 赵海燕 宋佳智 《电子测量技术》 北大核心 2026年第1期247-256,共10页
随着城市化进程的加快,生活垃圾量的持续攀升对生态环境形成严峻挑战,因此基于目标检测的智能分拣技术成为关键解决方案。针对现有检测模型在复杂场景下精度不足和部署效率低的问题,提出一种改进的YOLO11生活垃圾检测模型。通过引入可... 随着城市化进程的加快,生活垃圾量的持续攀升对生态环境形成严峻挑战,因此基于目标检测的智能分拣技术成为关键解决方案。针对现有检测模型在复杂场景下精度不足和部署效率低的问题,提出一种改进的YOLO11生活垃圾检测模型。通过引入可变形卷积和自主设计的三分支坐标注意力机制,构建了增强型可变形卷积模块,并用其重构骨干网络中的C3k2,显著提升了模型对复杂背景中目标的特征提取能力。此外,采用内容感知特征重组算子替代颈部网络中的上采样,增强特征重建效果。引入指数移动平均滑动损失函数,有效提升检测精度并加速模型收敛。在优化后的华为云生活垃圾数据集上进行的实验表明,改进模型在mAP@0.5和mAP@0.5:0.95指标上分别达到76.5%和64.6%,较基线模型提升1.8%和1.7%。相比其他主流检测算法,改进模型参数量仅为2.8 M,更适合移动端部署。 展开更多
关键词 YOLO11 可变形卷积 注意力机制 生活垃圾 目标检测
原文传递
上一页 1 2 34 下一页 到第
使用帮助 返回顶部