期刊文献+
共找到695篇文章
< 1 2 35 >
每页显示 20 50 100
A Remote Sensing Image Semantic Segmentation Method by Combining Deformable Convolution with Conditional Random Fields 被引量:13
1
作者 Zongcheng ZUO Wen ZHANG Dongying ZHANG 《Journal of Geodesy and Geoinformation Science》 2020年第3期39-49,共11页
Currently,deep convolutional neural networks have made great progress in the field of semantic segmentation.Because of the fixed convolution kernel geometry,standard convolution neural networks have been limited the a... Currently,deep convolutional neural networks have made great progress in the field of semantic segmentation.Because of the fixed convolution kernel geometry,standard convolution neural networks have been limited the ability to simulate geometric transformations.Therefore,a deformable convolution is introduced to enhance the adaptability of convolutional networks to spatial transformation.Considering that the deep convolutional neural networks cannot adequately segment the local objects at the output layer due to using the pooling layers in neural network architecture.To overcome this shortcoming,the rough prediction segmentation results of the neural network output layer will be processed by fully connected conditional random fields to improve the ability of image segmentation.The proposed method can easily be trained by end-to-end using standard backpropagation algorithms.Finally,the proposed method is tested on the ISPRS dataset.The results show that the proposed method can effectively overcome the influence of the complex structure of the segmentation object and obtain state-of-the-art accuracy on the ISPRS Vaihingen 2D semantic labeling dataset. 展开更多
关键词 high-resolution remote sensing image semantic segmentation deformable convolution network conditions random fields
在线阅读 下载PDF
Multi-Layer Feature Extraction with Deformable Convolution for Fabric Defect Detection 被引量:1
2
作者 Jielin Jiang Chao Cui +1 位作者 Xiaolong Xu Yan Cui 《Intelligent Automation & Soft Computing》 2024年第4期725-744,共20页
In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process.... In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process.Traditional fabric defect detection algorithms can only detect specific materials and specific fabric defect types;in addition,their detection efficiency is low,and their detection results are relatively poor.Deep learning-based methods have many advantages in the field of fabric defect detection,however,such methods are less effective in identifying multiscale fabric defects and defects with complex shapes.Therefore,we propose an effective algorithm,namely multilayer feature extraction combined with deformable convolution(MFDC),for fabric defect detection.In MFDC,multi-layer feature extraction is used to fuse the underlying location features with high-level classification features through a horizontally connected top-down architecture to improve the detection of multi-scale fabric defects.On this basis,a deformable convolution is added to solve the problem of the algorithm’s weak detection ability of irregularly shaped fabric defects.In this approach,Roi Align and Cascade-RCNN are integrated to enhance the adaptability of the algorithm in materials with complex patterned backgrounds.The experimental results show that the MFDC algorithm can achieve good detection results for both multi-scale fabric defects and defects with complex shapes,at the expense of a small increase in detection time. 展开更多
关键词 Fabric defect detection multi-layer features deformable convolution
在线阅读 下载PDF
A depth-wise separable residual neural network for PCDH8 status prediction in thyroid cancer pathological images
3
作者 Linlin Qi Xiangyu Li +2 位作者 Zhihong Liu Pei Zhang Liangliang Liu 《Intelligent Oncology》 2025年第4期290-298,共9页
Introduction:Accurate prediction of protocadherin 8(PCDH8)gene expression status from whole-slide images(WSIs)is critical for thyroid cancer diagnosis and prognosis,as PCDH8 overexpression is associated with tumor agg... Introduction:Accurate prediction of protocadherin 8(PCDH8)gene expression status from whole-slide images(WSIs)is critical for thyroid cancer diagnosis and prognosis,as PCDH8 overexpression is associated with tumor aggressiveness and poor outcomes.Existing methods for PCDH8 detection are often costly,time-consuming,or require specialized expertise.To address these limitations,we developed a novel depth-wise separable residual neural network(DSRNet)for noninvasive PCDH8 status prediction directly from WSIs.Materials and methods:We collected 403 thyroid cancer WSIs from The Cancer Genome Atlas(TCGA),with PCDH8 expression status classified as high or low based on median expression values.Each WSI was divided into 512×512 pixel tiles,with the top 100 non-white tiles selected per slide.DSRNet integrates depth-wise separable convolutions,residual connections,and a deformable convolutional pyramid pooling module to efficiently capture multiscale and long-range features in gigapixel WSIs.The model was trained using tenfold cross-validation.Results:DSRNet achieved state-of-the-art performance with 92.76%accuracy,91.92%precision,92.69%recall,and 0.93 area under the curve on the thyroid cancer dataset(TCGA-THCA),significantly outperforming leading convolutional neural networks and Transformer models.Ablation studies confirmed the contributions of each component,and attention visualization showed that DSRNet focuses on biologically relevant regions.The model also generalized well to a breast cancer dataset(TCGA-BRCA),achieving 89.13%accuracy.Conclusions:We developed DSRNet,a deep learning-based model for predicting PCDH8 status directly from routine hematoxylin and eosin-stained pathological images.DSRNet combines the efficiency of convolutional operations with enhanced long-range dependency modeling,providing a noninvasive,accurate,and interpretable tool for auxiliary thyroid cancer diagnosis and prognosis.The results demonstrate its strong potential for clinical translation,though further multicenter validation is warranted. 展开更多
关键词 Thyroid cancer Biomarker Whole-slide image depth-wise separable convolution Residual mechanism
在线阅读 下载PDF
CW-HRNet:Constrained Deformable Sampling and Wavelet-Guided Enhancement for Lightweight Crack Segmentation
4
作者 Dewang Ma 《Journal of Electronic Research and Application》 2025年第5期269-280,共12页
This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two ke... This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two key modules:Constrained Deformable Convolution(CDC),which stabilizes geometric alignment by applying a tanh limiter and learnable scaling factor to the predicted offsets,and the Wavelet Frequency Enhancement Module(WFEM),which decomposes features using Haar wavelets to preserve low-frequency structures while enhancing high-frequency boundaries and textures.Evaluations on the CrackSeg9k benchmark demonstrate CW-HRNet’s superior performance,achieving 82.39%mIoU with only 7.49M parameters and 10.34 GFLOPs,outperforming HrSegNet-B48 by 1.83% in segmentation accuracy with minimal complexity overhead.The model also shows strong cross-dataset generalization,achieving 60.01%mIoU and 66.22%F1 on Asphalt3k without fine-tuning.These results highlight CW-HRNet’s favorable accuracyefficiency trade-off for real-world crack segmentation tasks. 展开更多
关键词 Crack segmentation Lightweight semantic segmentation deformable convolution Wavelet transform Road infrastructure
在线阅读 下载PDF
Attention-Augmented YOLOv8 with Ghost Convolution for Real-Time Vehicle Detection in Intelligent Transportation Systems
5
作者 Syed Sajid Ullah Muhammad Zunair Zamir +1 位作者 Ahsan Ishfaq Salman Khan 《Journal on Artificial Intelligence》 2025年第1期255-274,共20页
Accurate vehicle detection is essential for autonomous driving,traffic monitoring,and intelligent transportation systems.This paper presents an enhanced YOLOv8n model that incorporates the Ghost Module,Convolutional B... Accurate vehicle detection is essential for autonomous driving,traffic monitoring,and intelligent transportation systems.This paper presents an enhanced YOLOv8n model that incorporates the Ghost Module,Convolutional Block Attention Module(CBAM),and Deformable Convolutional Networks v2(DCNv2).The Ghost Module streamlines feature generation to reduce redundancy,CBAM applies channel and spatial attention to improve feature focus,and DCNv2 enables adaptability to geometric variations in vehicle shapes.These components work together to improve both accuracy and computational efficiency.Evaluated on the KITTI dataset,the proposed model achieves 95.4%mAP@0.5—an 8.97% gain over standard YOLOv8n—along with 96.2% precision,93.7% recall,and a 94.93%F1-score.Comparative analysis with seven state-of-the-art detectors demonstrates consistent superiority in key performance metrics.An ablation study is also conducted to quantify the individual and combined contributions of GhostModule,CBAM,and DCNv2,highlighting their effectiveness in improving detection performance.By addressing feature redundancy,attention refinement,and spatial adaptability,the proposed model offers a robust and scalable solution for vehicle detection across diverse traffic scenarios. 展开更多
关键词 YOLOv8n vehicle detection deformable convolutional networks(DCNv2) ghost module convolutional block attention module(CBAM) attention mechanisms
在线阅读 下载PDF
Optical Flow with Learning Feature for Deformable Medical Image Registration 被引量:1
6
作者 Jinrong Hu Lujin Li +3 位作者 Ying Fu Maoyang Zou Jiliu Zhou Shanhui Sun 《Computers, Materials & Continua》 SCIE EI 2022年第5期2773-2788,共16页
Deformable medical image registration plays a vital role in medical image applications,such as placing different temporal images at the same time point or different modality images into the same coordinate system.Vari... Deformable medical image registration plays a vital role in medical image applications,such as placing different temporal images at the same time point or different modality images into the same coordinate system.Various strategies have been developed to satisfy the increasing needs of deformable medical image registration.One popular registration method is estimating the displacement field by computing the optical flow between two images.The motion field(flow field)is computed based on either gray-value or handcrafted descriptors such as the scale-invariant feature transform(SIFT).These methods assume that illumination is constant between images.However,medical images may not always satisfy this assumption.In this study,we propose a metric learning-based motion estimation method called Siamese Flow for deformable medical image registration.We train metric learners using a Siamese network,which produces an image patch descriptor that guarantees a smaller feature distance in two similar anatomical structures and a larger feature distance in two dissimilar anatomical structures.In the proposed registration framework,the flow field is computed based on such features and is close to the real deformation field due to the excellent feature representation ability of the Siamese network.Experimental results demonstrate that the proposed method outperforms the Demons,SIFT Flow,Elastix,and VoxelMorph networks regarding registration accuracy and robustness,particularly with large deformations. 展开更多
关键词 deformation registration feature extraction optical flow convolutional neural network
在线阅读 下载PDF
A Deformable Network with Attention Mechanism for Retinal Vessel Segmentation
7
作者 Xiaolong Zhu Wenjian Li +2 位作者 Weihang Zhang Dongwei Li Huiqi Li 《Journal of Beijing Institute of Technology》 EI CAS 2024年第3期186-193,共8页
The intensive application of deep learning in medical image processing has facilitated the advancement of automatic retinal vessel segmentation research.To overcome the limitation that traditional U-shaped vessel segm... The intensive application of deep learning in medical image processing has facilitated the advancement of automatic retinal vessel segmentation research.To overcome the limitation that traditional U-shaped vessel segmentation networks fail to extract features in fundus image sufficiently,we propose a novel network(DSeU-net)based on deformable convolution and squeeze excitation residual module.The deformable convolution is utilized to dynamically adjust the receptive field for the feature extraction of retinal vessel.And the squeeze excitation residual module is used to scale the weights of the low-level features so that the network learns the complex relationships of the different feature layers efficiently.We validate the DSeU-net on three public retinal vessel segmentation datasets including DRIVE,CHASEDB1,and STARE,and the experimental results demonstrate the satisfactory segmentation performance of the network. 展开更多
关键词 retinal vessel segmentation deformable convolution attention mechanism deep learning
暂未订购
DSD-MatchingNet:Deformable sparse-to-dense feature matching for learning accurate correspondences
8
作者 Yicheng ZHAO Han ZHANG +3 位作者 Ping LU Ping LI Enhua WU Bin SHENG 《Virtual Reality & Intelligent Hardware》 2022年第5期432-443,共12页
Background Exploring correspondences across multiview images is the basis of various computer vision tasks.However,most existing methods have limited accuracy under challenging conditions.Method To learn more robust a... Background Exploring correspondences across multiview images is the basis of various computer vision tasks.However,most existing methods have limited accuracy under challenging conditions.Method To learn more robust and accurate correspondences,we propose DSD-MatchingNet for local feature matching in this study.First,we develop a deformable feature extraction module to obtain multilevel feature maps,which harvest contextual information from dynamic receptive fields.The dynamic receptive fields provided by the deformable convolution network ensure that our method obtains dense and robust correspondence.Second,we utilize sparse-to-dense matching with symmetry of correspondence to implement accurate pixel-level matching,which enables our method to produce more accurate correspondences.Result Experiments show that our proposed DSD-MatchingNet achieves a better performance on the image matching benchmark,as well as on the visual localization benchmark.Specifically,our method achieved 91.3%mean matching accuracy on the HPatches dataset and 99.3%visual localization recalls on the Aachen Day-Night dataset. 展开更多
关键词 Image matching deformable convolution network Sparse-to-dense matching
在线阅读 下载PDF
3D Data Scattergram Image Classification Based Protection for Transmission Line Connecting BESS Using Depth-wise Separable Convolution Based CNN 被引量:1
9
作者 Yingyu Liang Yi Ren +1 位作者 Xiaoyang Yang Wenting Zha 《Journal of Modern Power Systems and Clean Energy》 2025年第2期609-621,共13页
The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data... The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data scattergrams are constructed using current data from both sides of the transmission line and their sum.Following a comprehensive analysis of the varying characteristics of 3D data scattergrams under different conditions,a 3D data scattergram image classification based protection method is developed.The depth-wise separable convolution is used to ensure a lightweight convolutional neural network(CNN)structure without compromising performance.In addition,a Bayesian hyperparameter optimization algorithm is used to achieve a hyperparametric search to simplify the training process.Compared with artificial neural networks and CNNs,the depth-wise separable convolution based CNN(DPCNN)achieves a higher recognition accuracy.The 3D data scattergram image classification based protection method using DPCNN can accurately separate internal faults from other disturbances and identify fault phases under different operating states and fault conditions.The proposed protection method also shows first-class tolerability against current transformer(CT)saturation and CT measurement errors. 展开更多
关键词 convolutional neural network(CNN) battery energy storage station(BESS) depth-wise separable convolution hyperparameter optimization fault classification line protection
原文传递
Validation Research on the Application of Depthwise Separable Convolutional Al Facial Expression Recognition in Non-pharmacological Treatment of BPSD
10
作者 Xiangyu Liu 《Journal of Clinical and Nursing Research》 2021年第4期31-37,共7页
One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence... One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence of emotions.Observing the reduction of BPSD in dementia through emotions can be considered effective and widely used in the field of non-pharmacological therapy.At present,this article will verify whether the image recognition artificial intelligence(AI)system can correctly reflect the emotional performance of the elderly with dementia through a questionnaire survey of three professional elderly nursing staff.The ANOVA(sig.=0.50)is used to determine that the judgment given by the nursing staff has no obvious deviation,and then Kendall's test(0.722**)and spearman's test(0.863**)are used to verify the judgment severity of the emotion recognition system and the nursing staff unanimously.This implies the usability of the tool.Additionally,it can be expected to be further applied in the research related to BPSD elderly emotion detection. 展开更多
关键词 depth-wise separable convolution EMOTION BPSD DEMENTIA Nursing
暂未订购
Pore network modeling of gas-water two-phase flow in deformed multi-scale fracture-porous media
11
作者 Dai-Gang Wang Yu-Shan Ma +6 位作者 Zhe Hu Tong Wu Ji-Rui Hou Zhen-Chang Jiang Xin-Xuan Qi Kao-Ping Song Fang-zhou Liu 《Petroleum Science》 2025年第5期2096-2108,共13页
Two actual rocks drilled from a typical ultra-deep hydrocarbon reservoir in the Tarim Basin are selected to conduct in-situ stress-loading micro-focus CT scanning experiments.The gray images of rock microstructure at ... Two actual rocks drilled from a typical ultra-deep hydrocarbon reservoir in the Tarim Basin are selected to conduct in-situ stress-loading micro-focus CT scanning experiments.The gray images of rock microstructure at different stress loading stages are obtained.The U-Net fully convolutional neural network is utilized to achieve fine semantic segmentation of rock skeleton,pore space,and microfractures based on CT slice images of deep rocks.The three-dimensional digital rock models of deformed multiscale fractured-porous media at different stress loading stages are thereafter reconstructed,and the equivalent fracture-pore network models are finally extracted to explore the underlying mechanisms of gas-water two-phase flow at the pore-scale.Results indicate that,in the process of insitu stress loading,both the deep rocks have experienced three stages:linear elastic deformation,nonlinear plastic deformation,and shear failure.The micro-mechanical behavior greatly affects the dynamic deformation of rock microstructure and gas-water two-phase flow.In the linear elastic deformation stage,with the increase in in-situ stress,both the deep rocks are gradually compacted,leading to decreases in average pore radius,pore throat ratio,tortuosity,and water-phase relative permeability,while the coordination number nearly remains unchanged.In the plastic deformation stage,the synergistic influence of rock compaction and existence of micro-fractures typically exert a great effect on pore-throat topological properties and gas-water relative permeability.In the shear failure stage,due to the generation and propagation of micro-fractures inside the deep rock,the topological connectivity becomes better,fluid flow paths increase,and flow conductivity is promoted,thus leading to sharp increases in average pore radius and coordination number,rapid decreases in pore throat ratio and tortuosity,as well as remarkable improvement in relative permeability of gas phase and waterphase. 展开更多
关键词 Ultra-deep reservoir In-situ stress loading U-Netfully convolutional neural network CTscanning Microstructure deformation Pore-scalefluid flow
原文传递
基于多模态语义信息的文本生成图像方法
12
作者 杨冰 周家辉 +1 位作者 姚金良 向学勤 《浙江大学学报(工学版)》 北大核心 2026年第2期360-369,共10页
针对文本语义与图像语义不一致以及图像细节表现不足的问题,提出新的文本生成图像方法.基于多模态语义信息建立鉴别依据,在文本语义基础上引入真实图像语义,以解决文本描述信息密度低的问题,有效缓解生成图像细节缺失或失真的现象.在生... 针对文本语义与图像语义不一致以及图像细节表现不足的问题,提出新的文本生成图像方法.基于多模态语义信息建立鉴别依据,在文本语义基础上引入真实图像语义,以解决文本描述信息密度低的问题,有效缓解生成图像细节缺失或失真的现象.在生成器中集成可变形卷积和星模块卷积,增强生成器表达能力,提高生成图像的细节表现和整体质量.为了验证所提方法的有效性,在CUB数据集和COCO数据集上进行模型训练及评估.与生成式对抗对比语言-图像预训练模型(GALIP)相比,所提方法在保证高效生成的同时,在细节表现、语义一致性及整体质量上具有显著优势. 展开更多
关键词 文本生成图像 多模态语义 可变形卷积 星模块卷积 语义对齐鉴别器
在线阅读 下载PDF
面向面部动作单元的自适应图注意力微表情检测网络
13
作者 马飞 安佳祺 +1 位作者 杨飞霞 徐光宪 《计算机科学与探索》 北大核心 2026年第4期1193-1206,共14页
微表情检测旨在视频中定位幅度微弱、时间短暂的表情区间。其难点在于有效提取面部区域间的动态关联特征和多尺度时序特征,进而精准捕捉面部各区域微小动作之间的关联。针对这些问题,提出了一种融合自适应图注意力和多尺度可变空洞卷积... 微表情检测旨在视频中定位幅度微弱、时间短暂的表情区间。其难点在于有效提取面部区域间的动态关联特征和多尺度时序特征,进而精准捕捉面部各区域微小动作之间的关联。针对这些问题,提出了一种融合自适应图注意力和多尺度可变空洞卷积的微表情检测网络(AG-DDNet)。通过引入参数可学习矩阵来实现键值对的特征变换,通过计算面部区域特征向量间的相似度得到动态邻接矩阵,并结合图注意力机制计算区域间权重系数,实现特征的动态融合;采用了多尺度可变空洞卷积模块,通过自适应池化与卷积组合的预测器生成动态感受野,从而实现多尺度的特征提取;引入基于Fisher信息矩阵的自然梯度优化机制,通过Fisher Adam优化器有效捕捉参数空间的几何结构信息,实现学习率的精确自适应调整,从而显著增强了模型对微表情和宏表情的协同检测能力。在微表情检测任务中,该算法与同类代表性算法相比,在CAS(ME)2数据集和SAMM Long Videos数据集上的性能分别提升了54.20%和20.11%。与最新算法相比,两个数据集上的提升幅度分别为38.43%和6.81%,有效证明了该方法在长视频微表情检测任务上的优越性能。 展开更多
关键词 微表情检测 自适应图注意力 多尺度可变空洞卷积 面部动作单元 长视频分析
在线阅读 下载PDF
基于可变形卷积和注意力机制的路面裂缝检测
14
作者 谢永华 方育才 彭银佳 《计算机工程与设计》 北大核心 2026年第1期279-285,共7页
为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器... 为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器部分使用空洞残差模块扩大感受野并保留更多细节信息;在解码器部分添加注意力机制提高对裂缝特征的关注度,抑制背景噪声。实验结果表明,该网络在MPA、mIoU和F1值这3项指标上均优于其它对比网络,验证了该网络的有效性。 展开更多
关键词 裂缝检测 语义分割 编码解码 可变形卷积 空洞卷积 残差连接 注意力机制
在线阅读 下载PDF
基于改进YOLOv7的遥感图像目标检测方法
15
作者 陈辉 田博 +2 位作者 赵永红 瞿海平 梁建虎 《兰州理工大学学报》 北大核心 2026年第1期93-100,共8页
为了解决遥感图像中小目标规模大、目标分布密集以及容易产生漏检和误检等问题,提出了一种基于改进YOLOv7模型的遥感图像目标检测方法.该方法首先在YOLOv7模型中引入DCNv2结构和残差结构,重新构建了新的骨干网络,以增强目标浅层特征信... 为了解决遥感图像中小目标规模大、目标分布密集以及容易产生漏检和误检等问题,提出了一种基于改进YOLOv7模型的遥感图像目标检测方法.该方法首先在YOLOv7模型中引入DCNv2结构和残差结构,重新构建了新的骨干网络,以增强目标浅层特征信息的提取,并提高网络的准确性.其次,在颈部网络中采用新的特征融合模块,并通过SimAM注意力机制,自适应调节浅层特征的纹理信息和深层语义信息的融合权重,更有针对性地抑制提取浅层特征时带来的噪声.最后,采用归一化高斯瓦瑟斯坦距离损失作为模型的回归损失函数,取代传统的IOU,以提高多尺度目标的检测能力.该算法在DOTAv1.0数据集上小目标平均精度达到20.1%,在DIOR数据集上小目标平均精度达到29.0%.同时,与YOLOv7、YOLOv6等方法相比,该算法展现出了较强的竞争力. 展开更多
关键词 遥感图像 目标检测 可变形卷积网络 SimAM注意力机制 高斯瓦瑟斯坦距离
在线阅读 下载PDF
基于MTF-DSGT的复杂支路串联故障电弧检测
16
作者 余琼芳 谭文新 +1 位作者 吴琼 张宇海 《兵器装备工程学报》 北大核心 2026年第3期235-242,291,共9页
针对低压交流配电系统中复杂支路串联电弧故障检测困难、易引发电气火灾的挑战,提出了基于马尔可夫变迁场与可变形自引导Transformer(Markov transition field and deformable convolutional self-guided transformer,MTF-DSGT)的检测... 针对低压交流配电系统中复杂支路串联电弧故障检测困难、易引发电气火灾的挑战,提出了基于马尔可夫变迁场与可变形自引导Transformer(Markov transition field and deformable convolutional self-guided transformer,MTF-DSGT)的检测方案。利用马尔可夫变迁场将一维电流信号转换为图像,融合可变形卷积网络(deformable convolutional network,DCN)提取局部特征及自引导Transformer捕捉全局信息,以提高故障识别精度。实验结果显示,该方案在复杂支路电路中检测准确率达99.88%,在Jetson Orin Nano平台测试耗时仅7.78 ms。该方案能高效辨识串联电弧故障,具备实时处理能力,适合边缘设备部署。 展开更多
关键词 串联故障电弧 可变形卷积 自引导Transformer 马尔可夫变迁场 复杂支路
在线阅读 下载PDF
基于多尺度与可变形卷积的无人机图像匹配定位
17
作者 武建亮 陈赓 张文博 《无线电工程》 2026年第2期342-349,共8页
在无人机绝对视觉定位任务中,图像匹配技术是实现高精度定位的核心环节。目前的图像匹配算法普遍对多尺度特征的提取能力不足,难以适应视角变化引起的几何形变,因此在处理视觉定位任务中的大尺度差异和视角差异图像时匹配精度较低,进而... 在无人机绝对视觉定位任务中,图像匹配技术是实现高精度定位的核心环节。目前的图像匹配算法普遍对多尺度特征的提取能力不足,难以适应视角变化引起的几何形变,因此在处理视觉定位任务中的大尺度差异和视角差异图像时匹配精度较低,进而影响视觉定位系统的定位精度。针对上述问题,提出了一种融合多尺度特征编码与可变形卷积的图像匹配方法。通过嵌入空洞空间金字塔池化(Atrous Spatial Pyramid Pooling,ASPP)的多尺度特征编码模块,利用不同膨胀率的空洞卷积并行提取多尺度特征,在避免信息丢失的同时增强对大尺度差异图像的表征能力;在描述子解码模块中引入可变形卷积,通过学习采样偏移量自适应调整卷积核位置,提升描述子在视角变换与几何形变场景下的一致性和区分度,并在Hpatches数据集和遥感图像上设计多项实验,以验证所提模型在特定问题上的有效性。 展开更多
关键词 无人机 绝对视觉定位 图像匹配 可变形卷积
在线阅读 下载PDF
基于多注意力机制的脊柱病灶MRI影像识别模型
18
作者 周慧 宋新景 《计算机科学与探索》 北大核心 2026年第1期291-300,共10页
人工检测脊柱病变是一项耗时的工作,并且高度依赖于该领域的专家,因此脊柱病灶的自动识别是非常必要的。然而,因为脊柱病灶的大小、位置和结构存在着广泛的差异,同时脊柱肿瘤与稀有病布鲁氏菌在影像上高度相似,所以脊柱病灶的准确定位... 人工检测脊柱病变是一项耗时的工作,并且高度依赖于该领域的专家,因此脊柱病灶的自动识别是非常必要的。然而,因为脊柱病灶的大小、位置和结构存在着广泛的差异,同时脊柱肿瘤与稀有病布鲁氏菌在影像上高度相似,所以脊柱病灶的准确定位和分类是一项具有挑战性的工作。为了应对这些挑战,提出了一种改进的脊柱病灶MRI影像识别模型。引入以ResNet-101为基础的双向特征金字塔主干网络,利用可变卷积在不同层替代传统的卷积神经网络,从特征层中获得更多的特征信息。在不同的模块中加入了多重注意力,包括自注意力机制和柔性注意力机制,有效地融合特征中贡献较大的部分。为了克服脊柱肿瘤、感染性病变、稀有病布鲁氏菌的数据不平衡问题,引入了改进的平衡交叉熵损失函数。在大连某医院提供的临床数据集上进行验证,识别精确率达到了94.2%,识别召回率达到90.8%。与其他识别模型进行对比实验,结果说明了该方法相对于其他模型识别性能更好。 展开更多
关键词 脊柱病灶识别 双向特征金字塔 多注意力机制 可变卷积 多特征融合
在线阅读 下载PDF
基于改进YOLOv8n的钢材表面缺陷检测算法
19
作者 陈辉 李淑婷 《山东理工大学学报(自然科学版)》 2026年第4期34-42,共9页
针对现有钢材表面缺陷检测方法存在特征提取能力不足,对形状不规则、尺度变化大等复杂缺陷检测精度低的问题,提出一种改进YOLOv8n的钢材表面缺陷检测算法YOLOv8n-OMD。首先,在主干网络利用在线重参数化卷积OREPA将复杂的多卷积层重参数... 针对现有钢材表面缺陷检测方法存在特征提取能力不足,对形状不规则、尺度变化大等复杂缺陷检测精度低的问题,提出一种改进YOLOv8n的钢材表面缺陷检测算法YOLOv8n-OMD。首先,在主干网络利用在线重参数化卷积OREPA将复杂的多卷积层重参数化为单卷积层,在保持特征提取能力的同时具有较低的计算成本;其次,设计中值增强通道空间注意力机制MECS并添加到主干网络末端,从而增强对重要特征的提取,提高检测准确性;最后,结合可变形卷积DCNv4构建C2f_DCNv4模块并引入到颈部网络,以增加有效感受野和有效位置的采样,更准确地捕获复杂形状特征的详细信息。实验结果表明:YOLOv8n-OMD算法在钢材表面缺陷数据集NEU-DET上,mAP@0.5达到82.2%,mAP@0.5∶0.95达到48.8%,计算量为7.1×10^(9);较基准算法YOLOv8n,mAP@0.5和mAP@0.5∶0.95分别提高了3.3%和1.5%,计算量下降了12.3%,证明了YOLOv8n-OMD算法对钢材表面缺陷检测的有效性和实用性。 展开更多
关键词 缺陷检测 YOLOv8n OREPA 可变形卷积 注意力机制
在线阅读 下载PDF
基于改进堆叠沙漏网络的人体姿态估计
20
作者 吕超 马歌谣 《计算机应用研究》 北大核心 2026年第3期948-953,共6页
针对堆叠沙漏网络在复杂场景人体姿态估计任务中存在注意力机制缺失导致语义感知能力弱、固定卷积采样模式导致几何建模能力差、结构冗余影响推理效率问题,设计了一种轻量高效的改进网络架构。该网络采用空间-通道双路径协同注意力模块... 针对堆叠沙漏网络在复杂场景人体姿态估计任务中存在注意力机制缺失导致语义感知能力弱、固定卷积采样模式导致几何建模能力差、结构冗余影响推理效率问题,设计了一种轻量高效的改进网络架构。该网络采用空间-通道双路径协同注意力模块,从空间维度增强关键点感知、抑制背景干扰,同时在通道维度筛选高语义特征,实现多维特征优化;引入多态线性可变形卷积瓶颈模块,通过异构初始采样形状提升对复杂姿态结构的几何建模能力;构建ELA-PCCW沙漏模块,在保持特征完整性的同时显著降低模型复杂度以提升模型推理效率。在MPII与COCO2017两个主流数据集上进行性能评估,结果显示,所提方法在MPII数据集上PCKh@0.5提高2.3个百分点,参数量和计算量分别减少9.1M和6 GFLOPs,在精度与复杂度之间形成良好平衡。对比实验和可视化分析进一步验证了该方法在多种复杂场景人体姿态估计任务中的优越性。 展开更多
关键词 人体姿态估计 堆叠沙漏网络 轻量化模型 注意力机制 线性可变形卷积 几何建模能力 特征融合 模型推理效率
在线阅读 下载PDF
上一页 1 2 35 下一页 到第
使用帮助 返回顶部