期刊文献+
共找到1,025篇文章
< 1 2 52 >
每页显示 20 50 100
结合对抗训练和IDCNN的医疗命名实体识别
1
作者 陈雪松 李洋洋 王浩畅 《计算机与现代化》 2026年第1期53-59,100,共8页
在医疗领域,传统的命名实体识别模型,无法兼顾全局特征与局部特征的提取,为了解决这个问题,本文提出一种结合全局特征与局部特征的命名实体识别模型用于处理医疗领域的命名实体识别任务。首先,使用预训练语言模型Chinese-BERT-wwm-ext... 在医疗领域,传统的命名实体识别模型,无法兼顾全局特征与局部特征的提取,为了解决这个问题,本文提出一种结合全局特征与局部特征的命名实体识别模型用于处理医疗领域的命名实体识别任务。首先,使用预训练语言模型Chinese-BERT-wwm-ext得到输入文本的初始向量表示;其次,在初始向量的表示上添加一些扰动来生成对抗样本,可提升模型的鲁棒性;再次,将初始向量表示与对抗样本一同依次输入到特征提取层,特征提取层结合了空洞卷积神经网络(Iterated Dilated Convolutional Neural Network,IDCNN)和双向长短时记忆网络(Bi-directional Long Short-Term Memory,BiLSTM)模型,共同生成特征向量,分别捕捉文本的局部和全局特征,使用自注意力机制将抽取的特征向量进行融合,从而充分利用各层次的信息;最后,利用CRF算法生成预测序列。通过结合特征融合模块与对抗训练模块,该模型对于医疗文本CMeEE中命名实体的识别精确率为66.31%,召回率为68.84%,F1值为67.55%;与基线模型相比,表现出较高的识别精度,适用于医疗领域命名实体识别任务。 展开更多
关键词 命名实体识别 预训练语言模型 对抗训练 IdcNN BiLSTM 自注意力机制
在线阅读 下载PDF
Multi-Head Attention Enhanced Parallel Dilated Convolution and Residual Learning for Network Traffic Anomaly Detection 被引量:1
2
作者 Guorong Qi Jian Mao +2 位作者 Kai Huang Zhengxian You Jinliang Lin 《Computers, Materials & Continua》 2025年第2期2159-2176,共18页
Abnormal network traffic, as a frequent security risk, requires a series of techniques to categorize and detect it. Existing network traffic anomaly detection still faces challenges: the inability to fully extract loc... Abnormal network traffic, as a frequent security risk, requires a series of techniques to categorize and detect it. Existing network traffic anomaly detection still faces challenges: the inability to fully extract local and global features, as well as the lack of effective mechanisms to capture complex interactions between features;Additionally, when increasing the receptive field to obtain deeper feature representations, the reliance on increasing network depth leads to a significant increase in computational resource consumption, affecting the efficiency and performance of detection. Based on these issues, firstly, this paper proposes a network traffic anomaly detection model based on parallel dilated convolution and residual learning (Res-PDC). To better explore the interactive relationships between features, the traffic samples are converted into two-dimensional matrix. A module combining parallel dilated convolutions and residual learning (res-pdc) was designed to extract local and global features of traffic at different scales. By utilizing res-pdc modules with different dilation rates, we can effectively capture spatial features at different scales and explore feature dependencies spanning wider regions without increasing computational resources. Secondly, to focus and integrate the information in different feature subspaces, further enhance and extract the interactions among the features, multi-head attention is added to Res-PDC, resulting in the final model: multi-head attention enhanced parallel dilated convolution and residual learning (MHA-Res-PDC) for network traffic anomaly detection. Finally, comparisons with other machine learning and deep learning algorithms are conducted on the NSL-KDD and CIC-IDS-2018 datasets. The experimental results demonstrate that the proposed method in this paper can effectively improve the detection performance. 展开更多
关键词 Network traffic anomaly detection multi-head attention parallel dilated convolution residual learning
在线阅读 下载PDF
DcNet: Dilated Convolutional Neural Networks for Side-Scan Sonar Image Semantic Segmentation 被引量:2
3
作者 ZHAO Xiaohong QIN Rixia +3 位作者 ZHANG Qilei YU Fei WANG Qi HE Bo 《Journal of Ocean University of China》 SCIE CAS CSCD 2021年第5期1089-1096,共8页
In ocean explorations,side-scan sonar(SSS)plays a very important role and can quickly depict seabed topography.As-sembling the SSS to an autonomous underwater vehicle(AUV)and performing semantic segmentation of an SSS... In ocean explorations,side-scan sonar(SSS)plays a very important role and can quickly depict seabed topography.As-sembling the SSS to an autonomous underwater vehicle(AUV)and performing semantic segmentation of an SSS image in real time can realize online submarine geomorphology or target recognition,which is conducive to submarine detection.However,because of the complexity of the marine environment,various noises in the ocean pollute the sonar image,which also encounters the intensity inhomogeneity problem.In this paper,we propose a novel neural network architecture named dilated convolutional neural network(DcNet)that can run in real time while addressing the above-mentioned issues and providing accurate semantic segmentation.The proposed architecture presents an encoder-decoder network to gradually reduce the spatial dimension of the input image and recover the details of the target,respectively.The core of our network is a novel block connection named DCblock,which mainly uses dilated convolution and depthwise separable convolution between the encoder and decoder to attain more context while still retaining high accuracy.Furthermore,our proposed method performs a super-resolution reconstruction to enlarge the dataset with high-quality im-ages.We compared our network to other common semantic segmentation networks performed on an NVIDIA Jetson TX2 using our sonar image datasets.Experimental results show that while the inference speed of the proposed network significantly outperforms state-of-the-art architectures,the accuracy of our method is still comparable,which indicates its potential applications not only in AUVs equipped with SSS but also in marine exploration. 展开更多
关键词 side-scan sonar(SSS) semantic segmentation dilated convolutions SUPER-RESOLUTION
在线阅读 下载PDF
Channel-Attention DenseNet with Dilated Convolutions for MRI Brain Tumor Classification
4
作者 Abdu Salam Mohammad Abrar +5 位作者 Raja Waseem Anwer Farhan Amin Faizan Ullah Isabel de la Torre Gerardo Mendez Mezquita Henry Fabian Gongora 《Computer Modeling in Engineering & Sciences》 2025年第11期2457-2479,共23页
Brain tumors pose significant diagnostic challenges due to their diverse types and complex anatomical locations.Due to the increase in precision image-based diagnostic tools,driven by advancements in artificial intell... Brain tumors pose significant diagnostic challenges due to their diverse types and complex anatomical locations.Due to the increase in precision image-based diagnostic tools,driven by advancements in artificial intelligence(AI)and deep learning,there has been potential to improve diagnostic accuracy,especially with Magnetic Resonance Imaging(MRI).However,traditional state-of-the-art models lack the sensitivity essential for reliable tumor identification and segmentation.Thus,our research aims to enhance brain tumor diagnosis in MRI by proposing an advanced model.The proposed model incorporates dilated convolutions to optimize the brain tumor segmentation and classification.The proposed model is first trained and later evaluated using the BraTS 2020 dataset.In our proposed model preprocessing consists of normalization,noise reduction,and data augmentation to improve model robustness.The attention mechanism and dilated convolutions were introduced to increase the model’s focus on critical regions and capture finer spatial details without compromising image resolution.We have performed experimentation to measure efficiency.For this,we have used various metrics including accuracy,sensitivity,and curve(AUC-ROC).The proposed model achieved a high accuracy of 94%,a sensitivity of 93%,a specificity of 92%,and an AUC-ROC of 0.98,outperforming traditional diagnostic models in brain tumor detection.The proposed model accurately identifies tumor regions,while dilated convolutions enhanced the segmentation accuracy,especially for complex tumor structures.The proposed model demonstrates significant potential for clinical application,providing reliable and precise brain tumor detection in MRI. 展开更多
关键词 Artificial intelligence MRI analysis deep learning dilated convolution DenseNet brain tumor detection brain tumor segmentation
在线阅读 下载PDF
基于DC-HED网络和骨架提取的岩心图像边缘检测
5
作者 潘少伟 杨怡婷 +2 位作者 尚娅敏 郭智 蔡文斌 《中国石油大学学报(自然科学版)》 北大核心 2025年第3期97-107,共11页
整体嵌套边缘检测(holistically-nested edge detection,HED)网络是目前图像边缘检测领域内一种应用广泛且性能良好的深度网络模型,但存在图像检测边缘缺失、冗余和模糊不清等不足。针对此问题,提出一种扩张卷积(dilated convolution,DC... 整体嵌套边缘检测(holistically-nested edge detection,HED)网络是目前图像边缘检测领域内一种应用广泛且性能良好的深度网络模型,但存在图像检测边缘缺失、冗余和模糊不清等不足。针对此问题,提出一种扩张卷积(dilated convolution,DC)结合HED网络的深度网络模型DC-HED。首先,去除原HED网络最后两层的池化层以进一步保留图像边缘信息;再加入扩张卷积来扩大感受野,更好地还原图像边缘细节,重新设计DC-HED网络。之后利用Zhang-Suen算法对其图像边缘检测结果进行骨架提取。把DC-HED网络和骨架提取应用于中国陕北地区S油田不同岩心铸体薄片图像(简称岩心图像)的边缘检测中,获得较好的试验效果。结果表明:相比已有文献中方法、传统Canny算子、传统Sobel算子和原HED网络,DC-HED网络检测获得的图像边缘更完整,连通性更好;DC-HED网络测试得到的均方误差、结构相似性和峰值信噪比分别为0.1106、0.7997和9.5611,与前面几种方法相比,均有较大幅度的改善。最后将图像骨架提取方法应用于已获得的图像边缘中,剔除了杂乱的图像边缘信息,可得到清晰连续的图像边缘中心轮廓线条。 展开更多
关键词 岩心铸体薄片图像 边缘检测 岩心数字化 HED网络 扩张卷积 骨架提取
在线阅读 下载PDF
An improved deep dilated convolutional neural network for seismic facies interpretation 被引量:1
6
作者 Na-Xia Yang Guo-Fa Li +2 位作者 Ting-Hui Li Dong-Feng Zhao Wei-Wei Gu 《Petroleum Science》 SCIE EI CAS CSCD 2024年第3期1569-1583,共15页
With the successful application and breakthrough of deep learning technology in image segmentation,there has been continuous development in the field of seismic facies interpretation using convolutional neural network... With the successful application and breakthrough of deep learning technology in image segmentation,there has been continuous development in the field of seismic facies interpretation using convolutional neural networks.These intelligent and automated methods significantly reduce manual labor,particularly in the laborious task of manually labeling seismic facies.However,the extensive demand for training data imposes limitations on their wider application.To overcome this challenge,we adopt the UNet architecture as the foundational network structure for seismic facies classification,which has demonstrated effective segmentation results even with small-sample training data.Additionally,we integrate spatial pyramid pooling and dilated convolution modules into the network architecture to enhance the perception of spatial information across a broader range.The seismic facies classification test on the public data from the F3 block verifies the superior performance of our proposed improved network structure in delineating seismic facies boundaries.Comparative analysis against the traditional UNet model reveals that our method achieves more accurate predictive classification results,as evidenced by various evaluation metrics for image segmentation.Obviously,the classification accuracy reaches an impressive 96%.Furthermore,the results of seismic facies classification in the seismic slice dimension provide further confirmation of the superior performance of our proposed method,which accurately defines the range of different seismic facies.This approach holds significant potential for analyzing geological patterns and extracting valuable depositional information. 展开更多
关键词 Seismic facies interpretation dilated convolution Spatial pyramid pooling Internal feature maps Compound loss function
原文传递
Long Text Classification Algorithm Using a Hybrid Model of Bidirectional Encoder Representation from Transformers-Hierarchical Attention Networks-Dilated Convolutions Network 被引量:1
7
作者 ZHAO Yuanyuan GAO Shining +1 位作者 LIU Yang GONG Xiaohui 《Journal of Donghua University(English Edition)》 CAS 2021年第4期341-350,共10页
Text format information is full of most of the resources of Internet,which puts forward higher and higher requirements for the accuracy of text classification.Therefore,in this manuscript,firstly,we design a hybrid mo... Text format information is full of most of the resources of Internet,which puts forward higher and higher requirements for the accuracy of text classification.Therefore,in this manuscript,firstly,we design a hybrid model of bidirectional encoder representation from transformers-hierarchical attention networks-dilated convolutions networks(BERT_HAN_DCN)which based on BERT pre-trained model with superior ability of extracting characteristic.The advantages of HAN model and DCN model are taken into account which can help gain abundant semantic information,fusing context semantic features and hierarchical characteristics.Secondly,the traditional softmax algorithm increases the learning difficulty of the same kind of samples,making it more difficult to distinguish similar features.Based on this,AM-softmax is introduced to replace the traditional softmax.Finally,the fused model is validated,which shows superior performance in the accuracy rate and F1-score of this hybrid model on two datasets and the experimental analysis shows the general single models such as HAN,DCN,based on BERT pre-trained model.Besides,the improved AM-softmax network model is superior to the general softmax network model. 展开更多
关键词 long text classification dilated convolution BERT fusing context semantic features hierarchical characteristics BERT_HAN_dcN AM-softmax
在线阅读 下载PDF
基于DC-CNN-PE-SSA-Informer的电缆缆芯温度预测研究 被引量:2
8
作者 鲍克勤 赵欣妍 +2 位作者 刘擘 王仕博 郝海斌 《昆明理工大学学报(自然科学版)》 北大核心 2025年第2期116-125,共10页
针对电缆缆芯温度不易直接测量,且预测精确度不足的问题,本文提出了DC-CNN-PE-SSA-Informer混合预测模型,该模型利用扩展因果卷积网络(DC-CNN)增强对时间序列数据局部特征的捕捉能力,并将提取的特征传递至Informer模块以捕获长期依赖关... 针对电缆缆芯温度不易直接测量,且预测精确度不足的问题,本文提出了DC-CNN-PE-SSA-Informer混合预测模型,该模型利用扩展因果卷积网络(DC-CNN)增强对时间序列数据局部特征的捕捉能力,并将提取的特征传递至Informer模块以捕获长期依赖关系,通过引入相对位置编码(PE)加强Informer模型对时间序列中相对位置信息的捕捉能力,最后由麻雀搜索算法(SSA)进行参数优化。通过对电缆温度场进行有限元分析,求解出不同条件下的缆芯温度作为仿真实验的样本数据。仿真结果表明,DC-CNN-PE-SSA-Informer模型相比常见的预测模型在电缆缆芯温度预测方面具有更高的预测精度,为电力调度的运行方式提供了依据。 展开更多
关键词 电力电缆 温度预测 扩展因果卷积网络(dc-CNN) INFORMER 麻雀搜索算法(SSA) 位置编码(PE)
原文传递
Multi⁃Scale Dilated Convolutional Neural Network for Hyperspectral Image Classification
9
作者 Shanshan Zheng Wen Liu +3 位作者 Rui Shan Jingyi Zhao Guoqian Jiang Zhi Zhang 《Journal of Harbin Institute of Technology(New Series)》 CAS 2021年第4期25-32,共8页
Aiming at the problem of image information loss,dilated convolution is introduced and a novel multi⁃scale dilated convolutional neural network(MDCNN)is proposed.Dilated convolution can polymerize image multi⁃scale inf... Aiming at the problem of image information loss,dilated convolution is introduced and a novel multi⁃scale dilated convolutional neural network(MDCNN)is proposed.Dilated convolution can polymerize image multi⁃scale information without reducing the resolution.The first layer of the network used spectral convolutional step to reduce dimensionality.Then the multi⁃scale aggregation extracted multi⁃scale features through applying dilated convolution and shortcut connection.The extracted features which represent properties of data were fed through Softmax to predict the samples.MDCNN achieved the overall accuracy of 99.58% and 99.92% on two public datasets,Indian Pines and Pavia University.Compared with four other existing models,the results illustrate that MDCNN can extract better discriminative features and achieve higher classification performance. 展开更多
关键词 multi⁃scale aggregation dilated convolution hyperspectral image classification(HSIC) shortcut connection
在线阅读 下载PDF
CADCNet:一种改进的视网膜血管分割算法 被引量:1
10
作者 岳昱超 王迎美 秦嘉川 《科学技术与工程》 北大核心 2025年第3期962-968,共7页
目前传统的视网膜血管分割方法存在的视盘混淆引起的误分割、分割结果缺乏连续性,以及细节区域分割不精准等问题。为解决这一难题,提出了一种基于UNet的视网膜血管分割算法。该算法利用两个水平和垂直一维卷积和二维方形卷积的融合替代... 目前传统的视网膜血管分割方法存在的视盘混淆引起的误分割、分割结果缺乏连续性,以及细节区域分割不精准等问题。为解决这一难题,提出了一种基于UNet的视网膜血管分割算法。该算法利用两个水平和垂直一维卷积和二维方形卷积的融合替代传统方形卷积,提高了眼球区域的表征能力;采用了多尺度分支增加特征空间的多样性,提升了网络的特征学习和表达能力。此外,为进一步改善分割效果,还将多层膨胀卷积引入自编码器的深层结构,替代了传统的简单池化操作,增大卷积核的大小,扩大了感受野范围,实现了多尺度浅层特征和深层特征信息的融合。本文算法在公开DRIVE和CHASE_DB1两个数据集上进行了评估,实验结果表明,本文算法的精确率和F_(1)上分别达到了0.9568、0.9598和0.8326、0.8304。与传统的UNet和近期部分UNet改进网络视网膜血管分割方法相比,本文算法在准确率、敏感度、特异性、F_(1)指标上表现出一定的优势,这一验证结果充分证明了本文所提出的模型在分割任务上具备较强的精确分割能力。 展开更多
关键词 视网膜血管分割 连续膨胀卷积 深度学习 不对称卷积 UNet模型
暂未订购
基于MSDCNN-BiGRU-SVM的滚动轴承故障诊断
11
作者 洪乐 文传博(指导) 《上海电机学院学报》 2025年第1期1-6,共6页
针对传统故障诊断方法特征提取不充分,复杂场景下诊断准确率低的问题,提出了一种结合神经网络特征提取能力与支持向量机(SVM)分类性能的故障诊断方法。首先,通过宽卷积核提取特征中的低频信息,并利用多尺度空洞卷积神经网络(MSDCNN)进... 针对传统故障诊断方法特征提取不充分,复杂场景下诊断准确率低的问题,提出了一种结合神经网络特征提取能力与支持向量机(SVM)分类性能的故障诊断方法。首先,通过宽卷积核提取特征中的低频信息,并利用多尺度空洞卷积神经网络(MSDCNN)进行自适应特征提取;其次,通过坐标注意力机制(CA)自适应确定不同通道的特征权值,并利用双向门控循环单元(Bi GRU)进一步提取振动信号中的时序特征;最后,将所提取的特征信息归一化后输入SVM分类器,并输出故障诊断结果。实验结果表明:该方法与其他智能诊断方法相比,在噪声干扰和变负载条件下有更好的故障诊断性能。 展开更多
关键词 轴承故障诊断 支持向量机 多尺度空洞卷积神经网络 坐标注意力机制 双向门控循环单元
在线阅读 下载PDF
Magnetic Resonance Imaging Reconstruction Based on Butterfly Dilated Geometric Distillation
12
作者 DUO Lin XU Boyu +1 位作者 REN Yong YANG Xin 《Journal of Shanghai Jiaotong university(Science)》 2025年第3期590-599,共10页
In order to improve the reconstruction accuracy of magnetic resonance imaging(MRI),an accurate natural image compressed sensing(CS)reconstruction network is proposed,which combines the advantages of model-based and de... In order to improve the reconstruction accuracy of magnetic resonance imaging(MRI),an accurate natural image compressed sensing(CS)reconstruction network is proposed,which combines the advantages of model-based and deep learning-based CS-MRI methods.In theory,enhancing geometric texture details in linear reconstruction is possible.First,the optimization problem is decomposed into two problems:linear approximation and geometric compensation.Aimed at the problem of image linear approximation,the data consistency module is used to deal with it.Since the processing process will lose texture details,a neural network layer that explicitly combines image and frequency feature representation is proposed,which is named butterfly dilated geometric distillation network.The network introduces the idea of butterfly operation,skillfully integrates the features of image domain and frequency domain,and avoids the loss of texture details when extracting features in a single domain.Finally,a channel feature fusion module is designed by combining channel attention mechanism and dilated convolution.The attention of the channel makes the final output feature map focus on the more important part,thus improving the feature representation ability.The dilated convolution enlarges the receptive field,thereby obtaining more dense image feature data.The experimental results show that the peak signal-to-noise ratio of the network is 5.43 dB,5.24 dB and 3.89 dB higher than that of ISTA-Net+,FISTA and DGDN networks on the brain data set with a Cartesian sampling mask CS ratio of 10%. 展开更多
关键词 butterfly geometric distillation dilation convolution channel attention image reconstruction
原文传递
基于空洞因果卷积的学生成绩预测及分析方法
13
作者 赖英旭 张亚薇 +1 位作者 庄俊玺 刘静 《北京工业大学学报》 北大核心 2026年第3期252-267,共16页
针对使用循环神经网络对学生长序列行为数据进行特征提取存在梯度消失或爆炸、长期依赖关系提取能力不足、深度学习模型缺乏可解释性等问题,提出一种面向长序列数据的空洞因果卷积(dilated causal convolution,DCC)成绩预测及分析方法... 针对使用循环神经网络对学生长序列行为数据进行特征提取存在梯度消失或爆炸、长期依赖关系提取能力不足、深度学习模型缺乏可解释性等问题,提出一种面向长序列数据的空洞因果卷积(dilated causal convolution,DCC)成绩预测及分析方法。首先,采用生成对抗网络(generative adversarial network,GAN)生成符合少数类学生原始行为数据分布规律的新样本,并将新样本加入学生数据集中以达到均衡数据集的目的;然后,提出一种基于DCC的成绩预测模型,DCC和门控循环单元(gated recurrent unit,GRU)相结合的结构提高了模型对长序列数据依赖关系的提取能力;最后,使用沙普利加性解释(Shapley additive explanations,SHAP)方法并结合三因素理论对影响学生成绩的因素进行重要性分析和解释。在公开数据集上的实验结果表明,在成绩预测任务中提出的方法与基线方法相比,加权F1分数提高了约6个百分点,并进一步验证了所提方法中关键模块的有效性和模型的泛化能力。此外,通过对比优秀学生和风险学生的学习特点发现,良好的学习习惯、课堂学习的主动性以及不同行为环境等因素会对学生成绩产生重要影响。 展开更多
关键词 学生成绩预测 空洞因果卷积(dilated causal convolution dcC) 不均衡数据 生成对抗网络(generative adversarial network GAN) 沙普利加性解释(Shapley additive explanations SHAP)方法 成绩影响因素分析
在线阅读 下载PDF
基于SA-CDC-GRU-AE模型的锂离子电池健康状态估计方法
14
作者 胡钰航 廖宇 +1 位作者 崔琨 李景聪 《湖北民族大学学报(自然科学版)》 2025年第2期266-271,共6页
为解决电动汽车锂离子电池健康状态(state of health, SOH)预测精度不足及传统模型泛化能力差的问题,提出了基于自注意力-因果膨胀卷积-门控循环单元-自动编码器(self attention-causal dilated convolution-gated recurrent unit-autoe... 为解决电动汽车锂离子电池健康状态(state of health, SOH)预测精度不足及传统模型泛化能力差的问题,提出了基于自注意力-因果膨胀卷积-门控循环单元-自动编码器(self attention-causal dilated convolution-gated recurrent unit-autoencoder, SA-CDC-GRU-AE)模型的锂离子电池SOH估计方法。在卷积模块中引入CDC模块,并结合SA机制,保证预测中的因果性,抑制了锂离子电池容量再生现象对预测结果的干扰。此外,引入AE模块优化GRU模型,使其兼具隐藏特征提取和长期依赖捕捉的能力。在2个公开数据集上进行验证,结果表明,SA-CDC-GRU-AE模型在2个数据集上的均方根误差(root mean square error, RMSE)平均值分别为1.009%、0.488%,平均绝对误差(mean absolute error, MAE)平均值分别为0.780%、0.432%。SA-CDC-GRU-AE模型能准确估计锂离子电池SOH,对电池管理系统具有重要的工程应用价值。 展开更多
关键词 锂离子电池 健康状态估计 容量再生 因果卷积 膨胀卷积 自动编码器
在线阅读 下载PDF
基于改进卷积神经网络的水体分割方法
15
作者 张永宏 孙岩 +2 位作者 田伟 马光义 朱灵龙 《计算机应用与软件》 北大核心 2026年第2期164-174,188,共12页
由于遥感图像中水体具有复杂的多尺度特征,传统方法在提取水体过程中容易产生误判和漏判现象。针对这一问题,提出一种融合局部和全局信息的新网络结构。该网络首先在编码端设计一个带有注意机制的残差模块,用于获取每个位置特征的全局... 由于遥感图像中水体具有复杂的多尺度特征,传统方法在提取水体过程中容易产生误判和漏判现象。针对这一问题,提出一种融合局部和全局信息的新网络结构。该网络首先在编码端设计一个带有注意机制的残差模块,用于获取每个位置特征的全局和局部信息,采用多路径扩张卷积实现多尺度水体特征提取。为了提高水体边界处的分割精度,在网络解码端设计细化注意力融合模块。实验结果显示该网络的召回率、精准率、F1-scores分别为95.78%、94.24%、93.75%,与传统卷积神经网络相比,评价指标分别提高1.56百分点、1.72百分点、1.62百分点。 展开更多
关键词 水体分割 全局注意力机制 多路径扩张卷积 局部和全局信息
在线阅读 下载PDF
基于可变形卷积和注意力机制的路面裂缝检测
16
作者 谢永华 方育才 彭银佳 《计算机工程与设计》 北大核心 2026年第1期279-285,共7页
为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器... 为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器部分使用空洞残差模块扩大感受野并保留更多细节信息;在解码器部分添加注意力机制提高对裂缝特征的关注度,抑制背景噪声。实验结果表明,该网络在MPA、mIoU和F1值这3项指标上均优于其它对比网络,验证了该网络的有效性。 展开更多
关键词 裂缝检测 语义分割 编码解码 可变形卷积 空洞卷积 残差连接 注意力机制
在线阅读 下载PDF
基于深浅双分支特征融合的去模糊网络
17
作者 徐志京 曾泓键 《计算机工程与应用》 北大核心 2026年第3期254-264,共11页
针对现有的图像去模糊方法存在边缘信息损失,分块间伪影以及大模型高成本的问题,构建了深浅双分支特征融合的去模糊网络(deep-shallow deblur network,DSDN),从深层和浅层两个分支提取模糊特征。在深层分支中设计的频域自注意力和级联... 针对现有的图像去模糊方法存在边缘信息损失,分块间伪影以及大模型高成本的问题,构建了深浅双分支特征融合的去模糊网络(deep-shallow deblur network,DSDN),从深层和浅层两个分支提取模糊特征。在深层分支中设计的频域自注意力和级联扩张卷积模块,能够在频域有效定位模糊特征并进行特征增强,同时在不增加核大小的前提下有效增大感受野。浅层分支高效提取模糊细节特征,通过残差连接的方式与深层特征融合,能够有效避免梯度消失。提出的空频双域加权联合的损失函数,能够在双域内引导优化网络训练,有效限制复原图像频域差异。在公开数据集GOPRO和HIDE上进行实验,所提方法取得了更高的指标,复原的图像细节更突出,在客观指标和主观观察上均优于现有的主流去模糊方法。 展开更多
关键词 图像去模糊 双分支 频域信息 注意力机制 扩张卷积
在线阅读 下载PDF
基于改进YOLOv8n的快递包裹缺陷检测方法研究
18
作者 杨慧敏 高小雯 +1 位作者 李瑞涛 王汉霞 《电子测量技术》 北大核心 2026年第3期66-76,共11页
为解决快递包裹缺陷检测中对复杂包裹类型和细节特征的识别能力有限,以及现有模型在精度和实时性方面的不足,提出一种基于改进YOLOv8n的快递包裹缺陷检测算法。首先,将网络中的C2f模块融合频率自适应空洞卷积设计了C2f-FADC模块,在处理... 为解决快递包裹缺陷检测中对复杂包裹类型和细节特征的识别能力有限,以及现有模型在精度和实时性方面的不足,提出一种基于改进YOLOv8n的快递包裹缺陷检测算法。首先,将网络中的C2f模块融合频率自适应空洞卷积设计了C2f-FADC模块,在处理多尺度、多频率缺陷检测任务时灵活调整,优化特征提取过程和提高表征能力;其次,引入SimSPPF模块替代原有SPPF模块,简化结构的同时增强多尺度特征融合能力,改善对小尺寸目标的感知效果;最后,将边界框回归损失函数替换为Shape-IoU,以更精准地建模预测框与GT框之间的形状与尺度差异,优化检测定位性能。在自制的包裹缺陷数据集上,改进后的算法检测精度为96.3%,与原算法相比mAP50提高了4.4%,检测速度达到98帧,综合考量较其他算法具有明显优势,验证了该方法的有效性和优越性。 展开更多
关键词 缺陷检测 快递包裹 YOLOv8n 频率自适应空洞卷积(FAdc) SimSPPF Shape-IoU
原文传递
渐进式多尺度特征提取与融合的红外与可见光图像融合
19
作者 许光宇 吴淑雅 《齐鲁工业大学学报》 2026年第1期45-56,共12页
针对融合图像中存在目标信息减弱、细节信息丢失的问题,本文提出一种渐进式多尺度特征提取与融合的红外与可见光图像融合算法。该方法构建了结构对称、参数独立的双分支生成网络,首先将原图像及其增强形式输入空洞卷积模块,从不同尺度... 针对融合图像中存在目标信息减弱、细节信息丢失的问题,本文提出一种渐进式多尺度特征提取与融合的红外与可见光图像融合算法。该方法构建了结构对称、参数独立的双分支生成网络,首先将原图像及其增强形式输入空洞卷积模块,从不同尺度提取上下文特征,以充分挖掘多尺度信息;其次,引入多注意力互补残差聚合模块,有效提升特征选择性,强化显著信息、抑制冗余特征,并通过渐进交互机制实现跨尺度融合与互补。在判别器设计上,采用双判别器结构,分别对红外与可见光图像分布建模,以减缓单一判别器在多模态对抗训练中产生的对比度偏移与细节削弱问题。实验结果表明,所提方法在多个主客观评估指标上优于现有主流算法,融合图像保留了更多的纹理细节,视觉效果更佳。 展开更多
关键词 图像融合 空洞卷积 图像增强 注意力机制 双鉴别器
在线阅读 下载PDF
基于YOLOv8-DLung的肺结节检测方法
20
作者 李澄非 蔡于斐 《光学与光电技术》 2026年第1期11-18,共8页
肺癌作为一种严重的公共卫生问题,其发病率和死亡率在所有癌症类型中均居首位。肺结节的准确检测对于肺癌的早期干预和防止其扩散至关重要。因此,提出了一种深度学习网络YOLOv8-DLung,通过使用深度学习方法提升对肺结节的检测精度,降低... 肺癌作为一种严重的公共卫生问题,其发病率和死亡率在所有癌症类型中均居首位。肺结节的准确检测对于肺癌的早期干预和防止其扩散至关重要。因此,提出了一种深度学习网络YOLOv8-DLung,通过使用深度学习方法提升对肺结节的检测精度,降低误诊率,从而提高患者的生存几率。首先,模型参考了YOLOv8模型的整体架构,在主干网络中增加膨胀卷积,扩大滤波器的区域,目的是捕获广泛的关联信息。同时在空间金字塔池化(Spatial Pyramid Pooling-Fast,SPPF)模块后使用SENet对主干网络提取到的信息进一步筛选和融合。有效地利用肺结节CT图像病灶的空间信息和通道之间的信息。经过在LUNA16公开数据集中的结果表明,模型的精确度为94.1%,mAP为95.5%,此外,测试集中平均每幅图片的推理速度在25 ms,能有效检测肺结节区域。 展开更多
关键词 深度学习 肺结节 目标检测 膨胀卷积 注意力机制
原文传递
上一页 1 2 52 下一页 到第
使用帮助 返回顶部