期刊文献+
共找到982篇文章
< 1 2 50 >
每页显示 20 50 100
Channel-Attention DenseNet with Dilated Convolutions for MRI Brain Tumor Classification
1
作者 Abdu Salam Mohammad Abrar +5 位作者 Raja Waseem Anwer Farhan Amin Faizan Ullah Isabel de la Torre Gerardo Mendez Mezquita Henry Fabian Gongora 《Computer Modeling in Engineering & Sciences》 2025年第11期2457-2479,共23页
Brain tumors pose significant diagnostic challenges due to their diverse types and complex anatomical locations.Due to the increase in precision image-based diagnostic tools,driven by advancements in artificial intell... Brain tumors pose significant diagnostic challenges due to their diverse types and complex anatomical locations.Due to the increase in precision image-based diagnostic tools,driven by advancements in artificial intelligence(AI)and deep learning,there has been potential to improve diagnostic accuracy,especially with Magnetic Resonance Imaging(MRI).However,traditional state-of-the-art models lack the sensitivity essential for reliable tumor identification and segmentation.Thus,our research aims to enhance brain tumor diagnosis in MRI by proposing an advanced model.The proposed model incorporates dilated convolutions to optimize the brain tumor segmentation and classification.The proposed model is first trained and later evaluated using the BraTS 2020 dataset.In our proposed model preprocessing consists of normalization,noise reduction,and data augmentation to improve model robustness.The attention mechanism and dilated convolutions were introduced to increase the model’s focus on critical regions and capture finer spatial details without compromising image resolution.We have performed experimentation to measure efficiency.For this,we have used various metrics including accuracy,sensitivity,and curve(AUC-ROC).The proposed model achieved a high accuracy of 94%,a sensitivity of 93%,a specificity of 92%,and an AUC-ROC of 0.98,outperforming traditional diagnostic models in brain tumor detection.The proposed model accurately identifies tumor regions,while dilated convolutions enhanced the segmentation accuracy,especially for complex tumor structures.The proposed model demonstrates significant potential for clinical application,providing reliable and precise brain tumor detection in MRI. 展开更多
关键词 Artificial intelligence MRI analysis deep learning dilated convolution DenseNet brain tumor detection brain tumor segmentation
在线阅读 下载PDF
Multi-Head Attention Enhanced Parallel Dilated Convolution and Residual Learning for Network Traffic Anomaly Detection 被引量:1
2
作者 Guorong Qi Jian Mao +2 位作者 Kai Huang Zhengxian You Jinliang Lin 《Computers, Materials & Continua》 2025年第2期2159-2176,共18页
Abnormal network traffic, as a frequent security risk, requires a series of techniques to categorize and detect it. Existing network traffic anomaly detection still faces challenges: the inability to fully extract loc... Abnormal network traffic, as a frequent security risk, requires a series of techniques to categorize and detect it. Existing network traffic anomaly detection still faces challenges: the inability to fully extract local and global features, as well as the lack of effective mechanisms to capture complex interactions between features;Additionally, when increasing the receptive field to obtain deeper feature representations, the reliance on increasing network depth leads to a significant increase in computational resource consumption, affecting the efficiency and performance of detection. Based on these issues, firstly, this paper proposes a network traffic anomaly detection model based on parallel dilated convolution and residual learning (Res-PDC). To better explore the interactive relationships between features, the traffic samples are converted into two-dimensional matrix. A module combining parallel dilated convolutions and residual learning (res-pdc) was designed to extract local and global features of traffic at different scales. By utilizing res-pdc modules with different dilation rates, we can effectively capture spatial features at different scales and explore feature dependencies spanning wider regions without increasing computational resources. Secondly, to focus and integrate the information in different feature subspaces, further enhance and extract the interactions among the features, multi-head attention is added to Res-PDC, resulting in the final model: multi-head attention enhanced parallel dilated convolution and residual learning (MHA-Res-PDC) for network traffic anomaly detection. Finally, comparisons with other machine learning and deep learning algorithms are conducted on the NSL-KDD and CIC-IDS-2018 datasets. The experimental results demonstrate that the proposed method in this paper can effectively improve the detection performance. 展开更多
关键词 Network traffic anomaly detection multi-head attention parallel dilated convolution residual learning
在线阅读 下载PDF
Long Text Classification Algorithm Using a Hybrid Model of Bidirectional Encoder Representation from Transformers-Hierarchical Attention Networks-Dilated Convolutions Network 被引量:1
3
作者 ZHAO Yuanyuan GAO Shining +1 位作者 LIU Yang GONG Xiaohui 《Journal of Donghua University(English Edition)》 CAS 2021年第4期341-350,共10页
Text format information is full of most of the resources of Internet,which puts forward higher and higher requirements for the accuracy of text classification.Therefore,in this manuscript,firstly,we design a hybrid mo... Text format information is full of most of the resources of Internet,which puts forward higher and higher requirements for the accuracy of text classification.Therefore,in this manuscript,firstly,we design a hybrid model of bidirectional encoder representation from transformers-hierarchical attention networks-dilated convolutions networks(BERT_HAN_DCN)which based on BERT pre-trained model with superior ability of extracting characteristic.The advantages of HAN model and DCN model are taken into account which can help gain abundant semantic information,fusing context semantic features and hierarchical characteristics.Secondly,the traditional softmax algorithm increases the learning difficulty of the same kind of samples,making it more difficult to distinguish similar features.Based on this,AM-softmax is introduced to replace the traditional softmax.Finally,the fused model is validated,which shows superior performance in the accuracy rate and F1-score of this hybrid model on two datasets and the experimental analysis shows the general single models such as HAN,DCN,based on BERT pre-trained model.Besides,the improved AM-softmax network model is superior to the general softmax network model. 展开更多
关键词 long text classification dilated convolution BERT fusing context semantic features hierarchical characteristics BERT_HAN_DCN AM-softmax
在线阅读 下载PDF
DcNet: Dilated Convolutional Neural Networks for Side-Scan Sonar Image Semantic Segmentation 被引量:2
4
作者 ZHAO Xiaohong QIN Rixia +3 位作者 ZHANG Qilei YU Fei WANG Qi HE Bo 《Journal of Ocean University of China》 SCIE CAS CSCD 2021年第5期1089-1096,共8页
In ocean explorations,side-scan sonar(SSS)plays a very important role and can quickly depict seabed topography.As-sembling the SSS to an autonomous underwater vehicle(AUV)and performing semantic segmentation of an SSS... In ocean explorations,side-scan sonar(SSS)plays a very important role and can quickly depict seabed topography.As-sembling the SSS to an autonomous underwater vehicle(AUV)and performing semantic segmentation of an SSS image in real time can realize online submarine geomorphology or target recognition,which is conducive to submarine detection.However,because of the complexity of the marine environment,various noises in the ocean pollute the sonar image,which also encounters the intensity inhomogeneity problem.In this paper,we propose a novel neural network architecture named dilated convolutional neural network(DcNet)that can run in real time while addressing the above-mentioned issues and providing accurate semantic segmentation.The proposed architecture presents an encoder-decoder network to gradually reduce the spatial dimension of the input image and recover the details of the target,respectively.The core of our network is a novel block connection named DCblock,which mainly uses dilated convolution and depthwise separable convolution between the encoder and decoder to attain more context while still retaining high accuracy.Furthermore,our proposed method performs a super-resolution reconstruction to enlarge the dataset with high-quality im-ages.We compared our network to other common semantic segmentation networks performed on an NVIDIA Jetson TX2 using our sonar image datasets.Experimental results show that while the inference speed of the proposed network significantly outperforms state-of-the-art architectures,the accuracy of our method is still comparable,which indicates its potential applications not only in AUVs equipped with SSS but also in marine exploration. 展开更多
关键词 side-scan sonar(SSS) semantic segmentation dilated convolutions SUPER-RESOLUTION
在线阅读 下载PDF
An improved deep dilated convolutional neural network for seismic facies interpretation 被引量:1
5
作者 Na-Xia Yang Guo-Fa Li +2 位作者 Ting-Hui Li Dong-Feng Zhao Wei-Wei Gu 《Petroleum Science》 SCIE EI CAS CSCD 2024年第3期1569-1583,共15页
With the successful application and breakthrough of deep learning technology in image segmentation,there has been continuous development in the field of seismic facies interpretation using convolutional neural network... With the successful application and breakthrough of deep learning technology in image segmentation,there has been continuous development in the field of seismic facies interpretation using convolutional neural networks.These intelligent and automated methods significantly reduce manual labor,particularly in the laborious task of manually labeling seismic facies.However,the extensive demand for training data imposes limitations on their wider application.To overcome this challenge,we adopt the UNet architecture as the foundational network structure for seismic facies classification,which has demonstrated effective segmentation results even with small-sample training data.Additionally,we integrate spatial pyramid pooling and dilated convolution modules into the network architecture to enhance the perception of spatial information across a broader range.The seismic facies classification test on the public data from the F3 block verifies the superior performance of our proposed improved network structure in delineating seismic facies boundaries.Comparative analysis against the traditional UNet model reveals that our method achieves more accurate predictive classification results,as evidenced by various evaluation metrics for image segmentation.Obviously,the classification accuracy reaches an impressive 96%.Furthermore,the results of seismic facies classification in the seismic slice dimension provide further confirmation of the superior performance of our proposed method,which accurately defines the range of different seismic facies.This approach holds significant potential for analyzing geological patterns and extracting valuable depositional information. 展开更多
关键词 Seismic facies interpretation dilated convolution Spatial pyramid pooling Internal feature maps Compound loss function
原文传递
Magnetic Resonance Imaging Reconstruction Based on Butterfly Dilated Geometric Distillation
6
作者 DUO Lin XU Boyu +1 位作者 REN Yong YANG Xin 《Journal of Shanghai Jiaotong university(Science)》 2025年第3期590-599,共10页
In order to improve the reconstruction accuracy of magnetic resonance imaging(MRI),an accurate natural image compressed sensing(CS)reconstruction network is proposed,which combines the advantages of model-based and de... In order to improve the reconstruction accuracy of magnetic resonance imaging(MRI),an accurate natural image compressed sensing(CS)reconstruction network is proposed,which combines the advantages of model-based and deep learning-based CS-MRI methods.In theory,enhancing geometric texture details in linear reconstruction is possible.First,the optimization problem is decomposed into two problems:linear approximation and geometric compensation.Aimed at the problem of image linear approximation,the data consistency module is used to deal with it.Since the processing process will lose texture details,a neural network layer that explicitly combines image and frequency feature representation is proposed,which is named butterfly dilated geometric distillation network.The network introduces the idea of butterfly operation,skillfully integrates the features of image domain and frequency domain,and avoids the loss of texture details when extracting features in a single domain.Finally,a channel feature fusion module is designed by combining channel attention mechanism and dilated convolution.The attention of the channel makes the final output feature map focus on the more important part,thus improving the feature representation ability.The dilated convolution enlarges the receptive field,thereby obtaining more dense image feature data.The experimental results show that the peak signal-to-noise ratio of the network is 5.43 dB,5.24 dB and 3.89 dB higher than that of ISTA-Net+,FISTA and DGDN networks on the brain data set with a Cartesian sampling mask CS ratio of 10%. 展开更多
关键词 butterfly geometric distillation dilation convolution channel attention image reconstruction
原文传递
Multi⁃Scale Dilated Convolutional Neural Network for Hyperspectral Image Classification
7
作者 Shanshan Zheng Wen Liu +3 位作者 Rui Shan Jingyi Zhao Guoqian Jiang Zhi Zhang 《Journal of Harbin Institute of Technology(New Series)》 CAS 2021年第4期25-32,共8页
Aiming at the problem of image information loss,dilated convolution is introduced and a novel multi⁃scale dilated convolutional neural network(MDCNN)is proposed.Dilated convolution can polymerize image multi⁃scale inf... Aiming at the problem of image information loss,dilated convolution is introduced and a novel multi⁃scale dilated convolutional neural network(MDCNN)is proposed.Dilated convolution can polymerize image multi⁃scale information without reducing the resolution.The first layer of the network used spectral convolutional step to reduce dimensionality.Then the multi⁃scale aggregation extracted multi⁃scale features through applying dilated convolution and shortcut connection.The extracted features which represent properties of data were fed through Softmax to predict the samples.MDCNN achieved the overall accuracy of 99.58% and 99.92% on two public datasets,Indian Pines and Pavia University.Compared with four other existing models,the results illustrate that MDCNN can extract better discriminative features and achieve higher classification performance. 展开更多
关键词 multi⁃scale aggregation dilated convolution hyperspectral image classification(HSIC) shortcut connection
在线阅读 下载PDF
1D-CNN:Speech Emotion Recognition System Using a Stacked Network with Dilated CNN Features 被引量:6
8
作者 Mustaqeem Soonil Kwon 《Computers, Materials & Continua》 SCIE EI 2021年第6期4039-4059,共21页
Emotion recognition from speech data is an active and emerging area of research that plays an important role in numerous applications,such as robotics,virtual reality,behavior assessments,and emergency call centers.Re... Emotion recognition from speech data is an active and emerging area of research that plays an important role in numerous applications,such as robotics,virtual reality,behavior assessments,and emergency call centers.Recently,researchers have developed many techniques in this field in order to ensure an improvement in the accuracy by utilizing several deep learning approaches,but the recognition rate is still not convincing.Our main aim is to develop a new technique that increases the recognition rate with reasonable cost computations.In this paper,we suggested a new technique,which is a one-dimensional dilated convolutional neural network(1D-DCNN)for speech emotion recognition(SER)that utilizes the hierarchical features learning blocks(HFLBs)with a bi-directional gated recurrent unit(BiGRU).We designed a one-dimensional CNN network to enhance the speech signals,which uses a spectral analysis,and to extract the hidden patterns from the speech signals that are fed into a stacked one-dimensional dilated network that are called HFLBs.Each HFLB contains one dilated convolution layer(DCL),one batch normalization(BN),and one leaky_relu(Relu)layer in order to extract the emotional features using a hieratical correlation strategy.Furthermore,the learned emotional features are feed into a BiGRU in order to adjust the global weights and to recognize the temporal cues.The final state of the deep BiGRU is passed from a softmax classifier in order to produce the probabilities of the emotions.The proposed model was evaluated over three benchmarked datasets that included the IEMOCAP,EMO-DB,and RAVDESS,which achieved 72.75%,91.14%,and 78.01%accuracy,respectively. 展开更多
关键词 Affective computing one-dimensional dilated convolutional neural network emotion recognition gated recurrent unit raw audio clips
在线阅读 下载PDF
Advanced Face Mask Detection Model Using Hybrid Dilation Convolution Based Method 被引量:1
9
作者 Shaohan Wang Xiangyu Wang Xin Guo 《Journal of Software Engineering and Applications》 2023年第1期1-19,共19页
A face-mask object detection model incorporating hybrid dilation convolutional network termed ResNet Hybrid-dilation-convolution Face-mask-detector (RHF) is proposed in this paper. Furthermore, a lightweight face-mask... A face-mask object detection model incorporating hybrid dilation convolutional network termed ResNet Hybrid-dilation-convolution Face-mask-detector (RHF) is proposed in this paper. Furthermore, a lightweight face-mask dataset named Light Masked Face Dataset (LMFD) and a medium-sized face-mask dataset named Masked Face Dataset (MFD) with data augmentation methods applied is also constructed in this paper. The hybrid dilation convolutional network is able to expand the perception of the convolutional kernel without concern about the discontinuity of image information during the convolution process. For the given two datasets being constructed above, the trained models are significantly optimized in terms of detection performance, training time, and other related metrics. By using the MFD dataset of 55,905 images, the RHF model requires roughly 10 hours less training time compared to ResNet50 with better detection results with mAP of 93.45%. 展开更多
关键词 Face Mask Detection Object Detection Hybrid dilation convolution Computer Vision
在线阅读 下载PDF
基于空洞因果卷积的学生成绩预测及分析方法
10
作者 赖英旭 张亚薇 +1 位作者 庄俊玺 刘静 《北京工业大学学报》 北大核心 2026年第3期252-267,共16页
针对使用循环神经网络对学生长序列行为数据进行特征提取存在梯度消失或爆炸、长期依赖关系提取能力不足、深度学习模型缺乏可解释性等问题,提出一种面向长序列数据的空洞因果卷积(dilated causal convolution,DCC)成绩预测及分析方法... 针对使用循环神经网络对学生长序列行为数据进行特征提取存在梯度消失或爆炸、长期依赖关系提取能力不足、深度学习模型缺乏可解释性等问题,提出一种面向长序列数据的空洞因果卷积(dilated causal convolution,DCC)成绩预测及分析方法。首先,采用生成对抗网络(generative adversarial network,GAN)生成符合少数类学生原始行为数据分布规律的新样本,并将新样本加入学生数据集中以达到均衡数据集的目的;然后,提出一种基于DCC的成绩预测模型,DCC和门控循环单元(gated recurrent unit,GRU)相结合的结构提高了模型对长序列数据依赖关系的提取能力;最后,使用沙普利加性解释(Shapley additive explanations,SHAP)方法并结合三因素理论对影响学生成绩的因素进行重要性分析和解释。在公开数据集上的实验结果表明,在成绩预测任务中提出的方法与基线方法相比,加权F1分数提高了约6个百分点,并进一步验证了所提方法中关键模块的有效性和模型的泛化能力。此外,通过对比优秀学生和风险学生的学习特点发现,良好的学习习惯、课堂学习的主动性以及不同行为环境等因素会对学生成绩产生重要影响。 展开更多
关键词 学生成绩预测 空洞因果卷积(dilated causal convolution DCC) 不均衡数据 生成对抗网络(generative adversarial network GAN) 沙普利加性解释(Shapley additive explanations SHAP)方法 成绩影响因素分析
在线阅读 下载PDF
基于改进卷积神经网络的水体分割方法
11
作者 张永宏 孙岩 +2 位作者 田伟 马光义 朱灵龙 《计算机应用与软件》 北大核心 2026年第2期164-174,188,共12页
由于遥感图像中水体具有复杂的多尺度特征,传统方法在提取水体过程中容易产生误判和漏判现象。针对这一问题,提出一种融合局部和全局信息的新网络结构。该网络首先在编码端设计一个带有注意机制的残差模块,用于获取每个位置特征的全局... 由于遥感图像中水体具有复杂的多尺度特征,传统方法在提取水体过程中容易产生误判和漏判现象。针对这一问题,提出一种融合局部和全局信息的新网络结构。该网络首先在编码端设计一个带有注意机制的残差模块,用于获取每个位置特征的全局和局部信息,采用多路径扩张卷积实现多尺度水体特征提取。为了提高水体边界处的分割精度,在网络解码端设计细化注意力融合模块。实验结果显示该网络的召回率、精准率、F1-scores分别为95.78%、94.24%、93.75%,与传统卷积神经网络相比,评价指标分别提高1.56百分点、1.72百分点、1.62百分点。 展开更多
关键词 水体分割 全局注意力机制 多路径扩张卷积 局部和全局信息
在线阅读 下载PDF
基于可变形卷积和注意力机制的路面裂缝检测
12
作者 谢永华 方育才 彭银佳 《计算机工程与设计》 北大核心 2026年第1期279-285,共7页
为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器... 为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器部分使用空洞残差模块扩大感受野并保留更多细节信息;在解码器部分添加注意力机制提高对裂缝特征的关注度,抑制背景噪声。实验结果表明,该网络在MPA、mIoU和F1值这3项指标上均优于其它对比网络,验证了该网络的有效性。 展开更多
关键词 裂缝检测 语义分割 编码解码 可变形卷积 空洞卷积 残差连接 注意力机制
在线阅读 下载PDF
基于深浅双分支特征融合的去模糊网络
13
作者 徐志京 曾泓键 《计算机工程与应用》 北大核心 2026年第3期254-264,共11页
针对现有的图像去模糊方法存在边缘信息损失,分块间伪影以及大模型高成本的问题,构建了深浅双分支特征融合的去模糊网络(deep-shallow deblur network,DSDN),从深层和浅层两个分支提取模糊特征。在深层分支中设计的频域自注意力和级联... 针对现有的图像去模糊方法存在边缘信息损失,分块间伪影以及大模型高成本的问题,构建了深浅双分支特征融合的去模糊网络(deep-shallow deblur network,DSDN),从深层和浅层两个分支提取模糊特征。在深层分支中设计的频域自注意力和级联扩张卷积模块,能够在频域有效定位模糊特征并进行特征增强,同时在不增加核大小的前提下有效增大感受野。浅层分支高效提取模糊细节特征,通过残差连接的方式与深层特征融合,能够有效避免梯度消失。提出的空频双域加权联合的损失函数,能够在双域内引导优化网络训练,有效限制复原图像频域差异。在公开数据集GOPRO和HIDE上进行实验,所提方法取得了更高的指标,复原的图像细节更突出,在客观指标和主观观察上均优于现有的主流去模糊方法。 展开更多
关键词 图像去模糊 双分支 频域信息 注意力机制 扩张卷积
在线阅读 下载PDF
基于改进YOLOv8n的快递包裹缺陷检测方法研究
14
作者 杨慧敏 高小雯 +1 位作者 李瑞涛 王汉霞 《电子测量技术》 北大核心 2026年第3期66-76,共11页
为解决快递包裹缺陷检测中对复杂包裹类型和细节特征的识别能力有限,以及现有模型在精度和实时性方面的不足,提出一种基于改进YOLOv8n的快递包裹缺陷检测算法。首先,将网络中的C2f模块融合频率自适应空洞卷积设计了C2f-FADC模块,在处理... 为解决快递包裹缺陷检测中对复杂包裹类型和细节特征的识别能力有限,以及现有模型在精度和实时性方面的不足,提出一种基于改进YOLOv8n的快递包裹缺陷检测算法。首先,将网络中的C2f模块融合频率自适应空洞卷积设计了C2f-FADC模块,在处理多尺度、多频率缺陷检测任务时灵活调整,优化特征提取过程和提高表征能力;其次,引入SimSPPF模块替代原有SPPF模块,简化结构的同时增强多尺度特征融合能力,改善对小尺寸目标的感知效果;最后,将边界框回归损失函数替换为Shape-IoU,以更精准地建模预测框与GT框之间的形状与尺度差异,优化检测定位性能。在自制的包裹缺陷数据集上,改进后的算法检测精度为96.3%,与原算法相比mAP50提高了4.4%,检测速度达到98帧,综合考量较其他算法具有明显优势,验证了该方法的有效性和优越性。 展开更多
关键词 缺陷检测 快递包裹 YOLOv8n 频率自适应空洞卷积(FADC) SimSPPF Shape-IoU
原文传递
渐进式多尺度特征提取与融合的红外与可见光图像融合
15
作者 许光宇 吴淑雅 《齐鲁工业大学学报》 2026年第1期45-56,共12页
针对融合图像中存在目标信息减弱、细节信息丢失的问题,本文提出一种渐进式多尺度特征提取与融合的红外与可见光图像融合算法。该方法构建了结构对称、参数独立的双分支生成网络,首先将原图像及其增强形式输入空洞卷积模块,从不同尺度... 针对融合图像中存在目标信息减弱、细节信息丢失的问题,本文提出一种渐进式多尺度特征提取与融合的红外与可见光图像融合算法。该方法构建了结构对称、参数独立的双分支生成网络,首先将原图像及其增强形式输入空洞卷积模块,从不同尺度提取上下文特征,以充分挖掘多尺度信息;其次,引入多注意力互补残差聚合模块,有效提升特征选择性,强化显著信息、抑制冗余特征,并通过渐进交互机制实现跨尺度融合与互补。在判别器设计上,采用双判别器结构,分别对红外与可见光图像分布建模,以减缓单一判别器在多模态对抗训练中产生的对比度偏移与细节削弱问题。实验结果表明,所提方法在多个主客观评估指标上优于现有主流算法,融合图像保留了更多的纹理细节,视觉效果更佳。 展开更多
关键词 图像融合 空洞卷积 图像增强 注意力机制 双鉴别器
在线阅读 下载PDF
基于YOLOv8-DLung的肺结节检测方法
16
作者 李澄非 蔡于斐 《光学与光电技术》 2026年第1期11-18,共8页
肺癌作为一种严重的公共卫生问题,其发病率和死亡率在所有癌症类型中均居首位。肺结节的准确检测对于肺癌的早期干预和防止其扩散至关重要。因此,提出了一种深度学习网络YOLOv8-DLung,通过使用深度学习方法提升对肺结节的检测精度,降低... 肺癌作为一种严重的公共卫生问题,其发病率和死亡率在所有癌症类型中均居首位。肺结节的准确检测对于肺癌的早期干预和防止其扩散至关重要。因此,提出了一种深度学习网络YOLOv8-DLung,通过使用深度学习方法提升对肺结节的检测精度,降低误诊率,从而提高患者的生存几率。首先,模型参考了YOLOv8模型的整体架构,在主干网络中增加膨胀卷积,扩大滤波器的区域,目的是捕获广泛的关联信息。同时在空间金字塔池化(Spatial Pyramid Pooling-Fast,SPPF)模块后使用SENet对主干网络提取到的信息进一步筛选和融合。有效地利用肺结节CT图像病灶的空间信息和通道之间的信息。经过在LUNA16公开数据集中的结果表明,模型的精确度为94.1%,mAP为95.5%,此外,测试集中平均每幅图片的推理速度在25 ms,能有效检测肺结节区域。 展开更多
关键词 深度学习 肺结节 目标检测 膨胀卷积 注意力机制
原文传递
结合对抗训练和IDCNN的医疗命名实体识别
17
作者 陈雪松 李洋洋 王浩畅 《计算机与现代化》 2026年第1期53-59,100,共8页
在医疗领域,传统的命名实体识别模型,无法兼顾全局特征与局部特征的提取,为了解决这个问题,本文提出一种结合全局特征与局部特征的命名实体识别模型用于处理医疗领域的命名实体识别任务。首先,使用预训练语言模型Chinese-BERT-wwm-ext... 在医疗领域,传统的命名实体识别模型,无法兼顾全局特征与局部特征的提取,为了解决这个问题,本文提出一种结合全局特征与局部特征的命名实体识别模型用于处理医疗领域的命名实体识别任务。首先,使用预训练语言模型Chinese-BERT-wwm-ext得到输入文本的初始向量表示;其次,在初始向量的表示上添加一些扰动来生成对抗样本,可提升模型的鲁棒性;再次,将初始向量表示与对抗样本一同依次输入到特征提取层,特征提取层结合了空洞卷积神经网络(Iterated Dilated Convolutional Neural Network,IDCNN)和双向长短时记忆网络(Bi-directional Long Short-Term Memory,BiLSTM)模型,共同生成特征向量,分别捕捉文本的局部和全局特征,使用自注意力机制将抽取的特征向量进行融合,从而充分利用各层次的信息;最后,利用CRF算法生成预测序列。通过结合特征融合模块与对抗训练模块,该模型对于医疗文本CMeEE中命名实体的识别精确率为66.31%,召回率为68.84%,F1值为67.55%;与基线模型相比,表现出较高的识别精度,适用于医疗领域命名实体识别任务。 展开更多
关键词 命名实体识别 预训练语言模型 对抗训练 IDCNN BiLSTM 自注意力机制
在线阅读 下载PDF
基于DenseNet和多域特征融合的表面肌电手势识别研究
18
作者 金亚辉 刘鑫 +1 位作者 连大山 郭一娜 《传感器与微系统》 北大核心 2026年第1期30-34,共5页
基于表面肌电(sEMG)信号的手势识别在人机交互领域应用广泛,快速准确识别手势动作可以提供更好的用户体验。由于个体差异性导致在多任务中,整体识别准确率低。提出一种密集连接卷积网络(DenseNet)和多域特征融合的sEMG手势识别方法。首... 基于表面肌电(sEMG)信号的手势识别在人机交互领域应用广泛,快速准确识别手势动作可以提供更好的用户体验。由于个体差异性导致在多任务中,整体识别准确率低。提出一种密集连接卷积网络(DenseNet)和多域特征融合的sEMG手势识别方法。首先,从sEMG信号中提取时域和频域特征构成特征集,并与原始信号融合作为网络输入,增强网络输入数据的表达能力。其次,使用融合挤压—激励(SE)注意力和多尺度空洞卷积的DenseNet进行特征提取与分类识别。实验结果表明,在NinaPro DB2数据集中,手势识别整体准确率达到了88.06%,在整体和分类运动中识别性能都有所提升。 展开更多
关键词 表面肌电信号 手势识别 挤压—激励注意力 多尺度空洞卷积 密集连接卷积网络
在线阅读 下载PDF
基于深度学习的红树林遥感图像信息提取的研究
19
作者 杨春月 王修信 《计算机应用与软件》 北大核心 2026年第1期249-256,共8页
红树林是生态环境系统的重要组成部分,对于保护和净化环境具有重要作用。由于过度开发等原因导致红树林的生存环境严重受损,因此监测红树林的状况十分重要。针对深度学习模型从遥感图像提取红树林信息性能较差的问题,结合Shuffle Transf... 红树林是生态环境系统的重要组成部分,对于保护和净化环境具有重要作用。由于过度开发等原因导致红树林的生存环境严重受损,因此监测红树林的状况十分重要。针对深度学习模型从遥感图像提取红树林信息性能较差的问题,结合Shuffle Transformer和卷积神经网络的优势,加入ASPP Embedding模块提取特征信息和跳跃注意力融合深层特征与浅层特征提取遥感图像中红树林信息。结果表明,提出的模型对红树林信息提取的精度为97.64%,相比U-Net网络提高了1.38百分点,实验结果证明此方法在红树林遥感图像信息提取中具有比较大的优势。 展开更多
关键词 红树林 遥感图像 深度学习 TRANSFORMER 空洞卷积 信息提取
在线阅读 下载PDF
融合可形变卷积与注意力检测头的交通多目标检测
20
作者 周泽睿 张友兵 +2 位作者 周奎 王鑫威 杨博超 《江苏理工学院学报》 2026年第1期115-128,共14页
目标检测作为环境感知的关键环节,在复杂交通场景中需应对多尺度目标识别和目标严重遮挡等挑战,这些问题往往会导致YOLO模型边界框定位精度下降,最终出现漏检和误检的情况。为应对上述挑战,设计并改进了基于YOLOv11架构的多目标检测方案... 目标检测作为环境感知的关键环节,在复杂交通场景中需应对多尺度目标识别和目标严重遮挡等挑战,这些问题往往会导致YOLO模型边界框定位精度下降,最终出现漏检和误检的情况。为应对上述挑战,设计并改进了基于YOLOv11架构的多目标检测方案:设计矩形自校准扩张多尺度融合模块(DMSFM),通过空洞卷积与多尺度融合策略实现三重特征提取与融合,提升模型对不同尺寸目标的特征感知能力;引入Shape NWD损失函数,突破传统IoU的局限,以形状加权和归一化Wasserstein距离的几何匹配准则,优化不同尺寸目标锚框的定位精度与敏感度;融合含并行补丁感知注意力机制的检测头,通过多分支与注意力策略,强化模型对多尺度目标特征的适应性及分类决策能力。在CODA自动驾驶道路目标检测数据集上的实验结果表明,所提方法相较于基准模型,平均召回率相对提升26.5%,mAP50和mAP50-95分别相对提升24.1%和16.9%;消融实验验证了三个核心组件的有效协同,进一步证实了本方案在复杂交通场景多尺度目标检测任务中的高鲁棒性。 展开更多
关键词 目标检测 多尺度融合 空洞卷积 Shape-NWD DMSFM
在线阅读 下载PDF
上一页 1 2 50 下一页 到第
使用帮助 返回顶部