期刊文献+
共找到979篇文章
< 1 2 49 >
每页显示 20 50 100
Multi-Head Attention Enhanced Parallel Dilated Convolution and Residual Learning for Network Traffic Anomaly Detection 被引量:1
1
作者 Guorong Qi Jian Mao +2 位作者 Kai Huang Zhengxian You Jinliang Lin 《Computers, Materials & Continua》 2025年第2期2159-2176,共18页
Abnormal network traffic, as a frequent security risk, requires a series of techniques to categorize and detect it. Existing network traffic anomaly detection still faces challenges: the inability to fully extract loc... Abnormal network traffic, as a frequent security risk, requires a series of techniques to categorize and detect it. Existing network traffic anomaly detection still faces challenges: the inability to fully extract local and global features, as well as the lack of effective mechanisms to capture complex interactions between features;Additionally, when increasing the receptive field to obtain deeper feature representations, the reliance on increasing network depth leads to a significant increase in computational resource consumption, affecting the efficiency and performance of detection. Based on these issues, firstly, this paper proposes a network traffic anomaly detection model based on parallel dilated convolution and residual learning (Res-PDC). To better explore the interactive relationships between features, the traffic samples are converted into two-dimensional matrix. A module combining parallel dilated convolutions and residual learning (res-pdc) was designed to extract local and global features of traffic at different scales. By utilizing res-pdc modules with different dilation rates, we can effectively capture spatial features at different scales and explore feature dependencies spanning wider regions without increasing computational resources. Secondly, to focus and integrate the information in different feature subspaces, further enhance and extract the interactions among the features, multi-head attention is added to Res-PDC, resulting in the final model: multi-head attention enhanced parallel dilated convolution and residual learning (MHA-Res-PDC) for network traffic anomaly detection. Finally, comparisons with other machine learning and deep learning algorithms are conducted on the NSL-KDD and CIC-IDS-2018 datasets. The experimental results demonstrate that the proposed method in this paper can effectively improve the detection performance. 展开更多
关键词 Network traffic anomaly detection multi-head attention parallel dilated convolution residual learning
在线阅读 下载PDF
Channel-Attention DenseNet with Dilated Convolutions for MRI Brain Tumor Classification
2
作者 Abdu Salam Mohammad Abrar +5 位作者 Raja Waseem Anwer Farhan Amin Faizan Ullah Isabel de la Torre Gerardo Mendez Mezquita Henry Fabian Gongora 《Computer Modeling in Engineering & Sciences》 2025年第11期2457-2479,共23页
Brain tumors pose significant diagnostic challenges due to their diverse types and complex anatomical locations.Due to the increase in precision image-based diagnostic tools,driven by advancements in artificial intell... Brain tumors pose significant diagnostic challenges due to their diverse types and complex anatomical locations.Due to the increase in precision image-based diagnostic tools,driven by advancements in artificial intelligence(AI)and deep learning,there has been potential to improve diagnostic accuracy,especially with Magnetic Resonance Imaging(MRI).However,traditional state-of-the-art models lack the sensitivity essential for reliable tumor identification and segmentation.Thus,our research aims to enhance brain tumor diagnosis in MRI by proposing an advanced model.The proposed model incorporates dilated convolutions to optimize the brain tumor segmentation and classification.The proposed model is first trained and later evaluated using the BraTS 2020 dataset.In our proposed model preprocessing consists of normalization,noise reduction,and data augmentation to improve model robustness.The attention mechanism and dilated convolutions were introduced to increase the model’s focus on critical regions and capture finer spatial details without compromising image resolution.We have performed experimentation to measure efficiency.For this,we have used various metrics including accuracy,sensitivity,and curve(AUC-ROC).The proposed model achieved a high accuracy of 94%,a sensitivity of 93%,a specificity of 92%,and an AUC-ROC of 0.98,outperforming traditional diagnostic models in brain tumor detection.The proposed model accurately identifies tumor regions,while dilated convolutions enhanced the segmentation accuracy,especially for complex tumor structures.The proposed model demonstrates significant potential for clinical application,providing reliable and precise brain tumor detection in MRI. 展开更多
关键词 Artificial intelligence MRI analysis deep learning dilated convolution DenseNet brain tumor detection brain tumor segmentation
在线阅读 下载PDF
Advanced Face Mask Detection Model Using Hybrid Dilation Convolution Based Method 被引量:1
3
作者 Shaohan Wang Xiangyu Wang Xin Guo 《Journal of Software Engineering and Applications》 2023年第1期1-19,共19页
A face-mask object detection model incorporating hybrid dilation convolutional network termed ResNet Hybrid-dilation-convolution Face-mask-detector (RHF) is proposed in this paper. Furthermore, a lightweight face-mask... A face-mask object detection model incorporating hybrid dilation convolutional network termed ResNet Hybrid-dilation-convolution Face-mask-detector (RHF) is proposed in this paper. Furthermore, a lightweight face-mask dataset named Light Masked Face Dataset (LMFD) and a medium-sized face-mask dataset named Masked Face Dataset (MFD) with data augmentation methods applied is also constructed in this paper. The hybrid dilation convolutional network is able to expand the perception of the convolutional kernel without concern about the discontinuity of image information during the convolution process. For the given two datasets being constructed above, the trained models are significantly optimized in terms of detection performance, training time, and other related metrics. By using the MFD dataset of 55,905 images, the RHF model requires roughly 10 hours less training time compared to ResNet50 with better detection results with mAP of 93.45%. 展开更多
关键词 Face Mask Detection Object Detection Hybrid dilation convolution Computer Vision
在线阅读 下载PDF
DcNet: Dilated Convolutional Neural Networks for Side-Scan Sonar Image Semantic Segmentation 被引量:2
4
作者 ZHAO Xiaohong QIN Rixia +3 位作者 ZHANG Qilei YU Fei WANG Qi HE Bo 《Journal of Ocean University of China》 SCIE CAS CSCD 2021年第5期1089-1096,共8页
In ocean explorations,side-scan sonar(SSS)plays a very important role and can quickly depict seabed topography.As-sembling the SSS to an autonomous underwater vehicle(AUV)and performing semantic segmentation of an SSS... In ocean explorations,side-scan sonar(SSS)plays a very important role and can quickly depict seabed topography.As-sembling the SSS to an autonomous underwater vehicle(AUV)and performing semantic segmentation of an SSS image in real time can realize online submarine geomorphology or target recognition,which is conducive to submarine detection.However,because of the complexity of the marine environment,various noises in the ocean pollute the sonar image,which also encounters the intensity inhomogeneity problem.In this paper,we propose a novel neural network architecture named dilated convolutional neural network(DcNet)that can run in real time while addressing the above-mentioned issues and providing accurate semantic segmentation.The proposed architecture presents an encoder-decoder network to gradually reduce the spatial dimension of the input image and recover the details of the target,respectively.The core of our network is a novel block connection named DCblock,which mainly uses dilated convolution and depthwise separable convolution between the encoder and decoder to attain more context while still retaining high accuracy.Furthermore,our proposed method performs a super-resolution reconstruction to enlarge the dataset with high-quality im-ages.We compared our network to other common semantic segmentation networks performed on an NVIDIA Jetson TX2 using our sonar image datasets.Experimental results show that while the inference speed of the proposed network significantly outperforms state-of-the-art architectures,the accuracy of our method is still comparable,which indicates its potential applications not only in AUVs equipped with SSS but also in marine exploration. 展开更多
关键词 side-scan sonar(SSS) semantic segmentation dilated convolutions SUPER-RESOLUTION
在线阅读 下载PDF
An improved deep dilated convolutional neural network for seismic facies interpretation 被引量:1
5
作者 Na-Xia Yang Guo-Fa Li +2 位作者 Ting-Hui Li Dong-Feng Zhao Wei-Wei Gu 《Petroleum Science》 SCIE EI CAS CSCD 2024年第3期1569-1583,共15页
With the successful application and breakthrough of deep learning technology in image segmentation,there has been continuous development in the field of seismic facies interpretation using convolutional neural network... With the successful application and breakthrough of deep learning technology in image segmentation,there has been continuous development in the field of seismic facies interpretation using convolutional neural networks.These intelligent and automated methods significantly reduce manual labor,particularly in the laborious task of manually labeling seismic facies.However,the extensive demand for training data imposes limitations on their wider application.To overcome this challenge,we adopt the UNet architecture as the foundational network structure for seismic facies classification,which has demonstrated effective segmentation results even with small-sample training data.Additionally,we integrate spatial pyramid pooling and dilated convolution modules into the network architecture to enhance the perception of spatial information across a broader range.The seismic facies classification test on the public data from the F3 block verifies the superior performance of our proposed improved network structure in delineating seismic facies boundaries.Comparative analysis against the traditional UNet model reveals that our method achieves more accurate predictive classification results,as evidenced by various evaluation metrics for image segmentation.Obviously,the classification accuracy reaches an impressive 96%.Furthermore,the results of seismic facies classification in the seismic slice dimension provide further confirmation of the superior performance of our proposed method,which accurately defines the range of different seismic facies.This approach holds significant potential for analyzing geological patterns and extracting valuable depositional information. 展开更多
关键词 Seismic facies interpretation dilated convolution Spatial pyramid pooling Internal feature maps Compound loss function
原文传递
Long Text Classification Algorithm Using a Hybrid Model of Bidirectional Encoder Representation from Transformers-Hierarchical Attention Networks-Dilated Convolutions Network 被引量:1
6
作者 ZHAO Yuanyuan GAO Shining +1 位作者 LIU Yang GONG Xiaohui 《Journal of Donghua University(English Edition)》 CAS 2021年第4期341-350,共10页
Text format information is full of most of the resources of Internet,which puts forward higher and higher requirements for the accuracy of text classification.Therefore,in this manuscript,firstly,we design a hybrid mo... Text format information is full of most of the resources of Internet,which puts forward higher and higher requirements for the accuracy of text classification.Therefore,in this manuscript,firstly,we design a hybrid model of bidirectional encoder representation from transformers-hierarchical attention networks-dilated convolutions networks(BERT_HAN_DCN)which based on BERT pre-trained model with superior ability of extracting characteristic.The advantages of HAN model and DCN model are taken into account which can help gain abundant semantic information,fusing context semantic features and hierarchical characteristics.Secondly,the traditional softmax algorithm increases the learning difficulty of the same kind of samples,making it more difficult to distinguish similar features.Based on this,AM-softmax is introduced to replace the traditional softmax.Finally,the fused model is validated,which shows superior performance in the accuracy rate and F1-score of this hybrid model on two datasets and the experimental analysis shows the general single models such as HAN,DCN,based on BERT pre-trained model.Besides,the improved AM-softmax network model is superior to the general softmax network model. 展开更多
关键词 long text classification dilated convolution BERT fusing context semantic features hierarchical characteristics BERT_HAN_DCN AM-softmax
在线阅读 下载PDF
Multi⁃Scale Dilated Convolutional Neural Network for Hyperspectral Image Classification
7
作者 Shanshan Zheng Wen Liu +3 位作者 Rui Shan Jingyi Zhao Guoqian Jiang Zhi Zhang 《Journal of Harbin Institute of Technology(New Series)》 CAS 2021年第4期25-32,共8页
Aiming at the problem of image information loss,dilated convolution is introduced and a novel multi⁃scale dilated convolutional neural network(MDCNN)is proposed.Dilated convolution can polymerize image multi⁃scale inf... Aiming at the problem of image information loss,dilated convolution is introduced and a novel multi⁃scale dilated convolutional neural network(MDCNN)is proposed.Dilated convolution can polymerize image multi⁃scale information without reducing the resolution.The first layer of the network used spectral convolutional step to reduce dimensionality.Then the multi⁃scale aggregation extracted multi⁃scale features through applying dilated convolution and shortcut connection.The extracted features which represent properties of data were fed through Softmax to predict the samples.MDCNN achieved the overall accuracy of 99.58% and 99.92% on two public datasets,Indian Pines and Pavia University.Compared with four other existing models,the results illustrate that MDCNN can extract better discriminative features and achieve higher classification performance. 展开更多
关键词 multi⁃scale aggregation dilated convolution hyperspectral image classification(HSIC) shortcut connection
在线阅读 下载PDF
Convolution-Transformer for Image Feature Extraction 被引量:2
8
作者 Lirong Yin Lei Wang +10 位作者 Siyu Lu Ruiyang Wang Youshuai Yang Bo Yang Shan Liu Ahmed AlSanad Salman A.AlQahtani Zhengtong Yin Xiaolu Li Xiaobing Chen Wenfeng Zheng 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第10期87-106,共20页
This study addresses the limitations of Transformer models in image feature extraction,particularly their lack of inductive bias for visual structures.Compared to Convolutional Neural Networks(CNNs),the Transformers a... This study addresses the limitations of Transformer models in image feature extraction,particularly their lack of inductive bias for visual structures.Compared to Convolutional Neural Networks(CNNs),the Transformers are more sensitive to different hyperparameters of optimizers,which leads to a lack of stability and slow convergence.To tackle these challenges,we propose the Convolution-based Efficient Transformer Image Feature Extraction Network(CEFormer)as an enhancement of the Transformer architecture.Our model incorporates E-Attention,depthwise separable convolution,and dilated convolution to introduce crucial inductive biases,such as translation invariance,locality,and scale invariance,into the Transformer framework.Additionally,we implement a lightweight convolution module to process the input images,resulting in faster convergence and improved stability.This results in an efficient convolution combined Transformer image feature extraction network.Experimental results on the ImageNet1k Top-1 dataset demonstrate that the proposed network achieves better accuracy while maintaining high computational speed.It achieves up to 85.0%accuracy across various model sizes on image classification,outperforming various baseline models.When integrated into the Mask Region-ConvolutionalNeuralNetwork(R-CNN)framework as a backbone network,CEFormer outperforms other models and achieves the highest mean Average Precision(mAP)scores.This research presents a significant advancement in Transformer-based image feature extraction,balancing performance and computational efficiency. 展开更多
关键词 TRANSFORMER E-Attention depth convolution dilated convolution CEFormer
在线阅读 下载PDF
TSCND:Temporal Subsequence-Based Convolutional Network with Difference for Time Series Forecasting 被引量:1
9
作者 Haoran Huang Weiting Chen Zheming Fan 《Computers, Materials & Continua》 SCIE EI 2024年第3期3665-3681,共17页
Time series forecasting plays an important role in various fields, such as energy, finance, transport, and weather. Temporal convolutional networks (TCNs) based on dilated causal convolution have been widely used in t... Time series forecasting plays an important role in various fields, such as energy, finance, transport, and weather. Temporal convolutional networks (TCNs) based on dilated causal convolution have been widely used in time series forecasting. However, two problems weaken the performance of TCNs. One is that in dilated casual convolution, causal convolution leads to the receptive fields of outputs being concentrated in the earlier part of the input sequence, whereas the recent input information will be severely lost. The other is that the distribution shift problem in time series has not been adequately solved. To address the first problem, we propose a subsequence-based dilated convolution method (SDC). By using multiple convolutional filters to convolve elements of neighboring subsequences, the method extracts temporal features from a growing receptive field via a growing subsequence rather than a single element. Ultimately, the receptive field of each output element can cover the whole input sequence. To address the second problem, we propose a difference and compensation method (DCM). The method reduces the discrepancies between and within the input sequences by difference operations and then compensates the outputs for the information lost due to difference operations. Based on SDC and DCM, we further construct a temporal subsequence-based convolutional network with difference (TSCND) for time series forecasting. The experimental results show that TSCND can reduce prediction mean squared error by 7.3% and save runtime, compared with state-of-the-art models and vanilla TCN. 展开更多
关键词 DIFFERENCE data prediction time series temporal convolutional network dilated convolution
在线阅读 下载PDF
Multi-Classification of Polyps in Colonoscopy Images Based on an Improved Deep Convolutional Neural Network 被引量:1
10
作者 Shuang Liu Xiao Liu +9 位作者 Shilong Chang Yufeng Sun Kaiyuan Li Ya Hou Shiwei Wang Jie Meng Qingliang Zhao Sibei Wu Kun Yang Linyan Xue 《Computers, Materials & Continua》 SCIE EI 2023年第6期5837-5852,共16页
Achieving accurate classification of colorectal polyps during colonoscopy can avoid unnecessary endoscopic biopsy or resection.This study aimed to develop a deep learning model that can automatically classify colorect... Achieving accurate classification of colorectal polyps during colonoscopy can avoid unnecessary endoscopic biopsy or resection.This study aimed to develop a deep learning model that can automatically classify colorectal polyps histologically on white-light and narrow-band imaging(NBI)colonoscopy images based on World Health Organization(WHO)and Workgroup serrAted polypS and Polyposis(WASP)classification criteria for colorectal polyps.White-light and NBI colonoscopy images of colorectal polyps exhibiting pathological results were firstly collected and classified into four categories:conventional adenoma,hyperplastic polyp,sessile serrated adenoma/polyp(SSAP)and normal,among which conventional adenoma could be further divided into three sub-categories of tubular adenoma,villous adenoma and villioustublar adenoma,subsequently the images were re-classified into six categories.In this paper,we proposed a novel convolutional neural network termed Polyp-DedNet for the four-and six-category classification tasks of colorectal polyps.Based on the existing classification network ResNet50,Polyp-DedNet adopted dilated convolution to retain more high-dimensional spatial information and an Efficient Channel Attention(ECA)module to improve the classification performance further.To eliminate gridding artifacts caused by dilated convolutions,traditional convolutional layers were used instead of the max pooling layer,and two convolutional layers with progressively decreasing dilation were added at the end of the network.Due to the inevitable imbalance of medical image data,a regularization method DropBlock and a Class-Balanced(CB)Loss were performed to prevent network overfitting.Furthermore,the 5-fold cross-validation was adopted to estimate the performance of Polyp-DedNet for the multi-classification task of colorectal polyps.Mean accuracies of the proposed Polyp-DedNet for the four-and six-category classifications of colorectal polyps were 89.91%±0.92%and 85.13%±1.10%,respectively.The metrics of precision,recall and F1-score were also improved by 1%∼2%compared to the baseline ResNet50.The proposed Polyp-DedNet presented state-of-the-art performance for colorectal polyp classifying on white-light and NBI colonoscopy images,highlighting its considerable potential as an AI-assistant system for accurate colorectal polyp diagnosis in colonoscopy. 展开更多
关键词 Colorectal polyps four-and six-category classifications convolutional neural network dilated residual network
在线阅读 下载PDF
A Lightweight Convolutional Neural Network with Hierarchical Multi-Scale Feature Fusion for Image Classification 被引量:2
11
作者 Adama Dembele Ronald Waweru Mwangi Ananda Omutokoh Kube 《Journal of Computer and Communications》 2024年第2期173-200,共28页
Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware reso... Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware resources. To address this issue, the MobileNetV1 network was developed, which employs depthwise convolution to reduce network complexity. MobileNetV1 employs a stride of 2 in several convolutional layers to decrease the spatial resolution of feature maps, thereby lowering computational costs. However, this stride setting can lead to a loss of spatial information, particularly affecting the detection and representation of smaller objects or finer details in images. To maintain the trade-off between complexity and model performance, a lightweight convolutional neural network with hierarchical multi-scale feature fusion based on the MobileNetV1 network is proposed. The network consists of two main subnetworks. The first subnetwork uses a depthwise dilated separable convolution (DDSC) layer to learn imaging features with fewer parameters, which results in a lightweight and computationally inexpensive network. Furthermore, depthwise dilated convolution in DDSC layer effectively expands the field of view of filters, allowing them to incorporate a larger context. The second subnetwork is a hierarchical multi-scale feature fusion (HMFF) module that uses parallel multi-resolution branches architecture to process the input feature map in order to extract the multi-scale feature information of the input image. Experimental results on the CIFAR-10, Malaria, and KvasirV1 datasets demonstrate that the proposed method is efficient, reducing the network parameters and computational cost by 65.02% and 39.78%, respectively, while maintaining the network performance compared to the MobileNetV1 baseline. 展开更多
关键词 MobileNet Image Classification Lightweight convolutional Neural Network Depthwise dilated Separable convolution Hierarchical Multi-Scale Feature Fusion
在线阅读 下载PDF
Magnetic Resonance Imaging Reconstruction Based on Butterfly Dilated Geometric Distillation
12
作者 DUO Lin XU Boyu +1 位作者 REN Yong YANG Xin 《Journal of Shanghai Jiaotong university(Science)》 2025年第3期590-599,共10页
In order to improve the reconstruction accuracy of magnetic resonance imaging(MRI),an accurate natural image compressed sensing(CS)reconstruction network is proposed,which combines the advantages of model-based and de... In order to improve the reconstruction accuracy of magnetic resonance imaging(MRI),an accurate natural image compressed sensing(CS)reconstruction network is proposed,which combines the advantages of model-based and deep learning-based CS-MRI methods.In theory,enhancing geometric texture details in linear reconstruction is possible.First,the optimization problem is decomposed into two problems:linear approximation and geometric compensation.Aimed at the problem of image linear approximation,the data consistency module is used to deal with it.Since the processing process will lose texture details,a neural network layer that explicitly combines image and frequency feature representation is proposed,which is named butterfly dilated geometric distillation network.The network introduces the idea of butterfly operation,skillfully integrates the features of image domain and frequency domain,and avoids the loss of texture details when extracting features in a single domain.Finally,a channel feature fusion module is designed by combining channel attention mechanism and dilated convolution.The attention of the channel makes the final output feature map focus on the more important part,thus improving the feature representation ability.The dilated convolution enlarges the receptive field,thereby obtaining more dense image feature data.The experimental results show that the peak signal-to-noise ratio of the network is 5.43 dB,5.24 dB and 3.89 dB higher than that of ISTA-Net+,FISTA and DGDN networks on the brain data set with a Cartesian sampling mask CS ratio of 10%. 展开更多
关键词 butterfly geometric distillation dilation convolution channel attention image reconstruction
原文传递
基于空洞因果卷积的学生成绩预测及分析方法
13
作者 赖英旭 张亚薇 +1 位作者 庄俊玺 刘静 《北京工业大学学报》 北大核心 2026年第3期252-267,共16页
针对使用循环神经网络对学生长序列行为数据进行特征提取存在梯度消失或爆炸、长期依赖关系提取能力不足、深度学习模型缺乏可解释性等问题,提出一种面向长序列数据的空洞因果卷积(dilated causal convolution,DCC)成绩预测及分析方法... 针对使用循环神经网络对学生长序列行为数据进行特征提取存在梯度消失或爆炸、长期依赖关系提取能力不足、深度学习模型缺乏可解释性等问题,提出一种面向长序列数据的空洞因果卷积(dilated causal convolution,DCC)成绩预测及分析方法。首先,采用生成对抗网络(generative adversarial network,GAN)生成符合少数类学生原始行为数据分布规律的新样本,并将新样本加入学生数据集中以达到均衡数据集的目的;然后,提出一种基于DCC的成绩预测模型,DCC和门控循环单元(gated recurrent unit,GRU)相结合的结构提高了模型对长序列数据依赖关系的提取能力;最后,使用沙普利加性解释(Shapley additive explanations,SHAP)方法并结合三因素理论对影响学生成绩的因素进行重要性分析和解释。在公开数据集上的实验结果表明,在成绩预测任务中提出的方法与基线方法相比,加权F1分数提高了约6个百分点,并进一步验证了所提方法中关键模块的有效性和模型的泛化能力。此外,通过对比优秀学生和风险学生的学习特点发现,良好的学习习惯、课堂学习的主动性以及不同行为环境等因素会对学生成绩产生重要影响。 展开更多
关键词 学生成绩预测 空洞因果卷积(dilated causal convolution DCC) 不均衡数据 生成对抗网络(generative adversarial network GAN) 沙普利加性解释(Shapley additive explanations SHAP)方法 成绩影响因素分析
在线阅读 下载PDF
基于改进卷积神经网络的水体分割方法
14
作者 张永宏 孙岩 +2 位作者 田伟 马光义 朱灵龙 《计算机应用与软件》 北大核心 2026年第2期164-174,188,共12页
由于遥感图像中水体具有复杂的多尺度特征,传统方法在提取水体过程中容易产生误判和漏判现象。针对这一问题,提出一种融合局部和全局信息的新网络结构。该网络首先在编码端设计一个带有注意机制的残差模块,用于获取每个位置特征的全局... 由于遥感图像中水体具有复杂的多尺度特征,传统方法在提取水体过程中容易产生误判和漏判现象。针对这一问题,提出一种融合局部和全局信息的新网络结构。该网络首先在编码端设计一个带有注意机制的残差模块,用于获取每个位置特征的全局和局部信息,采用多路径扩张卷积实现多尺度水体特征提取。为了提高水体边界处的分割精度,在网络解码端设计细化注意力融合模块。实验结果显示该网络的召回率、精准率、F1-scores分别为95.78%、94.24%、93.75%,与传统卷积神经网络相比,评价指标分别提高1.56百分点、1.72百分点、1.62百分点。 展开更多
关键词 水体分割 全局注意力机制 多路径扩张卷积 局部和全局信息
在线阅读 下载PDF
基于可变形卷积和注意力机制的路面裂缝检测
15
作者 谢永华 方育才 彭银佳 《计算机工程与设计》 北大核心 2026年第1期279-285,共7页
为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器... 为解决路面裂缝检测中图像边缘特征难以学习和背景噪声干扰的问题,提出一个基于可变形卷积和注意力机制的可端到端训练的路面裂缝检测网络。该网络基于U-Net结构设计,在特征融合部分添加边缘感知模块来增强裂缝边缘的检测能力;在编码器部分使用空洞残差模块扩大感受野并保留更多细节信息;在解码器部分添加注意力机制提高对裂缝特征的关注度,抑制背景噪声。实验结果表明,该网络在MPA、mIoU和F1值这3项指标上均优于其它对比网络,验证了该网络的有效性。 展开更多
关键词 裂缝检测 语义分割 编码解码 可变形卷积 空洞卷积 残差连接 注意力机制
在线阅读 下载PDF
基于深浅双分支特征融合的去模糊网络
16
作者 徐志京 曾泓键 《计算机工程与应用》 北大核心 2026年第3期254-264,共11页
针对现有的图像去模糊方法存在边缘信息损失,分块间伪影以及大模型高成本的问题,构建了深浅双分支特征融合的去模糊网络(deep-shallow deblur network,DSDN),从深层和浅层两个分支提取模糊特征。在深层分支中设计的频域自注意力和级联... 针对现有的图像去模糊方法存在边缘信息损失,分块间伪影以及大模型高成本的问题,构建了深浅双分支特征融合的去模糊网络(deep-shallow deblur network,DSDN),从深层和浅层两个分支提取模糊特征。在深层分支中设计的频域自注意力和级联扩张卷积模块,能够在频域有效定位模糊特征并进行特征增强,同时在不增加核大小的前提下有效增大感受野。浅层分支高效提取模糊细节特征,通过残差连接的方式与深层特征融合,能够有效避免梯度消失。提出的空频双域加权联合的损失函数,能够在双域内引导优化网络训练,有效限制复原图像频域差异。在公开数据集GOPRO和HIDE上进行实验,所提方法取得了更高的指标,复原的图像细节更突出,在客观指标和主观观察上均优于现有的主流去模糊方法。 展开更多
关键词 图像去模糊 双分支 频域信息 注意力机制 扩张卷积
在线阅读 下载PDF
渐进式多尺度特征提取与融合的红外与可见光图像融合
17
作者 许光宇 吴淑雅 《齐鲁工业大学学报》 2026年第1期45-56,共12页
针对融合图像中存在目标信息减弱、细节信息丢失的问题,本文提出一种渐进式多尺度特征提取与融合的红外与可见光图像融合算法。该方法构建了结构对称、参数独立的双分支生成网络,首先将原图像及其增强形式输入空洞卷积模块,从不同尺度... 针对融合图像中存在目标信息减弱、细节信息丢失的问题,本文提出一种渐进式多尺度特征提取与融合的红外与可见光图像融合算法。该方法构建了结构对称、参数独立的双分支生成网络,首先将原图像及其增强形式输入空洞卷积模块,从不同尺度提取上下文特征,以充分挖掘多尺度信息;其次,引入多注意力互补残差聚合模块,有效提升特征选择性,强化显著信息、抑制冗余特征,并通过渐进交互机制实现跨尺度融合与互补。在判别器设计上,采用双判别器结构,分别对红外与可见光图像分布建模,以减缓单一判别器在多模态对抗训练中产生的对比度偏移与细节削弱问题。实验结果表明,所提方法在多个主客观评估指标上优于现有主流算法,融合图像保留了更多的纹理细节,视觉效果更佳。 展开更多
关键词 图像融合 空洞卷积 图像增强 注意力机制 双鉴别器
在线阅读 下载PDF
结合对抗训练和IDCNN的医疗命名实体识别
18
作者 陈雪松 李洋洋 王浩畅 《计算机与现代化》 2026年第1期53-59,100,共8页
在医疗领域,传统的命名实体识别模型,无法兼顾全局特征与局部特征的提取,为了解决这个问题,本文提出一种结合全局特征与局部特征的命名实体识别模型用于处理医疗领域的命名实体识别任务。首先,使用预训练语言模型Chinese-BERT-wwm-ext... 在医疗领域,传统的命名实体识别模型,无法兼顾全局特征与局部特征的提取,为了解决这个问题,本文提出一种结合全局特征与局部特征的命名实体识别模型用于处理医疗领域的命名实体识别任务。首先,使用预训练语言模型Chinese-BERT-wwm-ext得到输入文本的初始向量表示;其次,在初始向量的表示上添加一些扰动来生成对抗样本,可提升模型的鲁棒性;再次,将初始向量表示与对抗样本一同依次输入到特征提取层,特征提取层结合了空洞卷积神经网络(Iterated Dilated Convolutional Neural Network,IDCNN)和双向长短时记忆网络(Bi-directional Long Short-Term Memory,BiLSTM)模型,共同生成特征向量,分别捕捉文本的局部和全局特征,使用自注意力机制将抽取的特征向量进行融合,从而充分利用各层次的信息;最后,利用CRF算法生成预测序列。通过结合特征融合模块与对抗训练模块,该模型对于医疗文本CMeEE中命名实体的识别精确率为66.31%,召回率为68.84%,F1值为67.55%;与基线模型相比,表现出较高的识别精度,适用于医疗领域命名实体识别任务。 展开更多
关键词 命名实体识别 预训练语言模型 对抗训练 IDCNN BiLSTM 自注意力机制
在线阅读 下载PDF
基于DenseNet和多域特征融合的表面肌电手势识别研究
19
作者 金亚辉 刘鑫 +1 位作者 连大山 郭一娜 《传感器与微系统》 北大核心 2026年第1期30-34,共5页
基于表面肌电(sEMG)信号的手势识别在人机交互领域应用广泛,快速准确识别手势动作可以提供更好的用户体验。由于个体差异性导致在多任务中,整体识别准确率低。提出一种密集连接卷积网络(DenseNet)和多域特征融合的sEMG手势识别方法。首... 基于表面肌电(sEMG)信号的手势识别在人机交互领域应用广泛,快速准确识别手势动作可以提供更好的用户体验。由于个体差异性导致在多任务中,整体识别准确率低。提出一种密集连接卷积网络(DenseNet)和多域特征融合的sEMG手势识别方法。首先,从sEMG信号中提取时域和频域特征构成特征集,并与原始信号融合作为网络输入,增强网络输入数据的表达能力。其次,使用融合挤压—激励(SE)注意力和多尺度空洞卷积的DenseNet进行特征提取与分类识别。实验结果表明,在NinaPro DB2数据集中,手势识别整体准确率达到了88.06%,在整体和分类运动中识别性能都有所提升。 展开更多
关键词 表面肌电信号 手势识别 挤压—激励注意力 多尺度空洞卷积 密集连接卷积网络
在线阅读 下载PDF
基于深度学习的红树林遥感图像信息提取的研究
20
作者 杨春月 王修信 《计算机应用与软件》 北大核心 2026年第1期249-256,共8页
红树林是生态环境系统的重要组成部分,对于保护和净化环境具有重要作用。由于过度开发等原因导致红树林的生存环境严重受损,因此监测红树林的状况十分重要。针对深度学习模型从遥感图像提取红树林信息性能较差的问题,结合Shuffle Transf... 红树林是生态环境系统的重要组成部分,对于保护和净化环境具有重要作用。由于过度开发等原因导致红树林的生存环境严重受损,因此监测红树林的状况十分重要。针对深度学习模型从遥感图像提取红树林信息性能较差的问题,结合Shuffle Transformer和卷积神经网络的优势,加入ASPP Embedding模块提取特征信息和跳跃注意力融合深层特征与浅层特征提取遥感图像中红树林信息。结果表明,提出的模型对红树林信息提取的精度为97.64%,相比U-Net网络提高了1.38百分点,实验结果证明此方法在红树林遥感图像信息提取中具有比较大的优势。 展开更多
关键词 红树林 遥感图像 深度学习 TRANSFORMER 空洞卷积 信息提取
在线阅读 下载PDF
上一页 1 2 49 下一页 到第
使用帮助 返回顶部