期刊文献+
共找到937篇文章
< 1 2 47 >
每页显示 20 50 100
Multi-Head Attention Enhanced Parallel Dilated Convolution and Residual Learning for Network Traffic Anomaly Detection
1
作者 Guorong Qi Jian Mao +2 位作者 Kai Huang Zhengxian You Jinliang Lin 《Computers, Materials & Continua》 2025年第2期2159-2176,共18页
Abnormal network traffic, as a frequent security risk, requires a series of techniques to categorize and detect it. Existing network traffic anomaly detection still faces challenges: the inability to fully extract loc... Abnormal network traffic, as a frequent security risk, requires a series of techniques to categorize and detect it. Existing network traffic anomaly detection still faces challenges: the inability to fully extract local and global features, as well as the lack of effective mechanisms to capture complex interactions between features;Additionally, when increasing the receptive field to obtain deeper feature representations, the reliance on increasing network depth leads to a significant increase in computational resource consumption, affecting the efficiency and performance of detection. Based on these issues, firstly, this paper proposes a network traffic anomaly detection model based on parallel dilated convolution and residual learning (Res-PDC). To better explore the interactive relationships between features, the traffic samples are converted into two-dimensional matrix. A module combining parallel dilated convolutions and residual learning (res-pdc) was designed to extract local and global features of traffic at different scales. By utilizing res-pdc modules with different dilation rates, we can effectively capture spatial features at different scales and explore feature dependencies spanning wider regions without increasing computational resources. Secondly, to focus and integrate the information in different feature subspaces, further enhance and extract the interactions among the features, multi-head attention is added to Res-PDC, resulting in the final model: multi-head attention enhanced parallel dilated convolution and residual learning (MHA-Res-PDC) for network traffic anomaly detection. Finally, comparisons with other machine learning and deep learning algorithms are conducted on the NSL-KDD and CIC-IDS-2018 datasets. The experimental results demonstrate that the proposed method in this paper can effectively improve the detection performance. 展开更多
关键词 Network traffic anomaly detection multi-head attention parallel dilated convolution residual learning
在线阅读 下载PDF
Channel-Attention DenseNet with Dilated Convolutions for MRI Brain Tumor Classification
2
作者 Abdu Salam Mohammad Abrar +5 位作者 Raja Waseem Anwer Farhan Amin Faizan Ullah Isabel de la Torre Gerardo Mendez Mezquita Henry Fabian Gongora 《Computer Modeling in Engineering & Sciences》 2025年第11期2457-2479,共23页
Brain tumors pose significant diagnostic challenges due to their diverse types and complex anatomical locations.Due to the increase in precision image-based diagnostic tools,driven by advancements in artificial intell... Brain tumors pose significant diagnostic challenges due to their diverse types and complex anatomical locations.Due to the increase in precision image-based diagnostic tools,driven by advancements in artificial intelligence(AI)and deep learning,there has been potential to improve diagnostic accuracy,especially with Magnetic Resonance Imaging(MRI).However,traditional state-of-the-art models lack the sensitivity essential for reliable tumor identification and segmentation.Thus,our research aims to enhance brain tumor diagnosis in MRI by proposing an advanced model.The proposed model incorporates dilated convolutions to optimize the brain tumor segmentation and classification.The proposed model is first trained and later evaluated using the BraTS 2020 dataset.In our proposed model preprocessing consists of normalization,noise reduction,and data augmentation to improve model robustness.The attention mechanism and dilated convolutions were introduced to increase the model’s focus on critical regions and capture finer spatial details without compromising image resolution.We have performed experimentation to measure efficiency.For this,we have used various metrics including accuracy,sensitivity,and curve(AUC-ROC).The proposed model achieved a high accuracy of 94%,a sensitivity of 93%,a specificity of 92%,and an AUC-ROC of 0.98,outperforming traditional diagnostic models in brain tumor detection.The proposed model accurately identifies tumor regions,while dilated convolutions enhanced the segmentation accuracy,especially for complex tumor structures.The proposed model demonstrates significant potential for clinical application,providing reliable and precise brain tumor detection in MRI. 展开更多
关键词 Artificial intelligence MRI analysis deep learning dilated convolution DenseNet brain tumor detection brain tumor segmentation
在线阅读 下载PDF
An improved deep dilated convolutional neural network for seismic facies interpretation 被引量:1
3
作者 Na-Xia Yang Guo-Fa Li +2 位作者 Ting-Hui Li Dong-Feng Zhao Wei-Wei Gu 《Petroleum Science》 SCIE EI CAS CSCD 2024年第3期1569-1583,共15页
With the successful application and breakthrough of deep learning technology in image segmentation,there has been continuous development in the field of seismic facies interpretation using convolutional neural network... With the successful application and breakthrough of deep learning technology in image segmentation,there has been continuous development in the field of seismic facies interpretation using convolutional neural networks.These intelligent and automated methods significantly reduce manual labor,particularly in the laborious task of manually labeling seismic facies.However,the extensive demand for training data imposes limitations on their wider application.To overcome this challenge,we adopt the UNet architecture as the foundational network structure for seismic facies classification,which has demonstrated effective segmentation results even with small-sample training data.Additionally,we integrate spatial pyramid pooling and dilated convolution modules into the network architecture to enhance the perception of spatial information across a broader range.The seismic facies classification test on the public data from the F3 block verifies the superior performance of our proposed improved network structure in delineating seismic facies boundaries.Comparative analysis against the traditional UNet model reveals that our method achieves more accurate predictive classification results,as evidenced by various evaluation metrics for image segmentation.Obviously,the classification accuracy reaches an impressive 96%.Furthermore,the results of seismic facies classification in the seismic slice dimension provide further confirmation of the superior performance of our proposed method,which accurately defines the range of different seismic facies.This approach holds significant potential for analyzing geological patterns and extracting valuable depositional information. 展开更多
关键词 Seismic facies interpretation dilated convolution Spatial pyramid pooling Internal feature maps Compound loss function
原文传递
Magnetic Resonance Imaging Reconstruction Based on Butterfly Dilated Geometric Distillation
4
作者 DUO Lin XU Boyu +1 位作者 REN Yong YANG Xin 《Journal of Shanghai Jiaotong university(Science)》 2025年第3期590-599,共10页
In order to improve the reconstruction accuracy of magnetic resonance imaging(MRI),an accurate natural image compressed sensing(CS)reconstruction network is proposed,which combines the advantages of model-based and de... In order to improve the reconstruction accuracy of magnetic resonance imaging(MRI),an accurate natural image compressed sensing(CS)reconstruction network is proposed,which combines the advantages of model-based and deep learning-based CS-MRI methods.In theory,enhancing geometric texture details in linear reconstruction is possible.First,the optimization problem is decomposed into two problems:linear approximation and geometric compensation.Aimed at the problem of image linear approximation,the data consistency module is used to deal with it.Since the processing process will lose texture details,a neural network layer that explicitly combines image and frequency feature representation is proposed,which is named butterfly dilated geometric distillation network.The network introduces the idea of butterfly operation,skillfully integrates the features of image domain and frequency domain,and avoids the loss of texture details when extracting features in a single domain.Finally,a channel feature fusion module is designed by combining channel attention mechanism and dilated convolution.The attention of the channel makes the final output feature map focus on the more important part,thus improving the feature representation ability.The dilated convolution enlarges the receptive field,thereby obtaining more dense image feature data.The experimental results show that the peak signal-to-noise ratio of the network is 5.43 dB,5.24 dB and 3.89 dB higher than that of ISTA-Net+,FISTA and DGDN networks on the brain data set with a Cartesian sampling mask CS ratio of 10%. 展开更多
关键词 butterfly geometric distillation dilation convolution channel attention image reconstruction
原文传递
DcNet: Dilated Convolutional Neural Networks for Side-Scan Sonar Image Semantic Segmentation 被引量:2
5
作者 ZHAO Xiaohong QIN Rixia +3 位作者 ZHANG Qilei YU Fei WANG Qi HE Bo 《Journal of Ocean University of China》 SCIE CAS CSCD 2021年第5期1089-1096,共8页
In ocean explorations,side-scan sonar(SSS)plays a very important role and can quickly depict seabed topography.As-sembling the SSS to an autonomous underwater vehicle(AUV)and performing semantic segmentation of an SSS... In ocean explorations,side-scan sonar(SSS)plays a very important role and can quickly depict seabed topography.As-sembling the SSS to an autonomous underwater vehicle(AUV)and performing semantic segmentation of an SSS image in real time can realize online submarine geomorphology or target recognition,which is conducive to submarine detection.However,because of the complexity of the marine environment,various noises in the ocean pollute the sonar image,which also encounters the intensity inhomogeneity problem.In this paper,we propose a novel neural network architecture named dilated convolutional neural network(DcNet)that can run in real time while addressing the above-mentioned issues and providing accurate semantic segmentation.The proposed architecture presents an encoder-decoder network to gradually reduce the spatial dimension of the input image and recover the details of the target,respectively.The core of our network is a novel block connection named DCblock,which mainly uses dilated convolution and depthwise separable convolution between the encoder and decoder to attain more context while still retaining high accuracy.Furthermore,our proposed method performs a super-resolution reconstruction to enlarge the dataset with high-quality im-ages.We compared our network to other common semantic segmentation networks performed on an NVIDIA Jetson TX2 using our sonar image datasets.Experimental results show that while the inference speed of the proposed network significantly outperforms state-of-the-art architectures,the accuracy of our method is still comparable,which indicates its potential applications not only in AUVs equipped with SSS but also in marine exploration. 展开更多
关键词 side-scan sonar(SSS) semantic segmentation dilated convolutions SUPER-RESOLUTION
在线阅读 下载PDF
Long Text Classification Algorithm Using a Hybrid Model of Bidirectional Encoder Representation from Transformers-Hierarchical Attention Networks-Dilated Convolutions Network 被引量:1
6
作者 ZHAO Yuanyuan GAO Shining +1 位作者 LIU Yang GONG Xiaohui 《Journal of Donghua University(English Edition)》 CAS 2021年第4期341-350,共10页
Text format information is full of most of the resources of Internet,which puts forward higher and higher requirements for the accuracy of text classification.Therefore,in this manuscript,firstly,we design a hybrid mo... Text format information is full of most of the resources of Internet,which puts forward higher and higher requirements for the accuracy of text classification.Therefore,in this manuscript,firstly,we design a hybrid model of bidirectional encoder representation from transformers-hierarchical attention networks-dilated convolutions networks(BERT_HAN_DCN)which based on BERT pre-trained model with superior ability of extracting characteristic.The advantages of HAN model and DCN model are taken into account which can help gain abundant semantic information,fusing context semantic features and hierarchical characteristics.Secondly,the traditional softmax algorithm increases the learning difficulty of the same kind of samples,making it more difficult to distinguish similar features.Based on this,AM-softmax is introduced to replace the traditional softmax.Finally,the fused model is validated,which shows superior performance in the accuracy rate and F1-score of this hybrid model on two datasets and the experimental analysis shows the general single models such as HAN,DCN,based on BERT pre-trained model.Besides,the improved AM-softmax network model is superior to the general softmax network model. 展开更多
关键词 long text classification dilated convolution BERT fusing context semantic features hierarchical characteristics BERT_HAN_DCN AM-softmax
在线阅读 下载PDF
Multi⁃Scale Dilated Convolutional Neural Network for Hyperspectral Image Classification
7
作者 Shanshan Zheng Wen Liu +3 位作者 Rui Shan Jingyi Zhao Guoqian Jiang Zhi Zhang 《Journal of Harbin Institute of Technology(New Series)》 CAS 2021年第4期25-32,共8页
Aiming at the problem of image information loss,dilated convolution is introduced and a novel multi⁃scale dilated convolutional neural network(MDCNN)is proposed.Dilated convolution can polymerize image multi⁃scale inf... Aiming at the problem of image information loss,dilated convolution is introduced and a novel multi⁃scale dilated convolutional neural network(MDCNN)is proposed.Dilated convolution can polymerize image multi⁃scale information without reducing the resolution.The first layer of the network used spectral convolutional step to reduce dimensionality.Then the multi⁃scale aggregation extracted multi⁃scale features through applying dilated convolution and shortcut connection.The extracted features which represent properties of data were fed through Softmax to predict the samples.MDCNN achieved the overall accuracy of 99.58% and 99.92% on two public datasets,Indian Pines and Pavia University.Compared with four other existing models,the results illustrate that MDCNN can extract better discriminative features and achieve higher classification performance. 展开更多
关键词 multi⁃scale aggregation dilated convolution hyperspectral image classification(HSIC) shortcut connection
在线阅读 下载PDF
Convolution-Transformer for Image Feature Extraction 被引量:2
8
作者 Lirong Yin Lei Wang +10 位作者 Siyu Lu Ruiyang Wang Youshuai Yang Bo Yang Shan Liu Ahmed AlSanad Salman A.AlQahtani Zhengtong Yin Xiaolu Li Xiaobing Chen Wenfeng Zheng 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第10期87-106,共20页
This study addresses the limitations of Transformer models in image feature extraction,particularly their lack of inductive bias for visual structures.Compared to Convolutional Neural Networks(CNNs),the Transformers a... This study addresses the limitations of Transformer models in image feature extraction,particularly their lack of inductive bias for visual structures.Compared to Convolutional Neural Networks(CNNs),the Transformers are more sensitive to different hyperparameters of optimizers,which leads to a lack of stability and slow convergence.To tackle these challenges,we propose the Convolution-based Efficient Transformer Image Feature Extraction Network(CEFormer)as an enhancement of the Transformer architecture.Our model incorporates E-Attention,depthwise separable convolution,and dilated convolution to introduce crucial inductive biases,such as translation invariance,locality,and scale invariance,into the Transformer framework.Additionally,we implement a lightweight convolution module to process the input images,resulting in faster convergence and improved stability.This results in an efficient convolution combined Transformer image feature extraction network.Experimental results on the ImageNet1k Top-1 dataset demonstrate that the proposed network achieves better accuracy while maintaining high computational speed.It achieves up to 85.0%accuracy across various model sizes on image classification,outperforming various baseline models.When integrated into the Mask Region-ConvolutionalNeuralNetwork(R-CNN)framework as a backbone network,CEFormer outperforms other models and achieves the highest mean Average Precision(mAP)scores.This research presents a significant advancement in Transformer-based image feature extraction,balancing performance and computational efficiency. 展开更多
关键词 TRANSFORMER E-Attention depth convolution dilated convolution CEFormer
在线阅读 下载PDF
A Lightweight Convolutional Neural Network with Hierarchical Multi-Scale Feature Fusion for Image Classification 被引量:2
9
作者 Adama Dembele Ronald Waweru Mwangi Ananda Omutokoh Kube 《Journal of Computer and Communications》 2024年第2期173-200,共28页
Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware reso... Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware resources. To address this issue, the MobileNetV1 network was developed, which employs depthwise convolution to reduce network complexity. MobileNetV1 employs a stride of 2 in several convolutional layers to decrease the spatial resolution of feature maps, thereby lowering computational costs. However, this stride setting can lead to a loss of spatial information, particularly affecting the detection and representation of smaller objects or finer details in images. To maintain the trade-off between complexity and model performance, a lightweight convolutional neural network with hierarchical multi-scale feature fusion based on the MobileNetV1 network is proposed. The network consists of two main subnetworks. The first subnetwork uses a depthwise dilated separable convolution (DDSC) layer to learn imaging features with fewer parameters, which results in a lightweight and computationally inexpensive network. Furthermore, depthwise dilated convolution in DDSC layer effectively expands the field of view of filters, allowing them to incorporate a larger context. The second subnetwork is a hierarchical multi-scale feature fusion (HMFF) module that uses parallel multi-resolution branches architecture to process the input feature map in order to extract the multi-scale feature information of the input image. Experimental results on the CIFAR-10, Malaria, and KvasirV1 datasets demonstrate that the proposed method is efficient, reducing the network parameters and computational cost by 65.02% and 39.78%, respectively, while maintaining the network performance compared to the MobileNetV1 baseline. 展开更多
关键词 MobileNet Image Classification Lightweight convolutional Neural Network Depthwise dilated Separable convolution Hierarchical Multi-Scale Feature Fusion
在线阅读 下载PDF
TSCND:Temporal Subsequence-Based Convolutional Network with Difference for Time Series Forecasting
10
作者 Haoran Huang Weiting Chen Zheming Fan 《Computers, Materials & Continua》 SCIE EI 2024年第3期3665-3681,共17页
Time series forecasting plays an important role in various fields, such as energy, finance, transport, and weather. Temporal convolutional networks (TCNs) based on dilated causal convolution have been widely used in t... Time series forecasting plays an important role in various fields, such as energy, finance, transport, and weather. Temporal convolutional networks (TCNs) based on dilated causal convolution have been widely used in time series forecasting. However, two problems weaken the performance of TCNs. One is that in dilated casual convolution, causal convolution leads to the receptive fields of outputs being concentrated in the earlier part of the input sequence, whereas the recent input information will be severely lost. The other is that the distribution shift problem in time series has not been adequately solved. To address the first problem, we propose a subsequence-based dilated convolution method (SDC). By using multiple convolutional filters to convolve elements of neighboring subsequences, the method extracts temporal features from a growing receptive field via a growing subsequence rather than a single element. Ultimately, the receptive field of each output element can cover the whole input sequence. To address the second problem, we propose a difference and compensation method (DCM). The method reduces the discrepancies between and within the input sequences by difference operations and then compensates the outputs for the information lost due to difference operations. Based on SDC and DCM, we further construct a temporal subsequence-based convolutional network with difference (TSCND) for time series forecasting. The experimental results show that TSCND can reduce prediction mean squared error by 7.3% and save runtime, compared with state-of-the-art models and vanilla TCN. 展开更多
关键词 DIFFERENCE data prediction time series temporal convolutional network dilated convolution
在线阅读 下载PDF
Advanced Face Mask Detection Model Using Hybrid Dilation Convolution Based Method 被引量:1
11
作者 Shaohan Wang Xiangyu Wang Xin Guo 《Journal of Software Engineering and Applications》 2023年第1期1-19,共19页
A face-mask object detection model incorporating hybrid dilation convolutional network termed ResNet Hybrid-dilation-convolution Face-mask-detector (RHF) is proposed in this paper. Furthermore, a lightweight face-mask... A face-mask object detection model incorporating hybrid dilation convolutional network termed ResNet Hybrid-dilation-convolution Face-mask-detector (RHF) is proposed in this paper. Furthermore, a lightweight face-mask dataset named Light Masked Face Dataset (LMFD) and a medium-sized face-mask dataset named Masked Face Dataset (MFD) with data augmentation methods applied is also constructed in this paper. The hybrid dilation convolutional network is able to expand the perception of the convolutional kernel without concern about the discontinuity of image information during the convolution process. For the given two datasets being constructed above, the trained models are significantly optimized in terms of detection performance, training time, and other related metrics. By using the MFD dataset of 55,905 images, the RHF model requires roughly 10 hours less training time compared to ResNet50 with better detection results with mAP of 93.45%. 展开更多
关键词 Face Mask Detection Object Detection Hybrid dilation convolution Computer Vision
在线阅读 下载PDF
1D-CNN:Speech Emotion Recognition System Using a Stacked Network with Dilated CNN Features 被引量:6
12
作者 Mustaqeem Soonil Kwon 《Computers, Materials & Continua》 SCIE EI 2021年第6期4039-4059,共21页
Emotion recognition from speech data is an active and emerging area of research that plays an important role in numerous applications,such as robotics,virtual reality,behavior assessments,and emergency call centers.Re... Emotion recognition from speech data is an active and emerging area of research that plays an important role in numerous applications,such as robotics,virtual reality,behavior assessments,and emergency call centers.Recently,researchers have developed many techniques in this field in order to ensure an improvement in the accuracy by utilizing several deep learning approaches,but the recognition rate is still not convincing.Our main aim is to develop a new technique that increases the recognition rate with reasonable cost computations.In this paper,we suggested a new technique,which is a one-dimensional dilated convolutional neural network(1D-DCNN)for speech emotion recognition(SER)that utilizes the hierarchical features learning blocks(HFLBs)with a bi-directional gated recurrent unit(BiGRU).We designed a one-dimensional CNN network to enhance the speech signals,which uses a spectral analysis,and to extract the hidden patterns from the speech signals that are fed into a stacked one-dimensional dilated network that are called HFLBs.Each HFLB contains one dilated convolution layer(DCL),one batch normalization(BN),and one leaky_relu(Relu)layer in order to extract the emotional features using a hieratical correlation strategy.Furthermore,the learned emotional features are feed into a BiGRU in order to adjust the global weights and to recognize the temporal cues.The final state of the deep BiGRU is passed from a softmax classifier in order to produce the probabilities of the emotions.The proposed model was evaluated over three benchmarked datasets that included the IEMOCAP,EMO-DB,and RAVDESS,which achieved 72.75%,91.14%,and 78.01%accuracy,respectively. 展开更多
关键词 Affective computing one-dimensional dilated convolutional neural network emotion recognition gated recurrent unit raw audio clips
在线阅读 下载PDF
Multi-Classification of Polyps in Colonoscopy Images Based on an Improved Deep Convolutional Neural Network 被引量:1
13
作者 Shuang Liu Xiao Liu +9 位作者 Shilong Chang Yufeng Sun Kaiyuan Li Ya Hou Shiwei Wang Jie Meng Qingliang Zhao Sibei Wu Kun Yang Linyan Xue 《Computers, Materials & Continua》 SCIE EI 2023年第6期5837-5852,共16页
Achieving accurate classification of colorectal polyps during colonoscopy can avoid unnecessary endoscopic biopsy or resection.This study aimed to develop a deep learning model that can automatically classify colorect... Achieving accurate classification of colorectal polyps during colonoscopy can avoid unnecessary endoscopic biopsy or resection.This study aimed to develop a deep learning model that can automatically classify colorectal polyps histologically on white-light and narrow-band imaging(NBI)colonoscopy images based on World Health Organization(WHO)and Workgroup serrAted polypS and Polyposis(WASP)classification criteria for colorectal polyps.White-light and NBI colonoscopy images of colorectal polyps exhibiting pathological results were firstly collected and classified into four categories:conventional adenoma,hyperplastic polyp,sessile serrated adenoma/polyp(SSAP)and normal,among which conventional adenoma could be further divided into three sub-categories of tubular adenoma,villous adenoma and villioustublar adenoma,subsequently the images were re-classified into six categories.In this paper,we proposed a novel convolutional neural network termed Polyp-DedNet for the four-and six-category classification tasks of colorectal polyps.Based on the existing classification network ResNet50,Polyp-DedNet adopted dilated convolution to retain more high-dimensional spatial information and an Efficient Channel Attention(ECA)module to improve the classification performance further.To eliminate gridding artifacts caused by dilated convolutions,traditional convolutional layers were used instead of the max pooling layer,and two convolutional layers with progressively decreasing dilation were added at the end of the network.Due to the inevitable imbalance of medical image data,a regularization method DropBlock and a Class-Balanced(CB)Loss were performed to prevent network overfitting.Furthermore,the 5-fold cross-validation was adopted to estimate the performance of Polyp-DedNet for the multi-classification task of colorectal polyps.Mean accuracies of the proposed Polyp-DedNet for the four-and six-category classifications of colorectal polyps were 89.91%±0.92%and 85.13%±1.10%,respectively.The metrics of precision,recall and F1-score were also improved by 1%∼2%compared to the baseline ResNet50.The proposed Polyp-DedNet presented state-of-the-art performance for colorectal polyp classifying on white-light and NBI colonoscopy images,highlighting its considerable potential as an AI-assistant system for accurate colorectal polyp diagnosis in colonoscopy. 展开更多
关键词 Colorectal polyps four-and six-category classifications convolutional neural network dilated residual network
在线阅读 下载PDF
基于空洞卷积U-Net的遥感影像道路提取方法 被引量:2
14
作者 林娜 张小青 +2 位作者 王岚 冯丽蓉 王伟 《测绘地理信息》 2025年第3期63-67,共5页
针对从遥感影像上提取道路出现的细节特征丢失、提取结果模糊的问题,本文提出了一种基于空洞卷积U-Net的遥感影像道路提取算法。首先,以U-Net为基础网络,将低层细节特征与高层语义特征进行多特征融合,更好地还原道路目标细节;其次,为了... 针对从遥感影像上提取道路出现的细节特征丢失、提取结果模糊的问题,本文提出了一种基于空洞卷积U-Net的遥感影像道路提取算法。首先,以U-Net为基础网络,将低层细节特征与高层语义特征进行多特征融合,更好地还原道路目标细节;其次,为了进一步提高网络对道路细节特征的识别能力,在U-Net中引入空洞卷积模块,学习更多语义信息来改善提取结果的模糊问题;最后,基于Massachusetts Roads数据集进行实验。结果表明,本文方法召回率、精度和F1得分分别达到82.5%、86.7%、84.5%。与基础的UNet相比,本文算法在解决细节特征丢失和提取结果模糊问题方面更具有应用价值。 展开更多
关键词 遥感影像 U-Net 道路提取 空洞卷积 深度学习
原文传递
融合注意机制的多尺度自适应空洞卷积面部情感识别方法 被引量:1
15
作者 王春影 孟天宇 +2 位作者 张震 葛雄心 杨继伟 《重庆理工大学学报(自然科学)》 北大核心 2025年第5期90-97,共8页
针对面部不连续动作单元的关联特征提取困难,以及不同面部区域对表情识别影响程度不一可能引入无用信息的问题,提出了一种基于双分支注意力机制的多尺度自适应空洞卷积模型(dual branching attention mechanism-adaptive multi-scale di... 针对面部不连续动作单元的关联特征提取困难,以及不同面部区域对表情识别影响程度不一可能引入无用信息的问题,提出了一种基于双分支注意力机制的多尺度自适应空洞卷积模型(dual branching attention mechanism-adaptive multi-scale dilated convolution,DAM-ADCNN)。模型通过双分支注意力机制生成特征映射,表征面部动作单元的局部和全局分布及关联关系;利用多尺度空洞卷积提取面部不连续动作单元的关键特征;采用自适应方式动态调整不同尺度关联特征的权重,以有效减少无用信息的干扰。结果表明,DAM-ADCNN模型在情感识别任务中的表现优于现有方法。在DEAP数据集的唤醒和效价维度上,模型的识别准确率分别提升了3.66%和3.99%。同时,在CK+数据集上,模型的识别准确率提高了3.93%。这些结果证明了DAM-ADCNN模型在面部表情情感识别中的有效性。 展开更多
关键词 面部情感识别 双分支注意力机制 空洞卷积 自适应权重
在线阅读 下载PDF
单目RGB穿衣人体的手部精细化重建
16
作者 张冀 任志鹏 +3 位作者 张荣华 苑朝 翟永杰 余正秦 《计算机应用研究》 北大核心 2025年第1期300-306,共7页
为解决单目穿衣人体在复杂姿态下手部形状重建存在遮挡和缺失的失真问题,提出了一种结合ECON与MANO手部模型,实现高效穿衣人体的手部精细化重建方法H-ECON(hand-focused explicit clothed humans obtained from normals)。具体而言,该... 为解决单目穿衣人体在复杂姿态下手部形状重建存在遮挡和缺失的失真问题,提出了一种结合ECON与MANO手部模型,实现高效穿衣人体的手部精细化重建方法H-ECON(hand-focused explicit clothed humans obtained from normals)。具体而言,该方法首先以类型无关的手部检测器聚焦手部区域并进行翻转和裁剪;然后,引入注意力机制用于增强对手部区域的感知能力,空洞螺旋卷积则更好地捕捉手部不同尺度的特征;最后,独特的融合模块确保了手部重建与整身模型的融合效果。在FreiHAND和HanCo公开数据集上与其他方法的定量定性对比结果表明了H-ECON的有效性,其独立手部模块明显优于ECON中的替代手部模块。H-ECON实现了对人体手部几何和姿态变化的精确描述,进一步缩小了2D图像生成到3D人体网格之间的差距。 展开更多
关键词 手部重建 穿衣人体 注意力机制 空洞螺旋卷积 深度几何学习
在线阅读 下载PDF
基于改进DeepLabv3+的安全帽佩戴分割算法
17
作者 邵晓艳 董文永 +2 位作者 赵雪专 李玲玲 薄树奎 《西南大学学报(自然科学版)》 北大核心 2025年第7期185-195,共11页
针对物流园区空间跨度大、作业设备繁多导致安全帽佩戴检测分割难度增加的问题,提出一种基于改进DeepLabv3+的安全帽佩戴分割算法。该算法采用ResNet-101膨胀残差网络进行特征提取;在编码阶段引入卷积注意力机制融合模块,有效增强特征... 针对物流园区空间跨度大、作业设备繁多导致安全帽佩戴检测分割难度增加的问题,提出一种基于改进DeepLabv3+的安全帽佩戴分割算法。该算法采用ResNet-101膨胀残差网络进行特征提取;在编码阶段引入卷积注意力机制融合模块,有效增强特征区域表征能力;在特征提取阶段引入图像特征网格化模块,将低分辨率图像进行平均切分,有助于获得局部图像的小目标特征。将该算法在SHWD(Safety Helmet Wearing Detect)数据集中训练测试,结果表明:算法的像素准确率达到89.23%,相比DeepLabv3+提升了2.21个百分点,有效提高了复杂场景下物流园区安全帽佩戴分割精度。 展开更多
关键词 神经网络 注意力机制 膨胀卷积 语义分割
原文传递
基于混合注意力的遥感图像超分辨率重建 被引量:1
18
作者 姚善化 潘品杨 王仲根 《安徽理工大学学报(自然科学版)》 2025年第1期64-73,98,共11页
目的为改善遥感图像局部区域模糊、部分细节信息重建丢失等问题。方法提出一种基于空洞卷积和混合注意力的遥感图像超分辨率重建算法。首先经过浅层特征提取模块得到浅层特征图,再利用卷积与空洞卷积以及非线性激活块相结合,扩大了整体... 目的为改善遥感图像局部区域模糊、部分细节信息重建丢失等问题。方法提出一种基于空洞卷积和混合注意力的遥感图像超分辨率重建算法。首先经过浅层特征提取模块得到浅层特征图,再利用卷积与空洞卷积以及非线性激活块相结合,扩大了整体感受野,提升了训练过程的稳定性,从而增强深层特征表达能力;其次,使用级联的空间注意力与通道注意力模块来改善高频信息缺失问题;最后,对所提取的特征进行上采样和重建获得高分辨率图像。结果在NWPU RESISC45和UCMerced-LandUse数据集上,仿真结果分析表明,该算法的峰值信噪比与结构相似性两项评价指标均优于所对比算法,在主观视觉效果上,重建图像也更能突出纹理细节信息。结论所提算法拥有更好的重建效果,提升了遥感图像的质量和可用性。 展开更多
关键词 超分辨率重建 遥感图像 空洞卷积 注意力机制 深度学习
在线阅读 下载PDF
基于SDFSN-HiFuse网络的减速器工件分类
19
作者 于智龙 张雪寒 +3 位作者 齐丽华 杨佳欣 于广滨 李忠刚 《光学精密工程》 北大核心 2025年第19期3093-3105,共13页
减速器相似工件的准确分类对于其精密装配至关重要。现有视觉分类方法在面对高度相似的工件时存在特征判别性不足、抗复杂背景干扰能力弱等问题,性能表现不佳,在精密装配中容易引入误差。针对减速器工件类内差异大、类间差异小的特点,... 减速器相似工件的准确分类对于其精密装配至关重要。现有视觉分类方法在面对高度相似的工件时存在特征判别性不足、抗复杂背景干扰能力弱等问题,性能表现不佳,在精密装配中容易引入误差。针对减速器工件类内差异大、类间差异小的特点,提出一种基于HiFuse的空域双焦协同网络(Spatial Dual-Focus Synergy Network,SDFSN)减速器工件分类方法。设计多分支空间自适应的膨胀率选择机制,使模型对形变区域自动选择最合适的感受野。构思双阶段几何-局部协同注意力机制,对每个膨胀分支的输出特征施加逐级精细的注意力引导,动态调整特征权重,有效增强模型对重要区域的判别能力,实现由粗到细的特征提取。引入可变形几何图,实现与几何拓扑适配的图结构,突破传统固定网格限制,在可变形卷积后引入曲率门控机制,继承几何形变的适应性特征,显著提升对复杂曲面区域的响应能力与表达精度。实验结果表明,SDFSN-HiFuse在自制数据集上的准确率比基线提高3.57%,精确度提高2.99%,而且满足工件分类的实时性要求,FPS达到300.39 frame/ms。 展开更多
关键词 减速器工件分类 深度学习 注意力机制 多尺度膨胀卷积
在线阅读 下载PDF
基于金字塔卷积和像素注意力的分割方法
20
作者 阴桂梅 肖易勇 +4 位作者 席鑫华 赵艳丽 谭淑平 强彦 罗士朝 《计算机应用与软件》 北大核心 2025年第6期241-248,289,共9页
针对医学图像分割任务中存在的分割目标大小变化跨度大且结构复杂,以及神经网络对目标边缘细节学习效果差这两个问题,在U-Net网络的基础上构造了金字塔空洞卷积与像素级注意力网络(DP-Net)。设计金字塔空洞卷积模块并替换传统的卷积操作... 针对医学图像分割任务中存在的分割目标大小变化跨度大且结构复杂,以及神经网络对目标边缘细节学习效果差这两个问题,在U-Net网络的基础上构造了金字塔空洞卷积与像素级注意力网络(DP-Net)。设计金字塔空洞卷积模块并替换传统的卷积操作,通过多种空洞卷积的组合扩展了网络感受野并编码得到全局上下文信息;提出像素级注意力模块,在通道注意力机制的基础上进一步编码像素间的依赖关系使网络能从不同通道的特征中学习到更丰富的局部上下文信息。通过在肺部公开数据集LIDC和私人肝肿瘤数据集上进行实验评估,所提出的DP-Net在三种评估指标上都获得优于当前方法的性能,证明所提出网络改进在分割精度方面的有效性。 展开更多
关键词 深度学习 医学图像处理 图像分割 注意力机制 空洞卷积
在线阅读 下载PDF
上一页 1 2 47 下一页 到第
使用帮助 返回顶部