期刊文献+
共找到1,243篇文章
< 1 2 63 >
每页显示 20 50 100
An RMD-YOLOv11 Approach for Typical Defect Detection of PV Modules
1
作者 Tao Geng Shuaibing Li +3 位作者 Yunyun Yun Yongqiang Kang Hongwei Li unmin Zhu 《Computers, Materials & Continua》 2026年第3期1804-1822,共19页
In order to address the challenges posed by complex background interference,high miss-detection rates of micro-scale defects,and limited model deployment efficiency in photovoltaic(PV)module defect detection,this pape... In order to address the challenges posed by complex background interference,high miss-detection rates of micro-scale defects,and limited model deployment efficiency in photovoltaic(PV)module defect detection,this paper proposes an efficient detection framework based on an improved YOLOv11 architecture.First,a Re-parameterized Convolution(RepConv)module is integrated into the backbone to enhance the model’s sensitivity to fine-grained defects—such as micro-cracks and hot spots—while maintaining high inference efficiency.Second,a Multi-Scale Feature Fusion Convolutional Block Attention Mechanism(MSFF-CBAM)is designed to guide the network toward critical defect regions by jointly modeling channel-wise and spatial attention.This mechanism effectively strengthens the specificity and robustness of feature representations.Third,a lightweight Dynamic Sampling Module(DySample)is employed to replace conventional upsampling operations,thereby improving the localization accuracy of small-scale defect targets.Experimental evaluations conducted on the PVEL-AD dataset demonstrate that the proposed RMDYOLOv11 model surpasses the baseline YOLOv11 in terms of mean Average Precision(mAP)@0.5,Precision,and Recall,achieving respective improvements of 4.70%,1.51%,and 5.50%.The model also exhibits notable advantages in inference speed and model compactness.Further validation on the ELPV dataset confirms the model’s generalization capability,showing respective performance gains of 1.99%,2.28%,and 1.45%across the same metrics.Overall,the enhanced model significantly improves the accuracy of micro-defect identification on PV module surfaces,effectively reducing both false negatives and false positives.This advancement provides a robust and reliable technical foundation for automated PV module defect detection. 展开更多
关键词 Photovoltaic(PV)modules YOLOv11 re-parameterization convolution attention mechanism dynamic upsampling
在线阅读 下载PDF
Enhanced Cutaneous Melanoma Segmentation in Dermoscopic Images Using a Dual U-Net Framework with Multi-Path Convolution Block Attention Module and SE-Res-Conv
2
作者 Kun Lan Feiyang Gao +2 位作者 Xiaoliang Jiang Jianzhen Cheng Simon Fong 《Computers, Materials & Continua》 2025年第9期4805-4824,共20页
With the continuous development of artificial intelligence and machine learning techniques,there have been effective methods supporting the work of dermatologist in the field of skin cancer detection.However,object si... With the continuous development of artificial intelligence and machine learning techniques,there have been effective methods supporting the work of dermatologist in the field of skin cancer detection.However,object significant challenges have been presented in accurately segmenting melanomas in dermoscopic images due to the objects that could interfere human observations,such as bubbles and scales.To address these challenges,we propose a dual U-Net network framework for skin melanoma segmentation.In our proposed architecture,we introduce several innovative components that aim to enhance the performance and capabilities of the traditional U-Net.First,we establish a novel framework that links two simplified U-Nets,enabling more comprehensive information exchange and feature integration throughout the network.Second,after cascading the second U-Net,we introduce a skip connection between the decoder and encoder networks,and incorporate a modified receptive field block(MRFB),which is designed to capture multi-scale spatial information.Third,to further enhance the feature representation capabilities,we add a multi-path convolution block attention module(MCBAM)to the first two layers of the first U-Net encoding,and integrate a new squeeze-and-excitation(SE)mechanism with residual connections in the second U-Net.To illustrate the performance of our proposed model,we conducted comprehensive experiments on widely recognized skin datasets.On the ISIC-2017 dataset,the IoU value of our proposed model increased from 0.6406 to 0.6819 and the Dice coefficient increased from 0.7625 to 0.8023.On the ISIC-2018 dataset,the IoU value of proposed model also improved from 0.7138 to 0.7709,while the Dice coefficient increased from 0.8285 to 0.8665.Furthermore,the generalization experiments conducted on the jaw cyst dataset from Quzhou People’s Hospital further verified the outstanding segmentation performance of the proposed model.These findings collectively affirm the potential of our approach as a valuable tool in supporting clinical decision-making in the field of skin cancer detection,as well as advancing research in medical image analysis. 展开更多
关键词 Dual U-Net skin lesion segmentation squeeze-and-excitation modified receptive field block multi-path convolution block attention module
在线阅读 下载PDF
ANC: Attention Network for COVID-19 Explainable Diagnosis Based on Convolutional Block Attention Module 被引量:10
3
作者 Yudong Zhang Xin Zhang Weiguo Zhu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2021年第6期1037-1058,共22页
Aim: To diagnose COVID-19 more efficiently and more correctly, this study proposed a novel attention network forCOVID-19 (ANC). Methods: Two datasets were used in this study. An 18-way data augmentation was proposed t... Aim: To diagnose COVID-19 more efficiently and more correctly, this study proposed a novel attention network forCOVID-19 (ANC). Methods: Two datasets were used in this study. An 18-way data augmentation was proposed toavoid overfitting. Then, convolutional block attention module (CBAM) was integrated to our model, the structureof which is fine-tuned. Finally, Grad-CAM was used to provide an explainable diagnosis. Results: The accuracyof our ANC methods on two datasets are 96.32% ± 1.06%, and 96.00% ± 1.03%, respectively. Conclusions: Thisproposed ANC method is superior to 9 state-of-the-art approaches. 展开更多
关键词 Deep learning convolutional block attention module attention mechanism COVID-19 explainable diagnosis
在线阅读 下载PDF
MobileNet network optimization based on convolutional block attention module 被引量:3
4
作者 ZHAO Shuxu MEN Shiyao YUAN Lin 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2022年第2期225-234,共10页
Deep learning technology is widely used in computer vision.Generally,a large amount of data is used to train the model weights in deep learning,so as to obtain a model with higher accuracy.However,massive data and com... Deep learning technology is widely used in computer vision.Generally,a large amount of data is used to train the model weights in deep learning,so as to obtain a model with higher accuracy.However,massive data and complex model structures require more calculating resources.Since people generally can only carry and use mobile and portable devices in application scenarios,neural networks have limitations in terms of calculating resources,size and power consumption.Therefore,the efficient lightweight model MobileNet is used as the basic network in this study for optimization.First,the accuracy of the MobileNet model is improved by adding methods such as the convolutional block attention module(CBAM)and expansion convolution.Then,the MobileNet model is compressed by using pruning and weight quantization algorithms based on weight size.Afterwards,methods such as Python crawlers and data augmentation are employed to create a garbage classification data set.Based on the above model optimization strategy,the garbage classification mobile terminal application is deployed on mobile phones and raspberry pies,realizing completing the garbage classification task more conveniently. 展开更多
关键词 MobileNet convolutional block attention module(CBAM) model pruning and quantization edge machine learning
在线阅读 下载PDF
Traffic Sign Recognition for Autonomous Vehicle Using Optimized YOLOv7 and Convolutional Block Attention Module 被引量:2
5
作者 P.Kuppusamy M.Sanjay +1 位作者 P.V.Deepashree C.Iwendi 《Computers, Materials & Continua》 SCIE EI 2023年第10期445-466,共22页
The infrastructure and construction of roads are crucial for the economic and social development of a region,but traffic-related challenges like accidents and congestion persist.Artificial Intelligence(AI)and Machine ... The infrastructure and construction of roads are crucial for the economic and social development of a region,but traffic-related challenges like accidents and congestion persist.Artificial Intelligence(AI)and Machine Learning(ML)have been used in road infrastructure and construction,particularly with the Internet of Things(IoT)devices.Object detection in Computer Vision also plays a key role in improving road infrastructure and addressing trafficrelated problems.This study aims to use You Only Look Once version 7(YOLOv7),Convolutional Block Attention Module(CBAM),the most optimized object-detection algorithm,to detect and identify traffic signs,and analyze effective combinations of adaptive optimizers like Adaptive Moment estimation(Adam),Root Mean Squared Propagation(RMSprop)and Stochastic Gradient Descent(SGD)with the YOLOv7.Using a portion of German traffic signs for training,the study investigates the feasibility of adopting smaller datasets while maintaining high accuracy.The model proposed in this study not only improves traffic safety by detecting traffic signs but also has the potential to contribute to the rapid development of autonomous vehicle systems.The study results showed an impressive accuracy of 99.7%when using a batch size of 8 and the Adam optimizer.This high level of accuracy demonstrates the effectiveness of the proposed model for the image classification task of traffic sign recognition. 展开更多
关键词 Object detection traffic sign detection YOLOv7 convolutional block attention module road sign detection ADAM
在线阅读 下载PDF
On fine-grained visual explanation in convolutional neural networks
6
作者 Xia Lei Yongkai Fan Xiong-Lin Luo 《Digital Communications and Networks》 SCIE CSCD 2023年第5期1141-1147,共7页
Existing explanation methods for Convolutional Neural Networks(CNNs)lack the pixel-level visualization explanations to generate the reliable fine-grained decision features.Since there are inconsistencies between the e... Existing explanation methods for Convolutional Neural Networks(CNNs)lack the pixel-level visualization explanations to generate the reliable fine-grained decision features.Since there are inconsistencies between the explanation and the actual behavior of the model to be interpreted,we propose a Fine-Grained Visual Explanation for CNN,namely F-GVE,which produces a fine-grained explanation with higher consistency to the decision of the original model.The exact backward class-specific gradients with respect to the input image is obtained to highlight the object-related pixels the model used to make prediction.In addition,for better visualization and less noise,F-GVE selects an appropriate threshold to filter the gradient during the calculation and the explanation map is obtained by element-wise multiplying the gradient and the input image to show fine-grained classification decision features.Experimental results demonstrate that F-GVE has good visual performances and highlights the importance of fine-grained decision features.Moreover,the faithfulness of the explanation in this paper is high and it is effective and practical on troubleshooting and debugging detection. 展开更多
关键词 convolutional neural network EXPLANATION Class-specific gradient fine-grained
在线阅读 下载PDF
A New Childhood Pneumonia Diagnosis Method Based on Fine-Grained Convolutional Neural Network
7
作者 Yang Zhang Liru Qiu +2 位作者 Yongkai Zhu Long Wen Xiaoping Luo 《Computer Modeling in Engineering & Sciences》 SCIE EI 2022年第12期873-894,共22页
Pneumonia is part of the main diseases causing the death of children.It is generally diagnosed through chest Xray images.With the development of Deep Learning(DL),the diagnosis of pneumonia based on DL has received ex... Pneumonia is part of the main diseases causing the death of children.It is generally diagnosed through chest Xray images.With the development of Deep Learning(DL),the diagnosis of pneumonia based on DL has received extensive attention.However,due to the small difference between pneumonia and normal images,the performance of DL methods could be improved.This research proposes a new fine-grained Convolutional Neural Network(CNN)for children’s pneumonia diagnosis(FG-CPD).Firstly,the fine-grainedCNNclassificationwhich can handle the slight difference in images is investigated.To obtain the raw images from the real-world chest X-ray data,the YOLOv4 algorithm is trained to detect and position the chest part in the raw images.Secondly,a novel attention network is proposed,named SGNet,which integrates the spatial information and channel information of the images to locate the discriminative parts in the chest image for expanding the difference between pneumonia and normal images.Thirdly,the automatic data augmentation method is adopted to increase the diversity of the images and avoid the overfitting of FG-CPD.The FG-CPD has been tested on the public Chest X-ray 2017 dataset,and the results show that it has achieved great effect.Then,the FG-CPD is tested on the real chest X-ray images from children aged 3–12 years ago from Tongji Hospital.The results show that FG-CPD has achieved up to 96.91%accuracy,which can validate the potential of the FG-CPD. 展开更多
关键词 Childhood pneumonia diagnosis fine-grained classification YOLOv4 attention network convolutional Neural Network(CNN)
在线阅读 下载PDF
A Convolutional and Transformer Based Deep Neural Network for Automatic Modulation Classification 被引量:5
8
作者 Shanchuan Ying Sai Huang +3 位作者 Shuo Chang Zheng Yang Zhiyong Feng Ningyan Guo 《China Communications》 SCIE CSCD 2023年第5期135-147,共13页
Automatic modulation classification(AMC)aims at identifying the modulation of the received signals,which is a significant approach to identifying the target in military and civil applications.In this paper,a novel dat... Automatic modulation classification(AMC)aims at identifying the modulation of the received signals,which is a significant approach to identifying the target in military and civil applications.In this paper,a novel data-driven framework named convolutional and transformer-based deep neural network(CTDNN)is proposed to improve the classification performance.CTDNN can be divided into four modules,i.e.,convolutional neural network(CNN)backbone,transition module,transformer module,and final classifier.In the CNN backbone,a wide and deep convolution structure is designed,which consists of 1×15 convolution kernels and intensive cross-layer connections instead of traditional 1×3 kernels and sequential connections.In the transition module,a 1×1 convolution layer is utilized to compress the channels of the previous multi-scale CNN features.In the transformer module,three self-attention layers are designed for extracting global features and generating the classification vector.In the classifier,the final decision is made based on the maximum a posterior probability.Extensive simulations are conducted,and the result shows that our proposed CTDNN can achieve superior classification performance than traditional deep models. 展开更多
关键词 automatic modulation classification deep neural network convolutional neural network TRANSFORMER
在线阅读 下载PDF
Text-Image Feature Fine-Grained Learning for Joint Multimodal Aspect-Based Sentiment Analysis
9
作者 Tianzhi Zhang Gang Zhou +4 位作者 Shuang Zhang Shunhang Li Yepeng Sun Qiankun Pi Shuo Liu 《Computers, Materials & Continua》 SCIE EI 2025年第1期279-305,共27页
Joint Multimodal Aspect-based Sentiment Analysis(JMASA)is a significant task in the research of multimodal fine-grained sentiment analysis,which combines two subtasks:Multimodal Aspect Term Extraction(MATE)and Multimo... Joint Multimodal Aspect-based Sentiment Analysis(JMASA)is a significant task in the research of multimodal fine-grained sentiment analysis,which combines two subtasks:Multimodal Aspect Term Extraction(MATE)and Multimodal Aspect-oriented Sentiment Classification(MASC).Currently,most existing models for JMASA only perform text and image feature encoding from a basic level,but often neglect the in-depth analysis of unimodal intrinsic features,which may lead to the low accuracy of aspect term extraction and the poor ability of sentiment prediction due to the insufficient learning of intra-modal features.Given this problem,we propose a Text-Image Feature Fine-grained Learning(TIFFL)model for JMASA.First,we construct an enhanced adjacency matrix of word dependencies and adopt graph convolutional network to learn the syntactic structure features for text,which addresses the context interference problem of identifying different aspect terms.Then,the adjective-noun pairs extracted from image are introduced to enable the semantic representation of visual features more intuitive,which addresses the ambiguous semantic extraction problem during image feature learning.Thereby,the model performance of aspect term extraction and sentiment polarity prediction can be further optimized and enhanced.Experiments on two Twitter benchmark datasets demonstrate that TIFFL achieves competitive results for JMASA,MATE and MASC,thus validating the effectiveness of our proposed methods. 展开更多
关键词 Multimodal sentiment analysis aspect-based sentiment analysis feature fine-grained learning graph convolutional network adjective-noun pairs
在线阅读 下载PDF
AG-GCN: Vehicle Re-Identification Based on Attention-Guided Graph Convolutional Network
10
作者 Ya-Jie Sun Li-Wei Qiao Sai Ji 《Computers, Materials & Continua》 2025年第7期1769-1785,共17页
Vehicle re-identification involves matching images of vehicles across varying camera views.The diversity of camera locations along different roadways leads to significant intra-class variation and only minimal inter-c... Vehicle re-identification involves matching images of vehicles across varying camera views.The diversity of camera locations along different roadways leads to significant intra-class variation and only minimal inter-class similarity in the collected vehicle images,which increases the complexity of re-identification tasks.To tackle these challenges,this study proposes AG-GCN(Attention-Guided Graph Convolutional Network),a novel framework integrating several pivotal components.Initially,AG-GCN embeds a lightweight attention module within the ResNet-50 structure to learn feature weights automatically,thereby improving the representation of vehicle features globally by highlighting salient features and suppressing extraneous ones.Moreover,AG-GCN adopts a graph-based structure to encapsulate deep local features.A graph convolutional network then amalgamates these features to understand the relationships among vehicle-related characteristics.Subsequently,we amalgamate feature maps from both the attention and graph-based branches for a more comprehensive representation of vehicle features.The framework then gauges feature similarities and ranks them,thus enhancing the accuracy of vehicle re-identification.Comprehensive qualitative and quantitative analyses on two publicly available datasets verify the efficacy of AG-GCN in addressing intra-class and inter-class variability issues. 展开更多
关键词 Vehicle re-identification a lightweight attention module global features local features graph convolution network
在线阅读 下载PDF
Yetter-Drinfel'd Module and Convolution Module
11
作者 张良云 王栓宏 《Northeastern Mathematical Journal》 CSCD 2002年第1期13-18,共6页
In this paper, we first give a sufficient and necessary condition for a Hopf algebra to be a Yetter-Drinfel'd module, and prove that the finite dual of a Yetter-Drinfel'd module is still a Yetter-Drinfel'd... In this paper, we first give a sufficient and necessary condition for a Hopf algebra to be a Yetter-Drinfel'd module, and prove that the finite dual of a Yetter-Drinfel'd module is still a Yetter-Drinfel'd module. Finally, we introduce a concept of convolution module. 展开更多
关键词 braided Hopf algebra convolution algebra convolution module Yetter-Drinfel'd module
在线阅读 下载PDF
A Survey on Deep Learning-based Fine-grained Object Classification and Semantic Segmentation 被引量:47
12
作者 Bo Zhao Jiashi Feng +1 位作者 Xiao Wu Shuicheng Yan 《International Journal of Automation and computing》 EI CSCD 2017年第2期119-135,共17页
The deep learning technology has shown impressive performance in various vision tasks such as image classification, object detection and semantic segmentation. In particular, recent advances of deep learning technique... The deep learning technology has shown impressive performance in various vision tasks such as image classification, object detection and semantic segmentation. In particular, recent advances of deep learning techniques bring encouraging performance to fine-grained image classification which aims to distinguish subordinate-level categories, such as bird species or dog breeds. This task is extremely challenging due to high intra-class and low inter-class variance. In this paper, we review four types of deep learning based fine-grained image classification approaches, including the general convolutional neural networks (CNNs), part detection based, ensemble of networks based and visual attention based fine-grained image classification approaches. Besides, the deep learning based semantic segmentation approaches are also covered in this paper. The region proposal based and fully convolutional networks based approaches for semantic segmentation are introduced respectively. 展开更多
关键词 Deep learning fine-grained image classification semantic segmentation convolutional neural network (CNN) recurrentneural network (RNN)
原文传递
Fine-Grained Action Recognition Based on Temporal Pyramid Excitation Network 被引量:1
13
作者 Xuan Zhou Jianping Yi 《Intelligent Automation & Soft Computing》 SCIE 2023年第8期2103-2116,共14页
Mining more discriminative temporal features to enrich temporal context representation is considered the key to fine-grained action recog-nition.Previous action recognition methods utilize a fixed spatiotemporal windo... Mining more discriminative temporal features to enrich temporal context representation is considered the key to fine-grained action recog-nition.Previous action recognition methods utilize a fixed spatiotemporal window to learn local video representation.However,these methods failed to capture complex motion patterns due to their limited receptive field.To solve the above problems,this paper proposes a lightweight Temporal Pyramid Excitation(TPE)module to capture the short,medium,and long-term temporal context.In this method,Temporal Pyramid(TP)module can effectively expand the temporal receptive field of the network by using the multi-temporal kernel decomposition without significantly increasing the computational cost.In addition,the Multi Excitation module can emphasize temporal importance to enhance the temporal feature representation learning.TPE can be integrated into ResNet50,and building a compact video learning framework-TPENet.Extensive validation experiments on several challenging benchmark(Something-Something V1,Something-Something V2,UCF-101,and HMDB51)datasets demonstrate that our method achieves a preferable balance between computation and accuracy. 展开更多
关键词 fine-grained action recognition temporal pyramid excitation module temporal receptive multi-excitation module
在线阅读 下载PDF
Learning Noise-Assisted Robust Image Features for Fine-Grained Image Retrieval 被引量:1
14
作者 Vidit Kumar Hemant Petwal +1 位作者 Ajay Krishan Gairola Pareshwar Prasad Barmola 《Computer Systems Science & Engineering》 SCIE EI 2023年第9期2711-2724,共14页
Fine-grained image search is one of the most challenging tasks in computer vision that aims to retrieve similar images at the fine-grained level for a given query image.The key objective is to learn discriminative fin... Fine-grained image search is one of the most challenging tasks in computer vision that aims to retrieve similar images at the fine-grained level for a given query image.The key objective is to learn discriminative fine-grained features by training deep models such that similar images are clustered,and dissimilar images are separated in the low embedding space.Previous works primarily focused on defining local structure loss functions like triplet loss,pairwise loss,etc.However,training via these approaches takes a long training time,and they have poor accuracy.Additionally,representations learned through it tend to tighten up in the embedded space and lose generalizability to unseen classes.This paper proposes a noise-assisted representation learning method for fine-grained image retrieval to mitigate these issues.In the proposed work,class manifold learning is performed in which positive pairs are created with noise insertion operation instead of tightening class clusters.And other instances are treated as negatives within the same cluster.Then a loss function is defined to penalize when the distance between instances of the same class becomes too small relative to the noise pair in that class in embedded space.The proposed approach is validated on CARS-196 and CUB-200 datasets and achieved better retrieval results(85.38%recall@1 for CARS-196%and 70.13%recall@1 for CUB-200)compared to other existing methods. 展开更多
关键词 convolutional network zero-shot learning fine-grained image retrieval image representation image retrieval intra-class diversity feature learning
在线阅读 下载PDF
A survey of fine-grained visual categorization based on deep learning
15
作者 XIE Yuxiang GONG Quanzhi +2 位作者 LUAN Xidao YAN Jie ZHANG Jiahui 《Journal of Systems Engineering and Electronics》 CSCD 2024年第6期1337-1356,共20页
Deep learning has achieved excellent results in various tasks in the field of computer vision,especially in fine-grained visual categorization.It aims to distinguish the subordinate categories of the label-level categ... Deep learning has achieved excellent results in various tasks in the field of computer vision,especially in fine-grained visual categorization.It aims to distinguish the subordinate categories of the label-level categories.Due to high intra-class variances and high inter-class similarity,the fine-grained visual categorization is extremely challenging.This paper first briefly introduces and analyzes the related public datasets.After that,some of the latest methods are reviewed.Based on the feature types,the feature processing methods,and the overall structure used in the model,we divide them into three types of methods:methods based on general convolutional neural network(CNN)and strong supervision of parts,methods based on single feature processing,and meth-ods based on multiple feature processing.Most methods of the first type have a relatively simple structure,which is the result of the initial research.The methods of the other two types include models that have special structures and training processes,which are helpful to obtain discriminative features.We conduct a specific analysis on several methods with high accuracy on pub-lic datasets.In addition,we support that the focus of the future research is to solve the demand of existing methods for the large amount of the data and the computing power.In terms of tech-nology,the extraction of the subtle feature information with the burgeoning vision transformer(ViT)network is also an important research direction. 展开更多
关键词 deep learning fine-grained visual categorization convolutional neural network(CNN) visual attention
在线阅读 下载PDF
Attention-Augmented YOLOv8 with Ghost Convolution for Real-Time Vehicle Detection in Intelligent Transportation Systems
16
作者 Syed Sajid Ullah Muhammad Zunair Zamir +1 位作者 Ahsan Ishfaq Salman Khan 《Journal on Artificial Intelligence》 2025年第1期255-274,共20页
Accurate vehicle detection is essential for autonomous driving,traffic monitoring,and intelligent transportation systems.This paper presents an enhanced YOLOv8n model that incorporates the Ghost Module,Convolutional B... Accurate vehicle detection is essential for autonomous driving,traffic monitoring,and intelligent transportation systems.This paper presents an enhanced YOLOv8n model that incorporates the Ghost Module,Convolutional Block Attention Module(CBAM),and Deformable Convolutional Networks v2(DCNv2).The Ghost Module streamlines feature generation to reduce redundancy,CBAM applies channel and spatial attention to improve feature focus,and DCNv2 enables adaptability to geometric variations in vehicle shapes.These components work together to improve both accuracy and computational efficiency.Evaluated on the KITTI dataset,the proposed model achieves 95.4%mAP@0.5—an 8.97% gain over standard YOLOv8n—along with 96.2% precision,93.7% recall,and a 94.93%F1-score.Comparative analysis with seven state-of-the-art detectors demonstrates consistent superiority in key performance metrics.An ablation study is also conducted to quantify the individual and combined contributions of GhostModule,CBAM,and DCNv2,highlighting their effectiveness in improving detection performance.By addressing feature redundancy,attention refinement,and spatial adaptability,the proposed model offers a robust and scalable solution for vehicle detection across diverse traffic scenarios. 展开更多
关键词 YOLOv8n vehicle detection deformable convolutional networks(DCNv2) ghost module convolutional block attention module(CBAM) attention mechanisms
在线阅读 下载PDF
A Multi-Task Learning Framework for Joint Sub-Nyquist Wideband Spectrum Sensing and Modulation Recognition
17
作者 Dong Xin Stefanos Bakirtzis +1 位作者 Zhang Jiliang Zhang Jie 《China Communications》 2025年第1期128-138,共11页
The utilization of millimeter-wave frequencies and cognitive radio(CR)are promising ways to increase the spectral efficiency of wireless communication systems.However,conventional CR spectrum sensing techniques entail... The utilization of millimeter-wave frequencies and cognitive radio(CR)are promising ways to increase the spectral efficiency of wireless communication systems.However,conventional CR spectrum sensing techniques entail sampling the received signal at a Nyquist rate,and they are not viable for wideband signals due to their high cost.This paper expounds on how sub-Nyquist sampling in conjunction with deep learning can be leveraged to remove this limitation.To this end,we propose a multi-task learning(MTL)framework using convolutional neural networks for the joint inference of the underlying narrowband signal number,their modulation scheme,and their location in a wideband spectrum.We demonstrate the effectiveness of the proposed framework for real-world millimeter-wave wideband signals collected by physical devices,exhibiting a 91.7% accuracy in the joint inference task when considering up to two narrowband signals over a wideband spectrum.Ultimately,the proposed data-driven approach enables on-the-fly wideband spectrum sensing,combining accuracy,and computational efficiency,which are indispensable for CR and opportunistic networking. 展开更多
关键词 automated modulation classification cognitive radio convolutional neural networks deep learning spectrum sensing sub-Nyquist sampling
在线阅读 下载PDF
Recognition of intrapulse modulation mode in radar signal with BRN-EST
18
作者 Yan Cheng Ke Mei Hao Zeng 《Journal of Electronic Science and Technology》 2025年第4期113-122,共10页
Neural network-based methods for intrapulse modulation recognition in radar signals have demonstrated significant improvements in classification accuracy.However,these approaches often rely on complex network structur... Neural network-based methods for intrapulse modulation recognition in radar signals have demonstrated significant improvements in classification accuracy.However,these approaches often rely on complex network structures,resulting in high computational resource requirements that limit their practical deployment in real-world settings.To address this issue,this paper proposes a bottleneck residual network with efficient soft-thresholding(BRN-EST)network,which integrates multiple lightweight design strategies and noise-reduction modules to maintain high recognition accuracy while significantly reducing computational complexity.Experimental results on the classical low-probability-of-intercept(LPI)radar signal dataset demonstrate that BRN-EST achieves comparable accuracy to state-of-the-art methods while reducing computational complexity by approximately 50%. 展开更多
关键词 Attention mechanism convolutional neural network Low probability of intercept radar Recognition of intrapulse modulation
在线阅读 下载PDF
基于轻量级卷积神经网络的岩石图像岩性识别方法
19
作者 刘善伟 马志伟 +1 位作者 魏世清 魏忠勇 《地质科技通报》 北大核心 2026年第1期360-370,共11页
岩性识别是油气勘探和开发过程中的重要环节,对于油气勘探定位、储层评价以及储层模型建立具有重要的指导意义。但传统的人工岩性识别方法耗时耗力,经典的深度学习模型虽然识别精度高,但模型的参数量较大,为了提高模型识别精度,同时降... 岩性识别是油气勘探和开发过程中的重要环节,对于油气勘探定位、储层评价以及储层模型建立具有重要的指导意义。但传统的人工岩性识别方法耗时耗力,经典的深度学习模型虽然识别精度高,但模型的参数量较大,为了提高模型识别精度,同时降低模型的参数量,使模型适用于岩性实时识别工作,首先收集了白云岩、砂岩等8种岩石共3016张岩石图像构建岩性识别数据集,然后以轻量型卷积神经网络ShuffleNetV2模型为基础网络,提出了一种Rock-ShuffleNetV2岩性识别模型(RSHFNet模型)。模型中将混合注意力机制模块(convolutional block attention module,简称CBAM)以及多尺度特征融合模块(multi-scale feature fusion module,简称MSF)融入基础网络中以加强模型的特征提取能力,提升模型识别性能,并优化模型中ShuffleNetV2单元的堆叠次数以减少模型参数量。结果表明:与基础模型相比,RSHFNet模型的准确率达到了87.21%,提高了4.98%;同时,模型参数量与浮点运算量分别降低到了869702个,0.93×108,分别是基础模型的0.67,0.63倍,模型参数量明显降低;并且RSHFNet模型的综合性能明显优于现有的卷积神经网络。RSHFNet岩性识别模型具有较高的识别精度和较好的泛化能力,同时更加的轻量化,为实现野外实时的岩性识别工作提供了新思路。 展开更多
关键词 岩性识别 ShuffleNetV2网络 混合注意力机制模块 多尺度特征融合模块 卷积神经网络
在线阅读 下载PDF
基于改进卷积神经网络的水体分割方法
20
作者 张永宏 孙岩 +2 位作者 田伟 马光义 朱灵龙 《计算机应用与软件》 北大核心 2026年第2期164-174,188,共12页
由于遥感图像中水体具有复杂的多尺度特征,传统方法在提取水体过程中容易产生误判和漏判现象。针对这一问题,提出一种融合局部和全局信息的新网络结构。该网络首先在编码端设计一个带有注意机制的残差模块,用于获取每个位置特征的全局... 由于遥感图像中水体具有复杂的多尺度特征,传统方法在提取水体过程中容易产生误判和漏判现象。针对这一问题,提出一种融合局部和全局信息的新网络结构。该网络首先在编码端设计一个带有注意机制的残差模块,用于获取每个位置特征的全局和局部信息,采用多路径扩张卷积实现多尺度水体特征提取。为了提高水体边界处的分割精度,在网络解码端设计细化注意力融合模块。实验结果显示该网络的召回率、精准率、F1-scores分别为95.78%、94.24%、93.75%,与传统卷积神经网络相比,评价指标分别提高1.56百分点、1.72百分点、1.62百分点。 展开更多
关键词 水体分割 全局注意力机制 多路径扩张卷积 局部和全局信息
在线阅读 下载PDF
上一页 1 2 63 下一页 到第
使用帮助 返回顶部