期刊文献+
共找到39,907篇文章
< 1 2 250 >
每页显示 20 50 100
Human Activity Recognition in a Realistic and Multiview Environment Based on Two-Dimensional Convolutional Neural Network 被引量:1
1
作者 Ashish KhareArati Kushwaha Om Prakash 《Journal of Artificial Intelligence and Technology》 2023年第3期100-107,共8页
Recognition of human activity based on convolutional neural network(CNN)has received the interest of researchers in recent years due to its significant improvement in accuracy.A large number of algorithms based on the... Recognition of human activity based on convolutional neural network(CNN)has received the interest of researchers in recent years due to its significant improvement in accuracy.A large number of algorithms based on the deep learning approach have been proposed for activity recognition purpose.However,with the increasing advancements in technologies having limited computational resources,it needs to design an efficient deep learning-based approaches with improved utilization of computational resources.This paper presents a simple and efficient 2-dimensional CNN(2-D CNN)architecture with very small-size convolutional kernel for human activity recognition.The merit of the proposed CNN architecture over standard deep learning architectures is fewer trainable parameters and lesser memory requirement which enables it to train the proposed CNN architecture on low GPU memory-based devices and also works well with smaller as well as larger size datasets.The proposed approach consists of mainly four stages:namely(1)creation of dataset and data augmentation,(2)designing 2-D CNN architecture,(3)the proposed 2-D CNN architecture trained from scratch up to optimum stage,and(4)evaluation of the trained 2-D CNN architecture.To illustrate the effectiveness of the proposed architecture several extensive experiments are conducted on three publicly available datasets,namely IXMAS,YouTube,and UCF101 dataset.The results of the proposed method and its comparison with other state-of-the-art methods demonstrate the usefulness of the proposed method. 展开更多
关键词 computational resources convolutional neural network GPU memory human activity recognition softmax classifier training parameters
在线阅读 下载PDF
Independent sampling and padding for Rayleigh-Sommerfeld diffraction based on scaled convolution approach
2
作者 YANG Chen FU Xi-hong +1 位作者 FU Xin-peng BAYANHESHIG 《中国光学(中英文)》 北大核心 2026年第2期367-381,共15页
We propose a novel fast numerical calculation method for the Rayleigh-Sommerfeld diffraction integral,which is developed based on the existing scaled convolution method.This approach enables fast cal-culations for gen... We propose a novel fast numerical calculation method for the Rayleigh-Sommerfeld diffraction integral,which is developed based on the existing scaled convolution method.This approach enables fast cal-culations for general cases of off-axis scenarios where the sampling intervals and numbers of the input and observation planes are unequal.Additionally,it allows for arbitrary adjustment of the sampling interval of the impulse response function,facilitating a manual trade-off between computational load and accuracy.The er-rors associated with this method,which is equivalent to interpolation,primarily arise from the discontinuities of the sampling matrix of the impulse response function on its boundaries of periodic extension.To address this issue,we propose the concept of the padding function and its construction method,and evaluate its ef-fectiveness in enhancing computational accuracy.The feasibility of the proposed method is verified by nu-merical simulation and compared with the direct integration DI-method in a simplified scenario.It shows that the proposed method has good computational accuracy for the general case where the sampling interval of the input and observation plane is not equal under non-near-field diffraction,and when the diffraction distance is large,although the computational accuracy of the proposed method cannot exceed that of the DI-method,the computational amount can be significantly reduced with almost no effect on the computational accuracy.This method provides a general numerical calculation scheme of diffraction in the non-near field case for areas such as computational holography. 展开更多
关键词 Rayleigh-Sommerfeld diffraction scaled convolution padding function
在线阅读 下载PDF
Underwater Image Enhancement Based on Depthwise Separable Convolution-Based Generative Adversarial Network
3
作者 ZENG Jun-yang SI Zhan-jun 《印刷与数字媒体技术研究》 北大核心 2026年第1期60-66,共7页
The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adver... The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adversarial network(GAN)algorithm was proposed.Taking GAN as the basic framework,it combined a depthwise separable convolution module,attention mechanism,and reconstructed convolution module to realize the enhancement of underwater degraded images.Multi-scale features were captured by the depthwise separable convolution module,and the attention mechanism was utilized to enhance attention to important features.The reconstructed convolution module further extracts and fuses local and global features.Experimental results showed that the algorithm performs well in improving the color bias and blurring of underwater images,with PSNR reaching 27.835,SSIM reaching 0.883,UIQM reaching 3.205,and UCIQE reaching 0.713.The enhanced image outperforms the comparison algorithm in both subjective and objective metrics. 展开更多
关键词 Underwater image enhancement Generating adversarial network Depthwise separable convolution
在线阅读 下载PDF
Two-dimensional grating line parameter calibration based on biaxial phase mapping
4
作者 TENG Hai-rui LIANG Xu +3 位作者 JIN Si-yu SUN Yu-jia LI Wen-hao LIU Zhao-wu 《中国光学(中英文)》 北大核心 2026年第2期407-420,共14页
The two-dimensional grating serves as a critical component in plane grating interferometers for achieving high-precision multidimensional displacement measurements.The calibration of grating groove density and orthogo... The two-dimensional grating serves as a critical component in plane grating interferometers for achieving high-precision multidimensional displacement measurements.The calibration of grating groove density and orthogonality error of grating grooves not only improves the positioning accuracy of grating interferometers but also provides essential feedback for optimizing two-dimensional grating fabrication.This study proposes a method for simultaneous calibration of these parameters using orthogonal heterodyne laser interferometry.A two-dimensional grating interferometer is built with the grating to be measured,and a biaxial laser interferometer provides a displacement reference for it.The phase mapping relationship between grating interference and laser interference is established.The interference phase information obtained by any two displacements can simultaneously solve the above three parameters and obtain the grating installation error.The feasibility of the proposed method is verified by using a 1200 gr/mm two-dimensional grating.The standard deviation of the grating groove density in the X and Y directions is 0.012 gr/mm and 0.014 gr/mm,respectively.The standard deviation of the orthogonality error of grating grooves is 0.004°,and the standard deviation of the installation error is 0.002°.Compared with the atomic force microscope method,the consistency of the grating groove density in the X and Y directions is better than 0.03 gr/mm and 0.06 gr/mm,and the orthogonality error of grating grooves is better than 0.008°.The experimental results show that the proposed method can be simply and efficiently applied to the calibration of the grating line parameters of the two-dimensional grating. 展开更多
关键词 two-dimensional grating grating line parameter calibration grating groove density orthogonality error of grating grooves
在线阅读 下载PDF
Dual Channel Graph Convolutional Networks via Personalized PageRank
5
作者 Longlong Lin Xin Luo 《IEEE/CAA Journal of Automatica Sinica》 2026年第1期221-223,共3页
Dear Editor,D2This letter presents a node feature similarity preserving graph convolutional framework P G.Graph neural networks(GNNs)have garnered significant attention for their efficacy in learning graph representat... Dear Editor,D2This letter presents a node feature similarity preserving graph convolutional framework P G.Graph neural networks(GNNs)have garnered significant attention for their efficacy in learning graph representations across diverse real-world applications. 展开更多
关键词 convolutional node feature similarity graph convolutional framework learning graph representations neural networks gnns NETWORKS GRAPH PERSONALIZED
在线阅读 下载PDF
Interactive Dynamic Graph Convolution with Temporal Attention for Traffic Flow Forecasting
6
作者 Zitong Zhao Zixuan Zhang Zhenxing Niu 《Computers, Materials & Continua》 2026年第1期1049-1064,共16页
Reliable traffic flow prediction is crucial for mitigating urban congestion.This paper proposes Attentionbased spatiotemporal Interactive Dynamic Graph Convolutional Network(AIDGCN),a novel architecture integrating In... Reliable traffic flow prediction is crucial for mitigating urban congestion.This paper proposes Attentionbased spatiotemporal Interactive Dynamic Graph Convolutional Network(AIDGCN),a novel architecture integrating Interactive Dynamic Graph Convolution Network(IDGCN)with Temporal Multi-Head Trend-Aware Attention.Its core innovation lies in IDGCN,which uniquely splits sequences into symmetric intervals for interactive feature sharing via dynamic graphs,and a novel attention mechanism incorporating convolutional operations to capture essential local traffic trends—addressing a critical gap in standard attention for continuous data.For 15-and 60-min forecasting on METR-LA,AIDGCN achieves MAEs of 0.75%and 0.39%,and RMSEs of 1.32%and 0.14%,respectively.In the 60-min long-term forecasting of the PEMS-BAY dataset,the AIDGCN out-performs the MRA-BGCN method by 6.28%,4.93%,and 7.17%in terms of MAE,RMSE,and MAPE,respectively.Experimental results demonstrate the superiority of our pro-posed model over state-of-the-art methods. 展开更多
关键词 Traffic flow prediction interactive dynamic graph convolution graph convolution temporal multi-head trend-aware attention self-attention mechanism
在线阅读 下载PDF
Two-Dimensional MXene-Based Advanced Sensors for Neuromorphic Computing Intelligent Application
7
作者 Lin Lu Bo Sun +2 位作者 Zheng Wang Jialin Meng Tianyu Wang 《Nano-Micro Letters》 2026年第2期664-691,共28页
As emerging two-dimensional(2D)materials,carbides and nitrides(MXenes)could be solid solutions or organized structures made up of multi-atomic layers.With remarkable and adjustable electrical,optical,mechanical,and el... As emerging two-dimensional(2D)materials,carbides and nitrides(MXenes)could be solid solutions or organized structures made up of multi-atomic layers.With remarkable and adjustable electrical,optical,mechanical,and electrochemical characteristics,MXenes have shown great potential in brain-inspired neuromorphic computing electronics,including neuromorphic gas sensors,pressure sensors and photodetectors.This paper provides a forward-looking review of the research progress regarding MXenes in the neuromorphic sensing domain and discussed the critical challenges that need to be resolved.Key bottlenecks such as insufficient long-term stability under environmental exposure,high costs,scalability limitations in large-scale production,and mechanical mismatch in wearable integration hinder their practical deployment.Furthermore,unresolved issues like interfacial compatibility in heterostructures and energy inefficiency in neu-romorphic signal conversion demand urgent attention.The review offers insights into future research directions enhance the fundamental understanding of MXene properties and promote further integration into neuromorphic computing applications through the convergence with various emerging technologies. 展开更多
关键词 two-dimensional MXenes SENSOR Neuromorphic computing Multimodal intelligent system Wearable electronics
在线阅读 下载PDF
Tensor Low-Rank Orthogonal Compression for Convolutional Neural Networks
8
作者 Yaping He Xin Luo 《IEEE/CAA Journal of Automatica Sinica》 2026年第1期227-229,共3页
Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression... Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression is crucial for deploying deep neural network(DNN)models on resource-constrained embedded devices. 展开更多
关键词 model compression convolutional neural network cnn which tensor low rank orthogonal compression deep neural network dnn models embedded devices convolutional neural networks
在线阅读 下载PDF
Two-dimensional kagome semiconductor Sc_(6)S_(5)X_(6)(X=Cl,Br,I)with trilayer kagome lattice
9
作者 Jin-Ling Yan Xing-Yu Wang +5 位作者 Gen-Ping Wu Hao Wang Ya-Jiao Ke Jiafu Wang Zhi-Hong Liu Jun-Hui Yuan 《Chinese Physics B》 2026年第2期519-528,共10页
Two-dimensional(2D)multilayer kagome materials hold significant research value for regulating kagome-related physical properties and exploring quantum effects.However,their development is hindered by the scarcity of a... Two-dimensional(2D)multilayer kagome materials hold significant research value for regulating kagome-related physical properties and exploring quantum effects.However,their development is hindered by the scarcity of available material systems,making the identification of novel 2D multilayer kagome candidates particularly important.In this work,three types of 2D materials with trilayer kagome lattices,namely Sc_(6)S_(5)X_(6)(X=Cl,Br,I),are predicted based on first-principles calculations.These 2D materials feature two kagome lattices composed of Sc atoms and one kagome lattice composed of S atoms.Stability analysis indicates that these materials can exist as free-standing 2D materials.Electronic structure calculations reveal that Sc_(6)S_(5)X_(6)are narrow-bandgap semiconductors(0.76–0.95 e V),with their band structures exhibiting flat bands contributed by Sc-based kagome lattices and Dirac band gaps resulting from symmetry breaking.The sulfur-based kagome lattice in the central layer contributes an independent flat band below the Fermi level.Additionally,Sc_(6)S_(5)X_(6)exhibit high carrier mobility,with hole and electron mobilities reaching up to 10^(3)cm^(2)·V^(-1)·s^(-1),indicating potential applications in low-dimensional electronic devices.This work provides an excellent example for the development of novel multilayer 2D kagome materials. 展开更多
关键词 multilayer kagome lattice two-dimensional materials carrier mobility first-principles calculations
原文传递
Brief application notes for vision transformer (ViT) and convolutional neural network (CNN) in medical imaging
10
作者 Wei Kitt Wong Melinda Melinda 《Medical Data Mining》 2026年第2期34-42,共9页
In contemporary computer vision,convolutional neural networks(CNNs)and vision transformers(ViTs)represent the two primary architectural paradigms for image recognition.While both approaches have been widely adopted in... In contemporary computer vision,convolutional neural networks(CNNs)and vision transformers(ViTs)represent the two primary architectural paradigms for image recognition.While both approaches have been widely adopted in medical imaging applications,they operate based on fundamentally different computational principles.This report attempts to provide brief application notes on ViTs and CNNs,particularly focusing on scenarios that guide the selection of one architecture over the other in practical medical implementations.Generally,CNNs rely on convolutional kernels,localized receptive fields,and weight sharing,enabling efficient hierarchical feature extraction.These properties contribute to strong performance in detecting spatially constrained patterns such as textures,edges,and anatomical boundaries,while maintaining relatively low computational requirements.ViTs,on the other hand,decompose images into smaller segments referred to as tokens and employ self-attention mechanisms to model relationships across the entire image.This global modeling capability allows ViTs to capture long-range dependencies that may be difficult for convolution-based architectures to learn.However,ViTs typically achieve optimal performance when trained on extremely large datasets or when supported by extensive pretraining,as their reduced inductive bias requires greater data exposure to learn robust representations.This report briefly examines the architectural structure,underlying mathematical foundations,and relative performance characteristics of CNNs and ViTs,drawing upon recent findings from contemporary research.Emphasis is placed on understanding how differences in data availability,computational resources,and task requirements influence model effectiveness across medical imaging domains.Most importantly,the report serves as a concise application guide for practitioners seeking informed implementation decisions between these two influential deep learning frameworks. 展开更多
关键词 convolutional neural network vision transformer comparative study medical imaging
在线阅读 下载PDF
Efficient Video Emotion Recognition via Multi-Scale Region-Aware Convolution and Temporal Interaction Sampling
11
作者 Xiaorui Zhang Chunlin Yuan +1 位作者 Wei Sun Ting Wang 《Computers, Materials & Continua》 2026年第2期2036-2054,共19页
Video emotion recognition is widely used due to its alignment with the temporal characteristics of human emotional expression,but existingmodels have significant shortcomings.On the one hand,Transformermultihead self-... Video emotion recognition is widely used due to its alignment with the temporal characteristics of human emotional expression,but existingmodels have significant shortcomings.On the one hand,Transformermultihead self-attention modeling of global temporal dependency has problems of high computational overhead and feature similarity.On the other hand,fixed-size convolution kernels are often used,which have weak perception ability for emotional regions of different scales.Therefore,this paper proposes a video emotion recognition model that combines multi-scale region-aware convolution with temporal interactive sampling.In terms of space,multi-branch large-kernel stripe convolution is used to perceive emotional region features at different scales,and attention weights are generated for each scale feature.In terms of time,multi-layer odd-even down-sampling is performed on the time series,and oddeven sub-sequence interaction is performed to solve the problem of feature similarity,while reducing computational costs due to the linear relationship between sampling and convolution overhead.This paper was tested on CMU-MOSI,CMU-MOSEI,and Hume Reaction.The Acc-2 reached 83.4%,85.2%,and 81.2%,respectively.The experimental results show that the model can significantly improve the accuracy of emotion recognition. 展开更多
关键词 MULTI-SCALE region-aware convolution temporal interaction sampling video emotion recognition
在线阅读 下载PDF
Enhanced Image Captioning via Integrated Wavelet Convolution and MobileNet V3 Architecture
12
作者 Mo Hou Bin Xu Wen Shang 《Computers, Materials & Continua》 2026年第2期897-915,共19页
Image captioning,a pivotal research area at the intersection of image understanding,artificial intelligence,and linguistics,aims to generate natural language descriptions for images.This paper proposes an efficient im... Image captioning,a pivotal research area at the intersection of image understanding,artificial intelligence,and linguistics,aims to generate natural language descriptions for images.This paper proposes an efficient image captioning model named Mob-IMWTC,which integrates improved wavelet convolution(IMWTC)with an enhanced MobileNet V3 architecture.The enhanced MobileNet V3 integrates a transformer encoder as its encoding module and a transformer decoder as its decoding module.This innovative neural network significantly reduces the memory space required and model training time,while maintaining a high level of accuracy in generating image descriptions.IMWTC facilitates large receptive fields without significantly increasing the number of parameters or computational overhead.The improvedMobileNet V3 model has its classifier removed,and simultaneously,it employs IMWTC layers to replace the original convolutional layers.This makes Mob-IMWTC exceptionally well-suited for deployment on lowresource devices.Experimental results,based on objective evaluation metrics such as BLEU,ROUGE,CIDEr,METEOR,and SPICE,demonstrate that Mob-IMWTC outperforms state-of-the-art models,including three CNN architectures(CNN-LSTM,CNN-Att-LSTM,CNN-Tran),two mainstream methods(LCM-Captioner,ClipCap),and our previous work(Mob-Tran).Subjective evaluations further validate the model’s superiority in terms of grammaticality,adequacy,logic,readability,and humanness.Mob-IMWTC offers a lightweight yet effective solution for image captioning,making it suitable for deployment on resource-constrained devices. 展开更多
关键词 Image caption wavelet convolution MobileNet V3 deep learning
在线阅读 下载PDF
Enhancing convolution for Transformer-based weakly supervised semantic segmentation
13
作者 LIU Yu TAN Diaoyin +1 位作者 ZHOU Wen XIAO Huaxin 《Journal of Systems Engineering and Electronics》 2026年第1期84-93,共10页
Weakly supervised semantic segmentation(WSSS)is a tricky task,which only provides category information for segmentation prediction.Thus,the key stage of WSSS is to generate the pseudo labels.For convolutional neural n... Weakly supervised semantic segmentation(WSSS)is a tricky task,which only provides category information for segmentation prediction.Thus,the key stage of WSSS is to generate the pseudo labels.For convolutional neural network(CNN)based methods,in which class activation mapping(CAM)is proposed to obtain the pseudo labels,and only concentrates on the most discriminative parts.Recently,transformer-based methods utilize attention map from the multi-headed self-attention(MHSA)module to predict pseudo labels,which usually contain obvious background noise and incoherent object area.To solve the above problems,we use the Conformer as our backbone,which is a parallel network based on convolutional neural network(CNN)and Transformer.The two branches generate pseudo labels and refine them independently,and can effectively combine the advantages of CNN and Transformer.However,the parallel structure is not close enough in the information communication.Thus,parallel structure can result in poor details about pseudo labels,and the background noise still exists.To alleviate this problem,we propose enhancing convolution CAM(ECCAM)model,which have three improved modules based on enhancing convolution,including deeper stem(DStem),convolutional feed-forward network(CFFN)and feature coupling unit with convolution(FCUConv).The ECCAM could make Conformer have tighter interaction between CNN and Transformer branches.After experimental verification,the improved modules we propose can help the network perceive more local information from images,making the final segmentation results more refined.Compared with similar architecture,our modules greatly improve the semantic segmentation performance and achieve70.2%mean intersection over union(mIoU)on the PASCAL VOC 2012 dataset. 展开更多
关键词 weakly supervised semantic segmentation TRANSFORMER convolutional neural network
在线阅读 下载PDF
Learning Laws for Deep Convolutional Neural Networks With Guaranteed Convergence
14
作者 Sitan Li Chien Chern Cheah 《IEEE/CAA Journal of Automatica Sinica》 2026年第1期170-185,共16页
Convolutional neural networks(CNNs)have shown remarkable success across numerous tasks such as image classification,yet the theoretical understanding of their convergence remains underdeveloped compared to their empir... Convolutional neural networks(CNNs)have shown remarkable success across numerous tasks such as image classification,yet the theoretical understanding of their convergence remains underdeveloped compared to their empirical achievements.In this paper,the first filter learning framework with convergence-guaranteed learning laws for end-to-end learning of deep CNNs is proposed.Novel update laws with convergence analysis are formulated based on the mathematical representation of each layer in convolutional neural networks.The proposed learning laws enable concurrent updates of weights across all layers of the deep convolutional neural network and the analysis shows that the training errors converge to certain bounds which are dependent on the approximation errors.Case studies are conducted on benchmark datasets and the results show that the proposed concurrent filter learning framework guarantees the convergence and offers more consistent and reliable results during training with a trade-off in performance compared to stochastic gradient descent methods.This framework represents a significant step towards enhancing the reliability and effectiveness of deep convolutional neural network by developing a theoretical analysis which allows practical implementation of the learning laws with automatic tuning of the learning rate to guarantee the convergence during training. 展开更多
关键词 CONVERGENCE convolution neural networks(CNNs) end-to-end learning online learning
在线阅读 下载PDF
Classification Method of Lower Limbs Motor Imagery Based on Functional Connectivity and Graph Convolutional Network
15
作者 Yang Liu Qi Lu +2 位作者 Junjie Wu Huaichang Yin Shiwei Cheng 《Computers, Materials & Continua》 2026年第3期1674-1689,共16页
The development of brain-computer interfaces(BCI)based on motor imagery(MI)has greatly improved patients’quality of life with movement disorders.The classification of upper limb MI has been widely studied and applied... The development of brain-computer interfaces(BCI)based on motor imagery(MI)has greatly improved patients’quality of life with movement disorders.The classification of upper limb MI has been widely studied and applied in many fields,including rehabilitation.However,the physiological representations of left and right lower limb movements are too close and activated deep in the cerebral cortex,making it difficult to distinguish their features.Therefore,classifying lower limbs motor imagery is more challenging.In this study,we propose a feature extraction method based on functional connectivity,which utilizes phase-locked values to construct a functional connectivity matrix as the features of the left and right legs,which can effectively avoid the problem of physiological representations of the left and right lower limbs being too close to each other during movement.In addition,considering the topology and the temporal characteristics of the electroencephalogram(EEG),we designed a temporal-spatial convolutional network(TSGCN)to capture the spatiotemporal information for classification.Experimental results show that the accuracy of the proposed method is higher than that of existing methods,achieving an average classification accuracy of 73.58%on the internal dataset.Finally,this study explains the network mechanism of left and right foot MI from the perspective of graph theoretic features and demonstrates the feasibility of decoding lower limb MI. 展开更多
关键词 Brain-computer interface lower limb motor imagery functional connectivity temporal-spatial convolutional network
在线阅读 下载PDF
HGS-ATD:A Hybrid Graph Convolutional Network-GraphSAGE Model for Anomaly Traffic Detection
16
作者 Zhian Cui Hailong Li Xieyang Shen 《Journal of Harbin Institute of Technology(New Series)》 2026年第1期33-50,共18页
With network attack technology continuing to develop,traditional anomaly traffic detection methods that rely on feature engineering are increasingly insufficient in efficiency and accuracy.Graph Neural Network(GNN),a ... With network attack technology continuing to develop,traditional anomaly traffic detection methods that rely on feature engineering are increasingly insufficient in efficiency and accuracy.Graph Neural Network(GNN),a promising Deep Learning(DL)approach,has proven to be highly effective in identifying intricate patterns in graph⁃structured data and has already found wide applications in the field of network security.In this paper,we propose a hybrid Graph Convolutional Network(GCN)⁃GraphSAGE model for Anomaly Traffic Detection,namely HGS⁃ATD,which aims to improve the accuracy of anomaly traffic detection by leveraging edge feature learning to better capture the relationships between network entities.We validate the HGS⁃ATD model on four publicly available datasets,including NF⁃UNSW⁃NB15⁃v2.The experimental results show that the enhanced hybrid model is 5.71%to 10.25%higher than the baseline model in terms of accuracy,and the F1⁃score is 5.53%to 11.63%higher than the baseline model,proving that the model can effectively distinguish normal traffic from attack traffic and accurately classify various types of attacks. 展开更多
关键词 anomaly traffic detection graph neural network deep learning graph convolutional network
在线阅读 下载PDF
Fuzzy C-Means Clustering-Driven Pooling for Robust and Generalizable Convolutional Neural Networks
17
作者 Seunggyu Byeon Jung-hun Lee Jong-Deok Kim 《Computers, Materials & Continua》 2026年第5期579-604,共26页
This paper introduces a fuzzy C-means-based pooling layer for convolutional neural networks that explicitly models local uncertainty and ambiguity.Conventional pooling operations,such as max and average,apply rigid ag... This paper introduces a fuzzy C-means-based pooling layer for convolutional neural networks that explicitly models local uncertainty and ambiguity.Conventional pooling operations,such as max and average,apply rigid aggregation and often discard fine-grained boundary information.In contrast,our method computes soft membershipswithin each receptive field and aggregates cluster-wise responses throughmembership-weighted pooling,thereby preserving informative structure while reducing dimensionality.Being differentiable,the proposed layer operates as standard two-dimensional pooling.We evaluate our approach across various CNN backbones and open datasets,including CIFAR-10/100,STL-10,LFW,and ImageNette,and further probe small training set restrictions on MNIST and Fashion-MNIST.In these settings,the proposed pooling consistently improves accuracy and weighted F1 over conventional baselines,with particularly strong gains when training data are scarce.Even with less than 1%of the training set,ourmethodmaintains reliable performance,indicating improved sample efficiency and robustness to noisy or ambiguous local patterns.Overall,integrating soft memberships into the pooling operator provides a practical and generalizable inductive bias that enhances robustness and generalization in modern CNN pipelines. 展开更多
关键词 Fuzzy logic fuzzy c-means clustering membership-based pooling convolutional neural networks downsampling feature extraction
在线阅读 下载PDF
Multi-Label Classification Model Using Graph Convolutional Neural Network for Social Network Nodes
18
作者 Junmin Lyu Guangyu Xu +4 位作者 Feng Bao Yu Zhou Yuxin Liu Siyu Lu Wenfeng Zheng 《Computer Modeling in Engineering & Sciences》 2026年第2期1235-1256,共22页
Graph neural networks(GNN)have shown strong performance in node classification tasks,yet most existing models rely on uniform or shared weight aggregation,lacking flexibility in modeling the varying strength of relati... Graph neural networks(GNN)have shown strong performance in node classification tasks,yet most existing models rely on uniform or shared weight aggregation,lacking flexibility in modeling the varying strength of relationships among nodes.This paper proposes a novel graph coupling convolutional model that introduces an adaptive weighting mechanism to assign distinct importance to neighboring nodes based on their similarity to the central node.Unlike traditional methods,the proposed coupling strategy enhances the interpretability of node interactions while maintaining competitive classification performance.The model operates in the spatial domain,utilizing adjacency list structures for efficient convolution and addressing the limitations of weight sharing through a coupling-based similarity computation.Extensive experiments are conducted on five graph-structured datasets,including Cora,Citeseer,PubMed,Reddit,and BlogCatalog,as well as a custom topology dataset constructed from the Open University Learning Analytics Dataset(OULAD)educational platform.Results demonstrate that the proposed model achieves good classification accuracy,while significantly reducing training time through direct second-order neighbor fusion and data preprocessing.Moreover,analysis of neighborhood order reveals that considering third-order neighbors offers limited accuracy gains but introduces considerable computational overhead,confirming the efficiency of first-and second-order convolution in practical applications.Overall,the proposed graph coupling model offers a lightweight,interpretable,and effective framework for multi-label node classification in complex networks. 展开更多
关键词 GNN social networks nodes multi-label classification model graphic convolution neural network coupling principle
在线阅读 下载PDF
A Privacy-Preserving Convolutional Neural Network Inference Framework for AIoT Applications
19
作者 Haoran Wang Shuhong Yang +2 位作者 Kuan Shao Tao Xiao Zhenyong Zhang 《Computers, Materials & Continua》 2026年第1期1354-1371,共18页
With the rapid development of the Artificial Intelligence of Things(AIoT),convolutional neural networks(CNNs)have demonstrated potential and remarkable performance in AIoT applications due to their excellent performan... With the rapid development of the Artificial Intelligence of Things(AIoT),convolutional neural networks(CNNs)have demonstrated potential and remarkable performance in AIoT applications due to their excellent performance in various inference tasks.However,the users have concerns about privacy leakage for the use of AI and the performance and efficiency of computing on resource-constrained IoT edge devices.Therefore,this paper proposes an efficient privacy-preserving CNN framework(i.e.,EPPA)based on the Fully Homomorphic Encryption(FHE)scheme for AIoT application scenarios.In the plaintext domain,we verify schemes with different activation structures to determine the actual activation functions applicable to the corresponding ciphertext domain.Within the encryption domain,we integrate batch normalization(BN)into the convolutional layers to simplify the computation process.For nonlinear activation functions,we use composite polynomials for approximate calculation.Regarding the noise accumulation caused by homomorphic multiplication operations,we realize the refreshment of ciphertext noise through minimal“decryption-encryption”interactions,instead of adopting bootstrapping operations.Additionally,in practical implementation,we convert three-dimensional convolution into two-dimensional convolution to reduce the amount of computation in the encryption domain.Finally,we conduct extensive experiments on four IoT datasets,different CNN architectures,and two platforms with different resource configurations to evaluate the performance of EPPA in detail. 展开更多
关键词 Artificial Intelligence of Things(AIoT) convolutional neural network PRIVACY-PRESERVING fully homomorphic encryption
在线阅读 下载PDF
Multi-scale simplified residual convolutional neural network model for predicting compositions of binary magnesium alloys
20
作者 Xu Qin Qinghang Wang +6 位作者 Xinqian Zhao Shouxin Xia Li Wang Jiabao Long Yuhui Zhang Yanfu Chai Daolun Chen 《Journal of Magnesium and Alloys》 2026年第1期117-123,共7页
This study proposes a multi-scale simplified residual convolutional neural network(MS-SRCNN)for the precise prediction of Mg-Nd binary alloy compositions from scanning electron microscope(SEM)images.A multi-scale data... This study proposes a multi-scale simplified residual convolutional neural network(MS-SRCNN)for the precise prediction of Mg-Nd binary alloy compositions from scanning electron microscope(SEM)images.A multi-scale data structure is established by spatially aligning and stacking SEM images at different magnifications.The MS-SRCNN significantly reduces computational runtime by over 90%compared to traditional architectures like ResNet50,VGG16,and VGG19,without compromising prediction accuracy.The model demonstrates more excellent predictive performance,achieving a>5%increase in R^(2) compared to single-scale models.Furthermore,the MS-SRCNN exhibits robust composition prediction capability across other Mg-based binary alloys,including Mg-La,Mg-Sn,Mg-Ce,Mg-Sm,Mg-Ag,and Mg-Y,thereby emphasizing its generalization and extrapolation potential.This research establishes a non-destructive,microstructure-informed composition analysis framework,reduces characterization time compared to traditional experiment methods and provides insights into the composition-microstructure relationship in diverse material systems. 展开更多
关键词 Magnesium alloys Composition prediction Scanning electron microscope images Multi-scale simplified residual convolutional neural network
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部