Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon...Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon-based readout circuits in a single step.Based on this,we propose a photodiode based on an n-i-p structure,which removes the buffer layer and further simplifies the manufacturing process of quantum dot image sensors,thus reducing manufacturing costs.Additionally,for the noise complexity in quantum dot image sensors when capturing images,traditional denoising and non-uniformity methods often do not achieve optimal denoising re⁃sults.For the noise and stripe-type non-uniformity commonly encountered in infrared quantum dot detector imag⁃es,a network architecture has been developed that incorporates multiple key modules.This network combines channel attention and spatial attention mechanisms,dynamically adjusting the importance of feature maps to en⁃hance the ability to distinguish between noise and details.Meanwhile,the residual dense feature fusion module further improves the network's ability to process complex image structures through hierarchical feature extraction and fusion.Furthermore,the pyramid pooling module effectively captures information at different scales,improv⁃ing the network's multi-scale feature representation ability.Through the collaborative effect of these modules,the network can better handle various mixed noise and image non-uniformity issues.Experimental results show that it outperforms the traditional U-Net network in denoising and image correction tasks.展开更多
Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression...Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression is crucial for deploying deep neural network(DNN)models on resource-constrained embedded devices.展开更多
Dear Editor,This letter addresses the critical challenge of preserving privacy in graph learning without compromising on data utility.Differential privacy(DP)is emerging as an effective method for privacy-preserving g...Dear Editor,This letter addresses the critical challenge of preserving privacy in graph learning without compromising on data utility.Differential privacy(DP)is emerging as an effective method for privacy-preserving graph learning.However,its application often diminishes data utility,especially for nodes with fewer neighbors in graph neural networks(GNNs).展开更多
Distributed Denial of Service(DDoS)attacks are one of the severe threats to network infrastructure,sometimes bypassing traditional diagnosis algorithms because of their evolving complexity.PresentMachine Learning(ML)t...Distributed Denial of Service(DDoS)attacks are one of the severe threats to network infrastructure,sometimes bypassing traditional diagnosis algorithms because of their evolving complexity.PresentMachine Learning(ML)techniques for DDoS attack diagnosis normally apply network traffic statistical features such as packet sizes and inter-arrival times.However,such techniques sometimes fail to capture complicated relations among various traffic flows.In this paper,we present a new multi-scale ensemble strategy given the Graph Neural Networks(GNNs)for improving DDoS detection.Our technique divides traffic into macro-and micro-level elements,letting various GNN models to get the two corase-scale anomalies and subtle,stealthy attack models.Through modeling network traffic as graph-structured data,GNNs efficiently learn intricate relations among network entities.The proposed ensemble learning algorithm combines the results of several GNNs to improve generalization,robustness,and scalability.Extensive experiments on three benchmark datasets—UNSW-NB15,CICIDS2017,and CICDDoS2019—show that our approach outperforms traditional machine learning and deep learning models in detecting both high-rate and low-rate(stealthy)DDoS attacks,with significant improvements in accuracy and recall.These findings demonstrate the suggested method’s applicability and robustness for real-world implementation in contexts where several DDoS patterns coexist.展开更多
Graph neural networks(GNN)have shown strong performance in node classification tasks,yet most existing models rely on uniform or shared weight aggregation,lacking flexibility in modeling the varying strength of relati...Graph neural networks(GNN)have shown strong performance in node classification tasks,yet most existing models rely on uniform or shared weight aggregation,lacking flexibility in modeling the varying strength of relationships among nodes.This paper proposes a novel graph coupling convolutional model that introduces an adaptive weighting mechanism to assign distinct importance to neighboring nodes based on their similarity to the central node.Unlike traditional methods,the proposed coupling strategy enhances the interpretability of node interactions while maintaining competitive classification performance.The model operates in the spatial domain,utilizing adjacency list structures for efficient convolution and addressing the limitations of weight sharing through a coupling-based similarity computation.Extensive experiments are conducted on five graph-structured datasets,including Cora,Citeseer,PubMed,Reddit,and BlogCatalog,as well as a custom topology dataset constructed from the Open University Learning Analytics Dataset(OULAD)educational platform.Results demonstrate that the proposed model achieves good classification accuracy,while significantly reducing training time through direct second-order neighbor fusion and data preprocessing.Moreover,analysis of neighborhood order reveals that considering third-order neighbors offers limited accuracy gains but introduces considerable computational overhead,confirming the efficiency of first-and second-order convolution in practical applications.Overall,the proposed graph coupling model offers a lightweight,interpretable,and effective framework for multi-label node classification in complex networks.展开更多
The thermal conductivity of nanofluids is an important property that influences the heat transfer capabilities of nanofluids.Researchers rely on experimental investigations to explore nanofluid properties,as it is a n...The thermal conductivity of nanofluids is an important property that influences the heat transfer capabilities of nanofluids.Researchers rely on experimental investigations to explore nanofluid properties,as it is a necessary step before their practical application.As these investigations are time and resource-consuming undertakings,an effective prediction model can significantly improve the efficiency of research operations.In this work,an Artificial Neural Network(ANN)model is developed to predict the thermal conductivity of metal oxide water-based nanofluid.For this,a comprehensive set of 691 data points was collected from the literature.This dataset is split into training(70%),validation(15%),and testing(15%)and used to train the ANN model.The developed model is a backpropagation artificial neural network with a 4–12–1 architecture.The performance of the developed model shows high accuracy with R values above 0.90 and rapid convergence.It shows that the developed ANN model accurately predicts the thermal conductivity of nanofluids.展开更多
In contemporary computer vision,convolutional neural networks(CNNs)and vision transformers(ViTs)represent the two primary architectural paradigms for image recognition.While both approaches have been widely adopted in...In contemporary computer vision,convolutional neural networks(CNNs)and vision transformers(ViTs)represent the two primary architectural paradigms for image recognition.While both approaches have been widely adopted in medical imaging applications,they operate based on fundamentally different computational principles.This report attempts to provide brief application notes on ViTs and CNNs,particularly focusing on scenarios that guide the selection of one architecture over the other in practical medical implementations.Generally,CNNs rely on convolutional kernels,localized receptive fields,and weight sharing,enabling efficient hierarchical feature extraction.These properties contribute to strong performance in detecting spatially constrained patterns such as textures,edges,and anatomical boundaries,while maintaining relatively low computational requirements.ViTs,on the other hand,decompose images into smaller segments referred to as tokens and employ self-attention mechanisms to model relationships across the entire image.This global modeling capability allows ViTs to capture long-range dependencies that may be difficult for convolution-based architectures to learn.However,ViTs typically achieve optimal performance when trained on extremely large datasets or when supported by extensive pretraining,as their reduced inductive bias requires greater data exposure to learn robust representations.This report briefly examines the architectural structure,underlying mathematical foundations,and relative performance characteristics of CNNs and ViTs,drawing upon recent findings from contemporary research.Emphasis is placed on understanding how differences in data availability,computational resources,and task requirements influence model effectiveness across medical imaging domains.Most importantly,the report serves as a concise application guide for practitioners seeking informed implementation decisions between these two influential deep learning frameworks.展开更多
Since the advent of smart contracts,security vulnerabilities have remained a persistent challenge,compromsing both the reliability of contract execution and the overall stability of the virtual currency market.Consequ...Since the advent of smart contracts,security vulnerabilities have remained a persistent challenge,compromsing both the reliability of contract execution and the overall stability of the virtual currency market.Consequently,the academic community has devoted increasing attention to these security risks.However,conventional approaches to vulnerability detection frequently exhibit limited accuracy.To address this limitation,the present study introduces a novel vulnerability detection framework called GNNSE that integrates symbolic execution with graph neural networks(GNNs).The proposedmethod first constructs semantic graphs to comprehensively capture the control flow and data flow dependencies within smart contracts.These graphs are subsequently processed using GNNs to efficiently identify contracts with a high likelihood of vulnerabilities.For these high-risk contracts,symbolic execution is employed to perform fine-grained,path-level analysis,thereby improving overall detection precision.Experimental results on a dataset comprising 10,079 contracts demonstrate that the proposed method achieves detection precisions of 93.58% for reentrancy vulnerabilities and 92.73% for timestamp-dependent vulnerabilities.展开更多
The development of β-titanium alloys with bone-mimicking elastic moduli remains a significant challenge.Although machine learning has the potential to accelerate alloy discovery,traditional methods often face data li...The development of β-titanium alloys with bone-mimicking elastic moduli remains a significant challenge.Although machine learning has the potential to accelerate alloy discovery,traditional methods often face data limitations such as sparsity,compositional discontinuity,and feature heterogeneity,leading to overfitting and restricting the exploration of novel compositional spaces.In this study,we introduce a domain-adversarial neural network framework that balances predictive accuracy with the generalization ability of unexplored composition space through integrated feature alignment and adversarial training.Using this approach,we successfully developed a non-intuitiveβ-Ti alloy with an ultra-low elastic modulus of 28±3 GPa,providing new insights beyond conventionally designed biomedical titanium alloys.This work establishes a screening framework for materials discovery in small-sample data spaces,with broad implications for the design of biomedical and other alloy systems.展开更多
Speckle-based optical cryptosystems are promising technologies for information security.However,existing techniques mostly rely on digital decryption,resulting in computational delay and undermining the high-speed adv...Speckle-based optical cryptosystems are promising technologies for information security.However,existing techniques mostly rely on digital decryption,resulting in computational delay and undermining the high-speed advantage of optical encryption.Moreover,conventional neural networks are typically effective only on images from the same distribution as the training datasets,limiting their general applicability.In this paper,we propose an all-optical high-speed decryption scheme for real-time recovery of speckle-encoded ciphertexts.By constructing a physics-informed diffractive neural network that approximates the inverse transmission matrix of the scattering medium,secret images can be directly reconstructed from speckle fields without optoelectronic conversion or post-processing.The network is trained with only 2048 samples from the MNIST dataset.Its transfer learning capability is validated across three out-of-distribution datasets,with decrypted images achieving a Pearson correlation coefficient of 0.82 and a structural similarity index measure of 0.75,demonstrating excellent transfer learning capability.For the first time,to our knowledge,this scheme simultaneously overcomes the bottlenecks of decryption delay and limited network generalizability in conventional speckle-based cryptosystems,achieving real-time image decryption with strong transferability.It provides a new pathway for developing low-power,real-time,and broadly applicable optical encryption systems,demonstrating significant potential for applications in high-speed security optical communications.展开更多
The flight envelope of Air Turbo Rocket(ATR)engines is broader compared to conventional aero-engines,and designing a full-envelope controller using traditional methods poses significant challenges due to a burdensome ...The flight envelope of Air Turbo Rocket(ATR)engines is broader compared to conventional aero-engines,and designing a full-envelope controller using traditional methods poses significant challenges due to a burdensome design process.To address this issue,this paper proposes a self-learning neural network controller design method based on Reinforcement Learning(RL).Additionally,a method for predictive compensation and stability rewards is proposed to reduce the system oscillation caused by actuator delay.This approach simplifies the actuator to a firstorder inertial element exhibiting pure delay.A simulation environment for the ATR engineactuator system is first established.Based on this environment,a self-learning neural network controller using a predictive compensator and the Proximal Policy Optimization(PPO)algorithm is then developed.Furthermore,the temporal difference signals from the controller output are integrated into the reward function to enhance system stability.The proposed method is validated through numerical simulations and semi-physical experiments.The numerical simulation results demonstrate that the proposed method increases the system's tolerance to delays from 20 ms to 400 ms.Under an actuator delay of 400 ms,the average steady-state error remains less than0.1%,the overshoot is limited to 1%,and the settling time does not exceed 3 s.Moreover,compared to the traditional method,the proposed method exhibits higher adaptability to model errors and variations in flight conditions.In the conducted semi-physical simulation experiments,the proposed method achieves stable control of a real electric pump.展开更多
Accurate state of health(SOH)estimation is essential for the safe and reliable operation of lithium-ion batteries.However,existing methods face significant challenges,primarily because they rely on complete charge–di...Accurate state of health(SOH)estimation is essential for the safe and reliable operation of lithium-ion batteries.However,existing methods face significant challenges,primarily because they rely on complete charge–discharge cycles and fixed-form physical constraints,which limit adaptability to different chemistries and real-world conditions.To address these issues,this study proposes an approach that extracts features from segmented state of charge(SOC)intervals and integrates them into an enhanced physics-informed neural network(PINN).Specifically,voltage data within the 25%–75%SOC range during charging are used to derive statistical,time–frequency,and mechanism-based features that capture degradation trends.A hybrid PINN-Lasso-Transformer-BiLSTM architecture is developed,where Lasso regression enables sparse feature selection,and a nonlinear empirical degradation model is embedded as a learnable physical term within a dynamically scaled composite loss.This design adaptively balances data-driven accuracy with physical consistency,thereby enhancing estimation precision,robustness,and generalization.The results show that the proposed method outperforms conventional neural networks across four battery chemistries,achieving root mean square error and mean absolute error below 1%.Notably,features from partial charging segments exhibit higher robustness than those from full cycles.Furthermore,the model maintains strong performance under high temperatures and demonstrates excellent generalization capacity in transfer learning across chemistries,temperatures,and C-rates.This work establishes a scalable and interpretable solution for accurate SOH estimation under diverse practical operating conditions.展开更多
The adjoint method is widely used in gradient-based optimization with high-dimensional design variables.However,the cost of solving the adjoint equations in each iteration is comparable to that of solving the flow fie...The adjoint method is widely used in gradient-based optimization with high-dimensional design variables.However,the cost of solving the adjoint equations in each iteration is comparable to that of solving the flow field,resulting in expensive computational costs.To improve the efficiency of solving adjoint equations,we propose a physics-constrained graph neural networks for solving adjoint equations,named ADJ-PCGN.ADJ-PCGN establishes a mapping relationship between flow characteristics and adjoint vector based on data,serving as a replacement for the computationally expensive numerical solution of adjoint equations.A physics-based graph structure and message-passing mechanism are designed to endow its strong fitting and generalization capabilities.Taking transonic drag reduction and maximum lift-drag ratio of the airfoil as examples,results indicate that ADJ-PCGN attains a similar optimal shape as the classical direct adjoint loop method.In addition,ADJ-PCGN demonstrates strong generalization capabilities across different mesh topologies,mesh densities,and out-of-distribution conditions.It holds the potential to become a universal model for aerodynamic shape optimization involving states,geometries,and meshes.展开更多
Deep learning has been recognized as an effective method for indoor positioning.However,most existing real-valued neural networks(RVNNs)treat the two constituent components of complex-valued channel state information(...Deep learning has been recognized as an effective method for indoor positioning.However,most existing real-valued neural networks(RVNNs)treat the two constituent components of complex-valued channel state information(CSI)as real-valued inputs,potentially discarding useful information embedded in the original CSI.In addition,existing positioning models generally face the contradiction between computational complexity and positioning accuracy.To address these issues,we combine graph neural network(GNN)with complex-valued neural network(CVNN)to construct a lightweight indoor positioning model named CGNet.CGNet employs complexvalued convolution operation to directly process the original CSI data,fully exploiting the correlation between real and imaginary parts of CSI while extracting local features.Subsequently,the feature values are treated as nodes,and conditional position encoding(CPE)module is applied to add positional information.To reduce the number of connections in the graph structure and lower themodel complexity,feature information is mapped to an efficient graph structure through a dynamic axial graph construction(DAGC)method,with global features extracted usingmaximum relative graph convolution(MRConv).Experimental results show that,on the CTW dataset,CGNet achieves a 10%improvement in positioning accuracy compared to existing methods,while the number of model parameters is only 0.8 M.CGNet achieves excellent positioning accuracy with very few parameters.展开更多
Convolutional neural networks(CNNs)have shown remarkable success across numerous tasks such as image classification,yet the theoretical understanding of their convergence remains underdeveloped compared to their empir...Convolutional neural networks(CNNs)have shown remarkable success across numerous tasks such as image classification,yet the theoretical understanding of their convergence remains underdeveloped compared to their empirical achievements.In this paper,the first filter learning framework with convergence-guaranteed learning laws for end-to-end learning of deep CNNs is proposed.Novel update laws with convergence analysis are formulated based on the mathematical representation of each layer in convolutional neural networks.The proposed learning laws enable concurrent updates of weights across all layers of the deep convolutional neural network and the analysis shows that the training errors converge to certain bounds which are dependent on the approximation errors.Case studies are conducted on benchmark datasets and the results show that the proposed concurrent filter learning framework guarantees the convergence and offers more consistent and reliable results during training with a trade-off in performance compared to stochastic gradient descent methods.This framework represents a significant step towards enhancing the reliability and effectiveness of deep convolutional neural network by developing a theoretical analysis which allows practical implementation of the learning laws with automatic tuning of the learning rate to guarantee the convergence during training.展开更多
Recently,the zeroing neural network(ZNN)has demonstrated remarkable effectiveness in tackling time-varying problems,delivering robust performance across both noise-free and noisy environments.However,existing ZNN mode...Recently,the zeroing neural network(ZNN)has demonstrated remarkable effectiveness in tackling time-varying problems,delivering robust performance across both noise-free and noisy environments.However,existing ZNN models are limited in their ability to actively suppress noise,which constrains their robustness and precision in solving time-varying problems.This paper introduces a novel active noise rejection ZNN(ANR-ZNN)design that enhances noise suppression by integrating computational error dynamics and harmonic behaviour.Through rigorous theoretical analysis,we demonstrate that the proposed ANR-ZNN maintains robust convergence in computational error performance under environmental noise.As a case study,the ANR-ZNN model is specifically applied to time-varying matrix inversion.Comprehensive computer simulations and robotic experiments further validate the ANR-ZNN's effectiveness,emphasising the proposed design's superiority and potential for solving time-varying problems.展开更多
Rapid quantification of seismic-induced damage immediately following an earthquake is critical for determining whether a structure is safe for continued occupation or requires evacuation.This study proposes a novel da...Rapid quantification of seismic-induced damage immediately following an earthquake is critical for determining whether a structure is safe for continued occupation or requires evacuation.This study proposes a novel damage identification method that utilizes limited strain data points,significantly reducing installation,maintenance,and data analysis costs compared to traditional distributed sensor networks.The approach integrates finite element(FE)modeling to generate capacity curves through pushover analysis,incorporates noise-augmented datasets for Artificial Neural Network(ANN)training,and classifies structural conditions into four damage levels:Operational(OP),Immediate Occupancy(IO),Life Safety(LS),and Collapse Prevention(CP).To evaluate the method’s accuracy and efficiency,it was applied to two reinforced concrete(RC)frames;a single-story frame tested experimentally under cyclic loading and a three-story frame analyzed under various lateral load patterns.Strain data from selected beam and column ends were used as ANN inputs,while the corresponding damage classes served as outputs.Confusion matrix results demonstrated high true positive rates(>85%for the single-story and>90%for the three-story frame),even with a reduced number of sensors.The model also exhibited strong robustness to White Gaussian Noise(SNR=2.5-5 dB)and generalized effectively to nonlinear time-history analyses under scaled ground motions(PGA=0.1-1.0 g).Feature selection using the MRMR and ANOVA algorithms further enhanced computational efficiency.Overall,the proposed ANN-based framework has strong potential for real-time structural health monitoring applications.展开更多
This paper introduces a fuzzy C-means-based pooling layer for convolutional neural networks that explicitly models local uncertainty and ambiguity.Conventional pooling operations,such as max and average,apply rigid ag...This paper introduces a fuzzy C-means-based pooling layer for convolutional neural networks that explicitly models local uncertainty and ambiguity.Conventional pooling operations,such as max and average,apply rigid aggregation and often discard fine-grained boundary information.In contrast,our method computes soft membershipswithin each receptive field and aggregates cluster-wise responses throughmembership-weighted pooling,thereby preserving informative structure while reducing dimensionality.Being differentiable,the proposed layer operates as standard two-dimensional pooling.We evaluate our approach across various CNN backbones and open datasets,including CIFAR-10/100,STL-10,LFW,and ImageNette,and further probe small training set restrictions on MNIST and Fashion-MNIST.In these settings,the proposed pooling consistently improves accuracy and weighted F1 over conventional baselines,with particularly strong gains when training data are scarce.Even with less than 1%of the training set,ourmethodmaintains reliable performance,indicating improved sample efficiency and robustness to noisy or ambiguous local patterns.Overall,integrating soft memberships into the pooling operator provides a practical and generalizable inductive bias that enhances robustness and generalization in modern CNN pipelines.展开更多
An efficient data-driven numerical framework is developed for transient heat conduction analysis in thin-walled structures.The proposed approach integrates spectral time discretization with neural network approximatio...An efficient data-driven numerical framework is developed for transient heat conduction analysis in thin-walled structures.The proposed approach integrates spectral time discretization with neural network approximation,forming a spectral-integrated neural network(SINN)scheme tailored for problems characterized by long-time evolution.Temporal derivatives are treated through a spectral integration strategy based on orthogonal polynomial expansions,which significantly alleviates stability constraints associated with conventional time-marching schemes.A fully connected neural network is employed to approximate the temperature-related variables,while governing equa-tions and boundary conditions are enforced through a physics-informed loss formulation.Numerical investigations demonstrate that the proposed method maintains high accuracy even when large time steps are adopted,where standard numerical solvers often suffer from instability or excessive computational cost.Moreover,the framework exhibits strong robustness for ultrathin configurations with extreme aspect ratios,achieving relative errors on the order of 10−5 or lower.These results indicate that the SINN framework provides a reliable and efficient alternative for transient thermal analysis of thin-walled structures under challenging computational conditions.展开更多
In this paper,we investigate data-driven bright soliton solutions of the nonlocal reverse-time nonlinear Schrodinger(NLS)equation and the parameter identification using the physically informed neural networks(PINNs)al...In this paper,we investigate data-driven bright soliton solutions of the nonlocal reverse-time nonlinear Schrodinger(NLS)equation and the parameter identification using the physically informed neural networks(PINNs)algorithm.Accurate simulations and comparative analyses of relative and absolute errors are performed for two-soliton and four-soliton solutions including linear solitary waves and periodic waves.In the training process,the standard PINNs scheme is employed for linear solitary wave solutions,while the prior information is added at local sharp regions for periodic wave solutions due to the complicated collision behaviors.For the parameter identification,we accurately recognize the nonlinear coefficients of the nonlocal NLS equation from known solutions with different noises.These results reinforce the application of deep learning with the PINNs framework to successfully study nonlocal integrable systems.展开更多
基金Supported by the National key research and development program in the 14th five year plan 2021YFA1200700)the National Natural Science Foundation of China(62535018,62431025,62561160113)the Natural Science Foundation of Shanghai(23ZR1473400).
文摘Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon-based readout circuits in a single step.Based on this,we propose a photodiode based on an n-i-p structure,which removes the buffer layer and further simplifies the manufacturing process of quantum dot image sensors,thus reducing manufacturing costs.Additionally,for the noise complexity in quantum dot image sensors when capturing images,traditional denoising and non-uniformity methods often do not achieve optimal denoising re⁃sults.For the noise and stripe-type non-uniformity commonly encountered in infrared quantum dot detector imag⁃es,a network architecture has been developed that incorporates multiple key modules.This network combines channel attention and spatial attention mechanisms,dynamically adjusting the importance of feature maps to en⁃hance the ability to distinguish between noise and details.Meanwhile,the residual dense feature fusion module further improves the network's ability to process complex image structures through hierarchical feature extraction and fusion.Furthermore,the pyramid pooling module effectively captures information at different scales,improv⁃ing the network's multi-scale feature representation ability.Through the collaborative effect of these modules,the network can better handle various mixed noise and image non-uniformity issues.Experimental results show that it outperforms the traditional U-Net network in denoising and image correction tasks.
基金supported by the Science and Technology Innovation Key R&D Program of Chongqing(CSTB2025TIAD-STX0032)National Key Research and Development Program of China(2024YFF0908200)+1 种基金the Chongqing Technology Innovation and Application Development Special Key Project(CSTB2024TIAD-KPX0018)the Southwest University Graduate Student Research Innovation(SWUB24051)。
文摘Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression is crucial for deploying deep neural network(DNN)models on resource-constrained embedded devices.
基金supported by the National Key Research and Development Program of China(2023YFF0612900,2023YFF0612902)the Natural Science Foundation of Beijing,China(4254086)+3 种基金the National Natural Science Foundation of China(62472032)the Open Project Funding of Key Laboratory of Mobile Application Innovation and Governance Technology,Ministry of Industry and Information Technology(2023IFS080601-K)the Beijing Institute of Technology Research Fund Program for Young Scholarsthe Young Elite Scientists Sponsorship Program by CAST(2023QNRC001)。
文摘Dear Editor,This letter addresses the critical challenge of preserving privacy in graph learning without compromising on data utility.Differential privacy(DP)is emerging as an effective method for privacy-preserving graph learning.However,its application often diminishes data utility,especially for nodes with fewer neighbors in graph neural networks(GNNs).
文摘Distributed Denial of Service(DDoS)attacks are one of the severe threats to network infrastructure,sometimes bypassing traditional diagnosis algorithms because of their evolving complexity.PresentMachine Learning(ML)techniques for DDoS attack diagnosis normally apply network traffic statistical features such as packet sizes and inter-arrival times.However,such techniques sometimes fail to capture complicated relations among various traffic flows.In this paper,we present a new multi-scale ensemble strategy given the Graph Neural Networks(GNNs)for improving DDoS detection.Our technique divides traffic into macro-and micro-level elements,letting various GNN models to get the two corase-scale anomalies and subtle,stealthy attack models.Through modeling network traffic as graph-structured data,GNNs efficiently learn intricate relations among network entities.The proposed ensemble learning algorithm combines the results of several GNNs to improve generalization,robustness,and scalability.Extensive experiments on three benchmark datasets—UNSW-NB15,CICIDS2017,and CICDDoS2019—show that our approach outperforms traditional machine learning and deep learning models in detecting both high-rate and low-rate(stealthy)DDoS attacks,with significant improvements in accuracy and recall.These findings demonstrate the suggested method’s applicability and robustness for real-world implementation in contexts where several DDoS patterns coexist.
基金Support by Sichuan Science and Technology Program[2023YFSY0026,2023YFH0004]Guangzhou Huashang University[2024HSZD01,HS2023JYSZH01].
文摘Graph neural networks(GNN)have shown strong performance in node classification tasks,yet most existing models rely on uniform or shared weight aggregation,lacking flexibility in modeling the varying strength of relationships among nodes.This paper proposes a novel graph coupling convolutional model that introduces an adaptive weighting mechanism to assign distinct importance to neighboring nodes based on their similarity to the central node.Unlike traditional methods,the proposed coupling strategy enhances the interpretability of node interactions while maintaining competitive classification performance.The model operates in the spatial domain,utilizing adjacency list structures for efficient convolution and addressing the limitations of weight sharing through a coupling-based similarity computation.Extensive experiments are conducted on five graph-structured datasets,including Cora,Citeseer,PubMed,Reddit,and BlogCatalog,as well as a custom topology dataset constructed from the Open University Learning Analytics Dataset(OULAD)educational platform.Results demonstrate that the proposed model achieves good classification accuracy,while significantly reducing training time through direct second-order neighbor fusion and data preprocessing.Moreover,analysis of neighborhood order reveals that considering third-order neighbors offers limited accuracy gains but introduces considerable computational overhead,confirming the efficiency of first-and second-order convolution in practical applications.Overall,the proposed graph coupling model offers a lightweight,interpretable,and effective framework for multi-label node classification in complex networks.
基金supported by Basic Science Research Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Education(2021R1A6A1A10044950).
文摘The thermal conductivity of nanofluids is an important property that influences the heat transfer capabilities of nanofluids.Researchers rely on experimental investigations to explore nanofluid properties,as it is a necessary step before their practical application.As these investigations are time and resource-consuming undertakings,an effective prediction model can significantly improve the efficiency of research operations.In this work,an Artificial Neural Network(ANN)model is developed to predict the thermal conductivity of metal oxide water-based nanofluid.For this,a comprehensive set of 691 data points was collected from the literature.This dataset is split into training(70%),validation(15%),and testing(15%)and used to train the ANN model.The developed model is a backpropagation artificial neural network with a 4–12–1 architecture.The performance of the developed model shows high accuracy with R values above 0.90 and rapid convergence.It shows that the developed ANN model accurately predicts the thermal conductivity of nanofluids.
文摘In contemporary computer vision,convolutional neural networks(CNNs)and vision transformers(ViTs)represent the two primary architectural paradigms for image recognition.While both approaches have been widely adopted in medical imaging applications,they operate based on fundamentally different computational principles.This report attempts to provide brief application notes on ViTs and CNNs,particularly focusing on scenarios that guide the selection of one architecture over the other in practical medical implementations.Generally,CNNs rely on convolutional kernels,localized receptive fields,and weight sharing,enabling efficient hierarchical feature extraction.These properties contribute to strong performance in detecting spatially constrained patterns such as textures,edges,and anatomical boundaries,while maintaining relatively low computational requirements.ViTs,on the other hand,decompose images into smaller segments referred to as tokens and employ self-attention mechanisms to model relationships across the entire image.This global modeling capability allows ViTs to capture long-range dependencies that may be difficult for convolution-based architectures to learn.However,ViTs typically achieve optimal performance when trained on extremely large datasets or when supported by extensive pretraining,as their reduced inductive bias requires greater data exposure to learn robust representations.This report briefly examines the architectural structure,underlying mathematical foundations,and relative performance characteristics of CNNs and ViTs,drawing upon recent findings from contemporary research.Emphasis is placed on understanding how differences in data availability,computational resources,and task requirements influence model effectiveness across medical imaging domains.Most importantly,the report serves as a concise application guide for practitioners seeking informed implementation decisions between these two influential deep learning frameworks.
基金supported by the National Key Research and Development Program of China(2020YFB1005704).
文摘Since the advent of smart contracts,security vulnerabilities have remained a persistent challenge,compromsing both the reliability of contract execution and the overall stability of the virtual currency market.Consequently,the academic community has devoted increasing attention to these security risks.However,conventional approaches to vulnerability detection frequently exhibit limited accuracy.To address this limitation,the present study introduces a novel vulnerability detection framework called GNNSE that integrates symbolic execution with graph neural networks(GNNs).The proposedmethod first constructs semantic graphs to comprehensively capture the control flow and data flow dependencies within smart contracts.These graphs are subsequently processed using GNNs to efficiently identify contracts with a high likelihood of vulnerabilities.For these high-risk contracts,symbolic execution is employed to perform fine-grained,path-level analysis,thereby improving overall detection precision.Experimental results on a dataset comprising 10,079 contracts demonstrate that the proposed method achieves detection precisions of 93.58% for reentrancy vulnerabilities and 92.73% for timestamp-dependent vulnerabilities.
基金supported by the Guangdong S&T Program(Grant No.2025B1111130003).
文摘The development of β-titanium alloys with bone-mimicking elastic moduli remains a significant challenge.Although machine learning has the potential to accelerate alloy discovery,traditional methods often face data limitations such as sparsity,compositional discontinuity,and feature heterogeneity,leading to overfitting and restricting the exploration of novel compositional spaces.In this study,we introduce a domain-adversarial neural network framework that balances predictive accuracy with the generalization ability of unexplored composition space through integrated feature alignment and adversarial training.Using this approach,we successfully developed a non-intuitiveβ-Ti alloy with an ultra-low elastic modulus of 28±3 GPa,providing new insights beyond conventionally designed biomedical titanium alloys.This work establishes a screening framework for materials discovery in small-sample data spaces,with broad implications for the design of biomedical and other alloy systems.
基金supported by the Guangdong Major Project of Basic Research(Grant No.2020B0301030009)the National Natural Science Foundation of China(Grant Nos.12174204,12174203,12074203,62335012,and 62435010)+5 种基金the Natural Science Foundation of Guangdong Province(Grant No.2023A1515012888)the Science and Technology Innovation Commission of Shenzhen(Grant Nos.JCYJ20220818101417039 and JCYJ20241202124428038)the Medical-Engineering Interdisciplinary Research Foundation of Shenzhen University(Grant No.86901/00000311)the Scientific Instrument Developing Project of Shenzhen University(Grant No.2023YQ001)the Shenzhen University 2035 Initiative(Grant No.2023B004)the Key R&D Program of Zhejiang(Grant No.30003AA240100)。
文摘Speckle-based optical cryptosystems are promising technologies for information security.However,existing techniques mostly rely on digital decryption,resulting in computational delay and undermining the high-speed advantage of optical encryption.Moreover,conventional neural networks are typically effective only on images from the same distribution as the training datasets,limiting their general applicability.In this paper,we propose an all-optical high-speed decryption scheme for real-time recovery of speckle-encoded ciphertexts.By constructing a physics-informed diffractive neural network that approximates the inverse transmission matrix of the scattering medium,secret images can be directly reconstructed from speckle fields without optoelectronic conversion or post-processing.The network is trained with only 2048 samples from the MNIST dataset.Its transfer learning capability is validated across three out-of-distribution datasets,with decrypted images achieving a Pearson correlation coefficient of 0.82 and a structural similarity index measure of 0.75,demonstrating excellent transfer learning capability.For the first time,to our knowledge,this scheme simultaneously overcomes the bottlenecks of decryption delay and limited network generalizability in conventional speckle-based cryptosystems,achieving real-time image decryption with strong transferability.It provides a new pathway for developing low-power,real-time,and broadly applicable optical encryption systems,demonstrating significant potential for applications in high-speed security optical communications.
基金co-supported by the National Science and Technology Major Project(No.J2019-Ⅲ-0010-0054)the National Natural Science Foundation of China(No.52336002)。
文摘The flight envelope of Air Turbo Rocket(ATR)engines is broader compared to conventional aero-engines,and designing a full-envelope controller using traditional methods poses significant challenges due to a burdensome design process.To address this issue,this paper proposes a self-learning neural network controller design method based on Reinforcement Learning(RL).Additionally,a method for predictive compensation and stability rewards is proposed to reduce the system oscillation caused by actuator delay.This approach simplifies the actuator to a firstorder inertial element exhibiting pure delay.A simulation environment for the ATR engineactuator system is first established.Based on this environment,a self-learning neural network controller using a predictive compensator and the Proximal Policy Optimization(PPO)algorithm is then developed.Furthermore,the temporal difference signals from the controller output are integrated into the reward function to enhance system stability.The proposed method is validated through numerical simulations and semi-physical experiments.The numerical simulation results demonstrate that the proposed method increases the system's tolerance to delays from 20 ms to 400 ms.Under an actuator delay of 400 ms,the average steady-state error remains less than0.1%,the overshoot is limited to 1%,and the settling time does not exceed 3 s.Moreover,compared to the traditional method,the proposed method exhibits higher adaptability to model errors and variations in flight conditions.In the conducted semi-physical simulation experiments,the proposed method achieves stable control of a real electric pump.
基金supported by the Shanghai Pilot Program for Basic Research(22T01400100-18)the National Natural Science Foundation of China(22278127 and 12447149)+1 种基金the Fundamental Research Funds for the Central Universities(2022ZFJH004)the Postdoctoral Fellowship Program of CPSF(GZB20250159).
文摘Accurate state of health(SOH)estimation is essential for the safe and reliable operation of lithium-ion batteries.However,existing methods face significant challenges,primarily because they rely on complete charge–discharge cycles and fixed-form physical constraints,which limit adaptability to different chemistries and real-world conditions.To address these issues,this study proposes an approach that extracts features from segmented state of charge(SOC)intervals and integrates them into an enhanced physics-informed neural network(PINN).Specifically,voltage data within the 25%–75%SOC range during charging are used to derive statistical,time–frequency,and mechanism-based features that capture degradation trends.A hybrid PINN-Lasso-Transformer-BiLSTM architecture is developed,where Lasso regression enables sparse feature selection,and a nonlinear empirical degradation model is embedded as a learnable physical term within a dynamically scaled composite loss.This design adaptively balances data-driven accuracy with physical consistency,thereby enhancing estimation precision,robustness,and generalization.The results show that the proposed method outperforms conventional neural networks across four battery chemistries,achieving root mean square error and mean absolute error below 1%.Notably,features from partial charging segments exhibit higher robustness than those from full cycles.Furthermore,the model maintains strong performance under high temperatures and demonstrates excellent generalization capacity in transfer learning across chemistries,temperatures,and C-rates.This work establishes a scalable and interpretable solution for accurate SOH estimation under diverse practical operating conditions.
基金supported by the National Natural Science Foundation of China(Grant No.12272316).
文摘The adjoint method is widely used in gradient-based optimization with high-dimensional design variables.However,the cost of solving the adjoint equations in each iteration is comparable to that of solving the flow field,resulting in expensive computational costs.To improve the efficiency of solving adjoint equations,we propose a physics-constrained graph neural networks for solving adjoint equations,named ADJ-PCGN.ADJ-PCGN establishes a mapping relationship between flow characteristics and adjoint vector based on data,serving as a replacement for the computationally expensive numerical solution of adjoint equations.A physics-based graph structure and message-passing mechanism are designed to endow its strong fitting and generalization capabilities.Taking transonic drag reduction and maximum lift-drag ratio of the airfoil as examples,results indicate that ADJ-PCGN attains a similar optimal shape as the classical direct adjoint loop method.In addition,ADJ-PCGN demonstrates strong generalization capabilities across different mesh topologies,mesh densities,and out-of-distribution conditions.It holds the potential to become a universal model for aerodynamic shape optimization involving states,geometries,and meshes.
文摘Deep learning has been recognized as an effective method for indoor positioning.However,most existing real-valued neural networks(RVNNs)treat the two constituent components of complex-valued channel state information(CSI)as real-valued inputs,potentially discarding useful information embedded in the original CSI.In addition,existing positioning models generally face the contradiction between computational complexity and positioning accuracy.To address these issues,we combine graph neural network(GNN)with complex-valued neural network(CVNN)to construct a lightweight indoor positioning model named CGNet.CGNet employs complexvalued convolution operation to directly process the original CSI data,fully exploiting the correlation between real and imaginary parts of CSI while extracting local features.Subsequently,the feature values are treated as nodes,and conditional position encoding(CPE)module is applied to add positional information.To reduce the number of connections in the graph structure and lower themodel complexity,feature information is mapped to an efficient graph structure through a dynamic axial graph construction(DAGC)method,with global features extracted usingmaximum relative graph convolution(MRConv).Experimental results show that,on the CTW dataset,CGNet achieves a 10%improvement in positioning accuracy compared to existing methods,while the number of model parameters is only 0.8 M.CGNet achieves excellent positioning accuracy with very few parameters.
基金supported by the Ministry of Education(MOE)Singapore,Academic Research Fund(AcRF)Tier 1(RG65/22)。
文摘Convolutional neural networks(CNNs)have shown remarkable success across numerous tasks such as image classification,yet the theoretical understanding of their convergence remains underdeveloped compared to their empirical achievements.In this paper,the first filter learning framework with convergence-guaranteed learning laws for end-to-end learning of deep CNNs is proposed.Novel update laws with convergence analysis are formulated based on the mathematical representation of each layer in convolutional neural networks.The proposed learning laws enable concurrent updates of weights across all layers of the deep convolutional neural network and the analysis shows that the training errors converge to certain bounds which are dependent on the approximation errors.Case studies are conducted on benchmark datasets and the results show that the proposed concurrent filter learning framework guarantees the convergence and offers more consistent and reliable results during training with a trade-off in performance compared to stochastic gradient descent methods.This framework represents a significant step towards enhancing the reliability and effectiveness of deep convolutional neural network by developing a theoretical analysis which allows practical implementation of the learning laws with automatic tuning of the learning rate to guarantee the convergence during training.
基金supported by the National Science and Technology Major Project(2022ZD0119901)the National Natural Science Foundation of China under Grant(U2141234,62463004 and U24A20260)+1 种基金the Hainan Province Science and Technology Special Fund(ZDYF2024GXJS003)the Scientific Research Fund of Hainan University(KYQD(ZR)23025).
文摘Recently,the zeroing neural network(ZNN)has demonstrated remarkable effectiveness in tackling time-varying problems,delivering robust performance across both noise-free and noisy environments.However,existing ZNN models are limited in their ability to actively suppress noise,which constrains their robustness and precision in solving time-varying problems.This paper introduces a novel active noise rejection ZNN(ANR-ZNN)design that enhances noise suppression by integrating computational error dynamics and harmonic behaviour.Through rigorous theoretical analysis,we demonstrate that the proposed ANR-ZNN maintains robust convergence in computational error performance under environmental noise.As a case study,the ANR-ZNN model is specifically applied to time-varying matrix inversion.Comprehensive computer simulations and robotic experiments further validate the ANR-ZNN's effectiveness,emphasising the proposed design's superiority and potential for solving time-varying problems.
基金funded by UTM Fundamental Research Grant(PY/2024/01221,Cost centre no.:Q.J130000.3822.23H73)HiCoE Grant Scheme(Cost centre no.:R.J130000.7822.4J738)。
文摘Rapid quantification of seismic-induced damage immediately following an earthquake is critical for determining whether a structure is safe for continued occupation or requires evacuation.This study proposes a novel damage identification method that utilizes limited strain data points,significantly reducing installation,maintenance,and data analysis costs compared to traditional distributed sensor networks.The approach integrates finite element(FE)modeling to generate capacity curves through pushover analysis,incorporates noise-augmented datasets for Artificial Neural Network(ANN)training,and classifies structural conditions into four damage levels:Operational(OP),Immediate Occupancy(IO),Life Safety(LS),and Collapse Prevention(CP).To evaluate the method’s accuracy and efficiency,it was applied to two reinforced concrete(RC)frames;a single-story frame tested experimentally under cyclic loading and a three-story frame analyzed under various lateral load patterns.Strain data from selected beam and column ends were used as ANN inputs,while the corresponding damage classes served as outputs.Confusion matrix results demonstrated high true positive rates(>85%for the single-story and>90%for the three-story frame),even with a reduced number of sensors.The model also exhibited strong robustness to White Gaussian Noise(SNR=2.5-5 dB)and generalized effectively to nonlinear time-history analyses under scaled ground motions(PGA=0.1-1.0 g).Feature selection using the MRMR and ANOVA algorithms further enhanced computational efficiency.Overall,the proposed ANN-based framework has strong potential for real-time structural health monitoring applications.
文摘This paper introduces a fuzzy C-means-based pooling layer for convolutional neural networks that explicitly models local uncertainty and ambiguity.Conventional pooling operations,such as max and average,apply rigid aggregation and often discard fine-grained boundary information.In contrast,our method computes soft membershipswithin each receptive field and aggregates cluster-wise responses throughmembership-weighted pooling,thereby preserving informative structure while reducing dimensionality.Being differentiable,the proposed layer operates as standard two-dimensional pooling.We evaluate our approach across various CNN backbones and open datasets,including CIFAR-10/100,STL-10,LFW,and ImageNette,and further probe small training set restrictions on MNIST and Fashion-MNIST.In these settings,the proposed pooling consistently improves accuracy and weighted F1 over conventional baselines,with particularly strong gains when training data are scarce.Even with less than 1%of the training set,ourmethodmaintains reliable performance,indicating improved sample efficiency and robustness to noisy or ambiguous local patterns.Overall,integrating soft memberships into the pooling operator provides a practical and generalizable inductive bias that enhances robustness and generalization in modern CNN pipelines.
基金supported by the National Natural Science Foundation of China(Nos.12422207 and 12372199).
文摘An efficient data-driven numerical framework is developed for transient heat conduction analysis in thin-walled structures.The proposed approach integrates spectral time discretization with neural network approximation,forming a spectral-integrated neural network(SINN)scheme tailored for problems characterized by long-time evolution.Temporal derivatives are treated through a spectral integration strategy based on orthogonal polynomial expansions,which significantly alleviates stability constraints associated with conventional time-marching schemes.A fully connected neural network is employed to approximate the temperature-related variables,while governing equa-tions and boundary conditions are enforced through a physics-informed loss formulation.Numerical investigations demonstrate that the proposed method maintains high accuracy even when large time steps are adopted,where standard numerical solvers often suffer from instability or excessive computational cost.Moreover,the framework exhibits strong robustness for ultrathin configurations with extreme aspect ratios,achieving relative errors on the order of 10−5 or lower.These results indicate that the SINN framework provides a reliable and efficient alternative for transient thermal analysis of thin-walled structures under challenging computational conditions.
基金supported by the National Natural Science Foundation of China(Grant Nos.12171217 and 12375003)the Zhejiang Provincial Natural Science Foundation of China(Grant No.LMS 25A010013)。
文摘In this paper,we investigate data-driven bright soliton solutions of the nonlocal reverse-time nonlinear Schrodinger(NLS)equation and the parameter identification using the physically informed neural networks(PINNs)algorithm.Accurate simulations and comparative analyses of relative and absolute errors are performed for two-soliton and four-soliton solutions including linear solitary waves and periodic waves.In the training process,the standard PINNs scheme is employed for linear solitary wave solutions,while the prior information is added at local sharp regions for periodic wave solutions due to the complicated collision behaviors.For the parameter identification,we accurately recognize the nonlinear coefficients of the nonlocal NLS equation from known solutions with different noises.These results reinforce the application of deep learning with the PINNs framework to successfully study nonlocal integrable systems.