This paper introduces a fuzzy C-means-based pooling layer for convolutional neural networks that explicitly models local uncertainty and ambiguity.Conventional pooling operations,such as max and average,apply rigid ag...This paper introduces a fuzzy C-means-based pooling layer for convolutional neural networks that explicitly models local uncertainty and ambiguity.Conventional pooling operations,such as max and average,apply rigid aggregation and often discard fine-grained boundary information.In contrast,our method computes soft membershipswithin each receptive field and aggregates cluster-wise responses throughmembership-weighted pooling,thereby preserving informative structure while reducing dimensionality.Being differentiable,the proposed layer operates as standard two-dimensional pooling.We evaluate our approach across various CNN backbones and open datasets,including CIFAR-10/100,STL-10,LFW,and ImageNette,and further probe small training set restrictions on MNIST and Fashion-MNIST.In these settings,the proposed pooling consistently improves accuracy and weighted F1 over conventional baselines,with particularly strong gains when training data are scarce.Even with less than 1%of the training set,ourmethodmaintains reliable performance,indicating improved sample efficiency and robustness to noisy or ambiguous local patterns.Overall,integrating soft memberships into the pooling operator provides a practical and generalizable inductive bias that enhances robustness and generalization in modern CNN pipelines.展开更多
Edge deployment solutions based on convolutional neural networks(CNNs)have garnered significant attention because of their potential applications.However,traditional CNNs rely on pooling to reduce the feature size,lea...Edge deployment solutions based on convolutional neural networks(CNNs)have garnered significant attention because of their potential applications.However,traditional CNNs rely on pooling to reduce the feature size,leading to substantial information loss and reduced network robustness.Herein,we propose a more robust adaptive pooling network(APN)method implemented using memristor technology.Our method introduces an improved pooling layer that reduces input features to an arbitrary scale without compromising their importance.Different coupling coefficients of the pooling layer are stored as conductance values in arrays.We validate the proposed APN on generic datasets,demonstrating significant performance improvements over previously reported CNN architectures.Additionally,we evaluate the APN on a CAPTCHA recognition task with perturbations to assess network robustness.The results show that the APN achieves 92.6% accuracy in 4-digit CAPTCHA recognition and exhibits higher robustness.This brief presents a highly robust and novel scheme for edge computing using memristor technology.展开更多
With the rapid development of intelligent video surveillance technology,pedestrian re-identification has become increasingly important inmulti-camera surveillance systems.This technology plays a critical role in enhan...With the rapid development of intelligent video surveillance technology,pedestrian re-identification has become increasingly important inmulti-camera surveillance systems.This technology plays a critical role in enhancing public safety.However,traditional methods typically process images and text separately,applying upstream models directly to downstream tasks.This approach significantly increases the complexity ofmodel training and computational costs.Furthermore,the common class imbalance in existing training datasets limitsmodel performance improvement.To address these challenges,we propose an innovative framework named Person Re-ID Network Based on Visual Prompt Technology andMulti-Instance Negative Pooling(VPM-Net).First,we incorporate the Contrastive Language-Image Pre-training(CLIP)pre-trained model to accurately map visual and textual features into a unified embedding space,effectively mitigating inconsistencies in data distribution and the training process.To enhancemodel adaptability and generalization,we introduce an efficient and task-specific Visual Prompt Tuning(VPT)technique,which improves the model’s relevance to specific tasks.Additionally,we design two key modules:the Knowledge-Aware Network(KAN)and theMulti-Instance Negative Pooling(MINP)module.The KAN module significantly enhances the model’s understanding of complex scenarios through deep contextual semantic modeling.MINP module handles samples,effectively improving the model’s ability to distinguish fine-grained features.The experimental outcomes across diverse datasets underscore the remarkable performance of VPM-Net.These results vividly demonstrate the unique advantages and robust reliability of VPM-Net in fine-grained retrieval tasks.展开更多
This study focuses on tool condition recognition through data-driven approaches to enhance the intelligence level of computerized numerical control(CNC)machining processes and improve tool utilization efficiency.Tradi...This study focuses on tool condition recognition through data-driven approaches to enhance the intelligence level of computerized numerical control(CNC)machining processes and improve tool utilization efficiency.Traditional tool monitoring methods that rely on empirical knowledge or limited mathematical models struggle to adapt to complex and dynamic machining environments.To address this,we implement real-time tool condition recognition by introducing deep learning technology.Aiming to the insufficient recognition accuracy,we propose a pyramid pooling-based vision Transformer network(P2ViT-Net)method for tool condition recognition.Using images as input effectively mitigates the issue of low-dimensional signal features.We enhance the vision Transformer(ViT)framework for image classification by developing the P2ViT model and adapt it to tool condition recognition.Experimental results demonstrate that our improved P2ViT model achieves 94.4%recognition accuracy,showing a 10%improvement over conventional ViT and outperforming all comparative convolutional neural network models.展开更多
The influence of oscillation amplitude on molten pool thermal history,weld morphology characteristics,microstructural evolution,and mechanical properties during laser oscillating welding of QP980 steel was systematica...The influence of oscillation amplitude on molten pool thermal history,weld morphology characteristics,microstructural evolution,and mechanical properties during laser oscillating welding of QP980 steel was systematically investigated.Results show that laser beam oscillation significantly regulates molten pool thermomechanical behavior through optimized spatial energy distribution,thereby enabling microstructural reconstruction and joint performance enhancement.As the oscillation amplitude increases from 0 to 0.8 mm,the molten pool duration extends to 1.7 times the original value,while peak temperature and average cooling rate decrease by 19%and 39%,respectively.This thermal regulation promotes weld surface width expansion from 0.72 to 1.07 mm.The welding mode undergoes a progressive transition from keyhole mode→transitional mode→conduction mode.This transformation effectively suppresses porosity defects,substantially reducing porosity from 1.8%to 0.15%.Microstructural analysis indicates that oscillation modifies the maximum temperature gradient direction within the molten pool,facilitating preferential growth of coarse columnar grains along the welding centerline to establish load-transfer-favorable crystallographic orientations.The synergistic effects of these factors substantially improve joint mechanical properties:lap joint shear load increases by 81.5%(7.6→13.8 kN),and fracture elongation is enhanced by 135%(0.98→2.3 mm).The operational principles of laser oscillation parameters on the welding quality of QP980 steel were elucidated,providing theoretical foundations for joining process optimization.展开更多
The adult subventricular zone of the lateral ventricles and the subgranular zone in the hippocampal dentate gyrus(DG)are the two brain regions where neurogenesis occurs throughout life in the adult mammalian brain(Min...The adult subventricular zone of the lateral ventricles and the subgranular zone in the hippocampal dentate gyrus(DG)are the two brain regions where neurogenesis occurs throughout life in the adult mammalian brain(Ming and Song,2011).Adult quiescent hippocampal neural stem cells(NSCs)are bona fide stem cells and,when activated,give rise to newborn granule neurons in the adult brain,which play vital roles in learning,memory,mood,and affective cognition(Bonaguidi et al.,2011;Ming and Song,2011).展开更多
文摘This paper introduces a fuzzy C-means-based pooling layer for convolutional neural networks that explicitly models local uncertainty and ambiguity.Conventional pooling operations,such as max and average,apply rigid aggregation and often discard fine-grained boundary information.In contrast,our method computes soft membershipswithin each receptive field and aggregates cluster-wise responses throughmembership-weighted pooling,thereby preserving informative structure while reducing dimensionality.Being differentiable,the proposed layer operates as standard two-dimensional pooling.We evaluate our approach across various CNN backbones and open datasets,including CIFAR-10/100,STL-10,LFW,and ImageNette,and further probe small training set restrictions on MNIST and Fashion-MNIST.In these settings,the proposed pooling consistently improves accuracy and weighted F1 over conventional baselines,with particularly strong gains when training data are scarce.Even with less than 1%of the training set,ourmethodmaintains reliable performance,indicating improved sample efficiency and robustness to noisy or ambiguous local patterns.Overall,integrating soft memberships into the pooling operator provides a practical and generalizable inductive bias that enhances robustness and generalization in modern CNN pipelines.
基金supported by the National Natural Science Foundation of China(Grant Nos.62274002,62304001,and 62201005)the Anhui Provincial Natural Science Foundation(Grant Nos.2308085QF213 and 2408085QF211)the Natural Science Research Project of the Anhui Educational Committee(Grant No.2023AH050072)。
文摘Edge deployment solutions based on convolutional neural networks(CNNs)have garnered significant attention because of their potential applications.However,traditional CNNs rely on pooling to reduce the feature size,leading to substantial information loss and reduced network robustness.Herein,we propose a more robust adaptive pooling network(APN)method implemented using memristor technology.Our method introduces an improved pooling layer that reduces input features to an arbitrary scale without compromising their importance.Different coupling coefficients of the pooling layer are stored as conductance values in arrays.We validate the proposed APN on generic datasets,demonstrating significant performance improvements over previously reported CNN architectures.Additionally,we evaluate the APN on a CAPTCHA recognition task with perturbations to assess network robustness.The results show that the APN achieves 92.6% accuracy in 4-digit CAPTCHA recognition and exhibits higher robustness.This brief presents a highly robust and novel scheme for edge computing using memristor technology.
基金funded by the Key Research and Development Program of Hubei Province,China(Grant No.2023BEB024)the Young and Middle-aged Scientific and Technological Innova-tion Team Plan in Higher Education Institutions inHubei Province,China(GrantNo.T2023007)the key projects ofHubei Provincial Department of Education(No.D20161403).
文摘With the rapid development of intelligent video surveillance technology,pedestrian re-identification has become increasingly important inmulti-camera surveillance systems.This technology plays a critical role in enhancing public safety.However,traditional methods typically process images and text separately,applying upstream models directly to downstream tasks.This approach significantly increases the complexity ofmodel training and computational costs.Furthermore,the common class imbalance in existing training datasets limitsmodel performance improvement.To address these challenges,we propose an innovative framework named Person Re-ID Network Based on Visual Prompt Technology andMulti-Instance Negative Pooling(VPM-Net).First,we incorporate the Contrastive Language-Image Pre-training(CLIP)pre-trained model to accurately map visual and textual features into a unified embedding space,effectively mitigating inconsistencies in data distribution and the training process.To enhancemodel adaptability and generalization,we introduce an efficient and task-specific Visual Prompt Tuning(VPT)technique,which improves the model’s relevance to specific tasks.Additionally,we design two key modules:the Knowledge-Aware Network(KAN)and theMulti-Instance Negative Pooling(MINP)module.The KAN module significantly enhances the model’s understanding of complex scenarios through deep contextual semantic modeling.MINP module handles samples,effectively improving the model’s ability to distinguish fine-grained features.The experimental outcomes across diverse datasets underscore the remarkable performance of VPM-Net.These results vividly demonstrate the unique advantages and robust reliability of VPM-Net in fine-grained retrieval tasks.
基金supported by China Postdoctoral Science Foundation(No.2024M754122)the Postdoctoral Fellowship Programof CPSF(No.GZB20240972)+3 种基金the Jiangsu Funding Program for Excellent Postdoctoral Talent(No.2024ZB194)Natural Science Foundation of Jiangsu Province(No.BK20241389)Basic Science ResearchFund of China(No.JCKY2023203C026)2024 Jiangsu Province Talent Programme Qinglan Project.
文摘This study focuses on tool condition recognition through data-driven approaches to enhance the intelligence level of computerized numerical control(CNC)machining processes and improve tool utilization efficiency.Traditional tool monitoring methods that rely on empirical knowledge or limited mathematical models struggle to adapt to complex and dynamic machining environments.To address this,we implement real-time tool condition recognition by introducing deep learning technology.Aiming to the insufficient recognition accuracy,we propose a pyramid pooling-based vision Transformer network(P2ViT-Net)method for tool condition recognition.Using images as input effectively mitigates the issue of low-dimensional signal features.We enhance the vision Transformer(ViT)framework for image classification by developing the P2ViT model and adapt it to tool condition recognition.Experimental results demonstrate that our improved P2ViT model achieves 94.4%recognition accuracy,showing a 10%improvement over conventional ViT and outperforming all comparative convolutional neural network models.
基金supported by the National Natural Science Foundation of China(Grant Nos.51805084 and 52474401)Guangdong Basic and Applied Basic Research Foundation(Grant Nos.2023B1515120086 and 2025A1515012873).
文摘The influence of oscillation amplitude on molten pool thermal history,weld morphology characteristics,microstructural evolution,and mechanical properties during laser oscillating welding of QP980 steel was systematically investigated.Results show that laser beam oscillation significantly regulates molten pool thermomechanical behavior through optimized spatial energy distribution,thereby enabling microstructural reconstruction and joint performance enhancement.As the oscillation amplitude increases from 0 to 0.8 mm,the molten pool duration extends to 1.7 times the original value,while peak temperature and average cooling rate decrease by 19%and 39%,respectively.This thermal regulation promotes weld surface width expansion from 0.72 to 1.07 mm.The welding mode undergoes a progressive transition from keyhole mode→transitional mode→conduction mode.This transformation effectively suppresses porosity defects,substantially reducing porosity from 1.8%to 0.15%.Microstructural analysis indicates that oscillation modifies the maximum temperature gradient direction within the molten pool,facilitating preferential growth of coarse columnar grains along the welding centerline to establish load-transfer-favorable crystallographic orientations.The synergistic effects of these factors substantially improve joint mechanical properties:lap joint shear load increases by 81.5%(7.6→13.8 kN),and fracture elongation is enhanced by 135%(0.98→2.3 mm).The operational principles of laser oscillation parameters on the welding quality of QP980 steel were elucidated,providing theoretical foundations for joining process optimization.
基金supported by National Institutes of Health(R35NS137480,R35NS116843,and RF1AG079557)by Dr.Miriam and Sheldon G.Adelson Medical Research Foundation.
文摘The adult subventricular zone of the lateral ventricles and the subgranular zone in the hippocampal dentate gyrus(DG)are the two brain regions where neurogenesis occurs throughout life in the adult mammalian brain(Ming and Song,2011).Adult quiescent hippocampal neural stem cells(NSCs)are bona fide stem cells and,when activated,give rise to newborn granule neurons in the adult brain,which play vital roles in learning,memory,mood,and affective cognition(Bonaguidi et al.,2011;Ming and Song,2011).