期刊文献+
共找到510篇文章
< 1 2 26 >
每页显示 20 50 100
A novel automated neural network architecture search method of air target intent recognition
1
作者 Ke WANG Yafei SONG +5 位作者 Yunfei XU Wen QUAN Peng NI Peng WANG Chenghai LI Xinyan ZHI 《Chinese Journal of Aeronautics》 2025年第6期26-40,共15页
Modern air battlefield operations are characterized by flexibility and change, and the battlefield evolves rapidly and intricately. However, traditional air target intent recognition methods, which mainly rely on manu... Modern air battlefield operations are characterized by flexibility and change, and the battlefield evolves rapidly and intricately. However, traditional air target intent recognition methods, which mainly rely on manually designed neural network models, find it difficult to maintain sustained and excellent performance in such a complex and changing environment. To address the problem of the adaptability of neural network models in complex environments, we propose a lightweight Transformer model(TransATIR) with a strong adaptive adjustment capability, based on the characteristics of air target intent recognition and the neural network architecture search technique. After conducting extensive experiments, it has been proved that TransATIR can efficiently extract the deep feature information from battlefield situation data by utilizing the neural architecture search algorithm, in order to quickly and accurately identify the real intention of the target. The experimental results indicate that TransATIR significantly improves recognition accuracy compared to the existing state-of-the-art methods, and also effectively reduces the computational complexity of the model. 展开更多
关键词 Intentrecognition TRANSFORMER neural architecture search Deep learning Situationalawareness
原文传递
Graph-Embedded Neural Architecture Search: A Variational Approach for Optimized Model Design
2
作者 Kazuki Hemmi Yuki Tanigaki +1 位作者 Kaisei Hara Masaki Onishi 《Computers, Materials & Continua》 2025年第8期2245-2271,共27页
Neural architecture search(NAS)optimizes neural network architectures to align with specific data and objectives,thereby enabling the design of high-performance models without specialized expertise.However,a significa... Neural architecture search(NAS)optimizes neural network architectures to align with specific data and objectives,thereby enabling the design of high-performance models without specialized expertise.However,a significant limitation of NAS is that it requires extensive computational resources and time.Consequently,performing a comprehensive architectural search for each new dataset is inefficient.Given the continuous expansion of available datasets,there is an urgent need to predict the optimal architecture for the previously unknown datasets.This study proposes a novel framework that generates architectures tailored to unknown datasets by mapping architectures that have demonstrated effectiveness on the existing dataset into a latent feature space.As NAS is inherently represented as graph structures,we employed an encoder-decoder transformation model based on variational graph auto-encoders to perform this latent feature mapping.The encoder-decoder transformation model demonstrates strong capability in extracting features from graph structures,making it particularly well-suited for mapping NAS architectures.By training variational graph auto-encoders on existing high-quality architectures,the proposed method constructs a latent space and facilitates the design of optimal architectures for diverse datasets.Furthermore,to effectively define similarity amongarchitectures,wepropose constructing the latent spaceby incorporatingbothdataset andtaskfeatures.Experimental results indicate that our approach significantly enhances search efficiency and outperforms conventional methods in terms of model performance. 展开更多
关键词 neural architecture search automated machine learning artificial intelligence deep learning graph neural network
在线阅读 下载PDF
Neural Architecture Search via Hierarchical Evaluation of Surrogate Models
3
作者 Xiaofeng Liu Yubin Bao Fangling Leng 《Computers, Materials & Continua》 2025年第8期3503-3517,共15页
The rapid development of evolutionary deep learning has led to the emergence of various Neural Architecture Search(NAS)algorithms designed to optimize neural network structures.However,these algorithms often face sign... The rapid development of evolutionary deep learning has led to the emergence of various Neural Architecture Search(NAS)algorithms designed to optimize neural network structures.However,these algorithms often face significant computational costs due to the time-consuming process of training neural networks and evaluating their performance.Traditional NAS approaches,which rely on exhaustive evaluations and large training datasets,are inefficient for solving complex image classification tasks within limited time frames.To address these challenges,this paper proposes a novel NAS algorithm that integrates a hierarchical evaluation strategy based on Surrogate models,specifically using supernet to pre-trainweights and randomforests as performance predictors.This hierarchical framework combines rapid Surrogate model evaluations with traditional,precise evaluations to balance the trade-off between performance accuracy and computational efficiency.The algorithm significantly reduces the time required for model evaluation by predicting the fitness of candidate architectures using a random forest Surrogate model,thus alleviating the need for full training cycles for each architecture.The proposed method also incorporates evolutionary operations such as mutation and crossover to refine the search process and improve the accuracy of the resulting architectures.Experimental evaluations on the CIFAR-10 and CIFAR-100 datasets demonstrate that the proposed hierarchical evaluation strategy reduces the search time and costs compared to traditional methods,while achieving comparable or even superior model performance.The results suggest that this approach can efficiently handle resourceconstrained tasks,providing a promising solution for accelerating the NAS process without compromising the quality of the generated architectures. 展开更多
关键词 neural architecture search hierarchical evaluation image classification Surrogate model
在线阅读 下载PDF
Expo-GAN:A Style Transfer Generative Adversarial Network for Exhibition Hall Design Based on Optimized Cyclic and Neural Architecture Search
4
作者 Qing Xie Ruiyun Yu 《Computers, Materials & Continua》 2025年第6期4757-4774,共18页
This study presents a groundbreaking method named Expo-GAN(Exposition-Generative Adversarial Network)for style transfer in exhibition hall design,using a refined version of the Cycle Generative Adversarial Network(Cyc... This study presents a groundbreaking method named Expo-GAN(Exposition-Generative Adversarial Network)for style transfer in exhibition hall design,using a refined version of the Cycle Generative Adversarial Network(CycleGAN).The primary goal is to enhance the transformation of image styles while maintaining visual consistency,an areawhere current CycleGAN models often fall short.These traditionalmodels typically face difficulties in accurately capturing expansive features as well as the intricate stylistic details necessary for high-quality image transformation.To address these limitations,the research introduces several key modifications to the CycleGAN architecture.Enhancements to the generator involve integrating U-net with SpecTransformer modules.This integration incorporates the use of Fourier transform techniques coupled with multi-head self-attention mechanisms,which collectively improve the generator’s ability to depict both large-scale structural patterns and minute elements meticulously in the generated images.This enhancement allows the generator to achieve a more detailed and coherent fusion of styles,essential for exhibition hall designs where both broad aesthetic strokes and detailed nuances matter significantly.The study also proposes innovative changes to the discriminator by employing dilated convolution and global attention mechanisms.These are derived using the Differentiable Architecture Search(DARTS)Neural Architecture Search framework to expand the receptive field,which is crucial for recognizing comprehensive artistically styled images.By broadening the ability to discern complex artistic features,the model avoids previous pitfalls associated with style inconsistency and missing detailed features.Moreover,the traditional cyde-consistency loss function is replaced with the Learned Perceptual Image Patch Similarity(LPIPS)metric.This shift aims to significantly enhance the perceptual quality of the resultant images by prioritizing human-perceived similarities,which aligns better with user expectations and professional standards in design aesthetics.The experimental phase of this research demonstrates that this novel approach consistently outperforms the conventional CycleGAN across a broad range of datasets.Complementary ablation studies and qualitative assessments underscore its superiority,particularly in maintaining detail fidelity and style continuity.This is critical for creating a visually harmonious exhibitionhall designwhere everydetail contributes to the overall aesthetic appeal.The results illustrate that this refined approach effectively bridges the gap between technical capability and artistic necessity,marking a significant advancement in computational design methodologies. 展开更多
关键词 Exhibition hall design CycleGAN SpecTransformer DARTS neural architecture search LPIPS loss function
在线阅读 下载PDF
Evolutionary neural architecture search for traffic sign recognition
5
作者 SONG Changwei MA Yongjie +1 位作者 PING Haoyu SUN Lisheng 《Optoelectronics Letters》 2025年第7期434-440,共7页
Convolutional neural networks(CNNs)exhibit superior performance in image feature extraction,making them extensively used in the area of traffic sign recognition.However,the design of existing traffic sign recognition ... Convolutional neural networks(CNNs)exhibit superior performance in image feature extraction,making them extensively used in the area of traffic sign recognition.However,the design of existing traffic sign recognition algorithms often relies on expert knowledge to enhance the image feature extraction networks,necessitating image preprocessing and model parameter tuning.This increases the complexity of the model design process.This study introduces an evolutionary neural architecture search(ENAS)algorithm for the automatic design of neural network models tailored for traffic sign recognition.By integrating the construction parameters of residual network(ResNet)into evolutionary algorithms(EAs),we automatically generate lightweight networks for traffic sign recognition,utilizing blocks as the fundamental building units.Experimental evaluations on the German traffic sign recognition benchmark(GTSRB)dataset reveal that the algorithm attains a recognition accuracy of 99.32%,with a mere 2.8×10^(6)parameters.Experimental results comparing the proposed method with other traffic sign recognition algorithms demonstrate that the method can more efficiently discover neural network architectures,significantly reducing the number of network parameters while maintaining recognition accuracy. 展开更多
关键词 traffic sign recognitionhoweverthe expert knowledge image feature extraction model parameter tuningthis evolutionary neural architecture search enas algorithm traffic sign recognition model design image preprocessing
原文传递
A Survey of Accelerator Architectures for Deep Neural Networks 被引量:10
6
作者 Yiran Chen Yuan Xie +2 位作者 Linghao Song Fan Chen Tianqi Tang 《Engineering》 SCIE EI 2020年第3期264-274,共11页
Recently,due to the availability of big data and the rapid growth of computing power,artificial intelligence(AI)has regained tremendous attention and investment.Machine learning(ML)approaches have been successfully ap... Recently,due to the availability of big data and the rapid growth of computing power,artificial intelligence(AI)has regained tremendous attention and investment.Machine learning(ML)approaches have been successfully applied to solve many problems in academia and in industry.Although the explosion of big data applications is driving the development of ML,it also imposes severe challenges of data processing speed and scalability on conventional computer systems.Computing platforms that are dedicatedly designed for AI applications have been considered,ranging from a complement to von Neumann platforms to a“must-have”and stand-alone technical solution.These platforms,which belong to a larger category named“domain-specific computing,”focus on specific customization for AI.In this article,we focus on summarizing the recent advances in accelerator designs for deep neural networks(DNNs)-that is,DNN accelerators.We discuss various architectures that support DNN executions in terms of computing units,dataflow optimization,targeted network topologies,architectures on emerging technologies,and accelerators for emerging applications.We also provide our visions on the future trend of AI chip designs. 展开更多
关键词 Deep neural network Domain-specific architecture ACCELERATOR
在线阅读 下载PDF
Effective distributed convolutional neural network architecture for remote sensing images target classification with a pre-training approach 被引量:3
7
作者 LI Binquan HU Xiaohui 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2019年第2期238-244,共7页
How to recognize targets with similar appearances from remote sensing images(RSIs) effectively and efficiently has become a big challenge. Recently, convolutional neural network(CNN) is preferred in the target classif... How to recognize targets with similar appearances from remote sensing images(RSIs) effectively and efficiently has become a big challenge. Recently, convolutional neural network(CNN) is preferred in the target classification due to the powerful feature representation ability and better performance. However,the training and testing of CNN mainly rely on single machine.Single machine has its natural limitation and bottleneck in processing RSIs due to limited hardware resources and huge time consuming. Besides, overfitting is a challenge for the CNN model due to the unbalance between RSIs data and the model structure.When a model is complex or the training data is relatively small,overfitting occurs and leads to a poor predictive performance. To address these problems, a distributed CNN architecture for RSIs target classification is proposed, which dramatically increases the training speed of CNN and system scalability. It improves the storage ability and processing efficiency of RSIs. Furthermore,Bayesian regularization approach is utilized in order to initialize the weights of the CNN extractor, which increases the robustness and flexibility of the CNN model. It helps prevent the overfitting and avoid the local optima caused by limited RSI training images or the inappropriate CNN structure. In addition, considering the efficiency of the Na¨?ve Bayes classifier, a distributed Na¨?ve Bayes classifier is designed to reduce the training cost. Compared with other algorithms, the proposed system and method perform the best and increase the recognition accuracy. The results show that the distributed system framework and the proposed algorithms are suitable for RSIs target classification tasks. 展开更多
关键词 convolutional neural network (CNN) DISTRIBUTED architecture REMOTE SENSING images (RSIs) TARGET classification pre-training
在线阅读 下载PDF
2D and 3D Palmprint and Palm Vein Recognition Based on Neural Architecture Search 被引量:8
8
作者 Wei Jia Wei Xia +2 位作者 Yang Zhao Hai Min Yan-Xiang Chen 《International Journal of Automation and computing》 EI CSCD 2021年第3期377-409,共33页
Palmprint recognition and palm vein recognition are two emerging biometrics technologies.In the past two decades,many traditional methods have been proposed for palmprint recognition and palm vein recognition and have... Palmprint recognition and palm vein recognition are two emerging biometrics technologies.In the past two decades,many traditional methods have been proposed for palmprint recognition and palm vein recognition and have achieved impressive results.In recent years,in the field of artificial intelligence,deep learning has gradually become the mainstream recognition technology because of its excellent recognition performance.Some researchers have tried to use convolutional neural networks(CNNs)for palmprint recognition and palm vein recognition.However,the architectures of these CNNs have mostly been developed manually by human experts,which is a time-consuming and error-prone process.In order to overcome some shortcomings of manually designed CNN,neural architecture search(NAS)technology has become an important research direction of deep learning.The significance of NAS is to solve the deep learning model's parameter adjustment problem,which is a cross-study combining optimization and machine learning.NAS technology represents the future development direction of deep learning.However,up to now,NAS technology has not been well studied for palmprint recognition and palm vein recognition.In this paper,in order to investigate the problem of NAS-based 2D and 3D palmprint recognition and palm vein recognition in-depth,we conduct a performance evaluation of twenty representative NAS methods on five 2D palmprint databases,two palm vein databases,and one 3D palmprint database.Experimental results show that some NAS methods can achieve promising recognition results.Remarkably,among different evaluated NAS methods,Proxyless NAS achieves the best recognition performance. 展开更多
关键词 Performance evaluation neural architecture search BIOMETRICS PALMPRINT palm vein deep learning
原文传递
Ultra-lightweight CNN design based on neural architecture search and knowledge distillation: A novel method to build the automatic recognition model of space target ISAR images 被引量:7
9
作者 Hong Yang Ya-sheng Zhang +1 位作者 Can-bin Yin Wen-zhe Ding 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2022年第6期1073-1095,共23页
In this paper,a novel method of ultra-lightweight convolution neural network(CNN)design based on neural architecture search(NAS)and knowledge distillation(KD)is proposed.It can realize the automatic construction of th... In this paper,a novel method of ultra-lightweight convolution neural network(CNN)design based on neural architecture search(NAS)and knowledge distillation(KD)is proposed.It can realize the automatic construction of the space target inverse synthetic aperture radar(ISAR)image recognition model with ultra-lightweight and high accuracy.This method introduces the NAS method into the radar image recognition for the first time,which solves the time-consuming and labor-consuming problems in the artificial design of the space target ISAR image automatic recognition model(STIIARM).On this basis,the NAS model’s knowledge is transferred to the student model with lower computational complexity by the flow of the solution procedure(FSP)distillation method.Thus,the decline of recognition accuracy caused by the direct compression of model structural parameters can be effectively avoided,and the ultralightweight STIIARM can be obtained.In the method,the Inverted Linear Bottleneck(ILB)and Inverted Residual Block(IRB)are firstly taken as each block’s basic structure in CNN.And the expansion ratio,output filter size,number of IRBs,and convolution kernel size are set as the search parameters to construct a hierarchical decomposition search space.Then,the recognition accuracy and computational complexity are taken as the objective function and constraint conditions,respectively,and the global optimization model of the CNN architecture search is established.Next,the simulated annealing(SA)algorithm is used as the search strategy to search out the lightweight and high accuracy STIIARM directly.After that,based on the three principles of similar block structure,the same corresponding channel number,and the minimum computational complexity,the more lightweight student model is designed,and the FSP matrix pairing between the NAS model and student model is completed.Finally,by minimizing the loss between the FSP matrix pairs of the NAS model and student model,the student model’s weight adjustment is completed.Thus the ultra-lightweight and high accuracy STIIARM is obtained.The proposed method’s effectiveness is verified by the simulation experiments on the ISAR image dataset of five types of space targets. 展开更多
关键词 Space target ISAR image neural architecture search Knowledge distillation Lightweight model
在线阅读 下载PDF
An Optimized Convolution Neural Network Architecture for Paddy Disease Classification 被引量:2
10
作者 Muhammad Asif Saleem Muhammad Aamir +2 位作者 Rosziati Ibrahim Norhalina Senan Tahir Alyas 《Computers, Materials & Continua》 SCIE EI 2022年第6期6053-6067,共15页
Plant disease classification based on digital pictures is challenging.Machine learning approaches and plant image categorization technologies such as deep learning have been utilized to recognize,identify,and diagnose... Plant disease classification based on digital pictures is challenging.Machine learning approaches and plant image categorization technologies such as deep learning have been utilized to recognize,identify,and diagnose plant diseases in the previous decade.Increasing the yield quantity and quality of rice forming is an important cause for the paddy production countries.However,some diseases that are blocking the improvement in paddy production are considered as an ominous threat.Convolution Neural Network(CNN)has shown a remarkable performance in solving the early detection of paddy leaf diseases based on its images in the fast-growing era of science and technology.Nevertheless,the significant CNN architectures construction is dependent on expertise in a neural network and domain knowledge.This approach is time-consuming,and high computational resources are mandatory.In this research,we propose a novel method based on Mutant Particle swarm optimization(MUT-PSO)Algorithms to search for an optimum CNN architecture for Paddy leaf disease classification.Experimentation results show that Mutant Particle swarm optimization Convolution Neural Network(MUTPSO-CNN)can find optimumCNNarchitecture that offers better performance than existing hand-crafted CNN architectures in terms of accuracy,precision/recall,and execution time. 展开更多
关键词 Deep learning optimum CNN architecture particle swarm optimization convolutional neural network parameter optimization
在线阅读 下载PDF
Scale adaptive fitness evaluation‐based particle swarm optimisation for hyperparameter and architecture optimisation in neural networks and deep learning 被引量:2
11
作者 Ye‐Qun Wang Jian‐Yu Li +2 位作者 Chun‐Hua Chen Jun Zhang Zhi‐Hui Zhan 《CAAI Transactions on Intelligence Technology》 SCIE EI 2023年第3期849-862,共14页
Research into automatically searching for an optimal neural network(NN)by optimi-sation algorithms is a significant research topic in deep learning and artificial intelligence.However,this is still challenging due to ... Research into automatically searching for an optimal neural network(NN)by optimi-sation algorithms is a significant research topic in deep learning and artificial intelligence.However,this is still challenging due to two issues:Both the hyperparameter and ar-chitecture should be optimised and the optimisation process is computationally expen-sive.To tackle these two issues,this paper focusses on solving the hyperparameter and architecture optimization problem for the NN and proposes a novel light‐weight scale‐adaptive fitness evaluation‐based particle swarm optimisation(SAFE‐PSO)approach.Firstly,the SAFE‐PSO algorithm considers the hyperparameters and architectures together in the optimisation problem and therefore can find their optimal combination for the globally best NN.Secondly,the computational cost can be reduced by using multi‐scale accuracy evaluation methods to evaluate candidates.Thirdly,a stagnation‐based switch strategy is proposed to adaptively switch different evaluation methods to better balance the search performance and computational cost.The SAFE‐PSO algorithm is tested on two widely used datasets:The 10‐category(i.e.,CIFAR10)and the 100−cate-gory(i.e.,CIFAR100).The experimental results show that SAFE‐PSO is very effective and efficient,which can not only find a promising NN automatically but also find a better NN than compared algorithms at the same computational cost. 展开更多
关键词 deep learning evolutionary computation hyperparameter and architecture optimisation neural networks particle swarm optimisation scale‐adaptive fitness evaluation
在线阅读 下载PDF
Side channel attacks for architecture extraction of neural networks 被引量:1
12
作者 HervéChabanne Jean-Luc Danger +1 位作者 Linda Guiga Ulrich Kühne 《CAAI Transactions on Intelligence Technology》 EI 2021年第1期3-16,共14页
Side channel attacks(SCAs)on neural networks(NNs)are particularly efficient for retrieving secret information from NNs.We differentiate multiple types of threat scenarios regarding what kind of information is availabl... Side channel attacks(SCAs)on neural networks(NNs)are particularly efficient for retrieving secret information from NNs.We differentiate multiple types of threat scenarios regarding what kind of information is available before the attack and its purpose:recovering hyperparameters(the architecture)of the targeted NN,its weights(parameters),or its inputs.In this survey article,we consider the most relevant attacks to extract the architecture of CNNs.We also categorize SCAs,depending on access with respect to the victim:physical,local,or remote.Attacks targeting the architecture via local SCAs are most common.As of today,physical access seems necessary to retrieve the weights of an NN.We notably describe cache attacks,which are local SCAs aiming to extract the NN's underlying architecture.Few countermeasures have emerged;these are presented at the end of the survey. 展开更多
关键词 architecture NETWORKS neural
在线阅读 下载PDF
Evolutionary Neural Architecture Search and Its Applications in Healthcare 被引量:1
13
作者 Xin Liu Jie Li +3 位作者 Jianwei Zhao Bin Cao Rongge Yan Zhihan Lyu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第4期143-185,共43页
Most of the neural network architectures are based on human experience,which requires a long and tedious trial-and-error process.Neural architecture search(NAS)attempts to detect effective architectures without human ... Most of the neural network architectures are based on human experience,which requires a long and tedious trial-and-error process.Neural architecture search(NAS)attempts to detect effective architectures without human intervention.Evolutionary algorithms(EAs)for NAS can find better solutions than human-designed architectures by exploring a large search space for possible architectures.Using multiobjective EAs for NAS,optimal neural architectures that meet various performance criteria can be explored and discovered efficiently.Furthermore,hardware-accelerated NAS methods can improve the efficiency of the NAS.While existing reviews have mainly focused on different strategies to complete NAS,a few studies have explored the use of EAs for NAS.In this paper,we summarize and explore the use of EAs for NAS,as well as large-scale multiobjective optimization strategies and hardware-accelerated NAS methods.NAS performs well in healthcare applications,such as medical image analysis,classification of disease diagnosis,and health monitoring.EAs for NAS can automate the search process and optimize multiple objectives simultaneously in a given healthcare task.Deep neural network has been successfully used in healthcare,but it lacks interpretability.Medical data is highly sensitive,and privacy leaks are frequently reported in the healthcare industry.To solve these problems,in healthcare,we propose an interpretable neuroevolution framework based on federated learning to address search efficiency and privacy protection.Moreover,we also point out future research directions for evolutionary NAS.Overall,for researchers who want to use EAs to optimize NNs in healthcare,we analyze the advantages and disadvantages of doing so to provide detailed guidance,and propose an interpretable privacy-preserving framework for healthcare applications. 展开更多
关键词 neural architecture search evolutionary computation large-scale multiobjective optimization distributed parallelism healthcare
在线阅读 下载PDF
A survey on computationally efficient neural architecture search 被引量:2
14
作者 Shiqing Liu Haoyu Zhang Yaochu Jin 《Journal of Automation and Intelligence》 2022年第1期8-22,共15页
Neural architecture search(NAS)has become increasingly popular in the deep learning community recently,mainly because it can provide an opportunity to allow interested users without rich expertise to benefit from the ... Neural architecture search(NAS)has become increasingly popular in the deep learning community recently,mainly because it can provide an opportunity to allow interested users without rich expertise to benefit from the success of deep neural networks(DNNs).However,NAS is still laborious and time-consuming because a large number of performance estimations are required during the search process of NAS,and training DNNs is computationally intensive.To solve this major limitation of NAS,improving the computational efficiency is essential in the design of NAS.However,a systematic overview of computationally efficient NAS(CE-NAS)methods still lacks.To fill this gap,we provide a comprehensive survey of the state-of-the-art on CE-NAS by categorizing the existing work into proxy-based and surrogate-assisted NAS methods,together with a thorough discussion of their design principles and a quantitative comparison of their performances and computational complexities.The remaining challenges and open research questions are also discussed,and promising research topics in this emerging field are suggested. 展开更多
关键词 neural architecture search(NAS) One-shot NAS Surrogate model Bayesian optimization Performance predictor
在线阅读 下载PDF
Deep Neural Network Architecture Search via Decomposition-Based Multi-Objective Stochastic Fractal Search 被引量:1
15
作者 Hongshang Xu Bei Dong +1 位作者 Xiaochang Liu Xiaojun Wu 《Intelligent Automation & Soft Computing》 2023年第11期185-202,共18页
Deep neural networks often outperform classical machine learning algorithms in solving real-world problems.However,designing better networks usually requires domain expertise and consumes significant time and com-puti... Deep neural networks often outperform classical machine learning algorithms in solving real-world problems.However,designing better networks usually requires domain expertise and consumes significant time and com-puting resources.Moreover,when the task changes,the original network architecture becomes outdated and requires redesigning.Thus,Neural Architecture Search(NAS)has gained attention as an effective approach to automatically generate optimal network architectures.Most NAS methods mainly focus on achieving high performance while ignoring architectural complexity.A myriad of research has revealed that network performance and structural complexity are often positively correlated.Nevertheless,complex network structures will bring enormous computing resources.To cope with this,we formulate the neural architecture search task as a multi-objective optimization problem,where an optimal architecture is learned by minimizing the classification error rate and the number of network parameters simultaneously.And then a decomposition-based multi-objective stochastic fractal search method is proposed to solve it.In view of the discrete property of the NAS problem,we discretize the stochastic fractal search step size so that the network architecture can be optimized more effectively.Additionally,two distinct update methods are employed in step size update stage to enhance the global and local search abilities adaptively.Furthermore,an information exchange mechanism between architectures is raised to accelerate the convergence process and improve the efficiency of the algorithm.Experimental studies show that the proposed algorithm has competitive performance comparable to many existing manual and automatic deep neural network generation approaches,which achieved a parameter-less and high-precision architecture with low-cost on each of the six benchmark datasets. 展开更多
关键词 Deep neural network neural architecture search multi-objective optimization stochastic fractal search DECOMPOSITION
在线阅读 下载PDF
APPLICATION OF ARCHITECTURE- BASED NEURAL NETWORKS IN MODELING AND PARAMETER OPTIMIZATION OF HYDRAULIC BUMPER 被引量:1
16
作者 Yang Haiwei Zhan Yongqi Qiao Junwei Shi GuanglinSchool of Mechanical Engineering,Shanghai Jiaotong University,Shanghai 200030, China 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2003年第3期313-316,共4页
The dynamic working process of 52SFZ-140-207B type of hydraulic bumper isanalyzed. The modeling method using architecture-based neural networks is introduced. Using thismodeling method, the dynamic model of the hydrau... The dynamic working process of 52SFZ-140-207B type of hydraulic bumper isanalyzed. The modeling method using architecture-based neural networks is introduced. Using thismodeling method, the dynamic model of the hydraulic bumper is established; Based on this model thestructural parameters of the hydraulic bumper are optimized with Genetic algorithm. The result showsthat the performance of the dynamic model is close to that of the hydraulic bumper, and the dynamicperformance of the hydraulic bumper is improved through parameter optimization. 展开更多
关键词 architecture-based neural networks MODELING Parameter optimization Hydraulic bumper
在线阅读 下载PDF
A Double-Branch Xception Architecture for Acute Hemorrhage Detection and Subtype Classification
17
作者 Muhammad Naeem Akram Muhammad Usman Yaseen +2 位作者 Muhammad Waqar Muhammad Imran Aftab Hussain 《Computers, Materials & Continua》 SCIE EI 2023年第9期3727-3744,共18页
This study presents a deep learning model for efficient intracranial hemorrhage(ICH)detection and subtype classification on non-contrast head computed tomography(CT)images.ICH refers to bleeding in the skull,leading t... This study presents a deep learning model for efficient intracranial hemorrhage(ICH)detection and subtype classification on non-contrast head computed tomography(CT)images.ICH refers to bleeding in the skull,leading to the most critical life-threatening health condition requiring rapid and accurate diagnosis.It is classified as intra-axial hemorrhage(intraventricular,intraparenchymal)and extra-axial hemorrhage(subdural,epidural,subarachnoid)based on the bleeding location inside the skull.Many computer-aided diagnoses(CAD)-based schemes have been proposed for ICH detection and classification at both slice and scan levels.However,these approaches performonly binary classification and suffer from a large number of parameters,which increase storage costs.Further,the accuracy of brain hemorrhage detection in existing models is significantly low for medically critical applications.To overcome these problems,a fast and efficient system for the automatic detection of ICH is needed.We designed a double-branch model based on xception architecture that extracts spatial and instant features,concatenates them,and creates the 3D spatial context(common feature vectors)fed to a decision tree classifier for final predictions.The data employed for the experimentation was gathered during the 2019 Radiologist Society of North America(RSNA)brain hemorrhage detection challenge.Our model outperformed benchmark models and achieved better accuracy in intraventricular(99.49%),subarachnoid(99.49%),intraparenchymal(99.10%),and subdural(98.09%)categories,thereby justifying the performance of the proposed double-branch xception architecture for ICH detection and classification. 展开更多
关键词 Computed tomography convolutional neural networks intracranial hemorrhage xception architecture
在线阅读 下载PDF
A leap forward in compute-in-memory system for neural network inference
18
作者 Liang Chu Wenjun Li 《Journal of Semiconductors》 2025年第4期5-7,共3页
Developing efficient neural network(NN)computing systems is crucial in the era of artificial intelligence(AI).Traditional von Neumann architectures have both the issues of"memory wall"and"power wall&quo... Developing efficient neural network(NN)computing systems is crucial in the era of artificial intelligence(AI).Traditional von Neumann architectures have both the issues of"memory wall"and"power wall",limiting the data transfer between memory and processing units[1,2].Compute-in-memory(CIM)technologies,particularly analogue CIM with memristor crossbars,are promising because of their high energy efficiency,computational parallelism,and integration density for NN computations[3].In practical applications,analogue CIM excels in tasks like speech recognition and image classification,revealing its unique advantages.For instance,it efficiently processes vast amounts of audio data in speech recognition,achieving high accuracy with minimal power consumption.In image classification,the high parallelism of analogue CIM significantly speeds up feature extraction and reduces processing time.With the boosting development of AI applications,the demands for computational accuracy and task complexity are rising continually.However,analogue CIM systems are limited in handling complex regression tasks with needs of precise floating-point(FP)calculations.They are primarily suited for the classification tasks with low data precision and a limited dynamic range[4]. 展开更多
关键词 neural network von neumann architectures compute memory INFERENCE MEMRISTOR artificial intelligence ai traditional memristor crossbarsare analogue cim
在线阅读 下载PDF
面向分布式计算的类脑智能处理器指令集架构设计
19
作者 冯烁 路冬冬 +6 位作者 尹飞 杨剑新 班冬松 何军 颜世云 李媛 雎浩宇 《计算机研究与发展》 北大核心 2026年第1期1-14,共14页
作为分布式计算的典型体现之一,端边云协同计算系统能够有效推动物联网、大模型、数字孪生等人工智能技术的垂直落地应用。类脑计算是一种受大脑工作方式启发而提出的智能计算技术,具有能效高、速度快、容错度高、可扩展性强等优点。通... 作为分布式计算的典型体现之一,端边云协同计算系统能够有效推动物联网、大模型、数字孪生等人工智能技术的垂直落地应用。类脑计算是一种受大脑工作方式启发而提出的智能计算技术,具有能效高、速度快、容错度高、可扩展性强等优点。通过利用脉冲神经网络的事件驱动机制和脉冲稀疏发放等特性,类脑计算能够极大地提升分布式端边云系统的实时处理能力和能量效率。针对分布式终端设备的高实时、低功耗、强异构等特点,聚焦于指令集架构这一软硬件的交互界面,给出了一种立足现有系统、易于部署升级、安全自主可控、异构融合兼容的硬件设计方案,一共提出了12条类脑计算指令,完成了基于某国产指令系统的类脑指令集和对应微结构的定制化设计,为类脑计算赋能分布式计算系统奠定了技术基础。 展开更多
关键词 分布式计算 类脑智能 脉冲神经网络 指令集架构 处理器微结构 神经拟态芯片
在线阅读 下载PDF
GPT-NAS:Neural Architecture Search Meets Generative Pre-Trained Transformer Model
20
作者 Caiyang Yu Xianggen Liu +5 位作者 Yifan Wang Yun Liu Wentao Feng Xiong Deng Chenwei Tang Jiancheng Lv 《Big Data Mining and Analytics》 2025年第1期45-64,共20页
The pursuit of optimal neural network architectures is foundational to the progression of Neural Architecture Search (NAS). However, the existing NAS methods suffer from the following problem using traditional search ... The pursuit of optimal neural network architectures is foundational to the progression of Neural Architecture Search (NAS). However, the existing NAS methods suffer from the following problem using traditional search strategies, i.e., when facing a large and complex search space, it is difficult to mine more effective architectures within a reasonable time, resulting in inferior search results. This research introduces the Generative Pre-trained Transformer NAS (GPT-NAS), an innovative approach designed to overcome the limitations which are inherent in traditional NAS strategies. This approach improves search efficiency and obtains better architectures by integrating GPT model into the search process. Specifically, we design a reconstruction strategy that utilizes the trained GPT to reorganize the architectures obtained from the search. In addition, to equip the GPT model with the design capabilities of neural architecture, we propose the use of the GPT model for training on a neural architecture dataset. For each architecture, the structural information of its previous layers is utilized to predict the next layer of structure, iteratively traversing the entire architecture. In this way, the GPT model can efficiently learn the key features required for neural architectures. Extensive experimental validation shows that our GPT-NAS approach beats both manually constructed neural architectures and automatically generated architectures by NAS. In addition, we validate the superiority of introducing the GPT model in several ways, and find that the accuracy of the neural architecture on the image dataset obtained from the search after introducing the GPT model is improved by up to about 9%. 展开更多
关键词 neural architecture Search(NAS) Generative Pre-trained Transformer(GPT)model evolutionary algorithm image classification
原文传递
上一页 1 2 26 下一页 到第
使用帮助 返回顶部