期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
ArchSim:A System-Level Parallel Simulation Platform for the Architecture Design of High Performance Computer 被引量:4
1
作者 黄永勤 李宏亮 +4 位作者 谢向辉 钱磊 郝子宇 过锋 张昆 《Journal of Computer Science & Technology》 SCIE EI CSCD 2009年第5期901-912,共12页
High performance computer (HPC) is a complex huge system, of which the architecture design meets increasing difficulties and risks. Traditional methods, such as theoretical analysis, component-level simulation and s... High performance computer (HPC) is a complex huge system, of which the architecture design meets increasing difficulties and risks. Traditional methods, such as theoretical analysis, component-level simulation and sequential simulation, are not applicable to system-level simulations of HPC systems. Even the parallel simulation using large-scale parallel machines also have many difficulties in scalability, reliability, generality, as well as efficiency. According to the current needs of HPC architecture design, this paper proposes a system-level parallel simulation platform: ArchSim. We first introduce the architecture of ArchSim simulation platform which is composed of a global server (GS), local server agents (LSA) and entities. Secondly, we emphasize some key techniques of ArchSim, including the synchronization protocol, the communication mechanism and the distributed checkpointing/restart mechanism. We then make a synthesized test of some main performance indices of ArchSim with the phold benchmark and analyze the extra overhead generated by ArchSim. Finally, based on ArchSim, we construct a parallel event-driven interconnection network simulator and a system-level simulator for a small scale HPC system with 256 processors. The results of the performance test and HPC system simulations demonstrate that ArchSim can achieve high speedup ratio and high scalability on parallel host machine and support system-level simulations for the architecture design of HPC systems. 展开更多
关键词 high performance computer architecture system-level parallel simulation synchronization protocol message communication distributed checkpointing/restart
原文传递
Distributed Heterogeneous Spiking Neural Network Simulator Using Sunway Accelerators
2
作者 Xuelei Li Zhichao Wang +3 位作者 Yi Pan Jintao Meng Shengzhong Feng Yanjie Wei 《Big Data Mining and Analytics》 CSCD 2024年第4期1301-1320,共20页
Spiking Neural Network(SNN)simulation is very important for studying brain function and validating the hypotheses for neuroscience,and it can also be used in artificial intelligence.Recently,GPU-based simulators have ... Spiking Neural Network(SNN)simulation is very important for studying brain function and validating the hypotheses for neuroscience,and it can also be used in artificial intelligence.Recently,GPU-based simulators have been developed to support the real-time simulation of SNN.However,these simulators’simulating performance and scale are severely limited,due to the random memory access pattern and the global communication between devices.Therefore,we propose an efficient distributed heterogeneous SNN simulator based on the Sunway accelerators(including SW26010 and SW26010pro),named SWsnn,which supports accurate simulation with small time step(1/16 ms),randomly delay sizes for synapses,and larger scale network computing.Compared with existing GPUs,the Local Dynamic Memory(LDM)(similar to cache)in Sunway is much bigger(4 MB or 16 MB in each core group).To improve the simulation performance,we redesign the network data storage structure and the synaptic plasticity flow to make most random accesses occur in LDM.SWsnn hides Message Passing Interface(MPI)-related operations to reduce communication costs by separating SNN general workflow.Besides,SWsnn relies on parallel Compute Processing Elements(CPEs)rather than serial Manage Processing Element(MPE)to control the communicating buffers,using Register-Level Communication(RLC)and Direct Memory Access(DMA).In addition,SWsnn is further optimized using vectorization and DMA hiding techniques.Experimental results show that SWsnn runs 1.4−2.2 times faster than state-of-the-art GPU-based SNN simulator GPU-enhanced Neuronal Networks(GeNN),and supports much larger scale real-time simulation. 展开更多
关键词 Spiking Neural Network(SNN)simulation Sunway accelerator random access message Passing Interface(MPI)communication real-time simulation
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部