期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
LACC:a hardware and software co-design accelerator for deep neural networks
1
作者 Yu Yong Zhi Tian Zhou Shengyuan 《High Technology Letters》 EI CAS 2021年第1期62-67,共6页
With the increasing of data size and model size,deep neural networks(DNNs)show outstanding performance in many artificial intelligence(AI)applications.But the big model size makes it a challenge for high-performance a... With the increasing of data size and model size,deep neural networks(DNNs)show outstanding performance in many artificial intelligence(AI)applications.But the big model size makes it a challenge for high-performance and low-power running DNN on processors,such as central processing unit(CPU),graphics processing unit(GPU),and tensor processing unit(TPU).This paper proposes a LOGNN data representation of 8 bits and a hardware and software co-design deep neural network accelerator LACC to meet the challenge.LOGNN data representation replaces multiply operations to add and shift operations in running DNN.LACC accelerator achieves higher efficiency than the state-of-the-art DNN accelerators by domain specific arithmetic computing units.Finally,LACC speeds up the performance per watt by 1.5 times,compared to the state-of-the-art DNN accelerators on average. 展开更多
关键词 deep neural network(DNN) domain specific accelerator domain specific data type
在线阅读 下载PDF
Editorial for the special issue on operating systems and programming systems for HPC
2
作者 Xiaobing Feng Minyi Guo 《CCF Transactions on High Performance Computing》 2020年第4期307-308,共2页
With the coming of exascale computing era,programming systems and operating systems(including runtime systems)are facing several challenges.In aspect of architecture,increasing deeper level of parallelism,heterogeneit... With the coming of exascale computing era,programming systems and operating systems(including runtime systems)are facing several challenges.In aspect of architecture,increasing deeper level of parallelism,heterogeneity,and the adoption of diverse domain specific accelerators raise the urgent need for programmability,performance optimization and portability.On the other side,big data analytics and machine learning applications demand to be ported and optimized on modern HPC systems.This issue focuses on the novel ideas,methods,as well as efforts of system software development for resolving the above challenges,and to fill the gap between applications and the underlying hardware systems. 展开更多
关键词 data analytics operating systems including ported optimized runtime systems machine learning operating systems domain specific accelerators exascale computing
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部