期刊文献+

LayCO:Achieving Least Lossy Accuracy for Most EfficientRRAM-Based Deep Neural Network Accelerator via Layer-Centric Co-Optimization

原文传递
导出
摘要 Resistive random access memory(RRAM)enables the functionality of operating massively parallel dot products and accumulations.RRAM-based accelerator is such an effective approach to bridging the gap between Internet of Things devices’constrained resources and deep neural networks’tremendous cost.Due to the huge overhead of Analog to Digital(A/D)and digital accumulations,analog RRAM buffer is introduced to extend the processing in analog and in approximation.Although analog RRAM buffer offers potential solutions to A/D conversion issues,the energy consumption is still challenging in resource-constrained environments,especially with enormous intermediate data volume.Besides,critical concerns over endurance must also be resolved before the RRAM buffer could be frequently used in reality for DNN inference tasks.Then we propose LayCO,a layer-centric co-optimizing scheme to address the energy and endurance concerns altogether while strictly providing an inference accuracy guarantee.LayCO relies on two key ideas:1)co-optimizing with reduced supply voltage and reduced bit-width of accelerator architectures to increase the DNN’s error tolerance and achieve the accelerator’s energy efficiency,and 2)efficiently mapping and swapping individual DNN data to a corresponding RRAM partition in a way that meets the endurance requirements.The evaluation with representative DNN models demonstrates that LayCO outperforms the baseline RRAM buffer based accelerator by 27x improvement in energy efficiency(over TIMELY-like configuration),308x in lifetime prolongation and 6x in area reduction(over RAQ)while maintaining the DNN accuracy loss less than 1%.
作者 赵少锋 王芳 刘博 冯丹 刘洋 Shao-Feng Zhao;Fang Wang;Bo Liu;Dan Feng;Yang Liu(Wuhan National Laboratory for Optoelectronics,Huazhong University of Science and Technology,Wuhan 430074,China;School of Computer Science and Technology,Huazhong University of Science and Technology,Wuhan 430074,China;Cloud Computing and Big Data Institute,Henan University of Economics and Law,Zhengzhou 450001,China;Research Institute of Huazhong University of Science and Technology in Shenzhen,Shenzhen 518057,China;School of Computer and Artificial Intelligence,Zhengzhou University,Zhengzhou 450001,China)
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2023年第2期328-347,共20页 计算机科学技术学报(英文版)
基金 supported by the National Natural Science Foundation of China under Grant Nos.U22A2027,61832020,61832007,61821003 the Science Technology and Innovation Commission of Shenzhen Municipality under Grant No.JCYJ20210324141601005 the Henan Provincial Science and Technology Key Project Foundation under Grant Nos.212102310085,222102210054,222102210154,222102210252.
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部