期刊文献+

嵌入式处理器中的寄存器堆延迟写回技术 被引量:1

Register File Delay-Writeback in Embedded Microprocessors
在线阅读 下载PDF
导出
摘要 为了降低嵌入式处理器中寄存器堆的功耗,提出一种基于限制取指的寄存器堆延迟写回技术.对于嵌入式处理器,传统的寄存器堆延迟写回技术带来的效果并不明显,文中根据处理器前端比后端快的特点,采用限制取指技术提高寄存器堆延迟写回的效果,不仅大幅度地消除了对寄存器堆不必要的写操作,同时也降低了处理器前端的功耗.FPGA平台上的实验结果表明:在不影响程序性能的情况下,应用该技术后,EEMBC程序对定点寄存器堆的写操作减少了35%,对ICache的访问减少了15%,且没有额外的开销. Abstract To reduce the power consumption of register file in embedded microprocessors, a register file delay-writeback technique based on fetch-throttling is proposed. Due to the fact that the front-end is faster than the back-end in microprocessors, traditional register file delay-writeback techniques are not effective. Our method improves the efficiency greatly through fetch-throttling. Not only the redundant writes to register file are reduced, but also the power of processor front-end is reduced. The experimental results on FPGA show that the proposed technique reduced 35% writes to register file and 15% accesses to ICache for EEMBC programs with little performance loss and no additional overhead.
出处 《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2009年第8期1182-1188,共7页 Journal of Computer-Aided Design & Computer Graphics
基金 国家"九七三"重点基础研究发展计划项目(2005CB321603) 国家自然科学基金(60673146 60703017 60736102 60803029) 国家"八六三"高技术研究发展计划(2007AA01Z114 2009AA01Z125)
关键词 嵌入式处理器 寄存器堆 限制取指 延迟写回 embedded processor register file fetch throttling delay-writeback
  • 相关文献

参考文献11

  • 1Zeng H, Ghose K. Register file caching for energy efficiency [C] //Proceedings of the International Symposium on Low Power Electronics and Design, Tegernsee, 2006:244-249.
  • 2Kondo M, Nakamura H. A small, fast and low-power register file by bit-partitioning[C] //Proceedings of the 11th International Symposium on High Performance Computer Architecture, San Francisco, 2005:40-49.
  • 3Ergin O, Balkan D, Ponomarev D, et al. Early register deallocation mechanisms using checkpointed register files [J]. IEEE Transactions on Computers, 2006, 55(9): 1153- 1166.
  • 4Jones T M, O'Boyle M F R, Abella J, et al. Compiler directed early register release [C] //Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques, Krasnoyarsk, 2005:110-119.
  • 5Savransky G, Ronen R, Gonzalez A. Lazy retirement: a power aware register management mechanism [OL]. (2002- 05-25) [2008- 08- 27]. http://www, ece. rochester, edu/-albonesi/wced02/papers/savransky, pdf.
  • 6Balkan D, Sharkey J, Ponomarev D, et al. Selective writeback: exploiting transient values for energy-efficiency and performance [C] //Proceedings of the International Symposium on Low Power Electronics and Design, Tegernsee, 2006:37-42.
  • 7Balkan D, Sharkey J, Ponomarev D, et al. Predicting and exploiting transient values for reduced register file pressure and energy consumption [J]. IEEE Transactions on Computers, 2008, 57(1): 82-95.
  • 8Smith J E, Sohi G S. The microarchitecture of superscalar processors [J]. Proceedings of the IEEE, 1995, 83 (12) : 1609-1624.
  • 9胡伟武,唐志敏.龙芯1号处理器结构设计[J].计算机学报,2003,26(4):385-396. 被引量:53
  • 10Aragon J L, Gonzalez J, Gonzhlez A. Power aware control speculation through selective throttling [C]//Proceedings of the 9th International Symposium on High-Performance Computer Architecture, Anaheim, 2003:103-112.

二级参考文献8

  • 1[1]Divid Patterson,John Hennessy. Computer Architecture: A Quantitative Approach. Morgan Kaufmann Publishers, 1996
  • 2[2]Kessler R. The Alpha 21264 Microprocessor. IEEE Micro, 1999,19(2): 24~36
  • 3[3]Kenneth Yeager. The MIPS R10000 Superscalar Microprocessor. IEEE Micro, 1996,16(2): 28~41
  • 4[4]Tim Horel, Gary Lauterbach. UntraSparc-III: Designing Third-Generation 64-bit Performance. IEEE Micro, 1999,19(3): 73~85
  • 5[5]Ashok Kumar. The HP PA-8000 RISC CPU. IEEE Micro, 1997,17(2): 27~32
  • 6[6]Joel Tendler, Steve Dodson, Steve Fields, Hung Le, Balaram Sinharoy. Power4 System Microarchitecture. IBM Technical White Paper, 2001
  • 7[7]Huck J et al. Introducing the IA-64 Architecture. IEEE Micro, 2000,20(5): 12~23
  • 8[8]Glenn Hinton, Dave Sager, Mike Upton, Darrell Boggs, Doug Carmean, Alan Kyker, Patrice Roussel. The Microarchitecture of the Pentium 4 Processor. Intel Technology, 2001

共引文献52

同被引文献5

  • 1Ujval J Kapasi, William J Dally, Scott Rixner, et al. The imagine stream processor[C]//Proceedings of the 2002 IEEE International Conference on Computer De- sign: VLSI in Computers and Processors (ICCD' 02). Germany: Freiburg, 2002.
  • 2Scott Rixner, William J Dally, Brucek Khailany, et al. Register organization for media processing high- per- formance computer architecture[C]/// HPCA- 6 Pro- ceedings Sixth International Symposium on Digital Ob- ject Identifier. France:Fouluse,2000.
  • 3Tay Jyi Lin, Chen chia Lee, Chih wei Liu,et al. A no- vel register organization for VL1W digital signal pro- cessors[C]// IEEE VLSI-TSA International Sympo- sium on VLSI DesigrL Tai Wan, Hisinehu,2005.
  • 4Yingchao Zhao, Chun Jason Xue, Miming Li, et al. Energy-aware register file repartitioning for clustered VLIW architectures[C]// Design Automation Confer- ence. Japan: Yokohama, 2009.
  • 5唐骞,杨小雪.VLIW处理器的设计与实现[J].微型机与应用,2010,29(11):27-29. 被引量:1

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部