期刊文献+

一种面向写穿透Cache的写合并设计及验证 被引量:2

A Write Coalescing Design and Verification for Write-Through Cache
在线阅读 下载PDF
导出
摘要 为了利用片上缓冲技术来提高处理器应用性能,提出一种面向写穿透Cache的写合并设计方法.使用同步动态随机存储器(SDRAM)的单个写方式和片上写缓冲器,对SDRAM一行内的局部数据采用写合并策略,由此提高了外部存储的访问效率,同时给出了连续和单个Cache读写的缓存与内存的数据一致性策略.在寄存器传输语言(RTL)仿真环境下使用mp3解码对Leon2处理器进行数据测试,结果表明:在缓冲区优化为3行8列的参数下,SDRAM每次行开启平均进行7.8个字的写入操作,外存的读写效率由12%提高到19%;在TSMC0.18μm工艺下,综合后面积为0.263mm2,流片后工作主频为100MHz. A write coalescing design for write-through cache is proposed to promote the performance of application processors using on-chip buffer. The scheme of write coalition for the local data in the same row of SDRAM (synchronous dynamic random access memory) is designed to improve accessing efficiency by employing the single write mode of SDRAM and on-chip buffer. A coherence scheme for the single or multiple cache data reading or writing is also presented. Simulation for mp3 decoding data is implemented in the RTL (register transmit language) simulation environment for Leon2 processor. The simulation results show that when the on-chip buffers are optimized at 3 rows and 8 columns, average 7.8 words are written into the SDRAM after every row pre-charge, and the accessing efficiency increases from 12% to 19% at 100 MHz, and that the area of the proposed design is 0. 263 mm^2 under TSMC 0. 18μm process.
出处 《西安交通大学学报》 EI CAS CSCD 北大核心 2010年第4期1-4,共4页 Journal of Xi'an Jiaotong University
基金 国家自然科学青年基金资助项目(60905007) 国家高技术研究发展计划资助项目(2009AA01Z307 2009AA011709)
关键词 写穿透 写合并 处理器 同步读写存储器 读写效率 write through write coalescing memory access efficiency processor synchronous dynamic random access
  • 相关文献

参考文献5

  • 1Intel Inc.Intel XScale technology overview[EB/OL].[2009-07-11].http://www.intel.com/design/intelx-scale/.
  • 2Gaisler.Leon SPARC V8 processor[EB/OL].[2009-07-11].http://www.gaisler.com.
  • 3林伟,叶笑春,宋风龙,张浩.众核处理器中使用写掩码实现混合写回/写穿透策略[J].计算机学报,2008,31(11):1918-1928. 被引量:5
  • 4KANG S Y,PARK S,JUNG H Y,et al.Perform-ance tradeoffs in using NVRAM write buffer for flash memory-based storage devices[J].IEEE Transactions on Computers,2009,58(6):44-758.
  • 5KANG W Y,SON S H,STANKOVIC J A.Power-aware data buffer cache management in real-time em-bedded databases[C] //The 14th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications.Piscataway.NJ,USA:IEEE,2008:35-44.

二级参考文献21

  • 1Huang He, Yuan Nan, Lin Wei et al. Architecture supported synchronization-based cache coherence protocol for manycore Processors//Proceedings of the ISCA Workshop on Chip Multiprocessor Memory Systems and Interconnects. Beijing, China, 2008:51-53
  • 2Asanovic Krste, Bodik Ras et al. The landscape of parallel computing research: A view from Berkeley. University of California, California, USA: Technical Report UCB/EECS- 2006-183, 2006
  • 3Culler David E, Singh Jaswinder Pal, Gupta Anoop. Parallel Computer Architecture: A Hardware/Software Approach. San Fransisco, USA: Morgan Kaufmann, 1998
  • 4Iftode Liviu, Singh Jaswinder Pal, Li Kai. Scope consistency: A bridge between release consistency and entry consistency//Proceedings of the 8th Annual ACM Symposium on Par allel Algorithms and Architectures. Padua, Italy, 1996: 277-287
  • 5Karp A H, Sarkar Vivek. Data merging for shared-memory multiprocessors//Proceedings of the 26th Hawaii International Conference on System Sciences. Hawaii, USA, 1993: 244-256
  • 6Lenoski Daniel, Laudon James, Joe Truman et al. The dash prototype: Implementation and performance//Proceedings of the 19th International Symposium on Computer Architecture. Queensland, Australia, 1992; 92-103
  • 7Lamport Leslie. How to make a multiprocessor computer that correctly executes multiproeess program. IEEE Transactions on Computers, 1979, 28(9): 690-691
  • 8Adve Sarita V, Hill Mark D. Weak ordering-A new definition//Proeeedings of the 17th International Symposium on Computer Architecture. Seattle, USA, 1990:2-14
  • 9Gharachorloo Kourosh, Lenoski Daniel et al. Memory consistency and event ordering in scalable shared-memory multiprocessors//Proceedings of the 17th International Symposium on Computer Architecture. Seattle, USA, 1990:15-26
  • 10del Cuvillo Juan, Zhu Wei-Rong et al. Toward a software infrastructure for the cyclops-64 cellular architecture//Proceedings of the 20th International Symposium on High-Performance Computing in an Advanced Collaborative Environment. St. John's, Canada, 2006:9-20

共引文献4

同被引文献15

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部