期刊文献+

消除VLIW结构上的循环体间冗余流相关 被引量:1

Eliminate Redundant Loop-Carried Flow Dependences for VLIW Architectures
在线阅读 下载PDF
导出
摘要 数据相关是并行处理的基本依据 .该文指出 ,VLIW(very long instruction word)特有的锁步性质使其数据相关性分析具有与众不同的特点 .同一体差上的流相关形成一个线序集合 ,多体差上的特征流相关之间也存在包含关系 .据此 ,提出一种用于 VL IW的消除循环体间冗余流相关的方法 .该方法是完备的 ,可以去除所有冗余的体间流相关 ,从而减轻循环调度的负担 .文章给出判定单体差和多体差存在冗余的充分必要条件 ,以及消除冗余的线性复杂度的算法 .这种方法具有普遍意义 ,可作为 VL IW上软件流水和多指令流调度的基础 . Data dependence is the basic guideline and constraint for parallel processing. The structural distinctiveness of VLIW (very long instruction word), i.e., the lockstep property, leads to remarkable uniqueness in its data dependence analysis, as is shown in this paper. Flow dependences with the same iteration distance comprise a linear ordered set, while characteristic flow dependences with different iteration distances may also have inclusion relationship between each other. From such discovery, a novel method is presented to eliminate redundant loop carried flow dependences for VLIW architectures, so as to facilitate loop scheduling. The method is complete in that it gets rid of all redundancies. The sufficient and necessary preconditions to identify redundancy for single and multiple iteration distances are given, and a linear ordered algorithm is described. The method is generic, and can serve as a foundation for research on VLIW, such as software pipelining and multi instruction stream scheduling.
出处 《软件学报》 EI CSCD 北大核心 2000年第1期126-132,共7页 Journal of Software
基金 国家自然科学基金! (No.6 97730 2 8)资助
关键词 VLIW 循环调度 数据相关 冗余 并行处理 VLIW (very long instruction word), loop scheduling, data dependence, loop, redundancy.
  • 相关文献

参考文献2

  • 1Kai Hwang,Advanced Computer Architecture.Parallelism,Scalability,Programmability,1993年,457页
  • 2Psarris K,IEEE Trans Parallel Distributed Systems,1993年,4卷,11期,1280页

同被引文献8

  • 1SCHUTTEN R J, de HAAN G. Real-time 2-3 pull-down elimination applying motion estimation/compensation in a programmable device[J]. IEEE Trans on Consumer Electronics, 1998, 44(3): 930-938.
  • 2MOSCHETTI F, KUNT M, DEBES E. A statistical adaptive block-matching motion estimation[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2003, 13(5): 417-431.
  • 3PETERS H, SETHURAMAN R, BERIC A, et al. Application specific instruction-set processor template for motion estimation in video applications[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2005, 15(4): 508-527.
  • 4SUHYUN Kim, SOO-MOOK Moon, JINPYO Park, et al. Unroll-based copy elimination for enhanced pipeline scheduling[J]. IEEE Transactions on Computers, 2002, 51(9): 977-994.
  • 5SHYH-KWEI Chen, FUCHS W K. Compiler-assisted multiple instruction word retry for VLIW architectures[J]. IEEE Transactions on Parallel and Distributed Systems, 2001, 12(12): 1293-1304.
  • 6SOO-MOOK Moon, CARSON S D. Generalized multiway branch unit for VLIW microprocessors[J]. IEEE Transactions on Parallel and Distributed Systems, 1995, 6(8): 850-862.
  • 7王元元,张嗣元,刘又诚,王雷.基于VLIW的指令格式的研究与设计[J].计算机工程与应用,2003,39(3):72-74. 被引量:5
  • 8李学明,李继.用超长指令实现DCT的新算法[J].电子学报,2003,31(7):1074-1077. 被引量:11

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部