期刊文献+

基于编译指示的向量化方法 被引量:3

Vectorization Method Based on Compiler Directive
在线阅读 下载PDF
导出
摘要 编译器由于程序分析能力不足,无法自动实现循环向量化或者会造成盲目自动向量化。为此,提出一种基于编译指示的向量化方法。通过在代码中插入向量化编译指示语句,指导自动向量化编译工具的处理过程,自动生成高效的向量化代码。测试结果表明,该方法能够有效提高目标代码的运行性能。 Aiming at the problems that vectorization compiler tool can not perform auto-vectorization or does blind auto-vectorization because of the insufficient of program analysis ability,this paper proposes a vectorization method based on the compiler directive.The compiler directive is used to guide vectorization compiler tool to perform some special actions and produce high efficiency vectorization code automaticly when it is inserted into the program.Test results show that the method can effectively improve the performance of the object code.
作者 姚远 赵荣彩
出处 《计算机工程》 CAS CSCD 2012年第12期272-275,共4页 Computer Engineering
基金 "核高基"重大专项"支持国产CPU的编译系统及工具链"分课题"自动并行化与二进制翻译系统"(2009ZX01036-001-001-2)
关键词 单指令多数据流 自动向量化 数据依赖关系 编译指示 循环不变量 向量寄存器 Single Instruction Multiple Data(SIMD) automatic vectorization data dependence relationship compiler directive loop invariant vector register
  • 相关文献

参考文献8

二级参考文献45

  • 1朱传琪,臧斌宇,陈彤.程序自动并行化系统[J].软件学报,1996,7(3):180-186. 被引量:34
  • 2Allen R., Kennedy K.. Automatic translation of Fortran programs to vector form. ACM Transactions on Programming Languages and Systems, 1987, 9(4): 491~542.
  • 3Padua D., Wolfe M.. Advanced compiler optimizations for supercomputers. Communications of the ACM, 1986, 29(12): 1184~1201.
  • 4Sreraman N., Govindarajan R.. A vectorizing compiler for multimedia extensions. International Journal on Parallel Processing, 2000, 28(4):363~400.
  • 5Cheong G., Lam M.S.. An optimizer for multimedia instruction sets. In: Proceedings of the 2nd SUIF Compiler Workshop, Stanford, 1997.
  • 6Krall A., Lelait S.. Compilation techniques for multimedia processors. International Journal of Parallel Programming, 2000, 28(4): 347~361.
  • 7Larsen S., Amarasinghe S.. Exploiting superword level parallelism with multimedia instruction sets. ACM Sigplan Notices, 2000, 35(5): 145~156.
  • 8Bik A.J.C., Girkar M., Grey P.M., Tian X.. Automatic detection of saturation and clipping idioms. In: Proceedings of the 15th International Workshop on Languages and Compilers for Parallel Computers, 2002.
  • 9Boekhold M., Karkowski I., Corporaal H.. Transforming and parallelizing ANSI C programs using pattern recognition. In: Lecture Notes in Computer Science 1593, 1999, 673.
  • 10Fisher R.J., Dietz H.G.. Compiling for SIMD within a register. In: Proceedings of Workshop on Languages and Compilers for Parallel Computing, University of North Carolina, Chapel Hill, 1998, 290~304.

共引文献21

同被引文献69

  • 1董庆华.数据可视化技术在农业领域的应用探讨[J].山西农业大学学报(自然科学版),2007,27(5):52-53. 被引量:2
  • 2陶雪娇,胡晓峰,刘洋.大数据研究综述[J].系统仿真学报,2013,25(S1):142-146. 被引量:344
  • 3我国水产种质资源共享平台服务社会化[J].江西饲料,2006(4):45-45. 被引量:1
  • 4左雄.我国农业信息化研究综述[J].安徽农业科学,2007,35(28):9092-9094. 被引量:35
  • 5Khronos OpenCL Working Group. The OpenCL Speci? cation [R]. [2009]. http://www, khronos, org/registry/c|/.
  • 6Tian X, Saito H, Preis S V. Compiling C/Cq-q-SIMD Exten- sions for Function and Loop Vectorization on Multicore-SIMD Processors [C]//Multicore and GPU Programming Models, Lan- guages and Compilers Workshop. 2012:2349-2358.
  • 7Klemm M,et al. Extending OpenMP" with vector constructs for modern multicore SIMD architectures[C]//OpenMP in a Heter- ogeneous World, 2012 .. 59-72.
  • 8黄娟娟,李春江,徐颖.GCC中自动向量化代价模型剖析[C]//第17届计算机工程与工艺年会暨第三届微处理器技术论坛论文集.长沙:国防科技大学出版社,2013:259-268.
  • 9OpenMP Architecture Review Board=OpenMP Application Pro- gram Interface[M]. Version 4. 0(July 2013).
  • 10Free Software Foundation Inc. GCC 4. 9 Release Series http:// gcc. gnu. org/gcc4. 9/.

引证文献3

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部