期刊文献+

GPU-S2S:面向GPU的源到源翻译转化

GPU-S2S: a source to source compiler for GPU
在线阅读 下载PDF
导出
摘要 针对图形处理器(GPU)架构下的软件可移植性、可编程性差的问题,为了便于在GPU上开发并行程序,通过自动映射与静态编译相结合,提出了一种新的基于制导语句控制的编译优化方法,实现了一个源到源的自动转化工具GPU-S2S,它能够将插入了制导语句的串行C程序转化为统一计算架构(CUDA)程序。实验结果表明,经GPU-S2S转化生成的代码和英伟达(NVIDIA)提供的基准测试代码具有相当的性能;与原串行程序在CPU上执行相比,转换后的并行程序在GPU上能够获取显著的性能提升。 To address the problem of poor software portability and programmability of a graphic processing unit ( GPU), and to facilitate the development of parallel programs on GPU, this study proposed a novel directive based compiler guided approach, and then the GPU-S2S, a prototypic tool for automatic source-to-source translation, was implemented through combining automatic mapping with static compilation configuration, which is capable of translating a C sequential program with directives into a compute unified device architecture (CUDA) program. The experimental results show that CUDA codes generated by the GPU-S2S can achieve comparable performance to that of CUDA benchmarks provided by NVIDIA CUDA SDK, and have significant performance improvements compared to its original C sequential codes.
出处 《高技术通讯》 CAS CSCD 北大核心 2012年第4期388-394,共7页 Chinese High Technology Letters
基金 863计划(2009AA012108,2009AA01A135,2006AA01A109)和中央高校基本科研业务费专项资金(08142007)资助项目.
关键词 图形处理器(GPU) 制导语句控制 源到源转化 graphic processing unit (GPU), compiler directive, source to source translation
  • 相关文献

参考文献8

  • 1Moazeni M, Bui A, Sarrafzadeh M. A memory optimization technique for software-managed scratchpad memory in GPUs. In: Proceedings of the IEEE 7th Symposium on Ap- plication Specific Processors, San Francisco, USA, 2009. 43-49.
  • 2Govindaraju N K, Larsen S, Gray J, et al. A memory model for scientific algorithms on graphics processors. In: Proceedings of the ACM/IEEE Conference on Supercom- puting, Tampa, USA, 2006. 6-15.
  • 3Ryoo S, Rodrigues S S, Baghsorkhi C I, et al. Optimiza- tion principles and application performance evaluation of a muhithreaded GPU using CUDA. In: Proceedings of Sympium on Principles and Practice of Parallel Program- ming, New York, USA, 2008. 73-82.
  • 4Govett M, Middlecoff J, Henderson T. Running the NIM next-generation weather model on GPUs. In: Proceedings of the 10th IEEE/ACM International Conference on Clus- ter, Cloud and Grid Computing, Melbourne, Australia, 2010. 792-796.
  • 5McCool M D, Qin Z, Popa T S. Shader metaprogramming. In: Proceedings of AMC SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware , Aire-la-Ville, Switzer- land, 2002. 57-68.
  • 6Lee S, Min S J, Eigenmann R. OpenMP to GPGPU: a compiler framework for automatic translation and optimiza- tion. In : Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, New York, USA, 2009. 101-110.
  • 7Ueng S Z, Lathara M, Baghsorkhi S S, et al. CUDA-lite: reducing GPU programming complexity. In: Proceedings of the 21st Annum Workshop on Languages and Compilers for Parallel Computing (LCPC), 2008, LNCS 5335. 1-15, DOI : 10. 1007/978-3-540-89740-8 _ 1.
  • 8NVIDIA Corporation. NVIDIA CUDA Compute Unified De- vice Architecture Programming Guide (Version 2. 0 ). NVIDIA Corporation, July 2008.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部