GPU-S2S：面向GPU的源到源翻译转化

GPU-S2S： a source to source compiler for GPU

下载PDF

导出

摘要针对图形处理器（GPU）架构下的软件可移植性、可编程性差的问题，为了便于在GPU上开发并行程序，通过自动映射与静态编译相结合，提出了一种新的基于制导语句控制的编译优化方法，实现了一个源到源的自动转化工具GPU-S2S，它能够将插入了制导语句的串行C程序转化为统一计算架构（CUDA）程序。实验结果表明，经GPU-S2S转化生成的代码和英伟达（NVIDIA）提供的基准测试代码具有相当的性能；与原串行程序在CPU上执行相比，转换后的并行程序在GPU上能够获取显著的性能提升。 To address the problem of poor software portability and programmability of a graphic processing unit （ GPU）, and to facilitate the development of parallel programs on GPU, this study proposed a novel directive based compiler guided approach, and then the GPU-S2S, a prototypic tool for automatic source-to-source translation, was implemented through combining automatic mapping with static compilation configuration, which is capable of translating a C sequential program with directives into a compute unified device architecture （CUDA） program. The experimental results show that CUDA codes generated by the GPU-S2S can achieve comparable performance to that of CUDA benchmarks provided by NVIDIA CUDA SDK, and have significant performance improvements compared to its original C sequential codes.

作者李丹曹海军董小社张保

机构地区西安交通大学电子与信息工程学院

出处《高技术通讯》 CAS CSCD 北大核心 2012年第4期388-394,共7页 Chinese High Technology Letters

基金 863计划（2009AA012108,2009AA01A135,2006AA01A109）和中央高校基本科研业务费专项资金（08142007）资助项目.

关键词图形处理器(GPU) 制导语句控制源到源转化 graphic processing unit （GPU）, compiler directive, source to source translation

分类号 TP334.7 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献8

1Moazeni M, Bui A, Sarrafzadeh M. A memory optimization technique for software-managed scratchpad memory in GPUs. In: Proceedings of the IEEE 7th Symposium on Ap- plication Specific Processors, San Francisco, USA, 2009. 43-49.
2Govindaraju N K, Larsen S, Gray J, et al. A memory model for scientific algorithms on graphics processors. In: Proceedings of the ACM/IEEE Conference on Supercom- puting, Tampa, USA, 2006. 6-15.
3Ryoo S, Rodrigues S S, Baghsorkhi C I, et al. Optimiza- tion principles and application performance evaluation of a muhithreaded GPU using CUDA. In: Proceedings of Sympium on Principles and Practice of Parallel Program- ming, New York, USA, 2008. 73-82.
4Govett M, Middlecoff J, Henderson T. Running the NIM next-generation weather model on GPUs. In: Proceedings of the 10th IEEE/ACM International Conference on Clus- ter, Cloud and Grid Computing, Melbourne, Australia, 2010. 792-796.
5McCool M D, Qin Z, Popa T S. Shader metaprogramming. In: Proceedings of AMC SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware , Aire-la-Ville, Switzer- land, 2002. 57-68.
6Lee S, Min S J, Eigenmann R. OpenMP to GPGPU: a compiler framework for automatic translation and optimiza- tion. In : Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, New York, USA, 2009. 101-110.
7Ueng S Z, Lathara M, Baghsorkhi S S, et al. CUDA-lite: reducing GPU programming complexity. In: Proceedings of the 21st Annum Workshop on Languages and Compilers for Parallel Computing (LCPC), 2008, LNCS 5335. 1-15, DOI : 10. 1007/978-3-540-89740-8 _ 1.
8NVIDIA Corporation. NVIDIA CUDA Compute Unified De- vice Architecture Programming Guide (Version 2. 0 ). NVIDIA Corporation, July 2008.

1陈麟.考务绩效管理系统设计——基于S2SH+jQuery[J].电子测试,2014,25(5):150-151.
2李洋.云计算应用对测评工具的影响[J].信息网络安全,2011(8):91-92. 被引量：1
3郭广军,苏希,黎梅,王剑波.基于S2SH+ExtJS的企业人力资源管理系统开发[J].计算机技术与发展,2012,22(4):187-190.
4董蕾,黄方,卜栓栓,冯杰,周纪.基于CUDA的压缩感知重构算法并行化研究[J].信息技术,2016,40(4):32-36. 被引量：1
5文延华,何王全,尉红梅.OpenCL的动态执行模式在静态编译支持下的实现[J].计算机应用与软件,2014,31(10):16-19. 被引量：1
6杨茂林.可移植的256色图形用户界面的设计[J].计算机世界月刊,1995(10):65-67.
7陈彬,陈和平,李晓卉.基于GPU的高效图像协方差矩阵算法与实现[J].计算机工程与设计,2014,35(12):4238-4242. 被引量：2
8邓楠.Ready？Go!语言开发背景、语法和类型[J].程序员,2012(5):122-125.
9闫嘉欣,王军,谢凯,赵玉川.基于云计算的校园服务云平台研究[J].科技传播,2013,5(13):207-208.
10乔丽萍,田军.软件可移植性的度量[J].微电子学与计算机,2003,20(B12):51-54. 被引量：4

高技术通讯

2012年第4期

浏览历史

内容加载中请稍等...

GPU-S2S：面向GPU的源到源翻译转化

参考文献8

相关作者

相关机构

相关主题

浏览历史