面向异构多核架构的自适应编译框架被引量：2

A Self-Adaptive Compilation Framework for Heterogeneous Multi-Core Architecture

下载PDF

导出

摘要针对应用在移植到异构多核高性能计算机系统中所面临的可移植性差以及性能优化难度大的问题,文中提出一种面向异构多核架构的自适应编译框架.通过源到源编译解决传统并行编程模型应用向异构多核架构的映射问题;同时利用动态剖分信息,自适应地调整插桩并配置优化策略,形成迭代式的自动优化过程.文中自适应编译框架将软硬件映射机制与优化策略结合,有效地解决了同构并行应用向异构多核架构的移植问题并提高了应用的整体性能.实验结果表明,文中基于Cell架构实现的原型系统,很好地解决了异构多核架构下应用移植性等问题,同时应用性能有所提高. To improve the application performance and portability on the novel hardware, this paper proposes a self-adaptive compilation framework for heterogeneous multi-core architecture. This framework uses source-to-source compiling technique to address the transformation prob- lems of the application from the traditional parallel programming model to the heterogeneous multi-core architecture, and analyzes dynamic profiling information to self-adaptively adjust instrument and configure the optimization strategy. The framework uses an iterative optimization method to combine the mapping mechanisms with the performance optimization strategy. The iterative automatic optimization method is formed to ensure the efficiency of parallel application migration with fully exploiting the ability of the heterogeneous multi-core architecture. The framework has been prototyped on the Cell architecture and tested with a set of examples and the experimental results are promising.

作者白秀秀董小社刘超曹海军李亮

机构地区西安交通大学计算机科学与技术系

出处《计算机学报》 EI CSCD 北大核心 2014年第7期1548-1559,共12页 Chinese Journal of Computers

基金国家自然科学基金(61173039) 国家"八六三"高技术研究发展计划项目基金(2012AA010904 2012AA01A306) 国家科技支撑计划(2011BAH04B03)资助~~

关键词异构多核源到源编译插桩迭代式优化 heterogeneous multi-core source-to-source compilation instrument iterativeoptimization

分类号 TP399 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献14

1Gschwind M.Chip multiprocessing and the cell broadband engine//Proceedings of the 3rd Conference on Computing Frontiers.Ischia,Italy,2006:1-8.
2Kahle J.The cell processor architecture//Proceedings of the 38th Annual IEEE/ACM International Symposium on Micro architecture.Barcelona,Spain,2005:3.
3Perez J M,Bellens P,Badia R M,et al.CellSs:Making it easier to program the cell broadband engine processor.IBM Journal of Research and Development,2007,51(5):593-604.
4Han T D,Abdelrahman T S.hiCUDA:High-level GPGPU programming.IEEE Transactions on Parallel and Distributed Systems,2011,22(1):78-90.
5Bauer M,Clark J,Schkufza E,et al.Programming the memory hierarchy revisited:Supporting irregular parallelism in Sequoia//Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming.San Antonio,USA,2011:13-24.
6Knight T J,Park J Y,Ren M,et al.Compilation for explicitly managed memory hierarchies//Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming.San Jose,USA,2007:226-236.
7Houston M,Park J Y,Ren M,et al.A portable runtime interface for multi level memory hierarchies//Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming.Salt Lake City,USA,2008:143-152.
8Linderman M D,Collins J D,Wang H,et al.Merge:A programming model for heterogeneous multi-core systems//Proceedings of the Architectural Support for Programming Languages and Operating Systems.Seattle,Washington,USA,2008:287-296.
9Blagojevic F,Stamatakis A,Antonopoulos C D,et al.RAxML-cell:Parallel phylogenetic tree inference on the cell broadband engine//Proceedings of the 21st IEEE/ACM International Parallel and Distributed Processing Symposium.Long Beach,USA,2007:1-10.
10Ohara M,Inoue H,Sohda Y,et al.MPI Microtask for programming the cell broadband engine processor.IBM Systems Journal,2006,45(1):85-102.

同被引文献14

1罗庚兴.西门子STEP7编程软件的使用方法[J].南方金属,2006(5):35-39. 被引量：9
2卢爱勤.三菱GX Developer软件中使用SFC编程的方法[J].广西轻工业,2007,23(7):54-55. 被引量：6
3Voronin K V. A numerical study of an MPI/OpenMP implementation based on asynchronous threads for a three-dimensional splitting scheme in heat transfer problems[J].Joumal of Applied and Industrial Mathematics,2014,8(3):436-443.
4Perla F, Zanetti R Performance analysis of an hybrid OpenMP/MPI ALM software for life insurance policies on multi-core architectures[C].8th International Workshop on OpenMP,2012: 250-253.
5Tsuji M, Sato M. Performance evaluation of OpenMP and MPI hybrid programs on a large scale multi-care multi-socket cluster, T2K Open Supercomputer[C]. 2009 International Conference on Parallel Processing Workshops, 2009.
6Miki Y, Takahashi D, Morid M, et al. Highly scalable implementation of an N-body code on a GPU cluster[J].Computer Physics Communications,2013(184):2159-2168.
7Capuzzo-Dolcetta R, Spera M, Punzo D. A fully parallel, high precision, N-body code running on hybrid computing platforms[J].Journal of Computational Physics, 2013(236): 580-593.
8何炎祥,吴伟,刘陶,李清安,陈勇,胡明昊,刘健博,石谦.可信编译理论及其核心实现技术:研究综述[J].计算机科学与探索,2011,5(1):1-22. 被引量：12
9刘志强,宋君强,卢风顺,赵娟.基于线程的MPI通信加速器技术研究[J].计算机学报,2011,34(1):154-164. 被引量：12
10祝永志,张丹丹,曹宝香,禹继国.基于SMP机群的层次化并行编程技术的研究[J].电子学报,2012,40(11):2206-2210. 被引量：9

引证文献2

1祝永志,王喜燕.一种基于大同步并行编程模式的N体问题的优化实现[J].电子技术（上海）,2015,0(2):28-32.
2郭肖旺,陈海,赵德政.工控行业自主可控编程编译工具关键技术研究[J].信息技术与网络安全,2018,37(9):13-16. 被引量：6

二级引证文献6

1王欣,仇旭涛,黑志杰.小型PLC在换热站智能改造项目中的应用[J].通讯世界,2020,27(7):209-210.
2李炜,余延磊.基于PLC控制技术的换热站系统设计[J].仪器仪表用户,2021,28(4):16-18. 被引量：4
3孙景荣,王健凯,吴科.项目式DDD实现的移动端航油加注系统设计[J].物联网技术,2023,13(11):59-63.
4李彬.自主可控可编程控制系统架构及关键技术分析[J].科技资讯,2024,22(3):19-22.
5赵奇伟,刘星辰,康晋菊,轩书婷,郭传伟.基于编程组态软件的二总线图形建模及调试研究[J].电子技术应用,2024,50(12):92-97.
6詹攀,梁建,韩华德,杨坤宾,邱董超.一种面向核工程一体化控制软件的编程编译工具[J].四川大学学报(自然科学版),2025,62(2):508-512.

1刘旸,张兆庆,乔如良.基于域的编译框架[J].计算机学报,2003,26(2):188-194. 被引量：5
2张素平,王冬,丁丽丽,王鹏翔,宫一,于海宁.一种基于SLP的新型编译框架[J].计算机应用研究,2017,34(1):21-26. 被引量：2
3龙舜.一个Java自适应优化编译框架的设计与实现[J].暨南大学学报（自然科学与医学版）,2006,27(5):676-682.
4刘磊,李振国,高艳华,丁岩,申春,刘雷.特定领域语言MISPC及其编译框架实现技术[J].吉林大学学报（理学版）,2016,54(4):805-812. 被引量：3
5魏海涛,秦明康,于俊清,范东睿.一种面向众核架构的数据流编译框架[J].计算机学报,2014,37(7):1560-1569. 被引量：2
6陶永劲,陈立亮,刘瑞祥,杨宠.OpenGL技术在铸造CAE前处理模块中的应用[J].特种铸造及有色合金,2001,21(6):23-24. 被引量：6
7赵迪,华保健,朱洪军.高阶代码消除性能比较框架的设计与实现[J].计算机应用,2016,36(9):2481-2485. 被引量：1
8徐华叶,郑启龙,丁陈飞,徐东鹏.面向多簇超长指令字DSP的向量化优化算法[J].计算机系统应用,2013,22(12):140-143. 被引量：3
9孙光,孙星明,杨蓉,黄华军.可嵌入水印的混淆编译框架[J].科学技术与工程,2005,5(10):656-660. 被引量：5
10吴圣宁,李思昆.若干编译优化技术的工程实现[J].计算机工程与应用,2007,43(3):19-21.

计算机学报

2014年第7期

浏览历史

内容加载中请稍等...

面向异构多核架构的自适应编译框架被引量：2

参考文献14

同被引文献14

引证文献2

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

面向异构多核架构的自适应编译框架 被引量：2

参考文献14

同被引文献14

引证文献2

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

面向异构多核架构的自适应编译框架被引量：2