期刊文献+

基于优化PSO的LU循环分块方法

LU Loop Tiling Method Based on Improved PSO
在线阅读 下载PDF
导出
摘要 循环分块技术可以提高Cache的命中率,而循环分块的规模是该技术的决定性因素。利用粒子群优化算法可以优化循环分块的规模,获取合适规模的分块,提高循环分块的效率。首先研究了LU分解算法,然后讨论了传统粒子群优化算法并针对用于循环分块方面的不足加以改进,最后把优化的粒子群优化算法用于LU分解算法之中,从而提出了一个PSO-LU循环分块算法。仿真实验结果表明,和原始基准测试程序相比,所提循环分块算法比性能有所提升,更充分地提高高速缓存运算效率。 Loop tiling can be used to improve hit rate of the cache. The scale of tiling is the key factor of the technology. The neural network model of Particle Swarm Optimization algorithm is applied to optimize the scale of tiling. The technology can obtain a more suitable tiling scale than the classical tiling algorithm, and result in improving the efficiency of tiling. Paper first studied the LU decomposition algorithm, and then discuses the traditional particle swarm optimization algorithm which was improved in light of the shortcomings of the for loop block. Finally, the improved particle swarm optimization algorithm was used to LU decomposition algorithm and a PSO - LU Loop tiling algorithm was proposed. Compared to the original benchmark achieve, we proposed using LU decomposition loop tiling algorithm to improve the operation efficiency of Cache.
出处 《科学技术与工程》 北大核心 2013年第20期5960-5963,5969,共5页 Science Technology and Engineering
基金 国家自然科学基金项目(61273042) 河南省重点科技攻关项目(122102210487)资助
关键词 循环分块 粒子群优化算法 LU分解 Loop tiling PSO LU decomposition
  • 相关文献

参考文献8

二级参考文献31

  • 1陈建平.LU分解递归算法的研究[J].计算机科学,2004,31(6):141-142. 被引量:4
  • 2王小牛,冯百明.基于存储的矩阵乘积优化算法[J].西北师范大学学报(自然科学版),2005,41(1):22-24. 被引量:2
  • 3韩明华,彭宇行,李思昆,陈福接.基于Linux集群电磁散射并行计算实现[J].计算机研究与发展,2005,42(6):1085-1088. 被引量:7
  • 4刘载文,王正祥,王小艺,杨斌,程志强.过程神经元网络学习算法及软测量方法的研究[J].系统仿真学报,2007,19(7):1456-1459. 被引量:13
  • 5[1]Rafael H Saavedra. The combined effectiveness of unimodular transformation, tiling and software prefetching. In: 10th Int'l Parallel Processing Symposium. 1996. 61~79
  • 6[2]M J Wolf. Improving locality and parallelism in nested loops[Ph D dissertation]. University of Stanford, Stanford, 1992
  • 7[3]Monica S Lam, Edward E Rothberg, Michael E Wolf. The cache performance and optimizations of blocked algorithms.Fourth Internet Conf on Architectural Support for Programming Languages and Operating Systems (ASPLOS IV), Palo Alto, California, 1991. 9~13
  • 8[4]S Ghosh. Cache miss equations: Compiler analysis framework for tuning memory behavior [Ph D dissertation]. Princeton University Department of Electrical Engineering, 1999
  • 9[5]Gabriel Rivera, Chau-wen Tseng. Data transformations for eliminating conflict missed. The 1998 ACM SIGHAN Conf on Programming Language Design and implementation (PLDI '98), Montreal, Canada, 1998. 8~13
  • 10Harrington R F. Field Computation by Moment Methods[M]. Piscataway: IEEE Press, 1993: 62-79.

共引文献339

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部