期刊文献+

LAPACK的自动并行化工具研究 被引量:3

RESEARCH ON THE AUTOMATIC PARALLELIZING TOOLS OF LAPACK
原文传递
导出
摘要 LAPACK (Linear Algebra PACKage) is a subroutine library for solving the most common problems in numerical linear algebra, designed to run efficiently on shared-memory vector and parallel processors. Only the general sequential code of LAPACK is available on INTERNET, the optimization of it on a special machine is very burdensome. To solve this problem, we develop an automatic parallelizing tool on SGI POWER Challenge, and it shows good results. LAPACK (Linear Algebra PACKage) is a subroutine library for solving the most common problems in numerical linear algebra, designed to run efficiently on shared-memory vector and parallel processors. Only the general sequential code of LAPACK is available on INTERNET, the optimization of it on a special machine is very burdensome. To solve this problem, we develop an automatic parallelizing tool on SGI POWER Challenge, and it shows good results.
作者 谢幸 李玉成
出处 《数值计算与计算机应用》 CSCD 北大核心 2001年第2期130-133,共4页 Journal on Numerical Methods and Computer Applications
关键词 LAPACK 自动并行化 并行化工具 源代码分析 相关性分析 数据私有化 基本技术 LAPACK, automatic parallelization, parallelizing tool
  • 相关文献

参考文献3

  • 1谢幸 李玉成.LAPACK在共享主存并行机上的自动并行化.中国科学院软件所并行软件研究开发中心1998年工作年报[M].,1999,1.220-225.
  • 2谢幸,中国科学院软件所并行软件研究开发中心1998年工作年报,1999年,220页
  • 3Kai Hwang,高等计算机系统结构,并行性,可扩展性,可编和性,1995年

同被引文献26

  • 1胡荣贵,陈意云,郭帆.机器语言的类型化及代码的安全检查[J].计算机研究与发展,2004,41(6):965-971. 被引量:1
  • 2朱传琪,臧斌宇,陈彤.程序自动并行化系统[J].软件学报,1996,7(3):180-186. 被引量:34
  • 3郭克榕,唐新春.面向MPP Fortran的程序自动并行化初探[J].国防科技大学学报,1996,18(1):92-97. 被引量:3
  • 4索红光,刘玉树,曹淑英.一种基于词汇链的关键词抽取方法[J].中文信息学报,2006,20(6):25-30. 被引量:88
  • 5董峰,付宇卓.基于LLVM架构的ARM后端移植[J].信息技术,2007,31(7):38-41. 被引量:5
  • 6Barua R,Lee W,Amarasinghe S, et al. Maps: a compilermanaged memory system for raw machines [C]//In: Proceedings of the 26th Annual International Symposium on Computer Architecture, 1999(5) :4-15.
  • 7Kar K, Lakshman T V, Stiliadis D,et al. Reduced complexity input buffered switches [C]// In: Proceedings of the Hot Interconnects VIII, 2000 : 145-152.
  • 8Keeton K, Arpaci-Dusseau R, Patterson D A. IRAM and SmartSIMM: overcoming the I/O bus bottleneck[C]// In: Proc. ISCA (International Symposium on Computer Architecture) Workshop on Mixing Logic and DRAM, 2005, 103-112.
  • 9Veidenbaum A V, Tang W, Gupta R, et al. Adapting cache line size to application behavior[C]// In: Proceedings of Supercomputing_99, 2006(6) :145-152.
  • 10黄哲煌,王美清.分布计算技术及其在分形视频压缩中的应用[D].福州:福州大学,2004.

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部