摘要
This paper shows two approaches to improve the performance of numeral al- gebra software by describing block algorithms in LAPACK. The block algorithms can make up higher level and more effcient BLAS programs. This paper further presents the relations between the effciency of the block algorithm and the size of block, and shows the relations relates to not only scale of algorithms and problems but also architectures and Characters of destination machines. Finally The paper gives the test results on Hitachi SR2201& SR8000.
This paper shows two approaches to improve the performance of numeral al- gebra software by describing block algorithms in LAPACK. The block algorithms can make up higher level and more effcient BLAS programs. This paper further presents the relations between the effciency of the block algorithm and the size of block, and shows the relations relates to not only scale of algorithms and problems but also architectures and Characters of destination machines. Finally The paper gives the test results on Hitachi SR2201& SR8000.
出处
《数值计算与计算机应用》
CSCD
北大核心
2001年第3期172-180,共9页
Journal on Numerical Methods and Computer Applications