期刊文献+

龙芯3A上三个自适应FFT包的对比与分析

Comparison and Analysis of Three Types of FFT Adaptive Libraries on Loongson 3A
在线阅读 下载PDF
导出
摘要 FFT算法在计算机科学中具有广泛的应用,自适应FFT软件包以其良好的可移植性而备受研究人员和用户的青睐,龙芯3A是中科院计算所自主研发的四核CPU,采用RISC架构,兼容MIPS指令。主要对FFTW,UHFFT,SPIRAL这3类FFT自适应软件包进行研究。首先从搜索框架和代码产生器两方面总结了FFTW和UHFFT的异同,接着阐述了SPIRAL自动产生优化代码的三层架构实现原理,之后在国产CPU龙芯3A上对这3个软件包进行了性能测试,并结合龙芯的体系结构特点对结果作了分析对比。在最后总结了目前自适应FFT软件包的一般方法,为下一步开发自适应FFT软件包提供了思路。 FFT algorithm has a wide range of applications in computer science.Adaptive FFT software package with its excellent portability has been interested by many researchers and users.Loongson 3A is developed by institute of computing technology,Chinese academy of sciences.It is a quad-core CPU and compatible with MIPS instructions using RISC architecture.The article focused on three types of FFT adaptive libraries which are FFTW,UHFFT and SPIRAL.Firstly,we compared the difference between FFTW and UHFFT from two aspects of search framework and code generator.Then we elaborated SPIRAL's three layers schema which is used to produce optimized code automatically.Furthermore,we evaluated these libraries on the Loonson 3A platform and analyzed the results.Finally,we concluded the general method of current FFT adaptive software packages and provided a guideline for further development of adaptive FFT software package.
出处 《计算机科学》 CSCD 北大核心 2012年第12期281-285,共5页 Computer Science
基金 国家自然科学基金(61133005) 国家高技术研究发展项目(863)(2009AA01A129 2009AA01A134) 国家重大专项核高基项目(2009ZX01036-001-002)资助
关键词 FFTW UHFFT SPIRAL LOONGSON 3A FFT FFTW UHFFT SPIRAL Loongson 3A FFT
  • 相关文献

参考文献22

  • 1Cooley J W. The re-discovery of the fast Fourier transform algo- rithm[J]. Mikrochimica Acta, 1987,3 : 33-45.
  • 2Cooley J W,Tukey J W. An algorithm for the machine calcula- tion of complex Fourier series[J]. Math. Comp. , 1965,19: 297- 301.
  • 3Frigo M. A fast Fourier transform eompiler[C]//Proceedings of the ACM SIGPLAN 1999conference on Programming Language Design and Implementation, PLDI ' 99. New York, NY, USA: ACMPress, 1999 : 169-180.
  • 4Frigo M, Johnson S G. The design andimplementation of FF- TW3[C]//Proceedings of the IEEE Special Issue on Program- Generation, Optimization, and Platform AdaptatiorL 2005 : 216-231.
  • 5Frigo M,Johnson S G. The fastest fourier transform in the west [M]. technical report technical reportMIT-LCS-TR-728, Sep.1997.
  • 6Frigo M,Johnson S G. Fftw.. An adaptive software architecture for the fft[C]//Proc. ICASSP 3. 1998.. 1381-1384.
  • 7Loan C V. Computational frameworks for the fastFourier trans- form. Society for Industrial and AppliedMathernatics[M]. Phila- delphia, PA, USA, 1992.
  • 8Mirkovic D, Johnsson S L. AutomaticPerformance Tuning in the UHFFT Library[C]// Proceedings of the International Confer- ence onComputational Sciences-Part I, ICCS' 01. London, UK: Springer-Verlag, 2001 : 71-80.
  • 9Franchetti F, Voronenko Y, Piischel M. FFT program genera- tionfor shared memory: SMP and multicore[C]//Proe. Super- computing (SC). 2006.
  • 10Spiral Web site[OL]. http://www, spiral, net.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部