期刊文献+

基于MPI+CUDA环境的静电相互作用能并行求解 被引量:1

PARALLEL SOLVING ELECTROSTATIC INTERACTION ENERGY BASED ON MPI+CUDA ENVIRONMENT
在线阅读 下载PDF
导出
摘要 ABEEMσπ(Atom-Bond Electronegativity EqualizationσπModel)模型中,原串行程序求静电相互作用能的方法非常耗时,致使研究问题的效率降低。针对原程序中多个循环相互嵌套的求解部分,采用带状卷帘存储迭代分配的MPI(Message Passing Inter-face)并行化处理;对体系中所有原子、σ键、孤对电子、π键位点之间的静电相互作用能采用多线程CUDA(Computer Unified DeviceArchitecture)并行化处理。传统MPI+CUDA环境中,GPU和CPU之间的数据传输开销大,导致整体性能下降以及各种粒子间计算串行调用CUDA,致使时间浪费。针对上述情况,使用GPU核心的缓存机制解决传输开销大的问题,并利用多CUDA流技术实现多个循环异步进行计算,从而缩短了运行时间。然后选取多个不同类型的大分子体系进行测试,结果表明,利用改进的MPI+CUDA并行模型进行动力学模拟,并行加速比显著提高,大幅度缩减了求解静电相互作用能的时间,并得到与串行一致的结果。 In ABEEMσπ model, original serial program consumes much time in seeking electrostatic interaction energy, which caused the research inefficient. In solution part of the original program, as the multiple loops are nested each other, MPI parallel processing of strip rolling storage iterative distribution is adopted to resolve this problem; and, multi-threaded CUDA parallel processing is used to deal with the static sites interactions among all the atoms, σ bond, lone pair electrons and ,π bond in the system. In traditional environment of MPI + CUDA, there is huge spending when data transferring between GPU and CPU, which results in overall performance decrease and the calculation of serial called CUDA between a variety of particles, therefore leads to time wasted. For these above, this paper proposes that applying the mechanism of the GPU core caching to solve the problem of huge transmission cost, and making use of multi-stream technology of CUDA to realise multiple cycles asynchronous for calculation, so that the running time will be reduced. Then, several systems of different types of macromolecular are selected to test, the result shows, by applying modified MPI + CUDA parallel model in dynamics simulation, the parallel speedup improves significantly, the time of solving the electrostatic interaction energy reduces substantially, while results are identical to the serial program.
出处 《计算机应用与软件》 CSCD 北大核心 2012年第11期35-38,共4页 Computer Applications and Software
基金 国家自然科学基金项目(21133005 20703022 21011120087)
关键词 原子-键电负性均衡σπ模型 静电子相互作用能 并行计算 消息传递接口 统一计算设备架构 ABEEMσπ model Electrostatic interaction energy Parallel computing MPI CUDA
  • 相关文献

参考文献8

二级参考文献22

  • 1盛跃宾,宋晓秋,刘德贵.带状线性方程组的一种有效分布式并行算法[J].系统工程与电子技术,2004,26(7):967-969. 被引量:8
  • 2秦岭,王煜坚,李东新,吴镇扬.视频编码标准H.264的主要技术特点及其应用前景[J].微计算机应用,2004,25(4):449-455. 被引量:25
  • 3Iain E. G. Richardson. H. 264 and MPEG-4 Video Compression: Video Coding for Next Generation Multimedia[R].The Robert Gordon University, Aberdeen, UK
  • 4Joint Video Team of ITU-t and ISO/IEC JTC 1. Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification(ITU-T Rec H. 2641 ISO/IEC 14496-10 AVC)[S]. JVT G050, 2003
  • 5JVT-G050. ITU-T Recommendation and Final Draft International Standard of Joint Video Specification[S]. 2003,3
  • 6JVT Official Site[DB/OL]. http://ftp3.itu. ch/ av-arch/jvt-site/ draft_conformance/
  • 7Nvidia Official Site[DB/OL]. http://www. nvidia.com/
  • 8Nvidia, NVIDIA_CUDA_Programming Guide_ 1. 1. pdf , September, 2007
  • 9Richard Gerber, The Software Optimization Cookbook[M].Copyright@Intel Corporation, 2002
  • 10Intel软件说明书(Intel Software manuals)[EB/OL]

共引文献67

同被引文献2

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部