期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
PARALLEL REGION PRESERVING MULTISECTION METHOD FOR SOLVING GENERALIZED EIGENPROBLEM 被引量:1
1
作者 曾岚 周树荃 《Transactions of Nanjing University of Aeronautics and Astronautics》 EI 1996年第2期51+46-50,共6页
The parallel multisection method for solving algebraic eigenproblem has been presented in recent years with the development of the parallel computers, but all the research work is limited in standard eigenproblems of ... The parallel multisection method for solving algebraic eigenproblem has been presented in recent years with the development of the parallel computers, but all the research work is limited in standard eigenproblems of symmetric tridiagonal matrix. The multisection method for solving the generalized eigenproblem applied significantly in many science and engineering domains has not been studied. The parallel region preserving multisection method (PRM for short) for solving generalized eigenproblems of large sparse and real symmetric matrix is presented in this paper. This method not only retains the advantages of the conventional determinant search method (DS for short), but also overcomes its disadvantages such as leaking roots and disconvergence. We have tested the method on the YH 1 vector computer, and compared it with the parallel region preserving determinant search method the parallel region preserving bisection method (PRB for short). The numerical results show that PRM has a higher speed up, for instance, it attains the speed up of 7.7 when the scale of the problem is 2 114 and the eigenpair found is 3, and PRM is superior to PRB when the scale of the problem is large. 展开更多
关键词 parallel processing structural analysis numerical algebra generalized eigenproblem parallel multisection method
在线阅读 下载PDF
Parallel Region-Preserving Multisection Method for Solving Generalized Eigenproblem
2
作者 Lan Zeng’Shuquan Zhou( Jiangsu Certified Pnblic Accountants, Nanjing 210005, P.R. China College of Science, NUAA, Nanjing 210016, P.R.China) 《Wuhan University Journal of Natural Sciences》 CAS 1996年第Z1期561-565,共5页
The parallel multisection method for solving algebraic eigenproblem has been presented in recent years with the developing of the parallel computers, but all the research work is limited in standard eigenproblem of sy... The parallel multisection method for solving algebraic eigenproblem has been presented in recent years with the developing of the parallel computers, but all the research work is limited in standard eigenproblem of symmetric tridiagonal matrix. The multisection method for solving generalized eigenproblem applied significantly in many secience and engineering domains has not been studied. The parallel region--preserving multisection method (PRM for shotr) for solving generalized eigenproblem of large sparse real symmetric matrix is presented in this paper. This method not only retains the advantages of the conventional determinant search method (DS for short), but also overcomes its disadvantages such as leaking roots and disconvergence. We tested the method on the YH--1 vector computer,and compared with the parallel region-preserving determinant search method (parallel region--preserving bisection method)(PRB for short). The numerical results show that PRM has a higher speed-up, for instance it attains the speed-up of 7.7 when the scale of the problem is 2114 and the eigenpair found is 3; and PRM is superior to PRB when scale of the problem is large. 展开更多
关键词 multisection method generalized eigenproblem parallel algorithm
在线阅读 下载PDF
Optimization of Generalized Eigensolver for Dense Symmetric Matrices on AMD GPU
3
作者 Chong Zhang Zi-Tong Su +3 位作者 Min Li Hui-Yuan Li Wen-Jing Ma Lei-Sheng Li 《Journal of Computer Science & Technology》 2025年第3期855-869,共15页
Accelerating the eigensolver on GPUs is getting more and more attention due to its ubiquitous usage in scientific and engineering fields.However,it is very challenging to achieve high performance on eigensolvers becau... Accelerating the eigensolver on GPUs is getting more and more attention due to its ubiquitous usage in scientific and engineering fields.However,it is very challenging to achieve high performance on eigensolvers because of the intricate computational patterns which cause inefficient memory access and workload imbalance on GPUs.In this work,we propose a series of optimizations for generalized dense symmetric eigenvalue problems from both the system and operator perspectives on AMD GPUs.Firstly,we adjust the workload assignments between CPUs and GPUs and find the computational performance balance between different levels of computation.Besides,we propose a multi-level pre-aggregation strategy for symmetric matrix-vector multiplication(SYMV)and general matrix-vector multiplication(GEMV)operators to tackle the performance issue caused by lacking hardware support for atomic operation.Furthermore,we optimize Cholesky decomposition and SYR2K by adopting a better overlapping method and utilizing symmetry to reduce computation.Experiments on AMD MI60 GPUs show that our optimized eigensolver outperforms the previous state-of-the-art with roughly 1.8x–3.8x speedups. 展开更多
关键词 AMD GPU dense symmetric matrix generalized eigenproblem performance optimization
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部