An efficient MPI/OpenMP hybrid parallel Radial Basis Function (RBF) strategy for both continuous and discontinuous large-scale mesh deformation is proposed to reduce the computational cost and memory consumption.Unlik...An efficient MPI/OpenMP hybrid parallel Radial Basis Function (RBF) strategy for both continuous and discontinuous large-scale mesh deformation is proposed to reduce the computational cost and memory consumption.Unlike the conventional parallel methods in which all processors use the same surface displacement and implement the same operation,the present method employs different surface points sets and influence radius for each volume point movement,accompanied with efficient geometry searching strategy.The deformed surface points,also called Control Points (CPs),are stored in each processor.The displacement of spatial points is interpolated by using only 20-50 nearest control points,and the local influence radius is set to 5-20 times the maximum displacement of control points.To shorten the searching time for the nearest control point clouds,an Alternating Digital Tree (ADT) algorithm for 3D complex geometry is designed based on an iterative bisection technique.Besides,an MPI/OpenMP hybrid parallel approach is developed to reduce the memory cost in each High-Performance Computing (HPC) node for large-scale applications.Three 3D cases,including the ONERA-M6 wing and a commercial transport airplane standard model with up to 2.5 billion hybrid elements,are used to test the present mesh deformation method.The robustness and high parallel efficiency are demonstrated by a wing deflection case with a maximum bending angle of 450 and more than 80% parallel efficiency with 1024 MPI processors.In addition,the availability for both continuous and discontinuous surface deformation is verified by interpolating the projecting displacement with opposite directions surface points to the spatial points.展开更多
Too high energy consumption is widely recognized to be a critical problem in large-scale parallel computing systems.The LogP-based energy-saving model and the frequency scaling method were proposed to reduce energy co...Too high energy consumption is widely recognized to be a critical problem in large-scale parallel computing systems.The LogP-based energy-saving model and the frequency scaling method were proposed to reduce energy consumption analytically and systematically for other two representative barrier algorithms:tournament barrier and central counter barrier.Furthermore,energy optimization methods of these two barrier algorithms were implemented on parallel computing platform.The experimental results validate the effectiveness of the energy optimization methods.67.12% and 70.95% energy savings are obtained respectively for tournament barrier and central counter barrier on platforms with 2048 processes with 1.55%?8.80% performance loss.Furthermore,LogP-based energy-saving analytical model for these two barrier algorithms is highly accurate as the predicted energy savings are within 9.67% of the results obtained by simulation.展开更多
基金supported by the National Key Research and Development Program of China (No.2016YFB0200701)the National Natural Science Foundation of China (Nos. 11532016 and 91530325)
文摘An efficient MPI/OpenMP hybrid parallel Radial Basis Function (RBF) strategy for both continuous and discontinuous large-scale mesh deformation is proposed to reduce the computational cost and memory consumption.Unlike the conventional parallel methods in which all processors use the same surface displacement and implement the same operation,the present method employs different surface points sets and influence radius for each volume point movement,accompanied with efficient geometry searching strategy.The deformed surface points,also called Control Points (CPs),are stored in each processor.The displacement of spatial points is interpolated by using only 20-50 nearest control points,and the local influence radius is set to 5-20 times the maximum displacement of control points.To shorten the searching time for the nearest control point clouds,an Alternating Digital Tree (ADT) algorithm for 3D complex geometry is designed based on an iterative bisection technique.Besides,an MPI/OpenMP hybrid parallel approach is developed to reduce the memory cost in each High-Performance Computing (HPC) node for large-scale applications.Three 3D cases,including the ONERA-M6 wing and a commercial transport airplane standard model with up to 2.5 billion hybrid elements,are used to test the present mesh deformation method.The robustness and high parallel efficiency are demonstrated by a wing deflection case with a maximum bending angle of 450 and more than 80% parallel efficiency with 1024 MPI processors.In addition,the availability for both continuous and discontinuous surface deformation is verified by interpolating the projecting displacement with opposite directions surface points to the spatial points.
基金Projects(60903044,61170049) supported by National Natural Science Foundation of China
文摘Too high energy consumption is widely recognized to be a critical problem in large-scale parallel computing systems.The LogP-based energy-saving model and the frequency scaling method were proposed to reduce energy consumption analytically and systematically for other two representative barrier algorithms:tournament barrier and central counter barrier.Furthermore,energy optimization methods of these two barrier algorithms were implemented on parallel computing platform.The experimental results validate the effectiveness of the energy optimization methods.67.12% and 70.95% energy savings are obtained respectively for tournament barrier and central counter barrier on platforms with 2048 processes with 1.55%?8.80% performance loss.Furthermore,LogP-based energy-saving analytical model for these two barrier algorithms is highly accurate as the predicted energy savings are within 9.67% of the results obtained by simulation.