期刊文献+
共找到12,615篇文章
< 1 2 250 >
每页显示 20 50 100
基于最优路径相似度度量的MPI程序路径覆盖测试方法
1
作者 袁剑锋 刘佳 郭建卫 《电脑与信息技术》 2025年第1期41-47,共7页
针对消息传递接口(Message Passing Interface,MPI)程序中,路径覆盖路径相似度度量方法在测试数据生成方面效率较低的问题,提出一种高效且高可靠性的测试路径度量方法。该方法首先基于不同的相似性度量方法,度量MPI程序路径间的相似度,... 针对消息传递接口(Message Passing Interface,MPI)程序中,路径覆盖路径相似度度量方法在测试数据生成方面效率较低的问题,提出一种高效且高可靠性的测试路径度量方法。该方法首先基于不同的相似性度量方法,度量MPI程序路径间的相似度,并运用协同进化算法产生测试数据;然后,对比在不同的相似性度量方法下,产生覆盖MPI程序目标路径测试数据的有效性和效率;最后,确定对应最高有效性和效率的相似度度量方法为最优路径相似度公式。所提出的验证方法被应用于7个并行程序上,实验结果表明,所提方法在生成测试数据方面具有最高的效率和有效性。 展开更多
关键词 相似度度量方法 测试数据生成 路径覆盖测试 mpi程序 协同进化算法
在线阅读 下载PDF
基于“天河二号”聚合通信卸载特性的MPI_Barrier优化
2
作者 朱琦 戴艺 +5 位作者 彭晋韬 谢旻 梁崇山 刘鹏 杨博 刘杰 《计算机工程与科学》 北大核心 2025年第3期400-411,共12页
Barrier作为消息传递接口MPI程序的基本操作,是确保程序正确执行的重要机制之一。目前已有的Barrier实现方案主要存在2个缺陷:首先,节点间同步存在大量冗余的数据路径传输开销;其次,节点内同步存在大量缓存失效的情况。为解决这些性能限... Barrier作为消息传递接口MPI程序的基本操作,是确保程序正确执行的重要机制之一。目前已有的Barrier实现方案主要存在2个缺陷:首先,节点间同步存在大量冗余的数据路径传输开销;其次,节点内同步存在大量缓存失效的情况。为解决这些性能限制,针对“天河二号”定制网络TH-Express聚合通信卸载特性,提出了基于GLEX NIC的Barrier加速和共享内存标志位重排列2种优化技术,有效减少了节点间同步开销,提高了节点内基于共享内存的同步效率。基于上述优化方法,重新设计了MPI_Barrier算法,并将其集成到MPI通信库中,并在国家超级计算长沙中心通过运行微基准测试程序和实际应用程序对所提优化方法进行性能测试,规模达到7168个节点。实验结果表明,优化后的MPI_Barrier集合操作获得了1.3~14.5倍的加速,并在应用级真实负载评测中,性能提升高达54%。 展开更多
关键词 mpi BARRIER 大规模并行应用 NIC聚合通信卸载
在线阅读 下载PDF
电大涂覆目标SBR算法与MPI并行加速技术
3
作者 吴扬 王思凡 +5 位作者 申子昂 贾浩文 祝强强 徐若锋 郭卿超 赵雷 《电波科学学报》 北大核心 2025年第3期407-414,共8页
为满足涂覆雷达吸波材料(radar absorbing material,RAM)的复杂目标电磁散射快速计算需求,提出了一种基于弹跳射线(shooting and bouncing ray,SBR)的高效计算方法。该方法利用广义传播矩阵法理论推导了金属衬底多层介质的反射系数,并... 为满足涂覆雷达吸波材料(radar absorbing material,RAM)的复杂目标电磁散射快速计算需求,提出了一种基于弹跳射线(shooting and bouncing ray,SBR)的高效计算方法。该方法利用广义传播矩阵法理论推导了金属衬底多层介质的反射系数,并将其与SBR法耦合,精确计算了多层介质涂覆目标的雷达散射截面(radar cross section,RCS);为进一步提高计算效率,采用基于CPU平台的MPI并行加速技术,实现了SBR算法的高效并行。数值结果表明:所计算的二面角反射器模型和舰船模型RCS结果与商业软件FEKO结果之间吻合良好,其中二面角反射器的均方根误差小于3 dBsm;针对电大涂覆飞行器目标,各计算进程的并行效率均达到80%以上。该方法有效解决了电大涂覆目标电磁散射计算的精度和速度,为电大复杂目标隐身性能的评估计算提供了高效的解决方案。 展开更多
关键词 弹跳射线(SBR)法 广义传播矩阵法 mpi并行 雷达散射截面(RCS) 雷达吸波材料(RAM)
在线阅读 下载PDF
Impact of message fatigue and individual behavioral responses on epidemiological spread in temporal simplicial networks 被引量:1
4
作者 Xiao-Nan Fan Xuemei You 《Chinese Physics B》 2025年第3期32-43,共12页
Health information spreads rapidly,which can effectively control epidemics.However,the swift dissemination of information also has potential negative impacts,which increasingly attracts attention.Message fatigue refer... Health information spreads rapidly,which can effectively control epidemics.However,the swift dissemination of information also has potential negative impacts,which increasingly attracts attention.Message fatigue refers to the psychological response characterized by feelings of boredom and anxiety that occur after receiving an excessive amount of similar information.This phenomenon can alter individual behaviors related to epidemic prevention.Additionally,recent studies indicate that pairwise interactions alone are insufficient to describe complex social transmission processes,and higher-order structures representing group interactions are crucial.To address this,we develop a novel epidemic model that investigates the interactions between information,behavioral responses,and epidemics.Our model incorporates the impact of message fatigue on the entire transmission system.The information layer is modeled using a static simplicial network to capture group interactions,while the disease layer uses a time-varying network based on activity-driven model with attractiveness to represent the self-protection behaviors of susceptible individuals and self-isolation behaviors of infected individuals.We theoretically describe the co-evolution equations using the microscopic Markov chain approach(MMCA)and get the epidemic threshold.Experimental results show that while the negative impact of message fatigue on epidemic transmission is limited,it significantly weakens the group interactions depicted by higher-order structures.Individual behavioral responses strongly inhibit the epidemic.Our simulations using the Monte Carlo(MC)method demonstrate that greater intensity in these responses leads to clustering of susceptible individuals in the disease layer.Finally,we apply the proposed model to real networks to verify its reliability.In summary,our research results enhance the understanding of the information-epidemic coupling dynamics,and we expect to provide valuable guidance for managing future emerging epidemics. 展开更多
关键词 Monte Carlo simulation microscopic Markov chain approach message fatigue information-epidemic coupled spreading simplicial complex
原文传递
ITU Year in Review 2024:Secretary-General’s message
5
作者 《China Standardization》 2025年第1期50-50,共1页
If 2024 has taught me anything,it’s that digital is an irrefutable force for unity—a much-needed catalyst for global cooperation in an increasingly fragmented world.This truth has been on display all year long,somet... If 2024 has taught me anything,it’s that digital is an irrefutable force for unity—a much-needed catalyst for global cooperation in an increasingly fragmented world.This truth has been on display all year long,sometimes against the odds.And it’s evident in the adoption of the Pact for the Future and Global Digital Compact at the United Nations General Assembly,in the outcomes of the World Telecommunication Standardization Assembly(WTSA-24),and in the wide endorsement of the COP29 Declaration on Green Digital Action. 展开更多
关键词 TELECOMMUNICATION COMPACT message
原文传递
Efficient Data Aggregation and Message Transmission for Information Processing Model in the CPS-WSN
6
作者 Chao-Hsien Hsieh Qingqing Yang +2 位作者 Dehong Kong Fengya Xu Hongmei Wang 《Computers, Materials & Continua》 2025年第2期2869-2891,共23页
The Cyber-Physical Systems (CPS) supported by Wireless Sensor Networks (WSN) helps factories collect data and achieve seamless communication between physical and virtual components. Sensor nodes are energy-constrained... The Cyber-Physical Systems (CPS) supported by Wireless Sensor Networks (WSN) helps factories collect data and achieve seamless communication between physical and virtual components. Sensor nodes are energy-constrained devices. Their energy consumption is typically correlated with the amount of data collection. The purpose of data aggregation is to reduce data transmission, lower energy consumption, and reduce network congestion. For large-scale WSN, data aggregation can greatly improve network efficiency. However, as many heterogeneous data is poured into a specific area at the same time, it sometimes causes data loss and then results in incompleteness and irregularity of production data. This paper proposes an information processing model that encompasses the Energy-Conserving Data Aggregation Algorithm (ECDA) and the Efficient Message Reception Algorithm (EMRA). ECDA is divided into two stages, Energy conservation based on the global cost and Data aggregation based on ant colony optimization. The EMRA comprises the Polling Message Reception Algorithm (PMRA), the Shortest Time Message Reception Algorithm (STMRA), and the Specific Condition Message Reception Algorithm (SCMRA). These algorithms are not only available for the regularity and directionality of sensor information transmission, but also satisfy the different requirements in small factory environments. To compare with the recent HPSO-ILEACH and E-PEGASIS, DCDA can effectively reduce energy consumption. Experimental results show that STMRA consumes 1.3 times the time of SCMRA. Both optimization algorithms exhibit higher time efficiency than PMRA. Furthermore, this paper also evaluates these three algorithms using AHP. 展开更多
关键词 WSN-CPS assembly line message transmission data aggregation energy conservation
在线阅读 下载PDF
Partners in Progress Congratulatory message from Nguyen Thi Huong, Vietnam’s Consul General in Nanning for the 22nd CAEXPO
7
《China Report ASEAN》 2025年第9期22-23,共2页
The China-ASEAN Expo(CAEXPO),held annually in Nanning City of Guangxi Zhuang Autonomous Region since 2004,has become a pivotal platform for economic and trade exchange between China,Vietnam,and other ASEAN member stat... The China-ASEAN Expo(CAEXPO),held annually in Nanning City of Guangxi Zhuang Autonomous Region since 2004,has become a pivotal platform for economic and trade exchange between China,Vietnam,and other ASEAN member states.Over the years,CAEXPO has proven to be a highly effective mechanism for fostering international cooperation,playing a vital role in establishing ASEAN as China’s largest trading partner and positioning China as the foremost trade partner of many ASEAN countries,including Vietnam. 展开更多
关键词 huong PARTNERS thi message trade partner nguyen congratulatory fostering international cooperationplaying
在线阅读 下载PDF
Learning-Based Turbo Message Passing for Channel Estimation in Rich-Scattering MIMO-OFDM
8
作者 Huang Zhouyang Jiang Wenjun +2 位作者 Yuan Xiaojun Wang Li Zuo Yong 《China Communications》 2025年第6期154-167,共14页
In this paper,we focus on the channel estimation for multi-user MIMO-OFDM systems in rich scattering environments.We find that channel sparsity in the delay-angle domain is severely compromised in rich scattering envi... In this paper,we focus on the channel estimation for multi-user MIMO-OFDM systems in rich scattering environments.We find that channel sparsity in the delay-angle domain is severely compromised in rich scattering environments,so that most existing compressed sensing(CS)based techniques can harvest a very limited gain(if any)in reducing the channel estimation overhead.To address the problem,we propose the learning-based turbo message passing(LTMP)algorithm.Instead of exploiting the channel sparsity,LTMP is able to efficiently extract the channel feature via deep learning as well as to exploit the channel continuity in the frequency domain via block-wise linear modelling.More specifically,as a component of LTMP,we develop a multi-scale parallel dilated convolutional neural network(MPDCNN),which leverages frequency-space channel correlation in different scales for channel denoising.We evaluate the LTMP’s performance in MIMO-OFDM channels using the 3rd generation partnership project(3GPP)clustered delay line(CDL)channel models.Simulation results show that the proposed channel estimation method has more than 5 dB power gain than the existing algorithms when the normalized mean-square error of the channel estimation is-20 dB.The proposed algorithm also exhibits strong robustness in various environments. 展开更多
关键词 channel estimation deep learning dilated CNN message passing MIMO-OFDM rich scattering environments
在线阅读 下载PDF
Dual Self-attention Fusion Message Neural Network for Virtual Screening in Drug Discovery by Molecular Property Prediction
9
作者 Jingjing Wang Kangming Hou +2 位作者 Hao Chen Jing Fang Hongzhen Li 《Journal of Bionic Engineering》 2025年第1期354-369,共16页
The development of deep learning has made non-biochemical methods for molecular property prediction screening a reality,which can increase the experimental speed and reduce the experimental cost of relevant experiment... The development of deep learning has made non-biochemical methods for molecular property prediction screening a reality,which can increase the experimental speed and reduce the experimental cost of relevant experiments.There are currently two main approaches to representing molecules:(a)representing molecules by fixing molecular descriptors,and(b)representing molecules by graph convolutional neural networks.Currently,both of these Representative methods have achieved some results in their respective experiments.Based on past efforts,we propose a Dual Self-attention Fusion Message Neural Network(DSFMNN).DSFMNN uses a combination of dual self-attention mechanism and graph convolutional neural network.Advantages of DSFMNN:(1)The dual self-attention mechanism focuses not only on the relationship between individual subunits in a molecule but also on the relationship between the atoms and chemical bonds contained in each subunit.(2)On the directed molecular graph,a message delivery approach centered on directed molecular bonds is used.We test the performance of the model on eight publicly available datasets and compare the performance with several models.Based on the current experimental results,DSFMNN has superior performance compared to previous models on the datasets applied in this paper. 展开更多
关键词 Directed message passing network Deep learning Molecular property prediction Self-attention mechanism
暂未订购
基于MPI和OpenMP混合编程的高分三号数据分布式并行转换算法 被引量:5
10
作者 陈云 《测绘与空间地理信息》 2024年第2期43-45,49,共4页
高分三号是我国C波段多极化合成孔径雷达卫星。PolSARpro是欧空局支持下的一款极化SAR影像处理的开源软件,为了便于利用该软件处理高分三号数据,本文提出了一种基于MPI和OpenMP并以PolSARpro软件的数据格式要求进行分布式并行转换算法,... 高分三号是我国C波段多极化合成孔径雷达卫星。PolSARpro是欧空局支持下的一款极化SAR影像处理的开源软件,为了便于利用该软件处理高分三号数据,本文提出了一种基于MPI和OpenMP并以PolSARpro软件的数据格式要求进行分布式并行转换算法,实现将高分三号极化数据快速精确转化为复数散射矩阵S2数据格式,通过KingMap V8.0平台实现了算法并在实际数据中进行测试,验证了算法的可行性、正确性和高效性。 展开更多
关键词 高分三号 合成孔径雷达 复数散射矩阵 OPENMP mpi KingMap
在线阅读 下载PDF
基于CGA的MPI程序分支覆盖测试套件生成
11
作者 袁剑锋 刘佳 郭建卫 《计算机技术与发展》 2024年第7期78-86,共9页
针对程序的分支覆盖测试,元启发式搜索技术已经被广泛应用于测试数据生成中。然而,当前的研究成果主要适用于串行程序。因此,为覆盖消息传递接口(Message Passing Interface,MPI)程序的分支,该文研究基于协同进化遗传算法(Co-evolutiona... 针对程序的分支覆盖测试,元启发式搜索技术已经被广泛应用于测试数据生成中。然而,当前的研究成果主要适用于串行程序。因此,为覆盖消息传递接口(Message Passing Interface,MPI)程序的分支,该文研究基于协同进化遗传算法(Co-evolutionary Genetic Algorithm,CGA)的测试套件生成方法(简称为:CGA生成法),该方法具有不受不可行分支影响的优势。首先,基于收集覆盖信息的探针,定义最小归一化分支距离,并以此设计出相应的适应度值函数;然后,使用CGA生成进化个体,并基于设计的适应度值函数,计算这些个体的适应值;最后,基于计算的适应值,选择子种群中代表个体,以构成合作种群。所提CGA生成法应用于7个基准MPI程序,并与其他多种方法进行比较。实验结果表明,CGA生成法的覆盖率通常高于其他搜索算法。 展开更多
关键词 消息传递接口程序 协同进化遗传算法 分支覆盖测试 测试套件生成 适应度值函数
在线阅读 下载PDF
基于MPI+CUDA的DSMC/PIC耦合模拟异构并行及性能优化研究 被引量:1
12
作者 林拥真 徐传福 +4 位作者 邱昊中 汪青松 王正华 杨富翔 李洁 《计算机科学》 CSCD 北大核心 2024年第9期31-39,共9页
DSMC/PIC耦合模拟是一类重要的高性能计算应用,大规模DSMC/PIC耦合模拟计算量巨大,需要实现高效并行计算。由于粒子动态注入、迁移等操作,基于MPI并行的DSMC/PIC耦合模拟往往通信开销较大且难以实现负载均衡。针对自主研发的DSMC/PIC耦... DSMC/PIC耦合模拟是一类重要的高性能计算应用,大规模DSMC/PIC耦合模拟计算量巨大,需要实现高效并行计算。由于粒子动态注入、迁移等操作,基于MPI并行的DSMC/PIC耦合模拟往往通信开销较大且难以实现负载均衡。针对自主研发的DSMC/PIC耦合模拟软件,在原有MPI并行优化版本上设计实现了高效的MPI+CUDA异构并行算法,结合GPU体系结构和DSMC/PIC计算特点,开展了GPU访存优化、GPU线程工作负载优化、CPU-GPU数据传输优化及DSMC/PIC数据冲突优化等一系列性能优化。在北京北龙超级云HPC系统的NVIDIA V100和A100 GPU上,针对数亿粒子规模的脉冲真空弧等离子体羽流应用,开展了大规模DSMC/PIC耦合异构并行模拟,相比原有纯MPI并行,GPU异构并行大幅缩短了模拟时间,两块GPU卡较192核的CPU加速比达到550%,同时具有更好的强可扩展性。 展开更多
关键词 DSMC/PIC耦合 粒子模拟 异构并行 mpi+CUDA
在线阅读 下载PDF
压电陶瓷作动器的MPI动态迟滞建模与控制 被引量:3
13
作者 周子希 王贞艳 《振动与冲击》 EI CSCD 北大核心 2024年第18期131-136,共6页
压电陶瓷是一种具有迟滞非线性的智能材料。为了实现系统的精密跟踪控制,提出一种基于MPI(modified Prandtl-Ishlinskii)的Hammerstein动态迟滞模型,并基于该模型设计了滑模跟踪控制方案。在play算子的上升边沿和下降边沿阈值处引入了... 压电陶瓷是一种具有迟滞非线性的智能材料。为了实现系统的精密跟踪控制,提出一种基于MPI(modified Prandtl-Ishlinskii)的Hammerstein动态迟滞模型,并基于该模型设计了滑模跟踪控制方案。在play算子的上升边沿和下降边沿阈值处引入了延时系数,并串联死区算子构成改进的非对称PI(Prandtl-Ishlinskii)模型,基于MPI的Hammerstein动态迟滞非线性模型可以描述压电陶瓷作动器的率相关迟滞特性。通过采集在单频率10 Hz,40 Hz,80 Hz和复合频率10~90 Hz正弦输入电压信号下的压电陶瓷作动器的位移数据,并采用粒子群算法和最小二乘递推方法辨识MPI模型参数和ARX(auto regressive model with exogenous input)模型参数,验证了模型的可行性,相较于基于经典PI的Hammerstein动态迟滞模型,模型误差分别降低了37%,42%,35%和24%。最后,构建迟滞补偿器,利用Hammerstein模型的模块化特点,提出一种可以实现对系统动态跟踪控制的滑模控制方案,并搭建了滑模控制压电系统试验平台,对单频率1 Hz,40 Hz,80 Hz和复合频率10~90 Hz的正弦输入电压信号进行了微位移实时跟踪控制试验,试验中的相对误差在7.62%以内,均方根最大误差为1.8573μm,表明所提出的滑模控制器有较强的跟踪性能。 展开更多
关键词 压电陶瓷作动器 迟滞非线性 Hammerstein动态迟滞模型 mpi模型 滑模跟踪控制
在线阅读 下载PDF
Empirical Analysis on the Human Dynamics of a Large-Scale Short Message Communication System 被引量:6
14
作者 ZHAO Zhi-Dan XIA Hu +1 位作者 SHANG Ming-Sheng ZHOU Tao 《Chinese Physics Letters》 SCIE CAS CSCD 2011年第6期352-355,共4页
Research on human behavior has attracted increasing attention recently because of its scientific significance and potential applications.Some empirical results have indicated that the timing of many human activities f... Research on human behavior has attracted increasing attention recently because of its scientific significance and potential applications.Some empirical results have indicated that the timing of many human activities follows non-Poisson statistics.We analyze a real-life huge dataset of short message communication with 6326713 users and 37577781 records during the 2006 Chinese New Year.The results show that the number of short message sendings,the interevent time between two consecutive short message sendings and the response time all follow heavy-tailed distribution.We further observe a strongly positive correlation between the activity and the power-law exponent of the interevent time distribution.In addition,the short message communication system displays a bursty property yet no memory effects,which is in particular different from some well-studied human-activited systems such as email-sending,library-loaning and file printing. 展开更多
关键词 EMAIL message PRINTING
原文传递
The Use of Dynamic Message Signs (DMSs) on the Freeways: An Empirical Analysis of DMSs Logs and Survey Data 被引量:1
15
作者 Boniphace Kutela Hualiang Teng 《Journal of Transportation Technologies》 2021年第1期90-107,共18页
This study evaluates the Dynamic Message Signs (DMSs) use to dissipate incident information on the freeways in Las Vegas, Nevada. It focuses on the DMSs message timing, extent, and content, from the operators’ and dr... This study evaluates the Dynamic Message Signs (DMSs) use to dissipate incident information on the freeways in Las Vegas, Nevada. It focuses on the DMSs message timing, extent, and content, from the operators’ and drivers’ perspectives, considering the variability in drivers’ freeway experience. Two-week incidents data with fifty-nine incidents, DMS log data, and responses from a survey questionnaire were used. The descriptive analysis of the incidents revealed that about 54% of the incidents had their information posted on the DMSs;however, information of only 18.6% of the incidents was posted on time. The posted information covered the incident type (54.2%), location (49.2%), and lane blockage (45.8%), while the expected delay or the time the incident has lasted are rarely posted. Further, the standard DMSs are the most preferred sources of traffic information on the freeway compared to the travel time only DMSs, and the graphical map boards. The logistic regression applied to the survey responses revealed that regular freeway users are less likely to take an alternative route when they run into congestion, given no other </span><span style="font-family:Verdana;">information is available. Conversely, when given accurate information</span><span style="font-family:Verdana;"> through DMSs, regular freeway users are about 2.9 times more likely to detour. Furthermore, regular freeway users perceive that the DMSs show clear information about the incident location. Upon improving the DMSs usage, 73% of respondents suggested that the information be provided earlier, and 54% requested improvements on congestion duration and length information. These findings can be used by the DMSs operators in Nevada and worldwide to improve freeway operations. 展开更多
关键词 Dynamic message Signs Dynamic Traffic Display Driver Behaviors Freeways DETOUR
在线阅读 下载PDF
利用MPI实现点云SAC-IA并行配准
16
作者 崔家武 曾波 +2 位作者 李海军 甄兆聪 梁建青 《工程勘察》 2024年第4期61-67,共7页
采样一致性初始配准算法(SAC-IA)是点云的一种粗配准算法。针对大规模点云SAC-IA配准效率低、实时性差等问题,本文提出利用消息传递接口MPI实现点云SAC-IA多进程并行配准,主要包括法向量并行估计、SPFH特征及FPFH特征并行计算和SAC-IA... 采样一致性初始配准算法(SAC-IA)是点云的一种粗配准算法。针对大规模点云SAC-IA配准效率低、实时性差等问题,本文提出利用消息传递接口MPI实现点云SAC-IA多进程并行配准,主要包括法向量并行估计、SPFH特征及FPFH特征并行计算和SAC-IA并行配准。实验结果表明,MPI多进程并行算法可显著提高点云SAC-IA配准速度。 展开更多
关键词 SAC-IA mpi 法向量 SPFH特征 FPFH特征
原文传递
An MPI parallel DEM-IMB-LBM framework for simulating fluid-solid interaction problems 被引量:2
17
作者 Ming Xia Liuhong Deng +3 位作者 Fengqiang Gong Tongming Qu Y.T.Feng Jin Yu 《Journal of Rock Mechanics and Geotechnical Engineering》 SCIE CSCD 2024年第6期2219-2231,共13页
The high-resolution DEM-IMB-LBM model can accurately describe pore-scale fluid-solid interactions,but its potential for use in geotechnical engineering analysis has not been fully unleashed due to its prohibitive comp... The high-resolution DEM-IMB-LBM model can accurately describe pore-scale fluid-solid interactions,but its potential for use in geotechnical engineering analysis has not been fully unleashed due to its prohibitive computational costs.To overcome this limitation,a message passing interface(MPI)parallel DEM-IMB-LBM framework is proposed aimed at enhancing computation efficiency.This framework utilises a static domain decomposition scheme,with the entire computation domain being decomposed into multiple subdomains according to predefined processors.A detailed parallel strategy is employed for both contact detection and hydrodynamic force calculation.In particular,a particle ID re-numbering scheme is proposed to handle particle transitions across sub-domain interfaces.Two benchmarks are conducted to validate the accuracy and overall performance of the proposed framework.Subsequently,the framework is applied to simulate scenarios involving multi-particle sedimentation and submarine landslides.The numerical examples effectively demonstrate the robustness and applicability of the MPI parallel DEM-IMB-LBM framework. 展开更多
关键词 Discrete element method(DEM) Lattice Boltzmann method(LBM) Immersed moving boundary(IMB) Multi-cores parallelization message passing interface(mpi) CPU Submarine landslides
在线阅读 下载PDF
MPI+CUDA联合加速重力场反演的并行算法 被引量:1
18
作者 赵锴坤 朱炬波 +1 位作者 谷德峰 韦春博 《大地测量与地球动力学》 CSCD 北大核心 2024年第4期423-428,共6页
针对重力场解算过程中数据量巨大的问题,联合MPI(massage passing interface)与CUDA(compute unified device architecture)提出基于最小二乘法的重力场解算过程的并行加速算法。使用MPI完成复杂过程的任务分配,实现全局层面的并行加速... 针对重力场解算过程中数据量巨大的问题,联合MPI(massage passing interface)与CUDA(compute unified device architecture)提出基于最小二乘法的重力场解算过程的并行加速算法。使用MPI完成复杂过程的任务分配,实现全局层面的并行加速;基于CUDA编写大规模矩阵相乘的并行加速程序,并针对不同类型的矩阵进行适配,同时联合MPI将法矩阵的计算过程进一步细分,实现对分进程内存峰值的压缩。在单机上完成30阶与120阶重力场仿真解算任务,结果表明,反演30阶重力场时加速比可达180;反演120阶重力场时,并行计算单次迭代仅耗时2 h,而串行模式下无法计算。 展开更多
关键词 重力场 并行计算 CUDA mpi
在线阅读 下载PDF
一种基于HDFS的分布式文件系统MPIFS 被引量:4
19
作者 陈卓航 陈雅琴 郭志勇 《黑龙江工程学院学报》 CAS 2024年第1期9-14,共6页
传统的MPI(Message Passing Interface)计算特点是数据向计算迁移,对于数据量庞大的计算任务具有先天的不足。文中提出一种支持MPI的分布式文件系统MPIFS的架构及实现。该文件系统基于HDFS(Hadoop Distributed File System),使得MPI在MP... 传统的MPI(Message Passing Interface)计算特点是数据向计算迁移,对于数据量庞大的计算任务具有先天的不足。文中提出一种支持MPI的分布式文件系统MPIFS的架构及实现。该文件系统基于HDFS(Hadoop Distributed File System),使得MPI在MPIFS上能同时支持计算密集型和数据密集型计算,设置两个类型的批处理词频统计实验,所需数据都分布式存储在MPIFS分布式文件系统中,通过调用系统提供的统一数据接口实现数据访问。1个计算节点在本地计算大小为m的文件,n个计算节点分布式并行计算大小为n×m的文件,两者计算时间相同,MPIFS中文件总量不变,计算节点数量减少,计算时间t变长,可得出MPIFS文件系统架构可行,能够支持MPI实现计算向数据迁移的并行计算。 展开更多
关键词 mpi 分布式文件系统 分布式并行计算 计算迁移
在线阅读 下载PDF
基于MPI的鲲鹏CPU核间通信研究
20
作者 周岩 王鹏 王琨予 《西南民族大学学报(自然科学版)》 CAS 2024年第3期328-335,共8页
核间通信延时是影响高性能计算系统整体运行效率的重要因素.国产鲲鹏CPU在高性能计算领域应用日益广泛,针对鲲鹏CPU的缓存架构及多核间接口互联进行分析,研究影响鲲鹏CPU核间通信延时的因素.在消息传递接口(MPI)环境下进行节点内核间通... 核间通信延时是影响高性能计算系统整体运行效率的重要因素.国产鲲鹏CPU在高性能计算领域应用日益广泛,针对鲲鹏CPU的缓存架构及多核间接口互联进行分析,研究影响鲲鹏CPU核间通信延时的因素.在消息传递接口(MPI)环境下进行节点内核间通信实验,对包括跨三级缓存、跨物理CPU通信等不同模式下通信延时进行对比,发现通信数据包大于500 KB后,跨L3 Cache TAG的通信延时反优于共享L3 Cache TAG的通信延时.针对通信数据包在64 KB大小时的通信延迟异常,分析得出是MPI的Eager模式和Rendezvous模式的默认切换阈值所造成.对这两种模式进行实验对比,验证不同大小的通信数据包在不同模式下和跨核通信时的延时特征,Eager模式更适合低延时的小消息发送.在实际应用中可根据通信数据包大小调整两种模式的默认切换阈值,以达到更好的传输效果.实验结果表明由于鲲鹏CPU存在复杂的多核结构,在并行计算程序设计时可以进行针对性优化,以提升程序的运行效率. 展开更多
关键词 鲲鹏CPU 核间通信 消息传递接口 高性能计算 共享缓存
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部