期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Large-scale UAV swarm path planning based on mean-field reinforcement learning
1
作者 Yaozhong ZHANG Meiyan DING +4 位作者 Yao YUAN Jiandong ZHANG Qiming YANG Guoqing SHI Jianming JIANG 《Chinese Journal of Aeronautics》 2025年第9期336-349,共14页
In this paper,a deep deterministic policy gradient algorithm based on Partially Observable Weighted Mean Field Reinforcement Learning(PO-WMFRL)framework is designed to solve the problem of path planning in large-scale... In this paper,a deep deterministic policy gradient algorithm based on Partially Observable Weighted Mean Field Reinforcement Learning(PO-WMFRL)framework is designed to solve the problem of path planning in large-scale Unmanned Aerial Vehicle(UAV)swarm operations.We establish a motion control and detection communication model of UAVs.A simulation environment is carried out with No-Fly Zone(NFZ),the task assembly point is established,and the long-term reward and immediate reward functions are designed for large-scale UAV swarm path planning problem.Considering the combat characteristics of large-scale UAV swarm,we improve the traditional Deep Deterministic Policy Gradient(DDPG)algorithm and propose a Partially Observable Weighted Mean Field Deep Deterministic Policy Gradient(PO-WMFDDPG)algorithm.The effectiveness of the PO-WMFDDPG algorithm is verified through simulation,and through the comparative analysis with the DDPG and MFDDPG algorithms,it is verified that the PO-WMFDDPG algorithm has a higher task success rate and convergence speed. 展开更多
关键词 DDPG Mean field games Partially observable po-wmfddpg UAV swarm
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部