期刊文献+

基于近端策略优化的两栖无人平台路径规划算法研究 被引量:2

Path Planning Algorithm of Amphibious Unmanned Platform Based on Proximal Policy Optimization
在线阅读 下载PDF
导出
摘要 为解决水陆两栖无人平台在复杂环境中的路径规划问题,针对传统方法难以应对动态障碍物和多变环境的局限性,提出了一种基于近端策略优化(PPO)的路径规划算法,包含四种感知信息输入方案以及速度强化奖励函数,适应动态和静态环境.该算法通过批次函数正则化、策略熵引入和自适应裁剪因子,显著提升了算法的收敛速度和稳定性.研究中采用了ROS仿真平台,结合Flatland物理引擎和PedSim插件,模拟了包含动态障碍物的多种复杂场景.实验结果表明,采用BEV+V状态空间输入结构和离散动作空间的两栖无人平台,在路径规划中展现出高成功率和低超时率,优于传统方法和其他方案.仿真和对比实验显示采用鸟瞰图与速度组合的状态空间数据结构配合速度强化奖励函数算法提高了性能,收敛速度提高25.58%,路径规划成功率提升25.54%,超时率下降13.73%. In order to solve the algorithm problem of the training speed and stability in local path planning of am-phibious unmanned platform,a proximal policy optimization(PPO)algorithm was improved,establishing a foundation of multi-sensory information input for the amphibious platform.Actually,four perceptual informa-tion input schemes and speed-enhanced reward function were proposed to adapt to the dynamic and static envir-onment.The experimental results show that the amphibious unmanned platform with BEV+V state-space input structure and discrete action space demonstrates high success rate and low timeout rate in path planning,which is superior to the traditional methods and other schemes.Simulation and comparative experiment results show that the state space data structure with the combination of aerial view and speed combined with the speed enhance-ment reward function algorithm can improve the algorithm performance,increasing convergence speed up to 25.58%,the success rate of path planning up to 25.54%,and descending the timeout rate by 13.73%.
作者 左哲 覃卫 徐梓洋 李寓安 陈泰然 ZUO Zhe;QIN Wei;XU Ziyang;LI Yu'an;CHEN Tairan(School of Mechanical Engineering,Beijing Institute of Technology,Beijing 100081,China)
出处 《北京理工大学学报》 EI CAS 北大核心 2025年第1期19-25,共7页 Transactions of Beijing Institute of Technology
基金 北京理工大学重点实验室项目(2022-CXPT-LC-003-01)。
关键词 路径规划 两栖 无人平台 近端策略优化(PPO) path planning amphibious unmanned platform proximal policy optimization(PPO)
  • 相关文献

参考文献8

二级参考文献43

  • 1Li C, Zhang H, Hao B, et al. A survey on routing protocols for large-scale wireless sensor networks[J]. Sensors, 2011,11(4) :3498 - 3526.
  • 2Guo S, Yang Y. A distributed optimal framework for mobile data gathering with concurrent data uploading in wireless sensor networks [ C ]//INFOCOM, 2012 Proceedings IEEE. IS. 1. ].. IEEE, 2012:1305 - 1313.
  • 3Martinez-de Dios J R, I.ferd K, de San Bernab6 A, et al. Cooperation between uas and wireless sensor networks for efficient data collection in large environments [J]. Journal of Intelligent 8 Robotic Systems, 2013,70(1 - 4) ..491 - 508.
  • 4Ramadurai V, Sichitiu M L. Localization in wireless sensor networks: a probabilistic approach [C]// Proceedings of International Conference on Wireless Networks. [S. 1.1: IEEE, 2003:275 -281.
  • 5I.ev-Tov N, Peleg D. Polynomial time approximation schemes for base station coverage with minimum total radii[J]. Computer Networks, 2005,47(4) :489 - 501.
  • 6Lange S, Sunderhauf N, Protzel P. A vision based onboard approach for landing and position control of an autonomous multirotor UAV in GPS-denied environments [ C ] // Proceedings of International Conference on Advanced Robotics. [ S. 1. ]: IEEE, 2009 : 1 - 6.
  • 7Alba Martinez M A, Cordeau J F, Dell'Amico M, et al. A branch-and-cut algorithm for the double traveling salesman problem with multiple stacks[J]. INFORMS Journal on Computing, 2013,25(1) :41 - 55.
  • 8Cook W, Espinoza D. Computing with domino parity in- qualities for the TSP [J]. INFORMS Journal on Computing, 2007,19(3) :356 - 365.
  • 9李清泉,黄练.基于GPS轨迹数据的地图匹配算法[J].测绘学报,2010,39(2):207-212. 被引量:64
  • 10方彦军,周亭亭,方源.基于GIS和环境感知的无人车定位方法研究[J].自动化与仪表,2012,27(5):1-4. 被引量:4

共引文献120

同被引文献10

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部