模糊神经网络下基于强化学习的自主式地面车辆路径规划研究被引量：2

ALV Path Planning Based on Reinforcement Learning in Fuzzy Neural-networks

下载PDF

导出

摘要通过引入一种启发式学习算法,部分改进了MAXQ递阶强化学习方法,并结合模糊神经网络开发了一种自主式地面车辆(ALV)全局路径规划Agent。该智能Agent充分融合了人类操作经验和机器学习能力,为强化学习明确了搜索方向,缩减了计算量,具有较强的自适应能力,满足了系统的实时性要求。仿真结果表明:在庞大状态空间和动态变化环境中,全局路径规划Agent能够有效、实时地进行最优行为的策略学习。 By introducing FMQ（frequency maximum Q） heuristic learning algorithm, a hierarchical method of reinforcement learning was improved, through the combination of this method and fuzzy neural--networks, a global path planning Agent was developed. This Agent integrated the human op-eration experience and the capacity of machine learning, so that it ensured the search direction, reduced the amount of computation, strengthened the ments. The simulation results show that the global effective and real--time in the large state space and adaptive capacity and met the real--time requirepath planning Agent can find the optimal strategy the dynamic changing environment.

作者王文玺肖世德孟祥印张卫华

机构地区西南交通大学牵引动力国家重点实验室

出处《中国机械工程》 EI CAS CSCD 北大核心 2009年第21期2536-2541,共6页 China Mechanical Engineering

基金国家重点基础研究发展计划资助项目(2007CB714701)

关键词模糊神经网络 AGENT 强化学习路径规划自主式地面车辆 fuzzy neural-network Agent reinforcement learning path planning automated land vehicle（ALV）

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1蔡自兴,贺汉根,陈虹.未知环境中移动机器人导航控制研究的若干问题[J].控制与决策,2002,17(4):385-390. 被引量：119
2Cai Zixing, Peng Zhihong. Cooperative Coevoluationary Adaptive Genetic Algorithm in Path Planning of Cooperative Multi -- mobile Robot System [J]. Intelligent & Robotic System(S0921-0296), 2002, 33(1): 61-67.
3席裕庚.动态不确定环境下广义控制问题的预测控制[J].控制理论与应用,2000,17(5):665-670. 被引量：71
4Xu W L, Tso S K. Real--time Self--reaction of a Mobile Robot in Unstructured Environments Using Fuzzy Reasoning [J]. Engineering Applications of Artificial Intelligence, 1996, 9(5): 475-485.
5陈卫东,席裕庚,顾冬雷.自主机器人的强化学习研究进展[J].机器人,2001,23(4):379-384. 被引量：16
6Dietterich T. The MAXQ Method for Hierarchical Reinforcement Learning[C]// Proc. of the 15th ICML. San Francisco: Morgan Kaufmann,1998:118-126.
7Spiros K, Daniel K. Reinforcement Learning of Coordination in Cooperative MAS [C]// Eighteenth National Conference on AI. Alberta: ACM Press, 2002: 326-331.
8孟伟,洪炳镕,韩学东.多月球车定位/决策网络[J].机器人,2004,26(2):102-106. 被引量：1
9Kaelbling L P. Associative Reinforcement Learning: Function in K--DNF[J]. Machine Learning, 1994, 15(2): 279-298.
10肖本贤,刘海霞,张松灿,赵明阳,齐东流.基于多传感器行为融合基础上的AGV导航研究[J].系统仿真学报,2005,17(8):1939-1943. 被引量：4

二级参考文献21

1朱淼良,吴春明,张友军,金毅,李捷.基于多智能体的实时并发式智能机器人结构[J].高技术通讯,1995,5(10):20-24. 被引量：4
2贺汉根徐昕.增强学习在移动机器人导航控制中的应用[J].中南工业大学学报,2000,31:170-173.
3徐昕.增强学习及其在移动机器人导航与控制中的应用[M].长沙:国防科技大学,2002..
4席裕庚，预测控制，1993年
5Lin L J，Proc AAAI'91，1991年，781页
6Lin L J，From Animals to Animates:Int Conference on Simulation of Adaptive Behavior，1991年
7Xiaochuan Wang, Simon X.Yang. Intelligent Obstacle Avoidance for an Autonomous Mobile robot [R]. Proceedings of 5 World Congress on Intelligent Control and Automation, June15-19, 2004, Hangzhou, P.R.China, 4656-4660.
8Qiang Fang, Cunxi Xie. A Study on Intelligent Path Following and Control for Vision-based Automated Guided Vehicl [R]. Proceedings of 5 World Congress on Intelligent Control and Automation, June 15-19, 2004, Hangzhou, P.R.China, 4811-4815.
9Petru Rusu, Emil M.petriu, Thom E.Whalen, etal. Behavior-Based Neuro-fuzzy Controller for Mobile Robot Navigation [J]. IEEE Transactions on Instrumentation and Measurement, 2003, 52(4): 1335-1340.
10Xiaochuan Wang, Simon X.Yang. A Neuro-Fuzzy Approach to Obstacle Avoidance of a Nonholonomic Mobile Robot [R]. Proceedings of the 2003 IEEE/ASM. Advanced Intelligent Mechatronics, 29-34.

共引文献203

1邬再新,李艳宏,刘涛.多移动机器人路径规划技术的研究现状与展望[J].机械,2008,35(1):1-3. 被引量：11
2于飞,吕冬梅,杨宗尧,刘喜梅.改进的人工势能场在足球机器人避障中的应用[J].仪器仪表学报,2006,27(z1):508-509. 被引量：3
3席裕庚.注重控制科学的方法论研究[J].自动化学报,2002,28(S1):85-91. 被引量：4
4刘满禄,张华,胡天链.改进的人工势场法用于移动机器人导航[J].华中科技大学学报（自然科学版）,2008,36(S1):177-180. 被引量：11
5孙大勇,苏庆宇.井下机器人路径规划中的模糊逻辑控制算法[J].电气技术,2007,8(3):47-49. 被引量：1
6康亮,赵春霞,郭剑辉.未知环境下基于三次螺线Bug算法的移动机器人路径规划[J].工程图学学报,2010,31(1):30-38. 被引量：6
7张纯刚,席裕庚.Robot path planning in globally unknown environments based on rolling windows[J].Science China(Technological Sciences),2001,44(2):131-139. 被引量：12
8李一波,张庆涛.室内未知环境遍历路径规划算法综述[J].计算机科学,2012,39(S3):334-338. 被引量：7
9胡咏梅,贾磊.一种基于粗集的广义控制滚动调度策略[J].计算机工程与应用,2004,40(24):7-8. 被引量：1
10谢云,杨宜民.全自主机器人足球系统的研究综述[J].机器人,2004,26(5):474-480. 被引量：21

同被引文献17

1李清泉,郑年波,徐敬海,宋莺.一种基于道路网络层次拓扑结构的分层路径规划算法[J].中国图象图形学报,2007,12(7):1280-1285. 被引量：24
2Tsai C, Huang H, Chan C. Parallel elite genetic algorithm and its application to global path planning for autonomous robot navigation [J]. IEEE Transactions on Industrial Electronics, 2011, 58(10) : 4813--4823.
3Hsu C, Chen Y, Lu M, et al. Optimal path planning incorporating global and local search for mobile robots [A]. 2012 IEEE 1st Global Conference on Consumer Electronics (GCCE)[C], Tokyo, 2012.
4Gomez J V, Lumbier A, Garrido S, et al. Planning robot formations with fast marching square including uncertainty conditions[J]. Robotics and Autonomous Systems, 2013, 61(2): 137-152.
5Yilmaz N, Evangelinos C, Lermusiaux P, et al. Path planning of autonomous underwater vehicles for adaptive sampling using mixed integer linear programming [J]. IEEE Journal of Oceanic Enginering, 2008, 33 (4) : 522-537.
6True Ronggang, Xiao Jizhong, Wang Shaoping, et al. Modeling and path planning of the city-climber robot Part Ⅱ: 3D path planning using mixed integer linear programming[A]. Proceedings of the 2009 IEEE International Conference on Robotics and Biomimetics [C], Guilin, China, 2009.
7Xu Huali, Su Shoubao, Yang Yang. An ant optimization method for path planning on a euboid [A]. Second Pacific-Asia Conference on Circuits, Communications and System[C], Beijing, China, 2010.
8Zhu Yongjie, Chang Jiang, Wang Shuguo. A new path- planning algorithm for mobile robot based on neural network[A]. 2002 IEEE Region 10 Conference on Computers, Communications, Control and Power Engineering[C], Beijing, China, 2002.
9Dan Simon. The application of neural networks to optimal robot trajectory planning[J]. Robotics and Autonomous Systems, 1993, 11 ( 1 ) : 23-24.
10Duguleana M, Barbuceanu F, Teirelbar A, et al. Obstacle avoidance of redundant manipulators using neural networks based reinforcement learning[J]. Robotics and Computer Integrated Manufacturing. 2012, 28(2): 132-146.

引证文献2

1张照生,杨殿阁,张德鑫,连小珉.车辆导航系统中基于街区分块的分层路网路径规划[J].中国机械工程,2013,24(23):3255-3260. 被引量：5
2王斌锐,骆浩华,金英连,冯伟博.爬壁机器人风电叶片曲面路径规划设计与仿真[J].太阳能学报,2015,36(8):1806-1811. 被引量：4

二级引证文献9

1林娜,郑亚男.基于出租车轨迹数据的路径规划方法[J].计算机应用与软件,2016,33(1):68-72. 被引量：9
2张照生,杨殿阁,高利,连小珉.受限大规模路网中货车国道模式路径规划[J].长安大学学报（自然科学版）,2016,36(1):85-91. 被引量：1
3林娜,李建明.浮动车数据挖掘及其在路径规划中的应用[J].计算机工程与设计,2016,37(7):1952-1957. 被引量：2
4林娜,李建明.基于浮动车数据的公交车路线规划研究与实现[J].计算机应用与软件,2016,33(10):270-274. 被引量：1
5宗陈,于家斌,王小艺,许继平.巡航船污染水质采集路径规划仿真研究[J].计算机仿真,2018,35(9):338-342. 被引量：3
6肖鹏,王海鹏,许玮,文艳,李建祥.车载配网巡检数据采集系统设计[J].机床与液压,2020,48(8):102-108. 被引量：7
7李翠明,吴新民,龚俊.光伏组件清洁机器人真实地形的路径规划[J].太阳能学报,2022,43(1):341-347. 被引量：2
8王兵霞,丁志斌,孙乐乐.基于单片机与视觉技术的智能巡检机器人系统设计[J].科技与创新,2025(8):56-58.
9Junchao Kong,Aihong Ji,Qingfei Han,Huan Shen,Shijia Liu,Wenrui Xiang,Qiangqiang Zhang.Advances in Research of Wall-climbing Robots:from Biology to Bionics-A Review[J].Journal of Bionic Engineering,2025,22(3):945-981.

1唐振民,陆建峰,杨静宇.一种自主车全局速度的控制方法[J].机器人,1997,19(2):97-101.
2秦童.基于CMAC的Q算法在机器人足球中的应用[J].电子测试,2012,23(4):76-80.
3安岭丽,彭志平,李铁鹰.MAXQ方法在出租车问题中的应用[J].茂名学院学报,2007,17(1):56-59.
4蔡自兴,王耀南.机器人逆模神经控制及其应用[J].中南工业大学学报,1998,29(1):80-84. 被引量：2
5王耀南,付夏龙.神经网络与PID结合的机器人自适应控制[J].湖南大学学报（自然科学版）,1997,24(6):54-61. 被引量：6
6邵俊.连人的表情都能伪造了机器学习能力真强大[J].计算机与网络,2017,43(1):42-43.
7张清俐.机器学习能力能否媲美人类？[J].党政干部参考,2016,0(9):54-54.
8庞士焕,朱相冰,张琦,汤萍萍.基于MAXQ方法的分层强化学习[J].计算机技术与发展,2009,19(4):154-156. 被引量：1
9姚毓林,张逸敏,洪进.基于Hopfield神经网络模型的启发式学习算法及其在数字模式处理中的应用[J].机器人,1990,12(4):21-24.
10邢宇明,白振兴.分层强化学习在足球机器人中的应用[J].微计算机信息,2008,24(32):231-233. 被引量：2

中国机械工程

2009年第21期

浏览历史

内容加载中请稍等...

模糊神经网络下基于强化学习的自主式地面车辆路径规划研究被引量：2

参考文献10

二级参考文献21

共引文献203

同被引文献17

引证文献2

二级引证文献9

相关作者

相关机构

相关主题

浏览历史

模糊神经网络下基于强化学习的自主式地面车辆路径规划研究 被引量：2

参考文献10

二级参考文献21

共引文献203

同被引文献17

引证文献2

二级引证文献9

相关作者

相关机构

相关主题

浏览历史

模糊神经网络下基于强化学习的自主式地面车辆路径规划研究被引量：2