期刊文献+

MAXQ方法在出租车问题中的应用

Application of MAXQ Method in Taxi Problem
在线阅读 下载PDF
导出
摘要 分层强化学习方法可用于解决维数灾难问题,MAXQ方法通过分层地分解值函效,将任务分解为不同层次上的子任务,从而只需在低维空间中解决问题。针对MAXQ方法。首先介绍其基本原理,然后介绍MAXQ方法在出租车问题中的应用,包括任务分解以及类的设计,最后用实验验证了MAXQ方法比Q-学习算法收敛快。 Hierarchical reinforcement learning can be used to solve curse of dimensionality problem. MAXQ method decomposes the task into gubtasks in different levels through decomposing value function hierarchically, so it can be realized in low dimension space. Aiming at the method MAXQ, we firstly introduce the basal principle. Then we introduce the application of the MAXQ method in the taxi problem, including the task decomposition and the class design. Finally,as is testified in practice,MAXQ method converges more faster than Q - learning algorithm.
出处 《茂名学院学报》 2007年第1期56-59,共4页 Journal of Maoming College
关键词 分层强化学习 MAXQ 任务分解 hierarchical reinforcement learning MAXQ task decomposition
  • 相关文献

参考文献6

  • 1Tom M Mitchell.曾华军 张银奎译.机器学习[M].北京:机械工业出版社,2003..
  • 2高阳,陈世福,陆鑫.强化学习研究综述[J].自动化学报,2004,30(1):86-100. 被引量:295
  • 3Richard S S,Doina P,Satinder S.Between MDPs and Semi-MDPs:A framework for temporal abstraction in reinforcement learning[J].Artificial Intelligence,1999,112:181-211.
  • 4Ronald P,Stuart R.Reinforcement Learning with Hierarchies of Machines.Advances in Neural Information Processing Systems[EB/OL].(1997)[2006-10].http://citeseer.ist.psu.edu/parr97reinforeement.html.
  • 5Thomas G D.Hierarchical reinforcement learning with the MAXQ value function decomposition[J].Journal of Artificial Intelligence Research,2000,13:227-303.
  • 6Mare P,Andrew G B.PolicyBlocks:An Algorithm for Greating Useful Macro-Actions in Reinforcement Learning[EB/OL].(2002)[2006-10].http://citeseer.ist.psu.edu/pickett02policyblocks.html.

二级参考文献4

共引文献314

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部