期刊文献+

POMDPs算法复杂度对比分析研究

Algorithm Complexity for POMDPs: A Comparative Study
在线阅读 下载PDF
导出
摘要 部分可观察马尔可夫决策过程(Partially Observable Markov Decision Processes,POMDPs)是动态不确定环境下序贯决策的理想模型,但是现有算法都陷入"维数灾"和"历史灾"问题,造成理想的POMDPs模型无法在实际工程中得到应用.本文首先详细分析了POMDPs精确算法的复杂度,阐述问题求解的难点;然后比较分析现有基于点的离线算法和在线算法两类算法的算法思想和时间复杂度,指出两类算法的优缺点;最后简介POMDPs实际应用情况和未来的研究方向. Partially Observable Markov Decision Processes (POMDPs) offers a framework for sequential decision-making under uncertainty in stochastic domains. However, the conventional algorithms are plagued with two curses, dimensionality and history, which makes the ideal POMDPs model inapplicable in practical projects. This paper analyzes the complexity of exact algorithm of POMDPs, and presents the key points in solving this problem. Besides, the ideas and complexity of point-based offiine algorithms and online algorithms were analyzed respectively, and their advantages and disadvantages discussed. Finally, applications of POMDPs and research trends of POMDPs are pointed out.
出处 《深圳职业技术学院学报》 CAS 2013年第1期3-10,共8页 Journal of Shenzhen Polytechnic
基金 广东省自然科学基金资助项目(S2011040004769)
关键词 部分可观察马尔可夫决策过程 序贯决策 信念状态空间 在线算法 维数灾 POMDPs sequential decision-making belief states space online algorithms dimensionalitycurses
  • 相关文献

参考文献35

  • 1Littman M L. A tutorial on Partially Observable Markov Decision Processes [J]. Journal of Mathema- ticalPsychology, 2009, 53 (3): 119-125.
  • 2Cao Xi-Ren, Guo Xian-ping. Partially Observable Markov Decision Processes With Reward Information: Basic Ideas and Models [J]. IEEE Transactions on Automatic Control, 2007, 52 (4): 677-681.
  • 3Ross S, Pineau J, Chaibdraa B, et al. A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes [J]. Journal of Machine Learning Research, 2011, 12 : 1729-1770.
  • 4Paquet S. Distributed Decision-Making and Task Coordination in Dynamic Uncertain and Real-Time Multiagent Environments [D]. Qu6bec: Laval Univer- sity, 2006: 56-92.
  • 5Ross S, Pineau J, Paquet S, Algorithms for POMDPs [J] Intelligence Research, 2008, et al. Online Planning Journal of Artificial 32 (6): 663-704.
  • 6Robert K. Point-Based POMDP Solvers; Survey and Comparative Analysis [D]. Montreal: McGill Univer- sity, 2010: 34-78.
  • 7Roy N, Gordon G. Finding Approximate POMDPs Solutions Through Belief Compression [J. Journal of Artificiallntelligence Research, 2005, 23 (9): 1-40.
  • 8Sondik E. The Optimal Control of Partially Observ- able Markov Decision Processes ED]. California: Stanford University, 1971 : 56-68.
  • 9Pineau J, Gordon G, Thrun S. Anytime point-based approximations for large POMDPs[J]. Journal of Artifieial Intelligence Research, 2006, 27: 335-380.
  • 10Washington, R. BI-POMDPs: bounded incremental partially observable Markov model planning [C] // Steel S, Alami R. Proceedings of the 4th European Conference on Planning. Toulouse: Springer, 1997. 440-451.

二级参考文献7

共引文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部