期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Performance sensitivities for parameterized Markov systems
1
作者 XirenCAO JunyuZHANG 《控制理论与应用(英文版)》 EI 2004年第1期65-68,共4页
It is known that the performance potentials (or equivalentiy, perturbation realization factors) can be used as building blocks for performance sensitivities of Markov systems. In parameterized systems, the changes in ... It is known that the performance potentials (or equivalentiy, perturbation realization factors) can be used as building blocks for performance sensitivities of Markov systems. In parameterized systems, the changes in parameters may only affect some states, and the explicit transition probability matrix may not be known. In this paper, we use an example to show that we can use potentials to construct performance sensitivities in a more flexible way; only the potentials at the affected states need to be estimated, and the transition probability matrix need not be known. Policy iteration algorithms, which are simpler than the standard one, can be established. 展开更多
关键词 perturbation analysis Markov decision processes Policy iteration Reinforcement learning perturbation realization
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部