期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
On-Policy and Off-Policy Value Iteration Algorithms for Stochastic Zero-Sum Dynamic Games
1
作者 GUO Liangyuan WANG Bing-Chang ZHANG Ji-Feng 《Journal of Systems Science & Complexity》 2025年第1期421-435,共15页
This paper considers the value iteration algorithms of stochastic zero-sum linear quadratic games with unkown dynamics.On-policy and off-policy learning algorithms are developed to solve the stochastic zero-sum games,... This paper considers the value iteration algorithms of stochastic zero-sum linear quadratic games with unkown dynamics.On-policy and off-policy learning algorithms are developed to solve the stochastic zero-sum games,where the system dynamics is not required.By analyzing the value function iterations,the convergence of the model-based algorithm is shown.The equivalence of several types of value iteration algorithms is established.The effectiveness of model-free algorithms is demonstrated by a numerical example. 展开更多
关键词 Approximate dynamic programming on-policy off-policy stochastic zero-sum games valueiteration
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部