期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Multi-constraint reinforcement learning in complex robot environments
1
作者 Sheng HAN Hengrui ZHANG +2 位作者 Hao WU Youfang LIN Kai LV 《Frontiers of Computer Science》 2025年第8期105-107,共3页
1 Introduction Constrained Reinforcement Learning(CRL),modeled as a Constrained Markov Decision Process(CMDP)[1,2],is commonly used to address applications with security restrictions.Previous works[3]primarily focused... 1 Introduction Constrained Reinforcement Learning(CRL),modeled as a Constrained Markov Decision Process(CMDP)[1,2],is commonly used to address applications with security restrictions.Previous works[3]primarily focused on the single-constraint issue,overlooking the more common multi-constraint setting which involves extensive computations and combinatorial optimization of multiple Lagrange multipliers. 展开更多
关键词 constrained reinforcement learning combinatorial optimization multiple lagrange multipliers constrained markov decision process complex robot environments constrained reinforcement learning crl modeled constrained markov decision process cmdp multi constraint lagrange multipliers
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部