期刊文献+

连续时间Markov控制过程的平均代价最优鲁棒控制策略 被引量:4

Optimal Robust Control Policy for Continuous-time Markov Control Processes With Average-Cost Criteria
在线阅读 下载PDF
导出
摘要 在Markov性能势基础上 ,研究了一类转移速率不确定但受紧集约束的遍历连续时间Markov控制过程 (CTMCP)的鲁棒控制问题 .根据系统的遍历性 ,平均代价Poisson方程的解可被看作是性能势的一种定义 .在平均代价准则下 ,优化控制的目标是选择一个平稳策略使得系统在参数最坏取值下能获得最小无穷水平平均代价 ,据此论文给出了求解最优鲁棒控制策略的策略迭代 (PI)算法 ,并详细讨论了算法的收敛性 . Motivated by the needs of optimization and control of practical engineering systems with uncertain parameters, we considered, through the Markov performance potential theory, the robust control problems for a class of continuous time Markov control processes with uncertain transition rates that are constrained on compact sets. By ergodic property of the processes, the solution of the average cost Poisson equation can be viewed as a definition for the concept of Markov performance potential. Under average cost criteria, our goal is to obtain a stationary policy that generates the minimal infinite horizon average cost under the worst choice of the system parameters. Therefore, we developed a policy iteration algorithm for generating an optimal robust control policy, and discussed in detail the convergence of the proposed algorithm.
出处 《中国科学技术大学学报》 CAS CSCD 北大核心 2004年第2期219-225,共7页 JUSTC
基金 合肥工业大学中青年科技创新群体计划 安徽省优秀青年科技基金 (0 4 0 4 2 0 4 4 )
关键词 Markov性能势 连续时间Markov控制过程 鲁棒控制策略 策略迭代 最优控制 Markov performance potentials continuous time Markov control processes robust control policy policy iteration
  • 相关文献

参考文献3

二级参考文献2

共引文献15

同被引文献14

  • 1殷保群,李衍杰,奚宏生,周亚平.一类可数Markov控制过程的最优平稳策略[J].控制理论与应用,2005,22(1):43-46. 被引量:1
  • 2韩江洪,郑淑丽,陆阳,魏振春,于筑国.离散事件控制系统规则化描述方法的研究[J].合肥工业大学学报(自然科学版),2005,28(9):1081-1084. 被引量:11
  • 3Cao X R.Semi-Markov decision problems and performance sensitivity analysis[J].IEEE Trans on Automatic Control,2003,48(5): 758-769.
  • 4Ramadge P J,Wonham W M.The control of discret event system[C]//Proc IEEE on Discrete Event Systems, 1989,77(1 ):81-98.
  • 5Cao X R. Semi-Markov decision problems and performance sensitivity analysis[J]. IEEE Trans. on Automatic Control, 2003,48(5): 758-769.
  • 6Kalyanasundaram S, Chong E K P, Shroff N B. Markov decision processes with uncertain transition rates: sensitivity and robust control[A]. Proceedings of the 41th IEEE Conference on Decision and Control[C]. Las Vegas, Nevada USA: 2002, 4: 3799-3804.
  • 7Tang H, Liang X J, Gao J, and Liu C. Robust control policy for semi-Markov decision processes with dependent uncertain parameters[A]. The 5th World Congress on Intelligent Control and AutomationfC]. Hangzhou. China: June, 2004, 1: 515-518.
  • 8Tang H, Yuan J B, Lu Y, Cheng W J. Performance potential-based Neuro-dynamic programming for SMDPs. Acta Automatic Sinica (toapocar) [Z].
  • 9王兴富,谷红伟,戴学丰.马尔可夫链在离散事件系统中的应用[J].自动化技术与应用,2000,19(3):35-36. 被引量:2
  • 10唐昊,奚宏生,殷保群.Markov控制过程基于单个样本轨道的在线优化算法[J].控制理论与应用,2002,19(6):865-871. 被引量:5

引证文献4

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部