期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Stability Estimation for Markov Control Processes with Discounted Cost 被引量:1
1
作者 Jaime Eduardo Martínez-Sánchez 《Applied Mathematics》 2020年第6期491-509,共19页
This article explores controllable Borel spaces, stationary, homogeneous Markov processes, discrete time with infinite horizon, with bounded cost functions and using the expected total discounted cost criterion. The p... This article explores controllable Borel spaces, stationary, homogeneous Markov processes, discrete time with infinite horizon, with bounded cost functions and using the expected total discounted cost criterion. The problem of the estimation of stability for this type of process is set. The central objective is to obtain a bounded stability index expressed in terms of the Lévy-Prokhorov metric;likewise, sufficient conditions are provided for the existence of such inequalities. 展开更多
关键词 Discrete-Time markov control Process Expected Total Discounted Cost Stability Index Probabilistic Metric Lévy-Prokhorov Metric
在线阅读 下载PDF
Asymptotic Evaluations of the Stability Index for a Markov Control Process with the Expected Total Discounted Reward Criterion
2
作者 Jaime Eduardo Martínez-Sánchez 《American Journal of Operations Research》 2021年第1期62-85,共24页
In this work, for a control consumption-investment process with the discounted reward optimization criteria, a numerical estimate of the stability index is made. Using explicit formulas for the optimal stationary poli... In this work, for a control consumption-investment process with the discounted reward optimization criteria, a numerical estimate of the stability index is made. Using explicit formulas for the optimal stationary policies and for the value functions, the stability index is explicitly calculated and through statistical techniques its asymptotic behavior is investigated (using numerical experiments) when the discount coefficient approaches 1. The results obtained define the conditions under which an approximate optimal stationary policy can be used to control the original process. 展开更多
关键词 control Consumption-Investment Process Discrete-Time markov control Process Expected Total Discounted Reward Probabilistic Metrics Stability Index Estimation
在线阅读 下载PDF
Strategy optimization for controlled Markov process with descriptive complexity constraint 被引量:1
3
作者 JIA QingShan ZHAO QianChuan 《Science in China(Series F)》 2009年第11期1993-2005,共13页
Due to various advantages in storage and implementation, simple strategies are usually preferred than complex strategies when the performances are close. Strategy optimization for controlled Markov process with descri... Due to various advantages in storage and implementation, simple strategies are usually preferred than complex strategies when the performances are close. Strategy optimization for controlled Markov process with descriptive complexity constraint provides a general framework for many such problems. In this paper, we first show by examples that the descriptive complexity and the performance of a strategy could be independent, and use the F-matrix in the No-Free-Lunch Theorem to show the risk that approximating complex strategies may lead to simple strategies that are unboundedly worse in cardinal performance than the original complex strategies. We then develop a method that handles the descriptive complexity constraint directly, which describes simple strategies exactly and only approximates complex strategies during the optimization. The ordinal performance difference between the resulting strategies of this selective approximation method and the global optimum is quantified. Numerical examples on an engine maintenance problem show how this method improves the solution quality. We hope this work sheds some insights to solving general strategy optimization for controlled Markov process with descriptive complexity constraint. 展开更多
关键词 strategy optimization controlled markov process descriptive complexity
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部