期刊文献+

连续型动态规划的新算法研究 被引量:2

New Algorithm for the Continuing Dynamic Programming
在线阅读 下载PDF
导出
摘要 提出了求解一维连续型动态规划问题的自创算法——离散近似迭代法,并结合双收敛方法求解多维连续型动态规划问题.该算法的基本思路为:在给定其它状态向量序列的基础上,每次对一个状态变量序列进行离散近似迭代,并找出该状态变量的最优序列,直到所有状态向量序列都检查完.当模型为非凸非凹动态规划时,证明了该算法的收敛性.当模型为凸动态规划时,证明了该算法的线性收敛性.最后,以一个具体算例验证了该模型和算法的有效性. The paper proposes the discrete approximate iteration method to solve single-dimensional continuing dynamic programming model. At the same time, multidimensional continuing dynamic programming model is solved by the discrete approximate iteration method and bi-convergent method. The algorithm is as following: Firstly, let state value of one of state equations be unknown and the others be known. Secondly, use the discrete approximate iteration method to find the optimal value of the unknown state values and then continue iterating until all state equations have found optimal values. If the objective function is non-concave and non-convex, the convergence of the algorithm is proved. If the objective function is convex, the linear convergence of the algorithm is proved. At last, the effectiveness of the formation and the algorithm is proved by an example.
作者 张鹏
出处 《运筹学学报》 CSCD 北大核心 2012年第1期97-105,共9页 Operations Research Transactions
基金 教育部人文社科研究项目(08JC630062) 湖北省自然科学基金项目(2010CDB03304 2010CDB02103) 湖北省科技厅软科学项目(2010DHA018)
关键词 动态规划问题 多维 离散近似迭代方法 双收敛法 dynamic programming, dimension, discrete approximate iteration, biconvergent method
  • 相关文献

参考文献22

  • 1Bellman R.Dynamic programming[M].Princeton University Press,Princeton,1957.
  • 2Shepherd W, Zakikhani P. Suggested definition of reactive power innon-sinusoidal systems[C]. IEE Proceedings, 1972, 119 (9): 1361-1362.
  • 3Nowoniejski Z J, Sowa E. Teoriamocy ukladow elektrycznych. Gliwice: DzialW ydaw nictw Politechiki Slaskiej[Z]. 1977.
  • 4Kusters N I, Moore W J M. On definition of reactive power under non-sinusoidal conditions[J]. IEEE Transactions on Power Apparatus and Systems, 1980, PAS-99(5): 1845-1854.
  • 5Czarnecki L S. Considerations on the reactive power in non-sinusoidal situations[J]. IEEE Transactions on Instrumentation and Measurement, 1985, 34(3): 399-404.
  • 6Czarnecki L S. What is wrong with the Budeanu concept of reactive and distortion power and why it should be abandoned[J]. IEEE Transactions on Instrumentation and Measurement, 1987, 36(3): 834-837.
  • 7Czarnecki L S. An orthogonal decomposition of current of non sinusoidal voltage source applied to nonlinear loads[J]. International Journal of Circuit Theory Application, 1983, (11): 235-239.
  • 8Villarreal B,Karwan M H.Multicriteria integer programming:a(hybrid) dynamic programming recursive approach[J].Mathematical programming,1981,21:204-223.
  • 9Philbrick C R,Jr Kitanidis P K.Improved dynamic programming methods for optimal control of lumped-paramter stochastic system[J].Operations Research,2001,49(3):398-412.
  • 10Bertsimas D,Demir R.An approximate dynamic programming approach to multidimensional knapsack problems[J].Management Science,2002,48(2):550-565.

二级参考文献46

共引文献55

同被引文献24

  • 1段鹰,段文泽.大规模时滞系统的动态规划模型与优化算法[J].机械工程学报,2007,43(4):217-223. 被引量:3
  • 2李端,钱富才,李力,高建军.动态规划问题研究[J].系统工程理论与实践,2007,27(8):56-64. 被引量:30
  • 3Smith D R. The design of divide and conquer algorithms [J]. Science o] Computer Programming, 1985, 5: 37-58.
  • 4Dreyfus S. Richard Bellman on the birth of dynamic programming [J]. Operations Research, 2002, 50(1): 48-51.
  • 5Eddy S R. What is dynamic programming? [J]. Nature Biotechnology, 2004, 22(7): 909-910.
  • 6Viterbi A J. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm [J]. IEEE Transactions on Information Theory, 1967, 13(2): 260-269.
  • 7Omura J K. On the Viterbi decoding algorithm [J]. IEEE Transactions on Information Theory, 1969, 15(1): 177-179.
  • 8Viterbi A J. Convolutional codes and their performance in communication systems [J]. IEEE Transactions on Communications Technology, 1971, 19(5): 751-772.
  • 9Forney G D. Convolutional codes II. Maximum-likelihood decoding [J]. Information and Con- trol, 1974, 25(3): 222-266.
  • 10Levinson S E, Rabiner L R, Sondhi M M. An introduction to the application of the theory of probabilistic functions of Markov process to automatic speech recognition [J]. The Bell System Technical Journal, 1983, 62(4): 1035-1074.

引证文献2

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部