连续型动态规划的新算法研究被引量：2

New Algorithm for the Continuing Dynamic Programming

下载PDF

导出

摘要提出了求解一维连续型动态规划问题的自创算法——离散近似迭代法,并结合双收敛方法求解多维连续型动态规划问题.该算法的基本思路为:在给定其它状态向量序列的基础上,每次对一个状态变量序列进行离散近似迭代,并找出该状态变量的最优序列,直到所有状态向量序列都检查完.当模型为非凸非凹动态规划时,证明了该算法的收敛性.当模型为凸动态规划时,证明了该算法的线性收敛性.最后,以一个具体算例验证了该模型和算法的有效性. The paper proposes the discrete approximate iteration method to solve single-dimensional continuing dynamic programming model. At the same time, multidimensional continuing dynamic programming model is solved by the discrete approximate iteration method and bi-convergent method. The algorithm is as following： Firstly, let state value of one of state equations be unknown and the others be known. Secondly, use the discrete approximate iteration method to find the optimal value of the unknown state values and then continue iterating until all state equations have found optimal values. If the objective function is non-concave and non-convex, the convergence of the algorithm is proved. If the objective function is convex, the linear convergence of the algorithm is proved. At last, the effectiveness of the formation and the algorithm is proved by an example.

作者张鹏

机构地区武汉科技大学管理学院

出处《运筹学学报》 CSCD 北大核心 2012年第1期97-105,共9页 Operations Research Transactions

基金教育部人文社科研究项目(08JC630062) 湖北省自然科学基金项目(2010CDB03304 2010CDB02103) 湖北省科技厅软科学项目(2010DHA018)

关键词动态规划问题多维离散近似迭代方法双收敛法 dynamic programming, dimension, discrete approximate iteration, biconvergent method

分类号 O221.3 [理学—运筹学与控制论]

引文网络
相关文献

参考文献22

1Bellman R.Dynamic programming[M].Princeton University Press,Princeton,1957.
2Shepherd W, Zakikhani P. Suggested definition of reactive power innon-sinusoidal systems[C]. IEE Proceedings, 1972, 119 (9): 1361-1362.
3Nowoniejski Z J, Sowa E. Teoriamocy ukladow elektrycznych. Gliwice: DzialW ydaw nictw Politechiki Slaskiej[Z]. 1977.
4Kusters N I, Moore W J M. On definition of reactive power under non-sinusoidal conditions[J]. IEEE Transactions on Power Apparatus and Systems, 1980, PAS-99(5): 1845-1854.
5Czarnecki L S. Considerations on the reactive power in non-sinusoidal situations[J]. IEEE Transactions on Instrumentation and Measurement, 1985, 34(3): 399-404.
6Czarnecki L S. What is wrong with the Budeanu concept of reactive and distortion power and why it should be abandoned[J]. IEEE Transactions on Instrumentation and Measurement, 1987, 36(3): 834-837.
7Czarnecki L S. An orthogonal decomposition of current of non sinusoidal voltage source applied to nonlinear loads[J]. International Journal of Circuit Theory Application, 1983, (11): 235-239.
8Villarreal B,Karwan M H.Multicriteria integer programming:a(hybrid) dynamic programming recursive approach[J].Mathematical programming,1981,21:204-223.
9Philbrick C R,Jr Kitanidis P K.Improved dynamic programming methods for optimal control of lumped-paramter stochastic system[J].Operations Research,2001,49(3):398-412.
10Bertsimas D,Demir R.An approximate dynamic programming approach to multidimensional knapsack problems[J].Management Science,2002,48(2):550-565.

二级参考文献46

1王春峰,杨建林,蒋祥林.Multistage Stochastic Programming Model for the Portfolio Problem of a Property-Liability Insurance Company[J].Transactions of Tianjin University,2002,8(3):203-206. 被引量：3
2李仲飞,汪寿阳.摩擦市场的最优消费-投资组合选择[J].系统科学与数学,2004,24(3):406-416. 被引量：11
3郭文旌,胡奇英.不确定终止时间的多阶段最优投资组合[J].管理科学学报,2005,8(2):13-19. 被引量：23
4郭丹,徐伟,雷佑铭.机会约束下的均值-VaR组合投资问题[J].系统工程学报,2005,20(3):256-260. 被引量：13
5荣喜民,武丹丹,张奎廷.基于均值-VaR的投资组合最优化[J].数理统计与管理,2005,24(5):96-103. 被引量：23
6张鹏,张忠桢,岳超源.基于效用最大化的投资组合旋转算法研究[J].财经研究,2005,31(12):116-125. 被引量：15
7张鹏,张忠桢,岳超源.限制性卖空的均值-半绝对偏差投资组合模型及其旋转算法研究[J].中国管理科学,2006,14(2):7-11. 被引量：42
8安起光,王厚杰.引入无风险证券的均值——VaR投资组合模型研究[J].中国管理科学,2006,14(2):12-15. 被引量：15
9陈伟忠,金以萍,陈金贤.多阶段条件下投资组合的优化研究[J].管理工程学报,1996,10(3):163-167. 被引量：4
10Markowitz, H. Portfolio selection[ J ]. The Journal of Finance, 1952,7( 1 ) : 77 - 91.

共引文献55

1党世力,张鹏,李璟欣.具有基数约束的可容许均值-方差投资组合优化[J].模糊系统与数学,2023,37(4):92-103.
2张鹏,张忠桢.不允许卖空情况下M-VaR和M-SA投资组合比较研究[J].中国管理科学,2008,16(S1):263-267. 被引量：4
3邢凯,程雪.拓展的均值-方差模型在传媒市场中的应用[J].消费导刊,2009(9):13-13.
4王学伟,高朝.畸变波形下功率定义问题的探讨[J].电网技术,2004,28(23):17-21. 被引量：20
5张鹏.不允许卖空情况下均值-方差和均值-VaR投资组合比较研究[J].中国管理科学,2008,16(4):30-35. 被引量：41
6张鹏.基于离散近似迭代法的多阶段M-SAD投资组合优化[J].科学技术与工程,2008,8(19):5347-5351.
7张鹏.均值-平均绝对偏差投资组合模型与优化[J].统计与决策,2009,25(1):14-15. 被引量：2
8张鹏.多阶段M-SV投资组合优化的离散近似迭代法研究[J].经济数学,2008,25(3):257-264. 被引量：4
9邢凯,秦鑫.允许卖空机制下含有消费的证券组合的有效前沿——拓展的均值-方差模型[J].现代商业,2009(12):12-12.
10庄新田,刘洋,金强.存在风险容差约束的模糊投资规划研究[J].中国管理科学,2009,17(4):156-164. 被引量：3

同被引文献24

1段鹰,段文泽.大规模时滞系统的动态规划模型与优化算法[J].机械工程学报,2007,43(4):217-223. 被引量：3
2李端,钱富才,李力,高建军.动态规划问题研究[J].系统工程理论与实践,2007,27(8):56-64. 被引量：30
3Smith D R. The design of divide and conquer algorithms [J]. Science o] Computer Programming, 1985, 5: 37-58.
4Dreyfus S. Richard Bellman on the birth of dynamic programming [J]. Operations Research, 2002, 50(1): 48-51.
5Eddy S R. What is dynamic programming? [J]. Nature Biotechnology, 2004, 22(7): 909-910.
6Viterbi A J. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm [J]. IEEE Transactions on Information Theory, 1967, 13(2): 260-269.
7Omura J K. On the Viterbi decoding algorithm [J]. IEEE Transactions on Information Theory, 1969, 15(1): 177-179.
8Viterbi A J. Convolutional codes and their performance in communication systems [J]. IEEE Transactions on Communications Technology, 1971, 19(5): 751-772.
9Forney G D. Convolutional codes II. Maximum-likelihood decoding [J]. Information and Con- trol, 1974, 25(3): 222-266.
10Levinson S E, Rabiner L R, Sondhi M M. An introduction to the application of the theory of probabilistic functions of Markov process to automatic speech recognition [J]. The Bell System Technical Journal, 1983, 62(4): 1035-1074.

引证文献2

1蓝雯飞,吴子莹,李强,强小利.动态规划算法的时间效率改进[J].中南民族大学学报（自然科学版）,2016,35(2):135-140. 被引量：6
2叶飞,王翼飞.基于动态规划的高阶隐马氏模型推广的Viterbi算法[J].运筹学学报,2013,17(4):43-55. 被引量：2

二级引证文献8

1丁磊,陈殿远,胡向阳,张恒荣,王一.加速动态时间规整算法在测井曲线相似性度量中的改进及其应用[J].吉林大学学报（地球科学版）,2022,52(6):2042-2050. 被引量：3
2邹志龙,张亮.关于导弹战场突防目标航迹规划仿真[J].计算机仿真,2019,36(1):97-101. 被引量：4
3叶飞.隐马尔可夫模型算法基础探析[J].铜陵学院学报,2014,13(3):108-112. 被引量：1
4田巧玉.基于动态规划解决物资运输的最佳路径分析[J].洛阳师范学院学报,2017,36(5):49-52.
5张振球.一种改进时间效率的动态规划算法的设计与实现[J].电子制作,2019,0(24):76-77. 被引量：1
6方有亮,武铮,张颖.动态规划方法在斜拉桥模型索力优化中的应用[J].科学技术与工程,2020,20(29):12131-12136. 被引量：5
7夏文汇,夏乾尹,刘伟明.集装箱铁路运输物流多目标函数优化及降本增效研究[J].价格月刊,2020(12):49-56. 被引量：3
8叶飞,盛昭瀚,徐峰.基于二阶隐马尔可夫模型的桥梁健康状况分析与评定[J].系统管理学报,2018,27(4):694-703. 被引量：1

1张鹏.一种多维连续型动态规划的新算法[J].控制与决策,2011,26(8):1219-1223.
2储锦林.连续型动态规划在投资决策中的应用[J].大学数学,2003,19(5):101-104. 被引量：2
3王树忠,潘壮元,王萍.一种改进的弦截法的点估计[J].哈尔滨电工学院学报,1996,19(4):511-516. 被引量：6
4陈晨.混合序列加权和的完全收敛性及a.s.收敛性[J].中南民族大学学报（自然科学版）,2015,34(1):113-116.
5张鹏.连续型凸动态规划的离散近似迭代法研究[J].系统科学与数学,2011,31(8):943-951. 被引量：2
6张鹏.基于离散近似迭代法的多阶段M-SAD投资组合优化[J].科学技术与工程,2008,8(19):5347-5351.
7张鹏.均值—动态方差多阶段投资组合优化研究[J].统计与决策,2010,26(6):67-68. 被引量：3
8张鹏.基于离散近似迭代法的多阶段M-V投资组合优化[J].数学的实践与认识,2009,39(8):44-52. 被引量：3
9张鹏.均值—动态VaR多阶段投资组合优化研究[J].数学的实践与认识,2011,41(15):21-27. 被引量：2
10程照,程煌.对一道自创题的引申和总结[J].数学通讯（学生阅读）,2007(3):47-47.

运筹学学报

2012年第1期

浏览历史

内容加载中请稍等...

连续型动态规划的新算法研究被引量：2

参考文献22

二级参考文献46

共引文献55

同被引文献24

引证文献2

二级引证文献8

相关作者

相关机构

相关主题

浏览历史

连续型动态规划的新算法研究 被引量：2

参考文献22

二级参考文献46

共引文献55

同被引文献24

引证文献2

二级引证文献8

相关作者

相关机构

相关主题

浏览历史

连续型动态规划的新算法研究被引量：2