期刊文献+

基于平均报酬模型全过程R(λ)学习的互联电网CPS最优控制 被引量:10

An Average Reward Model Based Whole Process R(λ)-learning for Optimal CPS Control
在线阅读 下载PDF
导出
摘要 提出了一种新颖的基于平均报酬模型的全过程R(λ)学习互联电力系统CPS最优控制方法。该方法与电网自动发电控制(AGC)追求较高的考核时间段内的10min平均控制性能标准(CPS)指标合格率的目标相吻合,且所提出的基于平均报酬模型的R(λ)学习算法与基于折扣报酬模型的Q(λ)学习算法相比,在线学习收敛速度更快,可获得更佳的CPS指标。此外,所提出的改进的R(λ)控制器具有全过程在线学习的特点,其预学习过程被一种新型的在线"模仿学习"所代替,克服了以往强化学习控制需要另外搭建仿真模型来进行预学习收敛的严重缺陷,提高了R(λ)控制器的学习效率及其在实际电力系统中的应用性。 The R(λ)-learning algorithm is based on the average reward model.A novel optimal CPS control methodology for interconnected power systems based on the whole process R(λ)-learning algorithm is presented.The objective of the presented CPS control methodology coincides with that of AGC which pursues the high CPS compliance in every ten minutes.Moreover, the R(λ)-learning algorithm can converge faster and gain higher value of the CPS index than the Q(λ)-learning algorithm which is based on a discounted reward model.In addition,the improved controller based on the novel R(λ)-learning algorithm holds the advantage of learning on-line in the whole process and the pre-learning process of the controller is substituted by the imitation-learning process.The improved controller overcomes the serious defect of the conventional reinforcement learning controller which needs to build an accurate simulating model for converging in the pre-learning process,and it can enhance the learning efficiency and applicability in power systems.
作者 余涛 袁野
出处 《电力系统自动化》 EI CSCD 北大核心 2010年第21期27-33,共7页 Automation of Electric Power Systems
基金 国家自然科学基金资助项目(50807016) 广东省自然科学基金资助项目(9151064101000049) 中央高校基本科研业务费专项资金资助项目(2009ZM0251)~~
关键词 控制性能标准(CPS) 自动发电控制(AGC) 平均报酬模型 R(λ)学习 模仿学习 control performance standard(CPS) automatic generation control(AGC) average reward model R(λ)-learning imitation-learning
  • 相关文献

参考文献13

二级参考文献68

  • 1张小白,高宗和,钱玉妹,徐田.用AGC实现稳定断面越限的预防和校正控制[J].电网技术,2005,29(19):85-89. 被引量:44
  • 2唐跃中,张王俊,张健,陈明.基于CPS的AGC控制策略研究[J].电网技术,2004,28(21):75-79. 被引量:62
  • 3张健,唐跃中,章渊.OPEN2000AGC系统在上海电网的应用[J].电力系统自动化,2004,28(19):96-99. 被引量:2
  • 4高宗和,滕贤亮,张小白.互联电网CPS标准下的自动发电控制策略[J].电力系统自动化,2005,29(19):40-44. 被引量:76
  • 5张洪铖,王青.最优控制理论与应用[M].北京:高等教育出版社,2006.
  • 6Jaleeli N, Vanslyck L S. NERC's Mew Control Performance Standards[J]. IEEE Trans on Power Systems, 1999, 14(3): 1091-1099.
  • 7Yao M, Shoults R R, Kelm R. AGC Logic Based on NERC's New Control Performance Standard and Disturbance Control Standard[J]. IEEE Trans on Power Systems, 2000, 15(2): 855-857.
  • 8Feliachi A, Rerkpreedapong D. NERC Compliant Load Frequency Control Design Using Fuzzy Rules[J]. Electric Power Systems Research, 2005, 73(1): 101-106.
  • 9Makarov Y, Hawkins D. New AGC Algorithms[A]. In:Erican EPRI Infrastructure Integration & Markets Product Line Council Meeting[C].California(USA): 2002.
  • 10Sutton R S, Barto A G. Reinforcement Learning. an Introduction[M]. Cambridge.- MIT Press, 1998.

共引文献161

同被引文献180

引证文献10

二级引证文献204

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部