摘要
提出一种基于连续时间Markov决策过程的动态电源管理策略优化方法.通过建立动态电源管理系统的随机切换模型,将动态电源管理问题转化为带约束的策略优化问题,并给出一种基于矢量合成的策略梯度优化算法.随机切换模型对动态电源管理系统的描述精确,策略优化算法简便有效,既能离线计算,也适用于在线优化.仿真实验验证了该方法的有效性.
Based on continuous-time Markov decision processes, a power management policy optimization approach is proposed. First an event-driven stochastic switching model is introduced for power-managed systems. Under this model, the problem of dynamic power management is formulated as a constrained policy optimization problem. Then an efficient synthetic gradient-based policy optimization algorithm is presented. This algorithm can be employed to on-line optimization as well as off-line numerical computation.Finally, a simulation example is given to illustrate the effectiveness of the proposed approach.
出处
《计算机辅助设计与图形学学报》
EI
CSCD
北大核心
2006年第5期680-686,共7页
Journal of Computer-Aided Design & Computer Graphics
基金
国家自然科学基金(60274012
60574065)
安徽省自然科学基金(050420301)
关键词
动态电源管理
MARKOV决策过程
随机切换模型
策略优化
梯度算法
dynamic power management
Markov decision processes
stochastic switching model
policy optimization
gradient algorithm