期刊文献+

基于性能预测的遗传强化学习动态调度方法 被引量:7

Genetic Reinforcement Learning Approach to Dynamic Scheduling Based on Performance Prediction
原文传递
导出
摘要 针对作业车间动态调度问题,在模式驱动调度的框架下,提出遗传强化学习动态调度方法。首先,采用优先规则编码的染色体表达问题的解,将染色体分割成基因模式作为分阶段调度算法的状态模式;其次,设计性能预测变量,构建启发式立即回报函数,引导和加快遗传强化学习算法的搜索进程;再次,设置遗传算子、强化学习及其相关参数以实现搜索过程"开采"与"探索"之间的平衡;最后,仿真实验结果验证了遗传强化学习调度方法的有效性。 In the framework of pattern driven scheduling,a genetic reinforcement learning (GRL) approach to schedule the job in the dynamical job-shop was proposed.First,the chromosome was coded by preference rules-based representation for the problem.The chromosome was divided into gene schema as state patterns for the multi-phase scheduling system.Secondly,a performance predictive variable to construct instant reward function was designed which was used to guide the learning system to progress rapidly.Thirdly,genetic operators,RL and controlling parameters carried out the search strategy for the balance of "exploration" and "exploitation".Finally,the simulation results verify the efficiency of GRL scheduling approach.
出处 《系统仿真学报》 CAS CSCD 北大核心 2010年第12期2809-2812,2820,共5页 Journal of System Simulation
基金 辽宁省自然科学基金项目(20092060)
关键词 强化学习 遗传算法 预测 生产周期 作业车间动态调度 reinforcement learning genetic algorithm prediction makespan dynamic job-shop scheduling
  • 相关文献

参考文献11

  • 1Sun D, Lin L. A dynamic job shop scheduling framework: A backward approach [J]. International Journal of Production Research (S0020-7543), 1994, 32(4): 967-985.
  • 2K M Mohanasundaram, K Natarajan, G Viswanathkumar, P Radhakrishnan, C Rajendran. Scheduling rules for dynamic shops that manufacture multi-level jobs [J]. Computers & Industrial Engineering (S0360-8352), 2003, 44(1): 119-131.
  • 3孙容磊,熊有伦,杜润生,丁汉.规则调度的迭代优化[J].计算机集成制造系统-CIMS,2002,8(7):546-550. 被引量:10
  • 4Sutton R S, A Barto G. Reinforcement Learning: An Introduction [M]. Cambridge, MA, USA: MIT Press, 1998.
  • 5Zhang Wei. Reinforcement Learning for Job-Shop Scheduling [D]. USA: Oregon State University, 1996.
  • 6Aydin M E., Oztemel E. Dynamic Job-Shop Scheduling using Reinforcement Learning Agents [J]. Robotics and Autonomous Systems (S0921-8890), 2000, 33(2): 169-178.
  • 7Wang Y-C, Usher J M. Application of reinforcement learning for agent-based production scheduling [J]. Engineering Applications of Artificial Intelligence (S0952-197), 2005, 18(1): 73-82.
  • 8Pettinger J E, Everson R M. Controlling genetic algorithms with reinforcement learning [C]// Proceedings of the Genetic and Evolutionary Computation Conference. San Francisco, CA, USA: Morgan Kaufmann, 2002: 692-697.
  • 9王本年,高阳,陈兆乾,谢俊元,陈世福.RLGA:一种基于强化学习机制的遗传算法[J].电子学报,2006,34(5):856-860. 被引量:9
  • 10潘燕春,周泓,冯允成,魏佳呈.同顺序Flow-shop问题的一种遗传强化学习算法[J].系统工程理论与实践,2007,27(9):115-122. 被引量:4

二级参考文献28

  • 1吕赐兴,朱云龙,尹朝万,于海斌.基于多Agent的敏捷生产调度中的协商策略[J].计算机集成制造系统,2006,12(4):579-584. 被引量:5
  • 2赵新宇,林作铨.合同网协议中的Agent可信度模型[J].计算机科学,2006,33(6):150-153. 被引量:15
  • 3Nix A E,Vose M D.Modeling genetic algorithms with Markov chains[J].Annals Mathematics and Artificial Intelligence,1992,5(1):77-88.
  • 4Sutton R S,Butro A G.Reinforcement Learning:Introduction[M].Cambridge,MA:MIT Press,1998.
  • 5Holland J H.A mathematical framework for studying learning in classifier systems[J].Physica,1986,22D(1-3):307-317.
  • 6Dorigo M,Bersini H.A comparison of Q-learning and classifier systems[A].Proceedings of From Animals to Animats,Third International Conference on SIMULATION of Adaptive Behavior[C].MA:MIT Press,1994.248-255.
  • 7Qi D,Sun R.A multi-agent system integrating reinforcement learning,bidding and genetic algorithms[J].Web Intelligence and Agent Systems,2003,1(3-4):187-202.
  • 8Eriksson A,Capi G,Doya K.Evolution of metaparamters in reinforcement learning algorithm[A].Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems[C].Piscataway NJ:IEEE Press,2003.412-417.
  • 9Pettinger J E,Everson R M.Controlling genetic algorithms with reinforcement learning[A].Proceedings of the Genetic and Evolutionary Computation Conference[C].San Francisco,CA:Morgan Kaufmann,2002.692-692.
  • 10Watkins C J C H,Dayan P.Technical note:Q-learning[J].Machine Learning,1992,8(3-4):279-292.

共引文献23

同被引文献61

引证文献7

二级引证文献86

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部