期刊文献+

基于半自治agent的profit-sharing增强学习方法研究 被引量:3

Research of profit-sharing reinforcement learning method based on semi-autonomous agent
在线阅读 下载PDF
导出
摘要 在基于半自治agent的系统中应用profit-sharing增强学习方法,并与基于动态规划的Q-learning增强学习方法进行比较,在不确定因素较多的动态环境中,当系统状态变化不是一个马尔科夫过程时profit-sharing方法具有很大优势。根据半自治agent中半自治的特性——受制性,提出了一种面向基于半自治agent的增强学习模型,以战场仿真中安全隐蔽的寻找模型为实例对基于半自治agent的profit-sharing增强学习模型进行了试验分析。 We exert the profit-sharing reinforcement learning method into the semi-autonomous agent system,and compare it with the other reinforce learning method Q-learning.Profit-sharing method is more robust and fit for the dynamic environment which includes many uncertain factors,especially in the partial MDPs(Markov Decision Processes) environment.Facing the semi - autonomous property of the agent,we propose an improving learning method of profit-sharing in the semi-autonomous agent system and test it in a combat simulation environment that finds the safety hidden space in battlefield.At last we contract and analyze these methods to the others.
出处 《计算机工程与应用》 CSCD 北大核心 2007年第15期72-75,97,共5页 Computer Engineering and Applications
基金 国家部委"十五"预研项目(the Pre- Research Project of the "Tenth Five- Year- Plan"of China) 。
关键词 增强学习 半自治agent PROFIT-SHARING Q-LEARNING reinforcement learning semi-autonomous agent profit-sharing Q-learning
  • 相关文献

参考文献11

  • 1蔡庆生,张波.一种基于Agent团队的强化学习模型与应用研究[J].计算机研究与发展,2000,37(9):1087-1093. 被引量:31
  • 2Sutton R S.Learning to predict by the methods of temporal differences[J].Machine learning,1988,3:9-44.
  • 3Watkins D J H,Dayan P.Technical notes:Q-learning[J].Machine Learning,1992,8:55 -68.
  • 4Grefenstette J J.Credit assignment in rule discovery systems based on genetic algorithms[J].Machine Learning,1988,3:225-245.
  • 5杨克巍,王正元,谭跃进.基于DEVS形式化描述的半自治Agent建模研究[C]//CAAI-10:北京:北京邮电大学出版社,2003:177-182.
  • 6Kaelbling L,Littman M L,Moore A W.Reinforcement learning:a survey[J].Journal of Artificial Intelligence Research,1996,4:237-285.
  • 7Arai S,Sycara K,Payne T R.Experience-based reinforcement learning to acquire effective behavior in a multiagent domain[C]//Proceedings of the 6th Pacific Rim International Conference on Artificial Intelligence.
  • 8Whitehead S D,Balland D H.Active perception and reinforcement learning[C]//Proceedings of the 7th International Conference on Machine Learning,1990:162-169.
  • 9Yang Ke-wei,Wang Zheng-yuan,Tan Yue-jin.Study and application of semiautonomous agent communication model[C]//The Fourth International Conference on System Science and System Engineering.Hong Kong:Global-Link Publisher,2003:288-294.
  • 10李宁,高阳,陆鑫,陈世福.一种基于强化学习的学习Agent[J].计算机研究与发展,2001,38(9):1051-1056. 被引量:26

二级参考文献3

共引文献51

同被引文献15

引证文献3

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部