期刊文献+

基于神经网络集成的强化学习算法系统设计 被引量:2

Design on a Reinforcement Learning Algorithm Based on Neural Network Ensemble
在线阅读 下载PDF
导出
摘要 BP神经网络在目前的非线性系统中应用广泛,但是作为有导师的学习系统,BP神经网络必须要求提供相关的经验数据才能正常运行,这对一般系统来说是非常麻烦和不现实的。对此文章提出了一种基于神经网络集成的强化学习BP算法,通过强化学习体系来实现体统的自学习,通过网络集成来达到初始数据的预处理,提高系统的泛化能力,并在实际应用中取得较好的效果。 BP neural network has been used in nonlinear system controller widely.But as a supervised training algorithm,it requires experiential data to be trained.But in some system such data cannot be got.So this paper provides the optimization on a reinforcement leaming algorithm based on neural network ensemble. Reinforcement leaming is unsupervised and on-line.Neural network ensemble can significantly improve the generalization ability of leaming system. The method is tested and the expected results are obtained.
出处 《计算机工程与应用》 CSCD 北大核心 2006年第12期97-99,共3页 Computer Engineering and Applications
基金 燕山大学博士基金资助项目(编号:2004013)
关键词 神经网络集成 BP神经网络 强化学习 RBP模型 Neural Network ensemble,BP Neural Network,reinforcement learning, Reinforcement Baek-Propagation model
  • 相关文献

参考文献6

二级参考文献23

  • 1文新辉,陈开周.一种基于神经网络的非线性时间序列模型[J].西安电子科技大学学报,1994,21(1):73-78. 被引量:10
  • 2叶中行,顾立庭.混合认知系统及其在股市分析上的应用[J].上海交通大学学报,1995,29(2):92-99. 被引量:2
  • 3从爽.面向MATLAB工具箱的神经网络理论与应用[M].合肥:中国科技大学出版社,1998.59-60.
  • 4S Muggleton. Inductive logic programrnmg. In: S Muggleton ed. Inductive Logic Programming, London: Academic Press, 1992. 3-27.
  • 5Hong J. AEI: An extension matrix approximate method for the general covering problem. International Journal of Computer and Information Sciences, 1985, 14(6): 421-437.
  • 6J R Quinlan. CA. 5 : Programs for Machine Learning. San Mateo, CA: Morgan Kaufmarm, 1993.
  • 7M W Craven, J W Shavlik. Extracting tree-structured representations of trained neural networks. In: D Touretzky, M Mazer, M Hasselmo ecls. Advances in Neural Information Processing Systems 8, Cambridge, MA.. MIT Press, 1996.24 - 30.
  • 8R Setiono. Extracting rules from neural networks by pruning and hidden-unlt splitting. Neural Computation, 1997, 9 ( 1 ) : 205 -225.
  • 9R Kerber. Chi-Merge: Diseretization of numerie attributes. In: Proe of the 10th National Conf on Artifieisl Intelligence, Menlo Park, CA: AAAI Press, 1992. 123-128.
  • 10C Blake, E Keogh, C J Merz.UCI regository of machine learming databases.1998.http://www. its. uci. edu/- mlearn/MLRepository.html.

共引文献314

同被引文献33

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部