期刊文献+

异构无线网络中基于强化学习的频谱管理算法 被引量:1

Dynamic spectrum allocation algorithm for heterogeneous radio networks based on reinforcement learning
在线阅读 下载PDF
导出
摘要 提出了一种基于归一化径向基函数的自适应启发评价强化学习算法,用于异构无线网络系统中自主的动态频谱分配.该算法利用归一化径向基函数自适应构建状态空间,加快学习速度;利用自适应启发评价机制减少不必要的探索,提高学习效率.通过与无线环境交互,算法学会为不同接入网内的各个会话动态分配合适的频段.仿真结果表明,在同等网络条件下,该算法能获取更好的频谱利用率和服务质量,性能优于确定性频谱分配策略和一般的动态频谱分配策略. An adaptive heuristic critic(AHC) Reinforcement Learning algorithm is presented for the dynamic spectrum allocation in an autonomously deciding mode in heterogeneous radio networks based on the normalized radial basis function(NRBF).The algorithm accelerates the learning speed by utilizing the NRBF when constructing the state space,and improves the learning efficiency by using the AHC scheme to reduce the unnecessary exploration.Through interactions with the radio environment,it learns to allocate the proper frequency band for each session in multiple radio access networks.Simulation results show that the proposed algorithm can lead to a better spectrum efficiency and quality of service compared with to the fixed frequency planning scheme or general dynamic spectrum allocation policy.
出处 《西安电子科技大学学报》 EI CAS CSCD 北大核心 2011年第4期32-37,共6页 Journal of Xidian University
基金 国家杰出青年科学基金资助项目(60725105) 国家重点基础研究发展计划(973计划)课题资助项目(2009CB320404) 长江学者和创新团队发展计划资助项目(IRT0852) 国家自然科学基金资助项目(61072068 60872045) 中央高校基本科研业务费专项资助项目(JY10000901031)
关键词 异构无线网络 动态频谱分配 强化学习 归一化径向基函数 heterogeneous radio networks dynamic spectrum allocation reinforcement learning normalized radial basis function
  • 相关文献

参考文献11

  • 1Murata Y, Hasegawa M. The Architecture and a Business Model for the Open Heterogeneous Mobile Network [ J]. IEEE Communications Magazine, 2009, 47(5): 95-101.
  • 2邱晶,周正.认知无线电网络中的分布式动态频谱共享[J].北京邮电大学学报,2009,32(1):69-72. 被引量:18
  • 3Liu Yutao, Xu Guisen. A Novel Spectrum Allocation Mechanism Based on Graph Coloring and Bidding Theory [ C]// International Conference on CINC. Wuhan: IEEE Press, 2009: 155-158.
  • 4Niyato D, Hossain E. Dynamic Spectrum Access in IEEE 802. 22-based Cognitive Wireless Networks: a Game Theoretic Model for Competitive Spectrum Bidding and Pricing [ J]. IEEE Wireless Communications, 2009, 16(2): 16-23.
  • 5Versele C, Deblecker O. Multiobjective Optimal Design of High Frequency Transformers Using Genetic Algorithm [ C]//13th European Conference on Power Electronics and Applications. Barcelona: IEEE Press, 2009: 1-10.
  • 6Tsamis D, Alpcan T. Game Theoretic Rate Control for Mobile Devices [ C]//International Conference on Game Theory for Networks. Istanbul: IEEE Press, 2009: 646-652.
  • 7Williams R J. Simple Statistical Gradient-following Algorithms for Connectionist Reinforcement Learning [ J]. Machine Learning, 1992, 8(3): 229-256.
  • 8Bugmann G. Normalized Gaussian Radial Basis Function Networks [J]. Neurocomputing, 1995, 20( 1): 97-110.
  • 9黄炳强,Cao Guangyi,Fei Yanqiong,Li Jianhua.A new adaptive state space construction method for the mobile robot navigation[J].High Technology Letters,2008,14(2):182-186. 被引量:1
  • 10Sutton R S. Temporal Credit Assignment in Reinforcement Learning [ D]. Amherst: University of Massachusetts, 1984.

二级参考文献17

  • 1Shamik S, Mainak C. An economic framework for spectrum for spectrum allocation and service pricing with competitive wireless service providers [ C] // IEEE Proceeding of DySPAN. Dublin: [s. n. ], 2007: 89-98.
  • 2Huang Jianwei, Berry R A, Honig M L. Distributed interference compensation for wireless networks[J]. IEEE Journal on Selected Areas in Communications, 2006, 24 (5) : 1074-1084.
  • 3Nie Nie, Comaniciu C. Adaptive channel allocation spectrum etiquette for cognitive radio networks[C]//IEEE Proceeding of DySPAN. Baltimore: [s. n. ], 2005: 269- 278.
  • 4Neel J, Reed J, Gilles R. Game models for cognitive radio analysis [ C ] // SDR Forum Teehnical Conference. Phoenix: [s.n. ], 2004: 15-18.
  • 5Zhu Ji, Ray K J. Dynamic spectrum sharing: a game theoretical overview [ J ]. IEEE Communication Magazine, 2007, 45(5): 88-94.
  • 6Velagic J,Lacevic B,Penmicic B.A 3-level autonomous mobile robot navigation system designed by using reasoning/search approaches[].Robotics and Autonomous Systems.2006
  • 7Awad H A,Al-zorkany M A.Mobile robot navigation using local model networks[].International conference on computational intelhgence.2004
  • 8Barto AG,Sutton RS,Anderson CW.Neuronlike adaptive elements that can solve difficult learning control problems[].IEEE Transactions on Systems Man and Cybernetics.1983
  • 9H.Hagras,V.Callaghn,M.Colley.Learning and adaption of an intelligent mobile robot navigator operating in unstructured environment based on a novel online fuzzy-genetic system[].Fuzzy Sets and Systems.2004
  • 10A.E.Gaweda,M.K.Muezzinoglu,G.R.Aronoff.Individualization of pharmacological anemia management using reinforcement learning[].Neural Networks.2005

共引文献17

同被引文献1

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部