异构无线网络中基于强化学习的频谱管理算法被引量：1

Dynamic spectrum allocation algorithm for heterogeneous radio networks based on reinforcement learning

下载PDF

导出

摘要提出了一种基于归一化径向基函数的自适应启发评价强化学习算法,用于异构无线网络系统中自主的动态频谱分配.该算法利用归一化径向基函数自适应构建状态空间,加快学习速度;利用自适应启发评价机制减少不必要的探索,提高学习效率.通过与无线环境交互,算法学会为不同接入网内的各个会话动态分配合适的频段.仿真结果表明,在同等网络条件下,该算法能获取更好的频谱利用率和服务质量,性能优于确定性频谱分配策略和一般的动态频谱分配策略. An adaptive heuristic critic（AHC） Reinforcement Learning algorithm is presented for the dynamic spectrum allocation in an autonomously deciding mode in heterogeneous radio networks based on the normalized radial basis function（NRBF）.The algorithm accelerates the learning speed by utilizing the NRBF when constructing the state space,and improves the learning efficiency by using the AHC scheme to reduce the unnecessary exploration.Through interactions with the radio environment,it learns to allocate the proper frequency band for each session in multiple radio access networks.Simulation results show that the proposed algorithm can lead to a better spectrum efficiency and quality of service compared with to the fixed frequency planning scheme or general dynamic spectrum allocation policy.

作者张文柱邵丽娜

机构地区西安电子科技大学综合业务网理论及关键技术国家重点实验室

出处《西安电子科技大学学报》 EI CAS CSCD 北大核心 2011年第4期32-37,共6页 Journal of Xidian University

基金国家杰出青年科学基金资助项目(60725105) 国家重点基础研究发展计划(973计划)课题资助项目(2009CB320404) 长江学者和创新团队发展计划资助项目(IRT0852) 国家自然科学基金资助项目(61072068 60872045) 中央高校基本科研业务费专项资助项目(JY10000901031)

关键词异构无线网络动态频谱分配强化学习归一化径向基函数 heterogeneous radio networks dynamic spectrum allocation reinforcement learning normalized radial basis function

分类号 TP393 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献11

1Murata Y, Hasegawa M. The Architecture and a Business Model for the Open Heterogeneous Mobile Network [ J]. IEEE Communications Magazine, 2009, 47(5): 95-101.
2邱晶,周正.认知无线电网络中的分布式动态频谱共享[J].北京邮电大学学报,2009,32(1):69-72. 被引量：18
3Liu Yutao, Xu Guisen. A Novel Spectrum Allocation Mechanism Based on Graph Coloring and Bidding Theory [ C]// International Conference on CINC. Wuhan: IEEE Press, 2009: 155-158.
4Niyato D, Hossain E. Dynamic Spectrum Access in IEEE 802. 22-based Cognitive Wireless Networks: a Game Theoretic Model for Competitive Spectrum Bidding and Pricing [ J]. IEEE Wireless Communications, 2009, 16(2): 16-23.
5Versele C, Deblecker O. Multiobjective Optimal Design of High Frequency Transformers Using Genetic Algorithm [ C]//13th European Conference on Power Electronics and Applications. Barcelona: IEEE Press, 2009: 1-10.
6Tsamis D, Alpcan T. Game Theoretic Rate Control for Mobile Devices [ C]//International Conference on Game Theory for Networks. Istanbul: IEEE Press, 2009: 646-652.
7Williams R J. Simple Statistical Gradient-following Algorithms for Connectionist Reinforcement Learning [ J]. Machine Learning, 1992, 8(3): 229-256.
8Bugmann G. Normalized Gaussian Radial Basis Function Networks [J]. Neurocomputing, 1995, 20( 1): 97-110.
9黄炳强,Cao Guangyi,Fei Yanqiong,Li Jianhua.A new adaptive state space construction method for the mobile robot navigation[J].High Technology Letters,2008,14(2):182-186. 被引量：1
10Sutton R S. Temporal Credit Assignment in Reinforcement Learning [ D]. Amherst: University of Massachusetts, 1984.

二级参考文献17

1Shamik S, Mainak C. An economic framework for spectrum for spectrum allocation and service pricing with competitive wireless service providers [ C] // IEEE Proceeding of DySPAN. Dublin: [s. n. ], 2007: 89-98.
2Huang Jianwei, Berry R A, Honig M L. Distributed interference compensation for wireless networks[J]. IEEE Journal on Selected Areas in Communications, 2006, 24 (5) : 1074-1084.
3Nie Nie, Comaniciu C. Adaptive channel allocation spectrum etiquette for cognitive radio networks[C]//IEEE Proceeding of DySPAN. Baltimore: [s. n. ], 2005: 269- 278.
4Neel J, Reed J, Gilles R. Game models for cognitive radio analysis [ C ] // SDR Forum Teehnical Conference. Phoenix: [s.n. ], 2004: 15-18.
5Zhu Ji, Ray K J. Dynamic spectrum sharing: a game theoretical overview [ J ]. IEEE Communication Magazine, 2007, 45(5): 88-94.
6Velagic J,Lacevic B,Penmicic B.A 3-level autonomous mobile robot navigation system designed by using reasoning/search approaches[].Robotics and Autonomous Systems.2006
7Awad H A,Al-zorkany M A.Mobile robot navigation using local model networks[].International conference on computational intelhgence.2004
8Barto AG,Sutton RS,Anderson CW.Neuronlike adaptive elements that can solve difficult learning control problems[].IEEE Transactions on Systems Man and Cybernetics.1983
9H.Hagras,V.Callaghn,M.Colley.Learning and adaption of an intelligent mobile robot navigator operating in unstructured environment based on a novel online fuzzy-genetic system[].Fuzzy Sets and Systems.2004
10A.E.Gaweda,M.K.Muezzinoglu,G.R.Aronoff.Individualization of pharmacological anemia management using reinforcement learning[].Neural Networks.2005

共引文献17

1唐伦,陈前斌,曾孝平.基于POMDP强化学习的动态频谱分配算法[J].北京邮电大学学报,2009,32(6):125-129. 被引量：3
2刘全,高俊,关建新,郭云玮.认知无线电网络链路层关键技术的研究进展[J].电讯技术,2010,50(3):90-98. 被引量：6
3马良,朱琦.认知无线电系统中的协作频谱共享博弈[J].应用科学学报,2011,29(1):1-8. 被引量：1
4惠蕾放,李建东,肖丽媛,丁汉清.无线网络中兼顾业务类型及公平性的无线资源共享问题研究[J].通信学报,2011,32(4):39-46. 被引量：5
5陈宏滨,赵峰,邓小芳.基于社交网络的认知无线电频谱共享模型[J].计算机应用研究,2011,28(8):3083-3085.
6谢显中,赵晖.感知无线电系统中主用户的中断概率分析[J].重庆邮电大学学报（自然科学版）,2011,23(4):400-405.
7王首峰,李凡,王卫东,张英海.认知无线电快速控制信道选择算法[J].北京邮电大学学报,2011,34(5):80-85. 被引量：1
8李仙茂,张东屹,刘晓东.认知无线网络区域内集中方式频谱分配[J].舰船电子工程,2012,32(1):10-11. 被引量：2
9薛建彬.一种基于剩余能量的无线网络资源分配算法研究[J].系统仿真学报,2012,24(5):1021-1025.
10曾德睦,张琳.基于Logistic回归法信用预测的频谱交易定价策略[J].数据通信,2012(5):5-8.

同被引文献1

1马卓然,马建峰,苗银宾,孙聪.无人机网络中基于状态迁移的访问控制模型[J].西安电子科技大学学报,2018,45(6):44-50. 被引量：4

引证文献1

1张英,韦闽峰,王世会,陶磊岩,曹健,张兴.飞行器强化学习多模在轨控制[J].西安电子科技大学学报,2020,47(2):75-82. 被引量：1

二级引证文献1

1王旭,商尔科,苗启广,戴斌,刘泱.HCPP:一种数据高效的分层车辆跟随方法[J].西安电子科技大学学报,2023,50(6):161-171.

1彭鑫,王东,李仁发,曾凡仔,付彬.多接口车载自组织网络频谱分配算法研究[J].计算机研究与发展,2013,50(4):750-757. 被引量：2
2王东,彭鑫,李仁发,谢勇.车载自组网动态频谱分配技术研究进展[J].计算机工程与应用,2012,48(4):9-12. 被引量：4
3贾杰,王闯,张朝阳,陈剑.认知无线电网络中基于图着色的动态频谱分配[J].东北大学学报（自然科学版）,2012,33(3):336-339. 被引量：11
4周来秀,邓曙光,杨冰.无线传感器网络动态频谱分配方案[J].计算机工程,2010,36(14):99-101. 被引量：2
5侯炜,张林,山秀明,王耀希.多用户NC-OFDM动态频谱分配策略研究[J].计算机应用研究,2010,27(11):4201-4204.
6李晓月,戴冬.基于满意度的动态频谱分配问题研究[J].河南机电高等专科学校学报,2015,23(1):26-32.
7刘觉夫,杨将,朱丙虎,胡静.基于图型博弈的动态频谱分配算法[J].计算机工程与设计,2016,37(6):1464-1470.
8张文柱,王凌云.基于单频段多赢家拍卖的动态频谱分配[J].通信学报,2012,33(2):1-6. 被引量：9
9徐昌彪,刘雪亮,鲜永菊.基于博弈论的动态频谱分配技术研究[J].电子技术应用,2012,38(4):102-105. 被引量：4
10朱翠涛,徐昭利.基于信道选择和自适应功率控制的动态频谱分配[J].中南民族大学学报（自然科学版）,2011,30(2):80-83.

西安电子科技大学学报

2011年第4期

浏览历史

内容加载中请稍等...

异构无线网络中基于强化学习的频谱管理算法被引量：1

参考文献11

二级参考文献17

共引文献17

同被引文献1

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

异构无线网络中基于强化学习的频谱管理算法 被引量：1

参考文献11

二级参考文献17

共引文献17

同被引文献1

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

异构无线网络中基于强化学习的频谱管理算法被引量：1