摘要
研究机会式频谱接入技术中次用户对可利用频谱进行探测和接入策略的优化问题.通过引入事件的概念,将含有可数无限状态的原问题转化为包含有限个事件的决策问题.从性能灵敏度的角度出发,分析不同策略下平均传输率的差异,给出了基于事件策略的性能差分公式.以此为基础,通过合理的近似,设计了基于事件的策略迭代算法.仿真示例验证了所提出算法的有效性和近似处理的合理性.
The sense and access optimization problem in opportunistic spectrum access technology is considered. By introducing the concept of event, the original problem with countably infinite states is converted to an event-based decision problem with finite events. From a sensitivity-based view, the performance differences between two event-based policies are investigated. Based on the event-based performance difference formula, with the appropriate approximation, the sample-path based policy iteration algorithm is developed. An example is provided to illustrate the effectiveness and reasonableness of the proposed algorithm.
出处
《控制与决策》
EI
CSCD
北大核心
2013年第11期1643-1649,共7页
Control and Decision
基金
国家自然科学基金项目(61203039)
高等学校学科创新引智计划(B06002)
华信息科学与技术国家实验室(筹)
关键词
机会式频谱接入
马尔可夫决策过程
基于事件的优化
灵敏度分析
策略迭代
opportunistic spectrum access
Markov decision process
event-based optimization
sensitivity-based approach
policy iteration