摘要
随着卫星对地观测进入多卫星、高分辨率、实时响应、全球观测的时代,卫星在轨数据处理已成为提高遥感数据处理实时性的主流手段之一。在卫星资源受限、数传链路信道资源受限、随遇观测任务具有不可预测性的场景下,进行数据处理任务实时调度具有较大挑战。首先,构建以最大化系统平均数据处理吞吐率为目标的优化问题模型。然后,提出一种在线的结合深度强化学习(deep reinforcement learning,DRL)的任务调度算法,采用DRL算法能够实时计算任务调度策略,选取拉格朗日对偶优化算法能够准确计算最优资源分配量。最后,通过仿真实验对算法有效性和数据处理吞吐率进行评价,结果表明算法能够收敛并接近最优解,相比于已有算法将数据处理吞吐率提高了约8%,且在卫星数据到达速率及卫星计算节点数量增大时具有一定扩展性。所提算法能够在最大化系统平均数据处理吞吐率的同时,保障高动态环境下任务队列长度及平均能耗稳定收敛。
As satellite earth observation enters an era of multiple satellites,high resolution,real-time response,and global observation,satellite on-orbit data processing has become one of the main methods to improve the real-time characteristic of remote sensing data processing.In scenarios where satellite resources are limited,data transmission link channels are constrained,and opportunistic observation tasks are unpredictable,real-time scheduling of data processing tasks faces significant challenges.An optimization problem model with the goal of maximizing the system’s average data processing throughput rate is firstly constructed.Secondly,an online task scheduling algorithm that combines deep reinforcement learning(DRL)is proposed.DRL algorithm enables real-time calculation of task scheduling strategies,and Lagrangian dual optimization algorithm can accurately computes the optimal resource allocation.Finally,simulation experiments are conducted to evaluate the effectiveness and data processing throughput rate of the proposed algorithm.Results show that the proposed algorithm can converge and approach the optimal solution,improving data processing throughput rate by approximately 8%compared to existing algorithms,and demonstrating scalability as the satellite data arrival speed and the number of satellite computing nodes increase.The proposed algorithm can maximize the average data processing throughput rate of the system while ensuring the stability and convergence of task queue length and average energy consumption in a high-dynamic environment.
作者
孟麟芝
孙小涓
胡玉新
高斌
孙国庆
牟文浩
MENG Linzhi;SUN Xiaojuan;HU Yuxin;GAO Bin;SUN Guoqing;MU Wenhao(Aerospace Information Research Institute,Chinese Academy of Sciences,Beijing 100190,China;Key Laboratory of Technology in Geo-spatial Information Processing and Application System,Beijing 100190,China;School of Electronic,Electrical and Communication Engineering,University of Chinese Academy of Sciences,Beijing 100049,China)
出处
《系统工程与电子技术》
北大核心
2025年第6期1917-1929,共13页
Systems Engineering and Electronics
关键词
卫星在轨处理
任务调度
资源分配
深度强化学习
李雅普诺夫优化
satellite on-orbit processing
task scheduling
resource allocation
deep reinforcement learning(DRL)
Lyapunov optimization