期刊文献+
共找到48篇文章
< 1 2 3 >
每页显示 20 50 100
Game-theoretic maneuvering strategies for orbital inspection of non-cooperative spacecraft in cislunar space
1
作者 Hongyu HAN Zhaohui DANG 《Chinese Journal of Aeronautics》 2026年第1期385-397,共13页
The problem of maneuvering for a servicing spacecraft(inspector)to inspect a noncooperative spacecraft(evader)in cislunar space is investigated in this paper.The evader,which may be a malfunctioning or uncontrolled sa... The problem of maneuvering for a servicing spacecraft(inspector)to inspect a noncooperative spacecraft(evader)in cislunar space is investigated in this paper.The evader,which may be a malfunctioning or uncontrolled satellite,introduces uncertainties due to its potential maneuvering capabilities.To address this challenge,the scenario is modeled as a special orbital game,incorporating the unique complexities of the cislunar environment.A variable-duration,turn-based inspection and anti-inspection game model is designed.The model defines both players'rules,constraints,and victory conditions,providing a framework for non-cooperative inspection.Strategies for both players are developed and validated based on their dynamical properties.The inspector's strategy integrates two-body Lambert transfers with shooting methods,while the evader's strategy aims to maximize the inspector's fuel consumption.Simulation results show that the evader's optimal strategy involves deliberate fluctuations in its lunar periapsis altitude,with the inspector's requiredΔV up to eight times greater than the evader's.The impact of game constraints is evaluated,and the effectiveness of deploying the inspector in low lunar orbit is compared with the inspector at the Earth-Moon Lagrange point L1.The strengths and weaknesses of both are shown.These findings provide valuable insights for future orbital servicing and orbital games. 展开更多
关键词 Cislunar Impulsive maneuver Non-cooperative target orbital game orbital transfer
原文传递
A model predictive Stackelberg solution to orbital pursuit-evasion game 被引量:1
2
作者 Yuchen LIU Chaoyong LI +1 位作者 Jun JIANG Yonghe ZHANG 《Chinese Journal of Aeronautics》 2025年第2期244-255,共12页
In this paper,we investigate analytical numerical iterative strategies for the pursuit-evasion game involving spacecraft with leader–follower information.In the proposed problem,the interplay between two spacecraft g... In this paper,we investigate analytical numerical iterative strategies for the pursuit-evasion game involving spacecraft with leader–follower information.In the proposed problem,the interplay between two spacecraft gives rise to a dynamic and real-time game,complicated further by the presence of perturbation.The primary challenge lies in crafting control strategies that are both efficient and applicable to real-time game problems within a nonlinear system.To overcome this challenge,we introduce the model prediction and iterative correction technique proposed in model predictive static programming,enabling the generation of strategies in analytical iterative form for nonlinear systems.Subsequently,we proceed by integrating this model predictive framework into a simplified Stackelberg equilibrium formulation,tailored to address the practical complexities of leader–follower pursuit-evasion scenarios.Simulation results validate the effectiveness and exceptional efficiency of the proposed solution within a receding horizon framework. 展开更多
关键词 Model predictive control Pursuit-evasion problem Leader-follower game Stackelberg game orbital game
原文传递
Impulsive thrust strategy for orbital pursuit-evasion games based on impulse-like constraint 被引量:1
3
作者 Hongbo WANG Yao ZHANG +1 位作者 Hao LIU Kunpeng ZHANG 《Chinese Journal of Aeronautics》 2025年第1期520-536,共17页
This paper proposes a novel impulsive thrust strategy guided by optimal continuous thrust strategy to address two-player orbital pursuit-evasion game under impulsive thrust control.The strategy seeks to enhance the in... This paper proposes a novel impulsive thrust strategy guided by optimal continuous thrust strategy to address two-player orbital pursuit-evasion game under impulsive thrust control.The strategy seeks to enhance the interpretability of impulsive thrust strategy by integrating it within the framework of differential game in traditional continuous systems.First,this paper introduces an impulse-like constraint,with periodical changes in thrust amplitude,to characterize the impulsive thrust control.Then,the game with the impulse-like constraint is converted into the two-point boundary value problem,which is solved by the combined shooting and deep learning method proposed in this paper.Deep learning and numerical optimization are employed to obtain the guesses for unknown terminal adjoint variables and the game terminal time.Subsequently,the accurate values are solved by the shooting method to yield the optimal continuous thrust strategy with the impulse-like constraint.Finally,the shooting method is iteratively employed at each impulse decision moment to derive the impulsive thrust strategy guided by the optimal continuous thrust strategy.Numerical examples demonstrate the convergence of the combined shooting and deep learning method,even if the strongly nonlinear impulse-like constraint is introduced.The effect of the impulsive thrust strategy guided by the optimal continuous thrust strategy is also discussed. 展开更多
关键词 orbital pursuit-evasion game Differential game Impulsive thrust Deep learning Shooting method
原文传递
Spacecraft intelligent orbital game technology:A review
4
作者 Xuyang CAO Xin NING +7 位作者 Suyi LIU Xiaobin LIAN Hongyan WANG Gaopeng ZHANG Feng CHEN Jiye ZHANG Bingzan LIU Zhansheng CHEN 《Chinese Journal of Aeronautics》 2025年第6期6-25,共20页
In recent years,the availability of space orbital resources has been declining,and the increasing frequency of spacecraft close approach events has heightened the urgency for enhanced space security measures.This pape... In recent years,the availability of space orbital resources has been declining,and the increasing frequency of spacecraft close approach events has heightened the urgency for enhanced space security measures.This paper establishes a comprehensive framework for intelligent orbital game technology in space,encompassing four core technologies:threat perception of noncooperative targets,intent recognition,situation assessment,and intelligent orbital game countermeasures.The concepts of multi-turn,multi-round and multi-match in space orbital games are defined,clarifying the core technological requirements for intelligent space orbital games and establishing a cohesive technological framework.Subsequently,the current status of research on these four core technologies is investigated.The challenges faced in the existing research are analyzed,and potential solutions for future studies are proposed.This paper aims to provide readers with a thorough understanding of the latest advancements in space intelligent orbital game technology.along with insights into the future directions and challenges in this field. 展开更多
关键词 Intelligent orbital game Threat perception Situation assessment Intent recognition game confrontation
原文传递
Mode-switching cooperative defense strategy for the orbit pursuit-evasion-defense game
5
作者 Yongshang Wei Tianxi Liu Cheng Wei 《Defence Technology(防务技术)》 2025年第2期272-286,共15页
This paper presents a mode-switching collaborative defense strategy for spacecraft pursuit-evasiondefense scenarios.In these scenarios,the pursuer tries to avoid the defender while capturing the evader,while the evade... This paper presents a mode-switching collaborative defense strategy for spacecraft pursuit-evasiondefense scenarios.In these scenarios,the pursuer tries to avoid the defender while capturing the evader,while the evader and defender form an alliance to prevent the pursuer from achieving its goal.First,the behavioral modes of the pursuer,including attack and avoidance modes,were established using differential game theory.These modes are then recognized by an interactive multiple model-matching algorithm(IMM),that uses several smooth variable structure filters to match the modes of the pursuer and update their probabilities in real time.Based on the linear-quadratic optimization theory,combined with the results of strategy identification,a two-way cooperative optimal strategy for the defender and evader is proposed,where the evader aids the defender to intercept the pursuer by performing luring maneuvers.Simulation results show that the interactive multi-model algorithm based on several smooth variable structure filters perform well in the strategy identification of the pursuer,and the cooperative defense strategy based on strategy identification has good interception performance when facing pursuers,who are able to flexibly adjust their game objectives. 展开更多
关键词 Cooperative policy Differential games orbit pursuit-evasion-defense game Mod recognition
在线阅读 下载PDF
Variable reward function-driven strategies for impulsive orbital attack-defense games under multiple constraints and victory conditions
6
作者 Liran Zhao Sihan Xu +1 位作者 Qinbo Sun Zhaohui Dang 《Defence Technology(防务技术)》 2025年第9期159-183,共25页
This paper investigates impulsive orbital attack-defense(AD)games under multiple constraints and victory conditions,involving three spacecraft:attacker,target,and defender.In the AD scenario,the attacker aims to breac... This paper investigates impulsive orbital attack-defense(AD)games under multiple constraints and victory conditions,involving three spacecraft:attacker,target,and defender.In the AD scenario,the attacker aims to breach the defender's interception to rendezvous with the target,while the defender seeks to protect the target by blocking or actively pursuing the attacker.Four different maneuvering constraints and five potential game outcomes are incorporated to more accurately model AD game problems and increase complexity,thereby reducing the effectiveness of traditional methods such as differential games and game-tree searches.To address these challenges,this study proposes a multiagent deep reinforcement learning solution with variable reward functions.Two attack strategies,Direct attack(DA)and Bypass attack(BA),are developed for the attacker,each focusing on different mission priorities.Similarly,two defense strategies,Direct interdiction(DI)and Collinear interdiction(CI),are designed for the defender,each optimizing specific defensive actions through tailored reward functions.Each reward function incorporates both process rewards(e.g.,distance and angle)and outcome rewards,derived from physical principles and validated via geometric analysis.Extensive simulations of four strategy confrontations demonstrate average defensive success rates of 75%for DI vs.DA,40%for DI vs.BA,80%for CI vs.DA,and 70%for CI vs.BA.Results indicate that CI outperforms DI for defenders,while BA outperforms DA for attackers.Moreover,defenders achieve their objectives more effectively under identical maneuvering capabilities.Trajectory evolution analyses further illustrate the effectiveness of the proposed variable reward function-driven strategies.These strategies and analyses offer valuable guidance for practical orbital defense scenarios and lay a foundation for future multi-agent game research. 展开更多
关键词 orbital attack-defense game Impulsive maneuver Multi-agent deep reinforcement learning Reward function design
在线阅读 下载PDF
Nash-equilibrium strategies of orbital TargetAttacker-Defender game with a non-maneuvering target 被引量:6
7
作者 Yifeng LI Xi LIANG Zhaohui DANG 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2024年第10期365-379,共15页
This study investigates the orbital Target-Attacker-Defender(TAD)game problem in the context of space missions.In this game,the Attacker and the Defender compete for a Target that is unable to maneuver due to its orig... This study investigates the orbital Target-Attacker-Defender(TAD)game problem in the context of space missions.In this game,the Attacker and the Defender compete for a Target that is unable to maneuver due to its original mission constraints.This paper establishes three TAD game models based on the thrust output capabilities:unconstrained thrust output,thrust constrained by an upper bound,and fixed thrust magnitude.These models are then solved using differential game theory to obtain Nash equilibrium solutions for the game problems,and the correctness and effectiveness of the solution methods are verified through simulations.Furthermore,an analysis of the winning mechanisms of the game is conducted,identifying key factors that influence the game’s outcomes,including weight coefficients in payoffs,the maximum thrust acceleration limit,and the initial game state.Considering the unique characteristics of space missions,a specific focus is given to the analysis of the Defender’s initial states in the hovering formation and in-plane circling formation,revealing overall success patterns for defense strategies from these two formations.In summary,this study provides valuable insights into the control strategies and winning mechanisms of orbital TAD games,deepening our understanding of these games and offering practical guidance to improve success rates in real-world scenarios. 展开更多
关键词 Target-Attacker-Defender problem Differential game TPBVP Continuous thrust orbital game
原文传递
Self-play training and analysis for GEO inspection game with modular actions
8
作者 ZHOU Rui ZHONG Weichao +1 位作者 LI Wenlong ZHANG Hao 《Journal of Systems Engineering and Electronics》 2025年第5期1353-1373,共21页
This paper comprehensively explores the impulsive on-orbit inspection game problem utilizing reinforcement learning and game training methods.The purpose of the spacecraft is to inspect the entire surface of a non-coo... This paper comprehensively explores the impulsive on-orbit inspection game problem utilizing reinforcement learning and game training methods.The purpose of the spacecraft is to inspect the entire surface of a non-cooperative target with active maneuverability in front lighting.First,the impulsive orbital game problem is formulated as a turn-based sequential game problem.Second,several typical relative orbit transfers are encapsulated into modules to construct a parameterized action space containing discrete modules and continuous parameters,and multi-pass deep Q-networks(MPDQN)algorithm is used to implement autonomous decision-making.Then,a curriculum learning method is used to gradually increase the difficulty of the training scenario.The backtracking proportional self-play training framework is used to enhance the agent’s ability to defeat inconsistent strategies by building a pool of opponents.The behavior variations of the agents during training indicate that the intelligent game system gradually evolves towards an equilibrium situation.The restraint relations between the agents show that the agents steadily improve the strategy.The influence of various factors on game results is tested. 展开更多
关键词 impulsive orbital game inspection mission turnbased reinforcement learning modular action self-play
在线阅读 下载PDF
基于最优脉冲的空间抵近行为模式可达性分析
9
作者 方嘉卉 黎克波 梁彦刚 《系统工程与电子技术》 北大核心 2026年第2期652-659,共8页
在轨道博弈中,若能实现对空间抵近的非合作目标的行为模式可达性预测,则可在博弈中占据有利形势。针对空间抵近的非合作目标的行为模式可达性问题,提出基于最优脉冲的空间抵近行为模式可达性分析方法。根据目标的机动能力,通过最优脉冲... 在轨道博弈中,若能实现对空间抵近的非合作目标的行为模式可达性预测,则可在博弈中占据有利形势。针对空间抵近的非合作目标的行为模式可达性问题,提出基于最优脉冲的空间抵近行为模式可达性分析方法。根据目标的机动能力,通过最优脉冲估计,判断目标是否具备完成各类行为模式的能力,实现对空间抵近行为模式可达性分析。结合更多情报信息,航天器即可提前进行行动决策,占领博弈先机。仿真实验结果表明,对于拦截、掠飞、绕飞、悬停4种行为模式,本文提出的空间抵近行为模式可达性分析方法能够比较准确地分析出抵近目标未来实现各类行为模式的可能性。 展开更多
关键词 轨道博弈 非合作目标 行为模式可达性分析 最优脉冲估计
在线阅读 下载PDF
基于证据理论的多航天器轨道博弈意图识别方法
10
作者 王鸿博 张尧 +1 位作者 李谋 张昆鹏 《宇航学报》 北大核心 2026年第1期61-72,共12页
针对多航天器轨道博弈中的意图识别问题,结合深度学习与证据理论提出“参数预测-博弈行为-博弈意图”的意图识别方法。首先,基于专家经验构建目标航天器集群的多种博弈行为与意图;其次,设计用于提取时序轨道状态特征的深度神经网络,以... 针对多航天器轨道博弈中的意图识别问题,结合深度学习与证据理论提出“参数预测-博弈行为-博弈意图”的意图识别方法。首先,基于专家经验构建目标航天器集群的多种博弈行为与意图;其次,设计用于提取时序轨道状态特征的深度神经网络,以预测目标航天器集群的关键行为参数;随后,基于证据理论,建立行为参数表征下的航天器博弈行为推理过程;然后,设计基于条件概率表的博弈意图推理策略,通过博弈行为推理目标航天器集群的博弈意图;最后,通过对比实验证明,该意图识别方法在具备较高意图识别准确率的同时,可有效提升意图识别过程的可解释性。 展开更多
关键词 航天器集群 轨道博弈 意图识别 证据理论 深度学习
在线阅读 下载PDF
基于行为预测和策略融合的轨道博弈决策方法
11
作者 王英杰 袁利 +1 位作者 黄煌 耿远卓 《自动化学报》 北大核心 2026年第3期451-462,共12页
轨道追逃博弈中逃逸策略的高度未知性与行为多样性,给追踪策略的泛化能力带来严峻挑战.深度强化学习虽可提升追踪星的博弈效能,但当逃逸策略偏离训练分布时,策略网络易产生次优甚至失效的决策.为此,提出一种基于行为预测和策略融合的轨... 轨道追逃博弈中逃逸策略的高度未知性与行为多样性,给追踪策略的泛化能力带来严峻挑战.深度强化学习虽可提升追踪星的博弈效能,但当逃逸策略偏离训练分布时,策略网络易产生次优甚至失效的决策.为此,提出一种基于行为预测和策略融合的轨道博弈决策方法.在训练阶段,首先采用“预测制导+人工势场法”构建多样化逃逸策略集.随后在传统演员-评论家训练框架的基础上,通过引入预测网络构建预测器-演员-评论家算法,针对每类逃逸策略分别训练以获得对应的追踪子策略.其中预测网络用于估计逃逸星动作,并通过预测结果与真实动作的相似性衡量子策略与未知逃逸策略的匹配度.在执行阶段,策略融合器以逃逸星历史动作与各追踪子策略的预测结果为输入,动态计算匹配度并选择最优子策略进行博弈决策.实验结果表明,预测网络能有效评估追踪子策略对未知逃逸策略的适应性,策略融合器可显著提升追踪星面对多样化逃逸策略的泛化能力与可靠性. 展开更多
关键词 轨道追逃博弈 深度强化学习 行为预测 策略融合
在线阅读 下载PDF
计算轨道追逃闭环均衡的有限差分方法
12
作者 杨傅云翔 杨乐平 柴华 《国防科技大学学报》 北大核心 2026年第1期247-261,共15页
针对近距离轨道追逃闭环均衡构造问题,提出一种综合运用Bellman最优性原理、有限差分法和插值技术的计算方法。推导视线坐标系下的博弈系统降维动力学模型,建立近距离轨道追逃博弈模型,降低系统状态空间维度;基于Bellman最优性原理,重... 针对近距离轨道追逃闭环均衡构造问题,提出一种综合运用Bellman最优性原理、有限差分法和插值技术的计算方法。推导视线坐标系下的博弈系统降维动力学模型,建立近距离轨道追逃博弈模型,降低系统状态空间维度;基于Bellman最优性原理,重构原问题为哈密顿-雅可比-艾萨克偏微分方程终值问题,通过逆向分析实现同时处理多组博弈场景;利用Cartesian网格离散状态空间,使用有限差分法计算均衡受动力学驱动的动态演化过程,分析博弈态势;基于控制与均衡空间梯度的关系,使用数值插值构造闭环控制函数;通过数值仿真验证了方法的有效性。 展开更多
关键词 轨道追逃 Bellman最优性原理 有限差分法 微分对策
在线阅读 下载PDF
基于博弈论的观测卫星集群轨道布设
13
作者 陈丽媛 冉德超 +1 位作者 季秋桐 秦同 《控制工程》 北大核心 2026年第3期420-426,共7页
为提高卫星集群的应用效能,针对卫星集群在进行轨道博弈的同时需执行观测任务的问题,提出一种考虑任务协同的轨道博弈方法。首先,通过分析非合作关系的双卫星集群的观测收益、干扰收益和损失代价,利用非合作博弈的思想构建了卫星集群轨... 为提高卫星集群的应用效能,针对卫星集群在进行轨道博弈的同时需执行观测任务的问题,提出一种考虑任务协同的轨道博弈方法。首先,通过分析非合作关系的双卫星集群的观测收益、干扰收益和损失代价,利用非合作博弈的思想构建了卫星集群轨道博弈模型。其次,基于三个指标函数,建立非合作双卫星集群的收益矩阵,进而求解该双矩阵的博弈问题,得到其纳什均衡,并据此确定双方集群在均衡意义下的最优轨道布设分布。此时,非合作双卫星集群都无法通过改变己方卫星集群的轨道位置来获得更大收益。最后,仿真实验表明,与一般策略相比,利用博弈思想选取的轨道布设策略能够提高卫星集群的任务执行能力和应用效能。 展开更多
关键词 卫星集群 区域观测 轨道博弈 纳什均衡
原文传递
一种基于人工势场的多航天器轨道博弈方法
14
作者 李子晗 卢山 +1 位作者 侯月阳 刘春阳 《上海航天(中英文)》 2026年第1期149-158,共10页
针对近地轨道(LEO)空间中的多航天器轨道博弈问题,提出了一种基于人工势场的博弈方法。首先,博弈双方均采用势函数策略进行博弈,针对逃逸器设计基于多源斥力场和速度保持势的综合规避算法,采用追踪器设计预测势场与协同斥力场结合的分... 针对近地轨道(LEO)空间中的多航天器轨道博弈问题,提出了一种基于人工势场的博弈方法。首先,博弈双方均采用势函数策略进行博弈,针对逃逸器设计基于多源斥力场和速度保持势的综合规避算法,采用追踪器设计预测势场与协同斥力场结合的分布式包围策略;然后其逃逸器策略不变,为追踪器设计融合比例导引(PNG)的增强型接近策略,强化末段追击精度。仿真验证表明:势函数法可实现多追踪器对逃逸器的初步包围,但因缺乏末段指向性导致“飞越”,未能捕获逃逸器;增强型追击策略可以提高追踪器末端接近能力,使得追踪器在1236 s成功捕获逃逸器。 展开更多
关键词 追逃博弈 人工势场(APF) 制导策略 多星轨道博弈
在线阅读 下载PDF
GEO航天器轨道机动控制研究进展
15
作者 薛锦妍 张雅声 +2 位作者 陶雪峰 杨茗棋 赵帅龙 《系统工程与电子技术》 北大核心 2026年第1期290-300,共11页
随着卫星机动能力的不断提升,地球静止轨道(geostationary Earth orbit,GEO)航天器执行空间任务时的安全问题不容忽视。首先,针对目前编队航天器轨道机动中常用的脉冲推力模型和连续推力模型进行综述,并按照机动过程中的航天器数量区分... 随着卫星机动能力的不断提升,地球静止轨道(geostationary Earth orbit,GEO)航天器执行空间任务时的安全问题不容忽视。首先,针对目前编队航天器轨道机动中常用的脉冲推力模型和连续推力模型进行综述,并按照机动过程中的航天器数量区分“一对一航天器轨道机动”和“多航天器轨道机动”;其次,分析了微分对策理论、人工智能算法和生物群体智能算法在解决编队航天器轨道机动问题中的异同优劣;最后,从动力学模型、航天器数量类型和求解方法的视角就编队航天器轨道机动问题的特点进行对比分析。未来的研究重点在于提高算法效率及鲁棒性、增强模型适应性,以实现更加精确和高效的太空管理,保障GEO航天器在轨运行的稳定性和安全性。 展开更多
关键词 编队航天器 轨道机动 纳什均衡 微分对策 深度强化学习
在线阅读 下载PDF
Numerical method for Nash equilibrium strategies of spacecraft orbit pursuit-evasion game based on continuous thrust reachable domain analysis
16
作者 Weizhi XU Xiaokun LIU +2 位作者 Zhenkun LU Bing HUA Yunhua WU 《Science China(Technological Sciences)》 2026年第4期354-371,共18页
A numerical method for computing Nash equilibrium strategies(NES)of the spacecraft time-optimal orbit pursuitevasion game(TOOPEG)with continuous thrust reachable domain(RD)analysis is proposed.Through theoretical deri... A numerical method for computing Nash equilibrium strategies(NES)of the spacecraft time-optimal orbit pursuitevasion game(TOOPEG)with continuous thrust reachable domain(RD)analysis is proposed.Through theoretical derivation and Monte Carlo validation,the equivalence among the minimum time of the TOOPEG problem with NES,the minimum time of a virtual single spacecraft for a time-optimal approach to the origin,and the minimum time required for the envelope of the pursuer's RD to enclose that of the evader is established.First,the necessary conditions for NES are derived using Pontryagin's maximum principle(PMP),converting the original bilateral optimal control problem into a 7-dimensional two-point boundary value problem(TPBVP).Then,the TOOPEG is transformed into a virtual single-spacecraft time-optimal approach problem,with the above necessary conditions.By exploiting the evolutionary characteristics of the continuous-thrust RD,the problem is further reduced to a 3-dimensional nonlinear differential equation.An improved Broyden quasi-Newton iterative(IBQNI)algorithm is employed to obtain high-precision numerical solutions,and an iterative initial value construction method based on a linearized orbit dynamic model is proposed.Furthermore,a set of criteria is developed to assess the relative spatial configuration between the RD of different spacecraft.Numerical simulations demonstrate that the proposed method achieves excellent convergence and remarkable computational efficiency. 展开更多
关键词 orbit pursuit-evasion game Nash equilibrium strategies continuous thrust reachable domain Hamilton function optimal control
原文传递
Game Strategy Prediction for Spacecraft Orbital Pursuit–Evasion Game Based on Long Short-Term Memory
17
作者 Hongbo Wang Yao Zhang Sifeng Bi 《Space(Science & Technology)》 2025年第1期282-299,共18页
This paper presents a strategy prediction frame for multi-player orbital pursuit–evasion game that is based on discount receding horizon coevolution(DRH-CE).The proposed frame aims to enable spacecraft to indirectly ... This paper presents a strategy prediction frame for multi-player orbital pursuit–evasion game that is based on discount receding horizon coevolution(DRH-CE).The proposed frame aims to enable spacecraft to indirectly characterize the target’s possible future states by predicting strategy parameters.The authors establish a game strategy model and a strategy solution model based on DRH-CE.The payoff function parameters of the DRH-CE are utilized as strategy parameters to construct the dataset by combining the strategy solutions and parameters.Furthermore,the authors establish a strategy parameter prediction model based on long short-term memory and multi-head self-attention,and combining this model with the strategy solution model allows for the prediction of the future states of targets.The numerical examples illustrate the efficacy of the proposed frame in predicting strategy parameters and the effectiveness of the future state prediction against targets. 展开更多
关键词 construct dataset spacecraft combining st orbital pursuit evasion game strategy solution model spacecraft indirectly characterize target s possible future states strategy prediction game strategy model
原文传递
Intelligent strategy resolution methods and mechanism analysis in two-on-one impulsive orbital pursuit–evasion games
18
作者 Liran Zhao Qinbo Sun Zhaohui Dang 《Astrodynamics》 2025年第5期727-751,共25页
This paper conducts a comprehensive study on the multi-constrained two-on-one impulsive orbital pursuit–evasion game(OPEG).Firstly,considering constraints such as maneuverability,fuel reserves,and mission duration,a ... This paper conducts a comprehensive study on the multi-constrained two-on-one impulsive orbital pursuit–evasion game(OPEG).Firstly,considering constraints such as maneuverability,fuel reserves,and mission duration,a mathematical game model for the two-on-one impulsive OPEG is established,which transforms the two-on-one impulsive OPEG,where cooperation and competition coexist,into a multi-constrained three-party optimization problem suitable for solving with multi-agent deep reinforcement learning.Then,an intelligent solution method for cooperative game strategies based on the Multi-Agent Deep Deterministic Policy Gradient(MADDPG)algorithm is proposed.In the reward function design section,a reward function based on fixed-time triggering is introduced to address the information loss problem caused by long impulse intervals.To ensure good convergence of the algorithm and guide the spacecraft to learn effective cooperative strategies during training,an immediate reward function is designed,incorporating outcome rewards,guidance rewards,and cooperative rewards.Numerical simulations validate the feasibility and effectiveness of the proposed method.To further analyze the cooperative mechanisms learned by the spacecraft during algorithm training,a comparative experiment with the one-on-one impulsive OPEG is designed.The experimental results demonstrate that the two pursuers in the two-on-one impulsive OPEG not only develop various strategies such as“pre-emptive interception”,“pincer interception”,and“trailing pursuit”during training,but also improve mission success rates and reduce mission durations through coordinated efforts.Additionally,this paper reveals the impact of the relative initial state distribution between the two pursuing spacecraft and the evading spacecraft on the effectiveness of cooperation. 展开更多
关键词 two-pursuers one-evader orbital pursuit–evasion game(OPEG) impulsive maneuver multi-agent deep reinforcement learning
原文传递
考虑阳光角约束的“追-防-逃”博弈控制方法
19
作者 刘鹏轩 余建慧 +2 位作者 杨彬 朱永生 李爽 《宇航学报》 北大核心 2025年第8期1579-1590,共12页
针对考虑阳光角约束的“追-防-逃”在轨三方博弈控制问题,提出一种基于降维求解与阳光角修正的联合优化方法。针对三方博弈最优控制高维求解复杂、光学约束建模困难的问题,首先通过将原问题拆分为“追-逃”与“防-追”两个低维子博弈问... 针对考虑阳光角约束的“追-防-逃”在轨三方博弈控制问题,提出一种基于降维求解与阳光角修正的联合优化方法。针对三方博弈最优控制高维求解复杂、光学约束建模困难的问题,首先通过将原问题拆分为“追-逃”与“防-追”两个低维子博弈问题,结合权值系数协调子问题控制策略,降低了36维两点边值问题的求解维度;其次,设计阳光角修正策略,通过动态调整控制矢量方向,使追击方逆光抵近目标,削弱敌方光学观测条件。仿真结果表明,该方法可实现三方博弈场景下的过程阳光角博弈,且相较于直接求解三方博弈问题,所提方法可实现在轨博弈策略快速求解,为实现复杂约束下的多方在轨博弈在线校正控制提供了解决方案。 展开更多
关键词 在轨博弈 三方博弈 微分对策 最优控制
在线阅读 下载PDF
航天器轨道追逃博弈问题的同伦解法
20
作者 张衷韬 郑毅 +2 位作者 张亚坤 张雅声 王斌 《宇航学报》 北大核心 2025年第7期1345-1354,共10页
针对连续推力航天器轨道追逃博弈问题,基于轨道动力学方程与庞特里亚金极小值原理,系统推导了J2摄动下笛卡尔坐标中追逃双方的协态方程及双边最优控制律显式表达式。创新地提出一种基于协态映射的同伦解法,通过构建辅助问题与原始问题... 针对连续推力航天器轨道追逃博弈问题,基于轨道动力学方程与庞特里亚金极小值原理,系统推导了J2摄动下笛卡尔坐标中追逃双方的协态方程及双边最优控制律显式表达式。创新地提出一种基于协态映射的同伦解法,通过构建辅助问题与原始问题的关联,分析了协态变量在同伦过程中的演化规律。该方法首先采用Radau伪谱法求解逃逸航天器无机动时的单边最优拦截问题,并通过协态映射获取追踪航天器初始协态量;随后逐步增加逃逸航天器推力加速度,运用序列二次规划(SQP)算法迭代求解追逃双方的初始时刻协态量,实现从辅助问题到原始问题的渐进过渡。数值仿真表明,所提方法有效克服了传统打靶法的收敛性缺陷,在J2摄动场景下成功获得轨道追逃博弈的鞍点解,解的最优性相比球模型提升4~5个数量级,且计算效率与打靶法相当。研究验证了该同伦方法在处理复杂轨道博弈问题中的有效性和优越性,为航天器追逃策略设计提供了新的理论工具。 展开更多
关键词 微分对策 同伦方法 航天器追逃博弈 最优控制 伪谱法
在线阅读 下载PDF
上一页 1 2 3 下一页 到第
使用帮助 返回顶部