期刊文献+
共找到7,608篇文章
< 1 2 250 >
每页显示 20 50 100
Segment-Conditioned Latent-Intent Framework for Cooperative Multi-UAV Search
1
作者 Gang Hou Aifeng Liu +4 位作者 Tao Zhao Wenyuan Wei Bo Li Jiancheng Liu Siwen Wei 《Computers, Materials & Continua》 2026年第4期2286-2301,共16页
Cooperative multi-UAV search requires jointly optimizing wide-area coverage,rapid target discovery,and endurance under sensing and motion constraints.Resolving this coupling enables scalable coordination with high dat... Cooperative multi-UAV search requires jointly optimizing wide-area coverage,rapid target discovery,and endurance under sensing and motion constraints.Resolving this coupling enables scalable coordination with high data efficiency and mission reliability.We formulate this problem as a discounted Markov decision process on an occupancy grid with a cellwise Bayesian belief update,yielding a Markov state that couples agent poses with a probabilistic target field.On this belief–MDP we introduce a segment-conditioned latent-intent framework,in which a discrete intent head selects a latent skill every K steps and an intra-segment GRU policy generates per-step control conditioned on the fixed intent;both components are trained end-to-end with proximal updates under a centralized critic.On the 50×50 grid,coverage and discovery convergence times are reduced by up to 48%and 40%relative to a flat actor-critic benchmark,and the aggregated convergence metric improves by about 12%compared with a stateof-the-art hierarchical method.Qualitative analyses further reveal stable spatial sectorization,low path overlap,and fuel-aware patrolling,indicating that segment-conditioned latent intents provide an effective and scalable mechanism for coordinated multi-UAV search. 展开更多
关键词 Multi-agent reinforcement learning Markov decision process multi-uav cooperative search
在线阅读 下载PDF
Multi-UAV Cooperative Path Planning Based on the Improved MADDPG
2
作者 Cailong Wu Caiyi Chen +2 位作者 Zhengyu Guo Jian Zhang Delin Luo 《Journal of Beijing Institute of Technology》 2026年第1期31-43,共13页
To address real-time path planning requirements for multi-unmanned aerial vehicle(multi-UAV)collaboration in environments,this study proposes an improved multi-agent deep deterministic policy gradient algorithm with p... To address real-time path planning requirements for multi-unmanned aerial vehicle(multi-UAV)collaboration in environments,this study proposes an improved multi-agent deep deterministic policy gradient algorithm with prioritized experience replay(PER-MADDPG).By designing a multi-dimensional state representation incorporating relative positions,velocity vectors,and obstacle distance fields,we construct a composite reward function integrating safe obstacle avoidance,formation maintenance,and energy efficiency for environment perception and multiobjective collaborative optimization.The prioritized experience replay mechanism dynamically adjusts sampling weights based on temporal difference(TD)errors,enhancing learning efficiency for high-value samples.Simulation experiments demonstrate that our method generates real-time collaborative paths in 3D complex obstacle environments,reducing training time by 25.3%and 16.8%compared to traditional MADDPG and multi-agent twin delayed deep deterministic policy gradient(MATD3)algorithms respectively,while achieving smaller path length variances among UAVs.Results validate the effectiveness of prioritized experience replay in multi-agent collaborative decision-making. 展开更多
关键词 multi-unmanned aerial vehicle(multi-uav) path planning deep deterministic policy gradient prioritized experience replay
在线阅读 下载PDF
Adaptive Reinforcement Learning with Multi-Modal Perception for Autonomous Formation Control and Exploration in Large-Scale Multi-UAV Swarms
3
作者 Ziyuan Ma Huajun Gong Xinhua Wang 《Journal of Beijing Institute of Technology》 2026年第1期63-83,共21页
To address the challenge of achieving decentralized,scalable,and adaptive control for large-scale multiple unmanned aerial vehicle(multi-UAV)swarms in dynamic urban environments with obstacles and wind perturbations,w... To address the challenge of achieving decentralized,scalable,and adaptive control for large-scale multiple unmanned aerial vehicle(multi-UAV)swarms in dynamic urban environments with obstacles and wind perturbations,we proposed a hybrid framework integrating adaptive reinforcement learning(RL),multi-modal perception fusion,and enhanced pigeon flock optimization(PFO)with curiosity-driven exploration to enable robust autonomous and formation control.The framework leverages meta-learning to optimize RL policies for real-time adaptation,fuses sensor data for precise state estimation,and enhances PFO with learned leader-follower dynamics and exploration rewards to maintain cohesive formations and explore uncertain areas.For swarms of 10–30 UAVs,it achieves 34%faster convergence,61%reduced stability root mean square error(RMSE),88%fewer collisions and 85.6%–92.3%success rates in target detection and encirclement,outperforming standard multi-agent RL,pure PFO,and single-modality RL.Three-dimensional trajectory visualizations confirm cohesive formations,collision-free maneuvers,and efficient exploration in urban search-and-rescue scenarios.Innovations include meta-RL for rapid adaptation,multi-modal fusion for robust perception,and curiosity-driven PFO for scalable,decentralized control,advancing real-world multi-UAV swarm autonomy and coordination. 展开更多
关键词 multiple unmanned aerial vehicle(multi-uav)swarm autonomous control reinforcement learning(RL) multi-modal perception pigeon flock optimization(PFO)
在线阅读 下载PDF
MAV-UAV combat organization's force formation plan generation based on NSGA-Ⅲ
4
作者 ZHONG Yun WAN Lujun ZHANG Jieyong 《Journal of Systems Engineering and Electronics》 2026年第1期307-317,共11页
Manned aerial vehicle-unmanned aerial vehicle(MAV-UAV)combat organization is a MAV-UAV combat collective formed from the perspective of organization design theory and methodology,and the generation of force formation ... Manned aerial vehicle-unmanned aerial vehicle(MAV-UAV)combat organization is a MAV-UAV combat collective formed from the perspective of organization design theory and methodology,and the generation of force formation plan is a key step in the organizational planning.Based on the description of the problem and the definition of organizational elements,the matching model of platform-target attack wave is constructed to minimize the redundancy of command and decision-making capability,resource capability and the number of platforms used.Based on the non-dominated sorting genetic algorithmⅢ(NSGA-Ⅲ)framework,which includes encoding/decoding method and constraint handling method,the generation model of organizational force formation plan is solved,and the effectiveness and superiority of the algorithm are verified by simulation experiments. 展开更多
关键词 manned-unmanned aerial vehicle combat organization force formation plan command and decision-making capability resource capability non-dominated sorting genetic algorithmⅢ(NSGA-Ⅲ)
在线阅读 下载PDF
Cooperative maneuver decision making for multi-UAV air combat based on incomplete information dynamic game 被引量:8
5
作者 Zhi Ren Dong Zhang +2 位作者 Shuo Tang Wei Xiong Shu-heng Yang 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2023年第9期308-317,共10页
Cooperative autonomous air combat of multiple unmanned aerial vehicles(UAVs)is one of the main combat modes in future air warfare,which becomes even more complicated with highly changeable situation and uncertain info... Cooperative autonomous air combat of multiple unmanned aerial vehicles(UAVs)is one of the main combat modes in future air warfare,which becomes even more complicated with highly changeable situation and uncertain information of the opponents.As such,this paper presents a cooperative decision-making method based on incomplete information dynamic game to generate maneuver strategies for multiple UAVs in air combat.Firstly,a cooperative situation assessment model is presented to measure the overall combat situation.Secondly,an incomplete information dynamic game model is proposed to model the dynamic process of air combat,and a dynamic Bayesian network is designed to infer the tactical intention of the opponent.Then a reinforcement learning framework based on multiagent deep deterministic policy gradient is established to obtain the perfect Bayes-Nash equilibrium solution of the air combat game model.Finally,a series of simulations are conducted to verify the effectiveness of the proposed method,and the simulation results show effective synergies and cooperative tactics. 展开更多
关键词 Cooperative maneuver decision Air combat Incomplete information dynamic game Perfect bayes-nash equilibrium Reinforcement learning
在线阅读 下载PDF
Enhanced deep reinforcement learning for integrated navigation in multi-UAV systems 被引量:1
6
作者 Zhengyang CAO Gang CHEN 《Chinese Journal of Aeronautics》 2025年第8期119-138,共20页
In multiple Unmanned Aerial Vehicles(UAV)systems,achieving efficient navigation is essential for executing complex tasks and enhancing autonomy.Traditional navigation methods depend on predefined control strategies an... In multiple Unmanned Aerial Vehicles(UAV)systems,achieving efficient navigation is essential for executing complex tasks and enhancing autonomy.Traditional navigation methods depend on predefined control strategies and trajectory planning and often perform poorly in complex environments.To improve the UAV-environment interaction efficiency,this study proposes a multi-UAV integrated navigation algorithm based on Deep Reinforcement Learning(DRL).This algorithm integrates the Inertial Navigation System(INS),Global Navigation Satellite System(GNSS),and Visual Navigation System(VNS)for comprehensive information fusion.Specifically,an improved multi-UAV integrated navigation algorithm called Information Fusion with MultiAgent Deep Deterministic Policy Gradient(IF-MADDPG)was developed.This algorithm enables UAVs to learn collaboratively and optimize their flight trajectories in real time.Through simulations and experiments,test scenarios in GNSS-denied environments were constructed to evaluate the effectiveness of the algorithm.The experimental results demonstrate that the IF-MADDPG algorithm significantly enhances the collaborative navigation capabilities of multiple UAVs in formation maintenance and GNSS-denied environments.Additionally,it has advantages in terms of mission completion time.This study provides a novel approach for efficient collaboration in multi-UAV systems,which significantly improves the robustness and adaptability of navigation systems. 展开更多
关键词 multi-uav system Reinforcement learning Integrated navigation MADDPG Information fusion
原文传递
Multi-UAV path planning for multiple emergency payloads delivery in natural disaster scenarios 被引量:1
7
作者 Zarina Kutpanova Mustafa Kadhim +1 位作者 Xu Zheng Nurkhat Zhakiyev 《Journal of Electronic Science and Technology》 2025年第2期1-18,共18页
Unmanned aerial vehicles(UAVs)are widely used in situations with uncertain and risky areas lacking network coverage.In natural disasters,timely delivery of first aid supplies is crucial.Current UAVs face risks such as... Unmanned aerial vehicles(UAVs)are widely used in situations with uncertain and risky areas lacking network coverage.In natural disasters,timely delivery of first aid supplies is crucial.Current UAVs face risks such as crashing into birds or unexpected structures.Airdrop systems with parachutes risk dispersing payloads away from target locations.The objective here is to use multiple UAVs to distribute payloads cooperatively to assigned locations.The civil defense department must balance coverage,accurate landing,and flight safety while considering battery power and capability.Deep Q-network(DQN)models are commonly used in multi-UAV path planning to effectively represent the surroundings and action spaces.Earlier strategies focused on advanced DQNs for UAV path planning in different configurations,but rarely addressed non-cooperative scenarios and disaster environments.This paper introduces a new DQN framework to tackle challenges in disaster environments.It considers unforeseen structures and birds that could cause UAV crashes and assumes urgent landing zones and winch-based airdrop systems for precise delivery and return.A new DQN model is developed,which incorporates the battery life,safe flying distance between UAVs,and remaining delivery points to encode surrounding hazards into the state space and Q-networks.Additionally,a unique reward system is created to improve UAV action sequences for better delivery coverage and safe landings.The experimental results demonstrate that multi-UAV first aid delivery in disaster environments can achieve advanced performance. 展开更多
关键词 Deep Q-network First aid delivery multi-uav path planning Reinforcement learning Unmanned aerial vehicle(UAV)
在线阅读 下载PDF
A novel trajectory prediction method for UAV air combat based on QCNet-3D 被引量:1
8
作者 Jiahui Zhang Zhijun Meng +2 位作者 Siyuan Liu Jiachi Ji Jiazheng He 《Defence Technology(防务技术)》 2025年第12期151-165,共15页
Unmanned Aerial Vehicle(UAV) trajectory prediction is an important research topic in the field of UAV air combat. In order to address the problem of single-feature extraction scale and scene adaptability in UAV air co... Unmanned Aerial Vehicle(UAV) trajectory prediction is an important research topic in the field of UAV air combat. In order to address the problem of single-feature extraction scale and scene adaptability in UAV air combat trajectory prediction algorithms, this paper proposes an innovative UAV trajectory prediction method QCNet-3D, which can predict the future trajectory of the target UAV and provide the corresponding possibility. Firstly, the UAV trajectory prediction is modeled based on the mixture of Laplace distributions, and the UAV's kinetic equations are employed to construct the UAV trajectory prediction dataset(UAVTP dataset), ensuring high reliability. Secondly, two improvement methods are proposed on the basis of QCNet: multi-scale Fourier mapping and three-dimensional adaptation. The ablation study shows that the improvement methods have reduced the minimum average displacement error, minimum final displacement error, and missing rate by 55.4%, 54.3%, and 68.1% respectively. Finally, QCNet-3D is proposed based on the two improvement methods, and the simulation experiment confirm the proposed algorithm's capability to predict both simple and complex UAV maneuvers, offering the possibility for each predicted trajectory under various prediction future steps and output modes. 展开更多
关键词 Unmanned aerial vehicle(UAV) UAV air combat Trajectory prediction Deep learning Fourier mapping
在线阅读 下载PDF
Exploring crash induction strategies in within-visual-range air combat based on distributional reinforcement learning
9
作者 Zetian HU Xuefeng LIANG +2 位作者 Jun ZHANG Xiaochuan YOU Chengcheng MA 《Chinese Journal of Aeronautics》 2025年第9期350-364,共15页
Within-Visual-Range(WVR)air combat is a highly dynamic and uncertain domain where effective strategies require intelligent and adaptive decision-making.Traditional approaches,including rule-based methods and conventio... Within-Visual-Range(WVR)air combat is a highly dynamic and uncertain domain where effective strategies require intelligent and adaptive decision-making.Traditional approaches,including rule-based methods and conventional Reinforcement Learning(RL)algorithms,often focus on maximizing engagement outcomes through direct combat superiority.However,these methods overlook alternative tactics,such as inducing adversaries to crash,which can achieve decisive victories with lower risk and cost.This study proposes Alpha Crash,a novel distributional-rein forcement-learning-based agent specifically designed to defeat opponents by leveraging crash induction strategies.The approach integrates an improved QR-DQN framework to address uncertainties and adversarial tactics,incorporating advanced pilot experience into its reward functions.Extensive simulations reveal Alpha Crash's robust performance,achieving a 91.2%win rate across diverse scenarios by effectively guiding opponents into critical errors.Visualization and altitude analyses illustrate the agent's three-stage crash induction strategies that exploit adversaries'vulnerabilities.These findings underscore Alpha Crash's potential to enhance autonomous decision-making and strategic innovation in real-world air combat applications. 展开更多
关键词 Unmanned combat aerial vehicle Decision-making Distributional reinforcement learning Within-visual-range air combat Crash induction strategy
原文传递
A sample selection mechanism for multi-UCAV air combat policy training using multi-agent reinforcement learning
10
作者 Zihui YAN Xiaolong LIANG +3 位作者 Yueqi HOU Aiwu YANG Jiaqiang ZHANG Ning WANG 《Chinese Journal of Aeronautics》 2025年第6期501-516,共16页
Policy training against diverse opponents remains a challenge when using Multi-Agent Reinforcement Learning(MARL)in multiple Unmanned Combat Aerial Vehicle(UCAV)air combat scenarios.In view of this,this paper proposes... Policy training against diverse opponents remains a challenge when using Multi-Agent Reinforcement Learning(MARL)in multiple Unmanned Combat Aerial Vehicle(UCAV)air combat scenarios.In view of this,this paper proposes a novel Dominant and Non-dominant strategy sample selection(DoNot)mechanism and a Local Observation Enhanced Multi-Agent Proximal Policy Optimization(LOE-MAPPO)algorithm to train the multi-UCAV air combat policy and improve its generalization.Specifically,the LOE-MAPPO algorithm adopts a mixed state that concatenates the global state and individual agent's local observation to enable efficient value function learning in multi-UCAV air combat.The DoNot mechanism classifies opponents into dominant or non-dominant strategy opponents,and samples from easier to more challenging opponents to form an adaptive training curriculum.Empirical results demonstrate that the proposed LOE-MAPPO algorithm outperforms baseline MARL algorithms in multi-UCAV air combat scenarios,and the DoNot mechanism leads to stronger policy generalization when facing diverse opponents.The results pave the way for the fast generation of cooperative strategies for air combat agents with MARLalgorithms. 展开更多
关键词 Unmanned combat aerial vehicle Air combat Sample selection Multi-agent reinforcement learning Policyproximal optimization
原文传递
Disintegration of heterogeneous combat network based on double deep Q-learning
11
作者 CHEN Wenhao CHEN Gang +1 位作者 LI Jichao JIANG Jiang 《Journal of Systems Engineering and Electronics》 2025年第5期1235-1246,共12页
The rapid development of military technology has prompted different types of equipment to break the limits of operational domains and emerged through complex interactions to form a vast combat system of systems(CSoS),... The rapid development of military technology has prompted different types of equipment to break the limits of operational domains and emerged through complex interactions to form a vast combat system of systems(CSoS),which can be abstracted as a heterogeneous combat network(HCN).It is of great military significance to study the disintegration strategy of combat networks to achieve the breakdown of the enemy’s CSoS.To this end,this paper proposes an integrated framework called HCN disintegration based on double deep Q-learning(HCN-DDQL).Firstly,the enemy’s CSoS is abstracted as an HCN,and an evaluation index based on the capability and attack costs of nodes is proposed.Meanwhile,a mathematical optimization model for HCN disintegration is established.Secondly,the learning environment and double deep Q-network model of HCN-DDQL are established to train the HCN’s disintegration strategy.Then,based on the learned HCN-DDQL model,an algorithm for calculating the HCN’s optimal disintegration strategy under different states is proposed.Finally,a case study is used to demonstrate the reliability and effectiveness of HCNDDQL,and the results demonstrate that HCN-DDQL can disintegrate HCNs more effectively than baseline methods. 展开更多
关键词 heterogeneous combat network(HCN) combat system of systems(CSoS) network disintegration reinforcement learning
在线阅读 下载PDF
Evolution and Characteristics of Traditional Wushu as a Combat Art
12
作者 Huang Xiaohua 《Contemporary Social Sciences》 2025年第5期17-30,共14页
During its interaction with modern sports,traditional Wushu has faced increasing doubts about its combat effectiveness,raising concerns about its cultural identity.How traditional Wushu is understood as a combat art n... During its interaction with modern sports,traditional Wushu has faced increasing doubts about its combat effectiveness,raising concerns about its cultural identity.How traditional Wushu is understood as a combat art not only helps define its cultural essence but also carries important implications for its long-term development.It is an objective fact that combat represents the practical manifestation of traditional Wushu in history.Combat reflects similarities among traditional Wushu forms that emerged throughout history.Combat reflects the historical law governing the evolution of traditional Wushu and represents an abstraction of repetitive phenomena in traditional Wushu.A correct understanding of this objectivity,these similarities,and this repeatability is conducive to promoting and carrying forward traditional Wushu,thereby facilitating an objective analysis of differences among different traditional Wushu forms and the discovery of their evolution paradigm.In the contemporary context,it is essential for traditional Wushu to emphasize its distinctive cultural roots,thereby facilitating creative transformation and innovative development. 展开更多
关键词 traditional Wushu combat evolutionary characteristics cultural identity
在线阅读 下载PDF
Multi-UAV Collaborative Path Planning Method Fusing Multi-Head Attention and SAC
13
作者 Ziyi Zhu Ji Huang Wangye Jiang 《Instrumentation》 2025年第4期57-62,共6页
Aiming at the problem of low convergence efficiency of traditional multi-UAV path planning algorithms in unknown complex environments,this paper proposes a deep reinforcement learning algorithm incorporating the atten... Aiming at the problem of low convergence efficiency of traditional multi-UAV path planning algorithms in unknown complex environments,this paper proposes a deep reinforcement learning algorithm incorporating the attention mechanism.The method is based on the Soft Actor-Critic(SAC)framework,which introduces a multi-attention mechanism in the Critic network,dynamically learns the dependency relationship between intelligences,and realizes key information screening and conflict avoidance.An environment with multiple random obstacles is designed to simulate complex emergent situations.The results show that the proposed algorithm significantly improves the mission success rate and average reward,significantly extends the survival time and exploration range of the UAVs,and verifies the effectiveness of the attention mechanism in enhancing the efficiency,robustness,and long-term planning capability of multi-UAV collaboration,as compared to the baseline method that does not use attention. 展开更多
关键词 multi-uav path planning soft actor-critic attention mechanism
原文传递
Dynamic Decoupling-Driven Cooperative Pursuit for Multi-UAV Systems:A Multi-Agent Reinforcement Learning Policy Optimization Approach
14
作者 Lei Lei Chengfu Wu Huaimin Chen 《Computers, Materials & Continua》 2025年第10期1339-1363,共25页
This paper proposes a Multi-Agent Attention Proximal Policy Optimization(MA2PPO)algorithm aiming at the problems such as credit assignment,low collaboration efficiency and weak strategy generalization ability existing... This paper proposes a Multi-Agent Attention Proximal Policy Optimization(MA2PPO)algorithm aiming at the problems such as credit assignment,low collaboration efficiency and weak strategy generalization ability existing in the cooperative pursuit tasks of multiple unmanned aerial vehicles(UAVs).Traditional algorithms often fail to effectively identify critical cooperative relationships in such tasks,leading to low capture efficiency and a significant decline in performance when the scale expands.To tackle these issues,based on the proximal policy optimization(PPO)algorithm,MA2PPO adopts the centralized training with decentralized execution(CTDE)framework and introduces a dynamic decoupling mechanism,that is,sharing the multi-head attention(MHA)mechanism for critics during centralized training to solve the credit assignment problem.This method enables the pursuers to identify highly correlated interactions with their teammates,effectively eliminate irrelevant and weakly relevant interactions,and decompose large-scale cooperation problems into decoupled sub-problems,thereby enhancing the collaborative efficiency and policy stability among multiple agents.Furthermore,a reward function has been devised to facilitate the pursuers to encircle the escapee by combining a formation reward with a distance reward,which incentivizes UAVs to develop sophisticated cooperative pursuit strategies.Experimental results demonstrate the effectiveness of the proposed algorithm in achieving multi-UAV cooperative pursuit and inducing diverse cooperative pursuit behaviors among UAVs.Moreover,experiments on scalability have demonstrated that the algorithm is suitable for large-scale multi-UAV systems. 展开更多
关键词 Multi-agent reinforcement learning multi-uav systems pursuit-evasion games
在线阅读 下载PDF
Dung Beetle Optimization Algorithm Based on Bounded Reflection Optimization and Multi-Strategy Fusion for Multi-UAV Trajectory Planning
15
作者 Weicong Tan Qiwu Wu +2 位作者 Lingzhi Jiang Tao Tong Yunchen Su 《Computers, Materials & Continua》 2025年第11期3621-3652,共32页
This study introduces a novel algorithm known as the dung beetle optimization algorithm based on bounded reflection optimization andmulti-strategy fusion(BFDBO),which is designed to tackle the complexities associated ... This study introduces a novel algorithm known as the dung beetle optimization algorithm based on bounded reflection optimization andmulti-strategy fusion(BFDBO),which is designed to tackle the complexities associated with multi-UAV collaborative trajectory planning in intricate battlefield environments.Initially,a collaborative planning cost function for the multi-UAV system is formulated,thereby converting the trajectory planning challenge into an optimization problem.Building on the foundational dung beetle optimization(DBO)algorithm,BFDBO incorporates three significant innovations:a boundary reflection mechanism,an adaptive mixed exploration strategy,and a dynamic multi-scale mutation strategy.These enhancements are intended to optimize the equilibrium between local exploration and global exploitation,facilitating the discovery of globally optimal trajectories thatminimize the cost function.Numerical simulations utilizing the CEC2022 benchmark function indicate that all three enhancements of BFDBOpositively influence its performance,resulting in accelerated convergence and improved optimization accuracy relative to leading optimization algorithms.In two battlefield scenarios of varying complexities,BFDBO achieved a minimum of a 39% reduction in total trajectory planning costs when compared to DBO and three other highperformance variants,while also demonstrating superior average runtime.This evidence underscores the effectiveness and applicability of BFDBO in practical,real-world contexts. 展开更多
关键词 Dung beetle optimizer algorithm swarm intelligence multi-uav trajectory planning complex environments
在线阅读 下载PDF
Decision-making and confrontation in close-range air combat based on reinforcement learning
16
作者 Mengchao YANG Shengzhe SHAN Weiwei ZHANG 《Chinese Journal of Aeronautics》 2025年第9期401-420,共20页
The high maneuverability of modern fighters in close air combat imposes significant cognitive demands on pilots,making rapid,accurate decision-making challenging.While reinforcement learning(RL)has shown promise in th... The high maneuverability of modern fighters in close air combat imposes significant cognitive demands on pilots,making rapid,accurate decision-making challenging.While reinforcement learning(RL)has shown promise in this domain,the existing methods often lack strategic depth and generalization in complex,high-dimensional environments.To address these limitations,this paper proposes an optimized self-play method enhanced by advancements in fighter modeling,neural network design,and algorithmic frameworks.This study employs a six-degree-of-freedom(6-DOF)F-16 fighter model based on open-source aerodynamic data,featuring airborne equipment and a realistic visual simulation platform,unlike traditional 3-DOF models.To capture temporal dynamics,Long Short-Term Memory(LSTM)layers are integrated into the neural network,complemented by delayed input stacking.The RL environment incorporates expert strategies,curiositydriven rewards,and curriculum learning to improve adaptability and strategic decision-making.Experimental results demonstrate that the proposed approach achieves a winning rate exceeding90%against classical single-agent methods.Additionally,through enhanced 3D visual platforms,we conducted human-agent confrontation experiments,where the agent attained an average winning rate of over 75%.The agent's maneuver trajectories closely align with human pilot strategies,showcasing its potential in decision-making and pilot training applications.This study highlights the effectiveness of integrating advanced modeling and self-play techniques in developing robust air combat decision-making systems. 展开更多
关键词 Air combat Decision making Flight simulation Reinforcement learning Self-play
原文传递
Multi-UAV Cooperative Target Search Based on Autonomous Connectivity in Uncertain Network Environment
17
作者 Wang Shan Sun Sheng +4 位作者 Liu Min Wang Yuwei Chen Yali Liu Danni Lin Fuhong 《China Communications》 2025年第8期257-280,共24页
Multiple UAVs cooperative target search has been widely used in various environments,such as emergency rescue and traffic monitoring.However,uncertain communication network among UAVs exhibits unstable links and rapid... Multiple UAVs cooperative target search has been widely used in various environments,such as emergency rescue and traffic monitoring.However,uncertain communication network among UAVs exhibits unstable links and rapid topological fluctuations due to mission complexity and unpredictable environmental states.This limitation hinders timely information sharing and insightful path decisions for UAVs,resulting in inefficient or even failed collaborative search.Aiming at this issue,this paper proposes a multi-UAV cooperative search strategy by developing a real-time trajectory decision that incorporates autonomous connectivity to reinforce multi-UAV collaboration and achieve search acceleration in uncertain search environments.Specifically,an autonomous connectivity strategy based on node cognitive information and network states is introduced to enable effective message transmission and adapt to the dynamic network environment.Based on the fused information,we formalize the trajectory planning as a multiobjective optimization problem by jointly considering search performance and UAV energy harnessing.A multi-agent deep reinforcement learning based algorithm is proposed to solve it,where the reward-guided real-time path is determined to achieve an energyefficient search.Finally,extensive experimental results show that the proposed algorithm outperforms existing works in terms of average search rate and coverage rate with reduced energy consumption under uncertain search environments. 展开更多
关键词 autonomous connectivity multi-agent reinforcement learning multi-uav collaboration path planning target search
在线阅读 下载PDF
Functional cartography of heterogeneous combat networks using operational chain-based label propagation algorithm
18
作者 CHEN Kebin JIANG Xuping +2 位作者 ZENG Guangjun YANG Wenjing ZHENG Xue 《Journal of Systems Engineering and Electronics》 2025年第5期1202-1215,共14页
To extract and display the significant information of combat systems,this paper introduces the methodology of functional cartography into combat networks and proposes an integrated framework named“functional cartogra... To extract and display the significant information of combat systems,this paper introduces the methodology of functional cartography into combat networks and proposes an integrated framework named“functional cartography of heterogeneous combat networks based on the operational chain”(FCBOC).In this framework,a functional module detection algorithm named operational chain-based label propagation algorithm(OCLPA),which considers the cooperation and interactions among combat entities and can thus naturally tackle network heterogeneity,is proposed to identify the functional modules of the network.Then,the nodes and their modules are classified into different roles according to their properties.A case study shows that FCBOC can provide a simplified description of disorderly information of combat networks and enable us to identify their functional and structural network characteristics.The results provide useful information to help commanders make precise and accurate decisions regarding the protection,disintegration or optimization of combat networks.Three algorithms are also compared with OCLPA to show that FCBOC can most effectively find functional modules with practical meaning. 展开更多
关键词 functional cartography heterogeneous combat network functional module label propagation algorithm operational chain
在线阅读 下载PDF
Integrated threat assessment method of beyond-visual-range air combat
19
作者 WANG Xingyu YANG Zhen +3 位作者 CHAI Shiyuan HE Yupeng HUO Weiyu ZHOU Deyun 《Journal of Systems Engineering and Electronics》 2025年第1期176-193,共18页
Beyond-visual-range(BVR)air combat threat assessment has attracted wide attention as the support of situation awareness and autonomous decision-making.However,the traditional threat assessment method is flawed in its ... Beyond-visual-range(BVR)air combat threat assessment has attracted wide attention as the support of situation awareness and autonomous decision-making.However,the traditional threat assessment method is flawed in its failure to consider the intention and event of the target,resulting in inaccurate assessment results.In view of this,an integrated threat assessment method is proposed to address the existing problems,such as overly subjective determination of index weight and imbalance of situation.The process and characteristics of BVR air combat are analyzed to establish a threat assessment model in terms of target intention,event,situation,and capability.On this basis,a distributed weight-solving algorithm is proposed to determine index and attribute weight respectively.Then,variable weight and game theory are introduced to effectively deal with the situation imbalance and achieve the combination of subjective and objective.The performance of the model and algorithm is evaluated through multiple simulation experiments.The assessment results demonstrate the accuracy of the proposed method in BVR air combat,indicating its potential practical significance in real air combat scenarios. 展开更多
关键词 beyond-visual-range(BVR) air combat threat assessment game theory variable weight theory
在线阅读 下载PDF
面向作战的体系韧性评估方法研究综述及展望
20
作者 杨克巍 徐任杰 +3 位作者 姜九瑶 李际超 杨志伟 宫琳 《系统工程与电子技术》 北大核心 2026年第1期157-171,共15页
未来战争中呈现出作战要素分布式更广、协同性更强、自主性更高等智能化特征,战争对抗中体系韧性则综合诠释和反映上述特点,科学可信的韧性评估对设计未来战争概念、指导智能化体系建设等具有重要的理论与应用价值。在界定韧性评估概念... 未来战争中呈现出作战要素分布式更广、协同性更强、自主性更高等智能化特征,战争对抗中体系韧性则综合诠释和反映上述特点,科学可信的韧性评估对设计未来战争概念、指导智能化体系建设等具有重要的理论与应用价值。在界定韧性评估概念基础上,梳理面向作战的体系韧性评估方法并对其未来发展趋势进行展望。总结面向作战的体系韧性的概念内涵、演化模型及特征,借助系统思想从结构、功能、行为及综合视角对面向作战的体系韧性评估方法进行了分析,提出体系韧性评估的未来发展趋势。对其他领域体系韧性评估工作提供参考,全面推动智能化时代体系研究的快速发展。 展开更多
关键词 体系韧性 面向作战 韧性评估 系统思想
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部