Lying in her makeshift hospital bed,Joyce Tembo thanked medical personnel for evacuating her to the designated national cholera treatment centre,6 km north of Zambia’s capital Lusaka.She was recently diagnosed with d...Lying in her makeshift hospital bed,Joyce Tembo thanked medical personnel for evacuating her to the designated national cholera treatment centre,6 km north of Zambia’s capital Lusaka.She was recently diagnosed with diarrhoeal disease.Tembo,43,commended the medical sta!stationed at the treatment centre for their great service to thousands of patients,especially women and children seeking urgent treatment.“I am very grateful to the Chinese doctors who attended to me as soon as the ambulance rushed me to the clinic where I received urgent treatment;they have really saved my life,”Tembo told ChinAfrica.But not all residents in her community are as lucky as her.Many in the densely populated slums die every day due to the area’s poor sanitation-one of the major causes of the cholera outbreak.展开更多
On May 11, 2011, 13 mem- bers of the Council of Europe signed a legally binding international in-strument, the Council of Europe Convention on Preventing and Combating Violence Against Women and Domestic Violence, as ...On May 11, 2011, 13 mem- bers of the Council of Europe signed a legally binding international in-strument, the Council of Europe Convention on Preventing and Combating Violence Against Women and Domestic Violence, as part of a program to protect and aid women. The Convention aims to increase aware- ness and understanding among the general public of the different mani- festations of all forms of violence and their consequences on children,展开更多
The Assembly of the Republic of Mozambique approved a law on the prevention of early marriages on July 15. The head of the Commission of Human Rights, Constitutional Affairs and Legality Edson Macuacua told the parlia...The Assembly of the Republic of Mozambique approved a law on the prevention of early marriages on July 15. The head of the Commission of Human Rights, Constitutional Affairs and Legality Edson Macuacua told the parliamentary members that with the law, there will be fewer girls dropping out of school and fewer girls forced to marry at an early age.展开更多
Neuroinflammation and a-synuclein (a-syn) aggregation are both neuropathological hallmarks of Parkinson’s disease (PD). Microglia are crucial participants in eliciting neuroinflammatory responses themselves, as well ...Neuroinflammation and a-synuclein (a-syn) aggregation are both neuropathological hallmarks of Parkinson’s disease (PD). Microglia are crucial participants in eliciting neuroinflammatory responses themselves, as well as modulating neurotoxic activity in astrocytes, therefore forming a pathway to neurodegeneration induced by both central and peripheral insults [1, 2].展开更多
The survival and development of human society highly depends on the water availability. Driven by the growth of population and economy, global water demand has increased more than eightfold since the 1900s. Meanwhile,...The survival and development of human society highly depends on the water availability. Driven by the growth of population and economy, global water demand has increased more than eightfold since the 1900s. Meanwhile, the commonly deteriorated freshwater quality cause a large proportion of available water resources unsuitable for human uses. This inter-coupled challenge of insufficient water quantity and inadequate water quality has rendered water scarcity a widespread problem in many parts of the world.展开更多
This is a story about a Chinese herbalist Ing“Doc”Hay who combated the 1918–1919 influenza pandemic in the America West.As an immigrant,he came to the States as a laborer,but he had knowledge of Chinese herbal medi...This is a story about a Chinese herbalist Ing“Doc”Hay who combated the 1918–1919 influenza pandemic in the America West.As an immigrant,he came to the States as a laborer,but he had knowledge of Chinese herbal medicine due to his family heritage.This made it possible for him to start practicing in the Chinese community in John Day,Oregon,until 1948 when he retired.During the time of the pandemic running wild in the 1910s,he prescribed formulas aimed at flu and boiled herbal decoction,personally delivering it to a working site for those Chinese laborers as well as non‑Chinese patients.None of the laborer patients treated by him died during this deadly pandemic.Due to his success and fame,his practice was booming even after the Chinese community disappeared in John Day in later years.Doc Hay is always remembered in the history of earlier development in eastern Oregon,so that the site of his practicing,Kam Wah Chung and Co.Building,is now a national historic landmark.And more importantly,he has also been remembered by Chinese herbal medicine practitioners in the United States.展开更多
Upper-lower computer mode is the main architecture design of the amphibious combat simulation system(ACSS)at present.Through continuous improvement of real-time performance,software and hardware infrastructure,the exp...Upper-lower computer mode is the main architecture design of the amphibious combat simulation system(ACSS)at present.Through continuous improvement of real-time performance,software and hardware infrastructure,the exponential growth of operational network data scale is realized,but the availability performance of ACSS declines.The reliability of the working host as the key node has become the bottleneck of the overall availability of network nodes in the ACSS.To optimize the network node architecture of ACSS,this paper presents an effective optimization solution by designing the dual redundancy warm-standby module of the mission computer and I/O port,the algorithm of selecting output path of the mission computer in network nodes,the decision-making algorithm upon the on-duty host and output,and the video output decision-making algorithm upon the upper host.Lastly,the complete process of operational data from the input to output and the opposite is implemented well to guarantee the overall availability of network nodes in the ACSS.It has great advantages of wide applicability,strong reliability and high real-time switching speed.展开更多
This is a story about a Chinese herbalist Ing"Doc"Hay who combated the 1918-1919 influenza pandemic in the America West.As an immigrant,he came to the States as a laborer,but he had knowledge of Chinese herb...This is a story about a Chinese herbalist Ing"Doc"Hay who combated the 1918-1919 influenza pandemic in the America West.As an immigrant,he came to the States as a laborer,but he had knowledge of Chinese herbal medicine due to his family heritage.This made it possible for him to start practicing in the Chinese community in John Day,Oregon,until 1948 when he retired.During the time of the pandemic miming wild in the 1910s,he prescribed formulas aimed at flu and boiled herbal decoction,personally delivering it to a working site for those Chinese laborers as well as non-Chinese patients.None of the laborer patients treated by him died during this deadly pandemic.Due to his success and fame,his practice was booming even after the Chinese community disappeared in John Day in later years.Doc Hay is always remembered in the history of earlier development in eastern Oregon,so that the site of his practicing,Kam Wah Chung and Co.Building,is now a national historic landmark.And more importantly,he has also been remembered by Chinese herbal medicine practitioners in the United States.展开更多
Tuft cells are a type of intestinal epithelial cells that play a critical role in the immune system.These cells are found in epithelial barriers and are important for protecting the body against infection by parasites...Tuft cells are a type of intestinal epithelial cells that play a critical role in the immune system.These cells are found in epithelial barriers and are important for protecting the body against infection by parasites.However,until now it was not clear whether Tuft cellsalso play a role in combating bacterial infections.展开更多
Climate change is getting worse and worse,and we're seeing moreextreme(极端的)weather.This is causing(导致)a big challenge.Nowthe question is,what should we do?
Since the beginning of European integration,the European Community has been committed to building an internal single market.Economically,it has been encouraging free competition,combating monopolies,and cautiously usi...Since the beginning of European integration,the European Community has been committed to building an internal single market.Economically,it has been encouraging free competition,combating monopolies,and cautiously using industrial policies.展开更多
Periodontitis is indeed a chronic inflammatory disease caused by microorganisms, and it is a leading cause of tooth loss in adults worldwide [1–3]. The immune system typically maintains a balance with pathogenic bact...Periodontitis is indeed a chronic inflammatory disease caused by microorganisms, and it is a leading cause of tooth loss in adults worldwide [1–3]. The immune system typically maintains a balance with pathogenic bacteria in the body, and the local mucosal immune system effectively monitors and controls these microorganisms to prevent excessive inflammation.展开更多
In recent years,advancements in nanomaterial production have significantly enhanced the efficiency of electrochemical sensors,facilitating rapid and accurate analysis of specific analytes.Electrochemical sensors have ...In recent years,advancements in nanomaterial production have significantly enhanced the efficiency of electrochemical sensors,facilitating rapid and accurate analysis of specific analytes.Electrochemical sensors have become promising tools for detecting and measuring pharmacological compounds due to their user-friendly operation,cost-effectiveness,and high sensitivity.展开更多
Mauritania, located in the Western Sahara, is one of the least developed countries in the Sahara Desert. Its capital, Nouakchott, which is home to 23% of its population, suffers from soil erosion from the Sahara and s...Mauritania, located in the Western Sahara, is one of the least developed countries in the Sahara Desert. Its capital, Nouakchott, which is home to 23% of its population, suffers from soil erosion from the Sahara and saltwater intrusion from the Atlantic Ocean. The local environment is under pressure from the combined effects of climate and socio-economic factors, with desertification being recognized as the greatest threat to life. In this context, high-resolution remote sensing images of Nouakchott obtained during the winters of 1985, 1988, 2000, 2006, and 2010 are selected for interpretation and classification. Analysis of the types of desertification and land use reveals the temporal and spatial characteristics of five distinct time periods from 1985 to 2010. This study analyzes the current status of desertification in Nouakchott and suggests five preventive measures.展开更多
Within-Visual-Range(WVR)air combat is a highly dynamic and uncertain domain where effective strategies require intelligent and adaptive decision-making.Traditional approaches,including rule-based methods and conventio...Within-Visual-Range(WVR)air combat is a highly dynamic and uncertain domain where effective strategies require intelligent and adaptive decision-making.Traditional approaches,including rule-based methods and conventional Reinforcement Learning(RL)algorithms,often focus on maximizing engagement outcomes through direct combat superiority.However,these methods overlook alternative tactics,such as inducing adversaries to crash,which can achieve decisive victories with lower risk and cost.This study proposes Alpha Crash,a novel distributional-rein forcement-learning-based agent specifically designed to defeat opponents by leveraging crash induction strategies.The approach integrates an improved QR-DQN framework to address uncertainties and adversarial tactics,incorporating advanced pilot experience into its reward functions.Extensive simulations reveal Alpha Crash's robust performance,achieving a 91.2%win rate across diverse scenarios by effectively guiding opponents into critical errors.Visualization and altitude analyses illustrate the agent's three-stage crash induction strategies that exploit adversaries'vulnerabilities.These findings underscore Alpha Crash's potential to enhance autonomous decision-making and strategic innovation in real-world air combat applications.展开更多
Policy training against diverse opponents remains a challenge when using Multi-Agent Reinforcement Learning(MARL)in multiple Unmanned Combat Aerial Vehicle(UCAV)air combat scenarios.In view of this,this paper proposes...Policy training against diverse opponents remains a challenge when using Multi-Agent Reinforcement Learning(MARL)in multiple Unmanned Combat Aerial Vehicle(UCAV)air combat scenarios.In view of this,this paper proposes a novel Dominant and Non-dominant strategy sample selection(DoNot)mechanism and a Local Observation Enhanced Multi-Agent Proximal Policy Optimization(LOE-MAPPO)algorithm to train the multi-UCAV air combat policy and improve its generalization.Specifically,the LOE-MAPPO algorithm adopts a mixed state that concatenates the global state and individual agent's local observation to enable efficient value function learning in multi-UCAV air combat.The DoNot mechanism classifies opponents into dominant or non-dominant strategy opponents,and samples from easier to more challenging opponents to form an adaptive training curriculum.Empirical results demonstrate that the proposed LOE-MAPPO algorithm outperforms baseline MARL algorithms in multi-UCAV air combat scenarios,and the DoNot mechanism leads to stronger policy generalization when facing diverse opponents.The results pave the way for the fast generation of cooperative strategies for air combat agents with MARLalgorithms.展开更多
The rapid development of military technology has prompted different types of equipment to break the limits of operational domains and emerged through complex interactions to form a vast combat system of systems(CSoS),...The rapid development of military technology has prompted different types of equipment to break the limits of operational domains and emerged through complex interactions to form a vast combat system of systems(CSoS),which can be abstracted as a heterogeneous combat network(HCN).It is of great military significance to study the disintegration strategy of combat networks to achieve the breakdown of the enemy’s CSoS.To this end,this paper proposes an integrated framework called HCN disintegration based on double deep Q-learning(HCN-DDQL).Firstly,the enemy’s CSoS is abstracted as an HCN,and an evaluation index based on the capability and attack costs of nodes is proposed.Meanwhile,a mathematical optimization model for HCN disintegration is established.Secondly,the learning environment and double deep Q-network model of HCN-DDQL are established to train the HCN’s disintegration strategy.Then,based on the learned HCN-DDQL model,an algorithm for calculating the HCN’s optimal disintegration strategy under different states is proposed.Finally,a case study is used to demonstrate the reliability and effectiveness of HCNDDQL,and the results demonstrate that HCN-DDQL can disintegrate HCNs more effectively than baseline methods.展开更多
The high maneuverability of modern fighters in close air combat imposes significant cognitive demands on pilots,making rapid,accurate decision-making challenging.While reinforcement learning(RL)has shown promise in th...The high maneuverability of modern fighters in close air combat imposes significant cognitive demands on pilots,making rapid,accurate decision-making challenging.While reinforcement learning(RL)has shown promise in this domain,the existing methods often lack strategic depth and generalization in complex,high-dimensional environments.To address these limitations,this paper proposes an optimized self-play method enhanced by advancements in fighter modeling,neural network design,and algorithmic frameworks.This study employs a six-degree-of-freedom(6-DOF)F-16 fighter model based on open-source aerodynamic data,featuring airborne equipment and a realistic visual simulation platform,unlike traditional 3-DOF models.To capture temporal dynamics,Long Short-Term Memory(LSTM)layers are integrated into the neural network,complemented by delayed input stacking.The RL environment incorporates expert strategies,curiositydriven rewards,and curriculum learning to improve adaptability and strategic decision-making.Experimental results demonstrate that the proposed approach achieves a winning rate exceeding90%against classical single-agent methods.Additionally,through enhanced 3D visual platforms,we conducted human-agent confrontation experiments,where the agent attained an average winning rate of over 75%.The agent's maneuver trajectories closely align with human pilot strategies,showcasing its potential in decision-making and pilot training applications.This study highlights the effectiveness of integrating advanced modeling and self-play techniques in developing robust air combat decision-making systems.展开更多
文摘Lying in her makeshift hospital bed,Joyce Tembo thanked medical personnel for evacuating her to the designated national cholera treatment centre,6 km north of Zambia’s capital Lusaka.She was recently diagnosed with diarrhoeal disease.Tembo,43,commended the medical sta!stationed at the treatment centre for their great service to thousands of patients,especially women and children seeking urgent treatment.“I am very grateful to the Chinese doctors who attended to me as soon as the ambulance rushed me to the clinic where I received urgent treatment;they have really saved my life,”Tembo told ChinAfrica.But not all residents in her community are as lucky as her.Many in the densely populated slums die every day due to the area’s poor sanitation-one of the major causes of the cholera outbreak.
文摘On May 11, 2011, 13 mem- bers of the Council of Europe signed a legally binding international in-strument, the Council of Europe Convention on Preventing and Combating Violence Against Women and Domestic Violence, as part of a program to protect and aid women. The Convention aims to increase aware- ness and understanding among the general public of the different mani- festations of all forms of violence and their consequences on children,
文摘The Assembly of the Republic of Mozambique approved a law on the prevention of early marriages on July 15. The head of the Commission of Human Rights, Constitutional Affairs and Legality Edson Macuacua told the parliamentary members that with the law, there will be fewer girls dropping out of school and fewer girls forced to marry at an early age.
基金supported by grants from the National Foundation of Natural Science of China (31871049, 31771124, and 31800893)。
文摘Neuroinflammation and a-synuclein (a-syn) aggregation are both neuropathological hallmarks of Parkinson’s disease (PD). Microglia are crucial participants in eliciting neuroinflammatory responses themselves, as well as modulating neurotoxic activity in astrocytes, therefore forming a pathway to neurodegeneration induced by both central and peripheral insults [1, 2].
文摘The survival and development of human society highly depends on the water availability. Driven by the growth of population and economy, global water demand has increased more than eightfold since the 1900s. Meanwhile, the commonly deteriorated freshwater quality cause a large proportion of available water resources unsuitable for human uses. This inter-coupled challenge of insufficient water quantity and inadequate water quality has rendered water scarcity a widespread problem in many parts of the world.
文摘This is a story about a Chinese herbalist Ing“Doc”Hay who combated the 1918–1919 influenza pandemic in the America West.As an immigrant,he came to the States as a laborer,but he had knowledge of Chinese herbal medicine due to his family heritage.This made it possible for him to start practicing in the Chinese community in John Day,Oregon,until 1948 when he retired.During the time of the pandemic running wild in the 1910s,he prescribed formulas aimed at flu and boiled herbal decoction,personally delivering it to a working site for those Chinese laborers as well as non‑Chinese patients.None of the laborer patients treated by him died during this deadly pandemic.Due to his success and fame,his practice was booming even after the Chinese community disappeared in John Day in later years.Doc Hay is always remembered in the history of earlier development in eastern Oregon,so that the site of his practicing,Kam Wah Chung and Co.Building,is now a national historic landmark.And more importantly,he has also been remembered by Chinese herbal medicine practitioners in the United States.
基金Supported by the National Natural Science Foundation of China(61401496)
文摘Upper-lower computer mode is the main architecture design of the amphibious combat simulation system(ACSS)at present.Through continuous improvement of real-time performance,software and hardware infrastructure,the exponential growth of operational network data scale is realized,but the availability performance of ACSS declines.The reliability of the working host as the key node has become the bottleneck of the overall availability of network nodes in the ACSS.To optimize the network node architecture of ACSS,this paper presents an effective optimization solution by designing the dual redundancy warm-standby module of the mission computer and I/O port,the algorithm of selecting output path of the mission computer in network nodes,the decision-making algorithm upon the on-duty host and output,and the video output decision-making algorithm upon the upper host.Lastly,the complete process of operational data from the input to output and the opposite is implemented well to guarantee the overall availability of network nodes in the ACSS.It has great advantages of wide applicability,strong reliability and high real-time switching speed.
文摘This is a story about a Chinese herbalist Ing"Doc"Hay who combated the 1918-1919 influenza pandemic in the America West.As an immigrant,he came to the States as a laborer,but he had knowledge of Chinese herbal medicine due to his family heritage.This made it possible for him to start practicing in the Chinese community in John Day,Oregon,until 1948 when he retired.During the time of the pandemic miming wild in the 1910s,he prescribed formulas aimed at flu and boiled herbal decoction,personally delivering it to a working site for those Chinese laborers as well as non-Chinese patients.None of the laborer patients treated by him died during this deadly pandemic.Due to his success and fame,his practice was booming even after the Chinese community disappeared in John Day in later years.Doc Hay is always remembered in the history of earlier development in eastern Oregon,so that the site of his practicing,Kam Wah Chung and Co.Building,is now a national historic landmark.And more importantly,he has also been remembered by Chinese herbal medicine practitioners in the United States.
文摘Tuft cells are a type of intestinal epithelial cells that play a critical role in the immune system.These cells are found in epithelial barriers and are important for protecting the body against infection by parasites.However,until now it was not clear whether Tuft cellsalso play a role in combating bacterial infections.
文摘Climate change is getting worse and worse,and we're seeing moreextreme(极端的)weather.This is causing(导致)a big challenge.Nowthe question is,what should we do?
文摘Since the beginning of European integration,the European Community has been committed to building an internal single market.Economically,it has been encouraging free competition,combating monopolies,and cautiously using industrial policies.
文摘Periodontitis is indeed a chronic inflammatory disease caused by microorganisms, and it is a leading cause of tooth loss in adults worldwide [1–3]. The immune system typically maintains a balance with pathogenic bacteria in the body, and the local mucosal immune system effectively monitors and controls these microorganisms to prevent excessive inflammation.
文摘In recent years,advancements in nanomaterial production have significantly enhanced the efficiency of electrochemical sensors,facilitating rapid and accurate analysis of specific analytes.Electrochemical sensors have become promising tools for detecting and measuring pharmacological compounds due to their user-friendly operation,cost-effectiveness,and high sensitivity.
基金Chinese Academy of Sciences(2017-XBQNXZ-B-018)Science and Technology Partnership Program,Ministry of Science and Technology of China(KY201702010)China–Africa Joint Research Centre Project of the Chinese Academy of Sciences(SAJC201610)
文摘Mauritania, located in the Western Sahara, is one of the least developed countries in the Sahara Desert. Its capital, Nouakchott, which is home to 23% of its population, suffers from soil erosion from the Sahara and saltwater intrusion from the Atlantic Ocean. The local environment is under pressure from the combined effects of climate and socio-economic factors, with desertification being recognized as the greatest threat to life. In this context, high-resolution remote sensing images of Nouakchott obtained during the winters of 1985, 1988, 2000, 2006, and 2010 are selected for interpretation and classification. Analysis of the types of desertification and land use reveals the temporal and spatial characteristics of five distinct time periods from 1985 to 2010. This study analyzes the current status of desertification in Nouakchott and suggests five preventive measures.
基金supported by the National Key R&D Program of China(No.2021YFB3300602)。
文摘Within-Visual-Range(WVR)air combat is a highly dynamic and uncertain domain where effective strategies require intelligent and adaptive decision-making.Traditional approaches,including rule-based methods and conventional Reinforcement Learning(RL)algorithms,often focus on maximizing engagement outcomes through direct combat superiority.However,these methods overlook alternative tactics,such as inducing adversaries to crash,which can achieve decisive victories with lower risk and cost.This study proposes Alpha Crash,a novel distributional-rein forcement-learning-based agent specifically designed to defeat opponents by leveraging crash induction strategies.The approach integrates an improved QR-DQN framework to address uncertainties and adversarial tactics,incorporating advanced pilot experience into its reward functions.Extensive simulations reveal Alpha Crash's robust performance,achieving a 91.2%win rate across diverse scenarios by effectively guiding opponents into critical errors.Visualization and altitude analyses illustrate the agent's three-stage crash induction strategies that exploit adversaries'vulnerabilities.These findings underscore Alpha Crash's potential to enhance autonomous decision-making and strategic innovation in real-world air combat applications.
文摘Policy training against diverse opponents remains a challenge when using Multi-Agent Reinforcement Learning(MARL)in multiple Unmanned Combat Aerial Vehicle(UCAV)air combat scenarios.In view of this,this paper proposes a novel Dominant and Non-dominant strategy sample selection(DoNot)mechanism and a Local Observation Enhanced Multi-Agent Proximal Policy Optimization(LOE-MAPPO)algorithm to train the multi-UCAV air combat policy and improve its generalization.Specifically,the LOE-MAPPO algorithm adopts a mixed state that concatenates the global state and individual agent's local observation to enable efficient value function learning in multi-UCAV air combat.The DoNot mechanism classifies opponents into dominant or non-dominant strategy opponents,and samples from easier to more challenging opponents to form an adaptive training curriculum.Empirical results demonstrate that the proposed LOE-MAPPO algorithm outperforms baseline MARL algorithms in multi-UCAV air combat scenarios,and the DoNot mechanism leads to stronger policy generalization when facing diverse opponents.The results pave the way for the fast generation of cooperative strategies for air combat agents with MARLalgorithms.
基金supported by the National Natural Science Foundation of China(7200120972231011+2 种基金72071206)the Science and Technology Innovative Research Team in Higher Educational Institutions of Hunan Province(2020RC4046)the Science Foundation for Outstanding Youth Scholars of Hunan Province(2022JJ20047).
文摘The rapid development of military technology has prompted different types of equipment to break the limits of operational domains and emerged through complex interactions to form a vast combat system of systems(CSoS),which can be abstracted as a heterogeneous combat network(HCN).It is of great military significance to study the disintegration strategy of combat networks to achieve the breakdown of the enemy’s CSoS.To this end,this paper proposes an integrated framework called HCN disintegration based on double deep Q-learning(HCN-DDQL).Firstly,the enemy’s CSoS is abstracted as an HCN,and an evaluation index based on the capability and attack costs of nodes is proposed.Meanwhile,a mathematical optimization model for HCN disintegration is established.Secondly,the learning environment and double deep Q-network model of HCN-DDQL are established to train the HCN’s disintegration strategy.Then,based on the learned HCN-DDQL model,an algorithm for calculating the HCN’s optimal disintegration strategy under different states is proposed.Finally,a case study is used to demonstrate the reliability and effectiveness of HCNDDQL,and the results demonstrate that HCN-DDQL can disintegrate HCNs more effectively than baseline methods.
基金co-supported by the National Natural Science Foundation of China(No.91852115)。
文摘The high maneuverability of modern fighters in close air combat imposes significant cognitive demands on pilots,making rapid,accurate decision-making challenging.While reinforcement learning(RL)has shown promise in this domain,the existing methods often lack strategic depth and generalization in complex,high-dimensional environments.To address these limitations,this paper proposes an optimized self-play method enhanced by advancements in fighter modeling,neural network design,and algorithmic frameworks.This study employs a six-degree-of-freedom(6-DOF)F-16 fighter model based on open-source aerodynamic data,featuring airborne equipment and a realistic visual simulation platform,unlike traditional 3-DOF models.To capture temporal dynamics,Long Short-Term Memory(LSTM)layers are integrated into the neural network,complemented by delayed input stacking.The RL environment incorporates expert strategies,curiositydriven rewards,and curriculum learning to improve adaptability and strategic decision-making.Experimental results demonstrate that the proposed approach achieves a winning rate exceeding90%against classical single-agent methods.Additionally,through enhanced 3D visual platforms,we conducted human-agent confrontation experiments,where the agent attained an average winning rate of over 75%.The agent's maneuver trajectories closely align with human pilot strategies,showcasing its potential in decision-making and pilot training applications.This study highlights the effectiveness of integrating advanced modeling and self-play techniques in developing robust air combat decision-making systems.