Aiming at the flexible manufacturing system with multi-machining and multi-assembly equipment, a new scheduling algorithm is proposed to decompose the assembly structure of the products, thus obtaining simple scheduli...Aiming at the flexible manufacturing system with multi-machining and multi-assembly equipment, a new scheduling algorithm is proposed to decompose the assembly structure of the products, thus obtaining simple scheduling problems and forming the cOrrespOnding agents. Then, the importance and the restriction of each agent are cOnsidered, to obtain an order of simple scheduling problems based on the cooperation game theory. With this order, the scheduling of sub-questions is implemented in term of rules, and the almost optimal scheduling results for meeting the restriction can be obtained. Experimental results verify the effectiveness of the proposed scheduling algorithm.展开更多
In multi-agent systems, autonomous agents may form coalition to increase the efficiency of problem solving. But the current coalition algorithm is very complex, and cannot satisfy the condition of optimality and stabl...In multi-agent systems, autonomous agents may form coalition to increase the efficiency of problem solving. But the current coalition algorithm is very complex, and cannot satisfy the condition of optimality and stableness simultaneously. To solve the problem, an algorithm that uses the mechanism of distribution according to work for coalition formation is presented, which can achieve global optimal and stable solution in subadditive task oriented domains. The validity of the algorithm is demonstrated by both experiments and theory.展开更多
A general multi-agent architecture is proposed for intelligent decision support system (MAIDSS). The agent in MAIDSS is built based on an extension of BDI framework. Several agents form a team working together on a de...A general multi-agent architecture is proposed for intelligent decision support system (MAIDSS). The agent in MAIDSS is built based on an extension of BDI framework. Several agents form a team working together on a decision problem; several agent teams are defined to stand for the benefits of different people in the real world. The decision making process is based on multi-agent cooperation, and a logical framework for a team of agents cooperating to create the solution for the decision problem is discussed in detail.展开更多
With the advent of sixth-generation mobile communications(6G),space-air-ground integrated networks have become mainstream.This paper focuses on collaborative scheduling for mobile edge computing(MEC)under a three-tier...With the advent of sixth-generation mobile communications(6G),space-air-ground integrated networks have become mainstream.This paper focuses on collaborative scheduling for mobile edge computing(MEC)under a three-tier heterogeneous architecture composed of mobile devices,unmanned aerial vehicles(UAVs),and macro base stations(BSs).This scenario typically faces fast channel fading,dynamic computational loads,and energy constraints,whereas classical queuing-theoretic or convex-optimization approaches struggle to yield robust solutions in highly dynamic settings.To address this issue,we formulate a multi-agent Markov decision process(MDP)for an air-ground-fused MEC system,unify link selection,bandwidth/power allocation,and task offloading into a continuous action space and propose a joint scheduling strategy that is based on an improved MATD3 algorithm.The improvements include Alternating Layer Normalization(ALN)in the actor to suppress gradient variance,Residual Orthogonalization(RO)in the critic to reduce the correlation between the twin Q-value estimates,and a dynamic-temperature reward to enable adaptive trade-offs during training.On a multi-user,dual-link simulation platform,we conduct ablation and baseline comparisons.The results reveal that the proposed method has better convergence and stability.Compared with MADDPG,TD3,and DSAC,our algorithm achieves more robust performance across key metrics.展开更多
With the release of the electricity sales side,large-scale small-capacity distributed power generation units are connected to the distribution side,forming multi-type market entities such as microgrids,integrated ener...With the release of the electricity sales side,large-scale small-capacity distributed power generation units are connected to the distribution side,forming multi-type market entities such as microgrids,integrated energy systems,and virtual power plants.With the large-scale integration of distributed energy,the energy market under the energy internet is different from a traditional transmission grid.It is currently developing in the direction of diversified entities and commodities,a flat structure,and a flexible and competitive multi-agent market mechanism.In this context,this study analyzes the value of combining blockchain and the electricity market presents the design of a blockchain trading framework for multi-agent cooperation and sharing of the energy internet.The nodes in market transactions are modeled through power system modeling in the physical layer and the transaction consensus strategy in the cyber layer;moreover,the nodes are verified in a modified IEEE 13 testing feeder of a distribution network.A transaction example is demonstrated using the multi-agent cooperation and sharing transaction platform based on the Ethereum private blockchain.展开更多
Educational cooperation,as the cornerstone and vanguard of people-to-people exchanges and practical cooperation between China and Africa,holds irreplaceable strategic significance for enhancing the quality of Africa...Educational cooperation,as the cornerstone and vanguard of people-to-people exchanges and practical cooperation between China and Africa,holds irreplaceable strategic significance for enhancing the quality of Africa's human capital and accelerating its structural transformation and modernization process.This paper employs“demand–supply–adaptation”as its core analytical framework and aims to systematically explore how the educational cooperation between China and Africa can serve Africa's goal of modernization more precisely and effectively.First,through reviewing and analyzing the domestic and international research literature,this paper clarifies the focus,paradigms,and shortcomings of existing research,identifying the knowledge contribution of this study.Second,utilizing detailed macro-level data,case studies,and comparative research methods,it comprehensively presents the multidimensional status,structural characteristics,and development trends of China–Africa educational cooperation in areas such as student exchanges,cooperative education,vocational and technical training,language and cultural exchange,and emerging digital education.This paper also deeply analyzes the pressing and specific demands placed on the education system by Africa's modernization development across key dimensions like economic diversification,industrialization,agricultural modernization,social governance upgrading,and digital transformation.Third,based on the cooperative principle of“Africa's needs and China's strengths,”this paper innovatively proposes systematic countermeasures and suggestions for constructing new,multi-level,high-quality,sustainable,and future-oriented pathways for China–Africa educational cooperation:(a)promoting the strategic focus of cooperation to extend from“hard infrastructure”support to empowering“soft infrastructure”;(b)deepening the integration of industry and education and school–enterprise collaboration to precisely align with Africa's industrial development needs;vigorously developing digital education and jointly building a smart education ecosystem to help Africa bridge the digital divide;(c)improving an evidence-based,third-party evaluation system for cooperative effectiveness and a full-process quality assurance system;and(d)promoting the collaborative participation of multiple actors including governments,schools,enterprises,think tanks,and social organizations to build a new cooperative pattern of coconstruction,co-governance,and shared benefits.展开更多
Experts and officials shared their insights on poverty reduction cooperation and sustainable development during the 2025 International Seminar on Global Poverty Reduction Partnerships.
Cooperative multi-agent reinforcement learning(MARL)is a key technology for enabling cooperation in complex multi-agent systems.It has achieved remarkable progress in areas such as gaming,autonomous driving,and multi-...Cooperative multi-agent reinforcement learning(MARL)is a key technology for enabling cooperation in complex multi-agent systems.It has achieved remarkable progress in areas such as gaming,autonomous driving,and multi-robot control.Empowering cooperative MARL with multi-task decision-making capabilities is expected to further broaden its application scope.In multi-task scenarios,cooperative MARL algorithms need to address 3 types of multi-task problems:reward-related multi-task,arising from different reward functions;multi-domain multi-task,caused by differences in state and action spaces,state transition functions;and scalability-related multi-task,resulting from the dynamic variation in the number of agents.Most existing studies focus on scalability-related multitask problems.However,with the increasing integration between large language models(LLMs)and multi-agent systems,a growing number of LLM-based multi-agent systems have emerged,enabling more complex multi-task cooperation.This paper provides a comprehensive review of the latest advances in this field.By combining multi-task reinforcement learning with cooperative MARL,we categorize and analyze the 3 major types of multi-task problems under multi-agent settings,offering more fine-grained classifications and summarizing key insights for each.In addition,we summarize commonly used benchmarks and discuss future directions of research in this area,which hold promise for further enhancing the multi-task cooperation capabilities of multi-agent systems and expanding their practical applications in the real world.展开更多
China provides a compelling reference for how governments can promote global climate governance and sustainable innovative development.SENIOR government officials,technical experts,and policy researchers from 14 devel...China provides a compelling reference for how governments can promote global climate governance and sustainable innovative development.SENIOR government officials,technical experts,and policy researchers from 14 developing countries in Asia,Africa,Latin America,and Pacific Island countries gathered at Tsinghua University(THU),Beijing,on September 1-13,2025 for in-depth study and exchange on“climate finance and low-carbon transformation.”展开更多
International investment constitutes a key dimension of high-quality cooperation under the Belt and Road Initiative.However,affected by unilateralism,protectionist measures,and growing regulatory uncertainty,it faces ...International investment constitutes a key dimension of high-quality cooperation under the Belt and Road Initiative.However,affected by unilateralism,protectionist measures,and growing regulatory uncertainty,it faces increasingly complex and diverse risks.Deepening the rule of law cooperation based on an international order governed by international law is not only a fundamental way to mitigate investment risks associated with the Belt and Road Initiative but also an essential requirement for building a more just and reasonable system of global economic governance.At present,investment rule of law cooperation under the Belt and Road Initiative faces three main challenges:limited legal cooperation capacity among some participating countries at the subject level,insufficient coordination between hard law and soft law at the normative level,and relatively weak regional enforcement and dispute-prevention mechanisms at the implementation level.To advance regional investment rule of law cooperation and promote the high-quality development of the Belt and Road Initiative,China should foster consultative governance on investment rule of law cooperation among partner countries,comprehensively upgrade hard law and soft law instruments regulating investment,and strengthen enforcement and dispute settlement capacity,thereby contributing a more just,reasonable,and inclusive model of investment governance to the world.展开更多
The development of chassis active safety control technology has improved vehicle stability under extreme conditions.However,its cross-system and multi-functional characteristics make the controller difficult to achiev...The development of chassis active safety control technology has improved vehicle stability under extreme conditions.However,its cross-system and multi-functional characteristics make the controller difficult to achieve cooperative goals.In addition,the chassis system,which has high complexity,numerous subsystems,and strong coupling,will also lead to low computing efficiency and poor control effect of the controller.Therefore,this paper proposes a scenario-driven hybrid distributed model predictive control algorithm with variable control topology.This algorithm divides multiple stability regions based on the vehicle’s β−γ phase plane,forming a mapping relationship between the control structure and the vehicle’s state.A control input fusion mechanism within the transition domain is designed to mitigate the problems of system state oscillation and control input jitter caused by switching control structures.Then,a distributed state-space equation with state coupling and input coupling characteristics is constructed,and a weighted local agent cost function in quadratic programming is derived.Through cost coupling,local agents can coordinate global performance goals.Finally,through Simulink/CarSim joint simulation and hardware-in-the-loop(HIL)test,the proposed algorithm is validated to improve vehicle stability while ensuring trajectory tracking accuracy and has good applicability for multi-objective coordinated control.This paper combines the advantages of distributed MPC and decentralized MPC,achieving a balance between approximating the global optimal results and the solution’s efficiency.展开更多
This paper proposes a Multi-Agent Attention Proximal Policy Optimization(MA2PPO)algorithm aiming at the problems such as credit assignment,low collaboration efficiency and weak strategy generalization ability existing...This paper proposes a Multi-Agent Attention Proximal Policy Optimization(MA2PPO)algorithm aiming at the problems such as credit assignment,low collaboration efficiency and weak strategy generalization ability existing in the cooperative pursuit tasks of multiple unmanned aerial vehicles(UAVs).Traditional algorithms often fail to effectively identify critical cooperative relationships in such tasks,leading to low capture efficiency and a significant decline in performance when the scale expands.To tackle these issues,based on the proximal policy optimization(PPO)algorithm,MA2PPO adopts the centralized training with decentralized execution(CTDE)framework and introduces a dynamic decoupling mechanism,that is,sharing the multi-head attention(MHA)mechanism for critics during centralized training to solve the credit assignment problem.This method enables the pursuers to identify highly correlated interactions with their teammates,effectively eliminate irrelevant and weakly relevant interactions,and decompose large-scale cooperation problems into decoupled sub-problems,thereby enhancing the collaborative efficiency and policy stability among multiple agents.Furthermore,a reward function has been devised to facilitate the pursuers to encircle the escapee by combining a formation reward with a distance reward,which incentivizes UAVs to develop sophisticated cooperative pursuit strategies.Experimental results demonstrate the effectiveness of the proposed algorithm in achieving multi-UAV cooperative pursuit and inducing diverse cooperative pursuit behaviors among UAVs.Moreover,experiments on scalability have demonstrated that the algorithm is suitable for large-scale multi-UAV systems.展开更多
The Internet of Unmanned Aerial Vehicles(I-UAVs)is expected to execute latency-sensitive tasks,but limited by co-channel interference and malicious jamming.In the face of unknown prior environmental knowledge,defendin...The Internet of Unmanned Aerial Vehicles(I-UAVs)is expected to execute latency-sensitive tasks,but limited by co-channel interference and malicious jamming.In the face of unknown prior environmental knowledge,defending against jamming and interference through spectrum allocation becomes challenging,especially when each UAV pair makes decisions independently.In this paper,we propose a cooperative multi-agent reinforcement learning(MARL)-based anti-jamming framework for I-UAVs,enabling UAV pairs to learn their own policies cooperatively.Specifically,we first model the problem as a modelfree multi-agent Markov decision process(MAMDP)to maximize the long-term expected system throughput.Then,for improving the exploration of the optimal policy,we resort to optimizing a MARL objective function with a mutual-information(MI)regularizer between states and actions,which can dynamically assign the probability for actions frequently used by the optimal policy.Next,through sharing their current channel selections and local learning experience(their soft Q-values),the UAV pairs can learn their own policies cooperatively relying on only preceding observed information and predicting others’actions.Our simulation results show that for both sweep jamming and Markov jamming patterns,the proposed scheme outperforms the benchmarkers in terms of throughput,convergence and stability for different numbers of jammers,channels and UAV pairs.展开更多
Within the context of ground-air cooperation,the distributed formation trajectory tracking control problems for the Heterogeneous Multi-Agent Systems(HMASs)is studied.First,considering external disturbances and model ...Within the context of ground-air cooperation,the distributed formation trajectory tracking control problems for the Heterogeneous Multi-Agent Systems(HMASs)is studied.First,considering external disturbances and model uncertainties,a graph theory-based formation control protocol is designed for the HMASs consisting of Unmanned Aerial Vehicles(UAVs)and Unmanned Ground Vehicles(UGVs).Subsequently,a formation trajectory tracking control strategy employing adaptive Fractional-Order Sliding Mode Control(FOSMC)method is developed,and a Feedback Multilayer Fuzzy Neural Network(FMFNN)is introduced to estimate the lumped uncertainties.This approach empowers HMASs to adaptively follow the expected trajectory and adopt the designated formation configuration,even in the presence of various uncertainties.Additionally,an event-triggered mechanism is incorporated into the controller to reduce the update frequency of the controller and minimize the communication exchange among the agents,and the absence of Zeno behavior is rigorously demonstrated by an integral inequality analysis.Finally,to confirm the effectiveness of the proposed formation control protocol,some numerical simulations are presented.展开更多
The accomplishment of a complex problem usually involves cooperation between participators with different knowledge background concerned. This paper identifies interdependency between different sub problems (through p...The accomplishment of a complex problem usually involves cooperation between participators with different knowledge background concerned. This paper identifies interdependency between different sub problems (through problem decomposition) as the major factor that influences cooperative relations in multi-Agent systems, based on which we propose an efficient means to measure cooperation coefficient (degree) between different Agents. Then cognitive cooperation between Agents is analyzed which aims at collecting the wisdom of the cognitive community for a systematic solution to the overall problem.展开更多
In this paper, rough set theory is introduced into the interface multi-agent system (MAS) for industrial supervisory system. Taking advantages of rough set in data mining, a cooperation model for MAS is built. Rules...In this paper, rough set theory is introduced into the interface multi-agent system (MAS) for industrial supervisory system. Taking advantages of rough set in data mining, a cooperation model for MAS is built. Rules for avoiding cooperation conflict are deduced. An optimization algorithm is used to enhance security and real time attributes of the system. An application based on the proposed algorithm and rules are given.展开更多
The application of reinforcement learning is widely used by multi-agent systems in recent years. An agent uses a multi-agent system to cooperate with other agents to accomplish the given task, and one agent′s behavio...The application of reinforcement learning is widely used by multi-agent systems in recent years. An agent uses a multi-agent system to cooperate with other agents to accomplish the given task, and one agent′s behavior usually affects the others′ behaviors. In traditional reinforcement learning, one agent takes the others location, so it is difficult to consider the others′ behavior, which decreases the learning efficiency. This paper proposes multi-agent reinforcement learning with cooperation based on eligibility traces, i.e. one agent estimates the other agent′s behavior with the other agent′s eligibility traces. The results of this simulation prove the validity of the proposed learning method.展开更多
The cooperative control and stability analysis problems for the multi-agent system with sampled com- munication are investigated. Distributed state feedback controllers are adopted for the cooperation of networked age...The cooperative control and stability analysis problems for the multi-agent system with sampled com- munication are investigated. Distributed state feedback controllers are adopted for the cooperation of networked agents. A theorem in the form of linear matrix inequalities(LMI) is derived to analyze the system stability. An- other theorem in the form of optimization problem subject to LMI constraints is proposed to design the controller, and then the algorithm is presented. The simulation results verify the validity and the effectiveness of the pro- posed approach.展开更多
文摘Aiming at the flexible manufacturing system with multi-machining and multi-assembly equipment, a new scheduling algorithm is proposed to decompose the assembly structure of the products, thus obtaining simple scheduling problems and forming the cOrrespOnding agents. Then, the importance and the restriction of each agent are cOnsidered, to obtain an order of simple scheduling problems based on the cooperation game theory. With this order, the scheduling of sub-questions is implemented in term of rules, and the almost optimal scheduling results for meeting the restriction can be obtained. Experimental results verify the effectiveness of the proposed scheduling algorithm.
文摘In multi-agent systems, autonomous agents may form coalition to increase the efficiency of problem solving. But the current coalition algorithm is very complex, and cannot satisfy the condition of optimality and stableness simultaneously. To solve the problem, an algorithm that uses the mechanism of distribution according to work for coalition formation is presented, which can achieve global optimal and stable solution in subadditive task oriented domains. The validity of the algorithm is demonstrated by both experiments and theory.
文摘A general multi-agent architecture is proposed for intelligent decision support system (MAIDSS). The agent in MAIDSS is built based on an extension of BDI framework. Several agents form a team working together on a decision problem; several agent teams are defined to stand for the benefits of different people in the real world. The decision making process is based on multi-agent cooperation, and a logical framework for a team of agents cooperating to create the solution for the decision problem is discussed in detail.
文摘With the advent of sixth-generation mobile communications(6G),space-air-ground integrated networks have become mainstream.This paper focuses on collaborative scheduling for mobile edge computing(MEC)under a three-tier heterogeneous architecture composed of mobile devices,unmanned aerial vehicles(UAVs),and macro base stations(BSs).This scenario typically faces fast channel fading,dynamic computational loads,and energy constraints,whereas classical queuing-theoretic or convex-optimization approaches struggle to yield robust solutions in highly dynamic settings.To address this issue,we formulate a multi-agent Markov decision process(MDP)for an air-ground-fused MEC system,unify link selection,bandwidth/power allocation,and task offloading into a continuous action space and propose a joint scheduling strategy that is based on an improved MATD3 algorithm.The improvements include Alternating Layer Normalization(ALN)in the actor to suppress gradient variance,Residual Orthogonalization(RO)in the critic to reduce the correlation between the twin Q-value estimates,and a dynamic-temperature reward to enable adaptive trade-offs during training.On a multi-user,dual-link simulation platform,we conduct ablation and baseline comparisons.The results reveal that the proposed method has better convergence and stability.Compared with MADDPG,TD3,and DSAC,our algorithm achieves more robust performance across key metrics.
基金the Smart Grid Joint Fund of the National Natural Science Foundation of China(No.U2066209)the Science and Technology Project of the China Electric Power Research Institute(No.AI83-20-002).
文摘With the release of the electricity sales side,large-scale small-capacity distributed power generation units are connected to the distribution side,forming multi-type market entities such as microgrids,integrated energy systems,and virtual power plants.With the large-scale integration of distributed energy,the energy market under the energy internet is different from a traditional transmission grid.It is currently developing in the direction of diversified entities and commodities,a flat structure,and a flexible and competitive multi-agent market mechanism.In this context,this study analyzes the value of combining blockchain and the electricity market presents the design of a blockchain trading framework for multi-agent cooperation and sharing of the energy internet.The nodes in market transactions are modeled through power system modeling in the physical layer and the transaction consensus strategy in the cyber layer;moreover,the nodes are verified in a modified IEEE 13 testing feeder of a distribution network.A transaction example is demonstrated using the multi-agent cooperation and sharing transaction platform based on the Ethereum private blockchain.
文摘Educational cooperation,as the cornerstone and vanguard of people-to-people exchanges and practical cooperation between China and Africa,holds irreplaceable strategic significance for enhancing the quality of Africa's human capital and accelerating its structural transformation and modernization process.This paper employs“demand–supply–adaptation”as its core analytical framework and aims to systematically explore how the educational cooperation between China and Africa can serve Africa's goal of modernization more precisely and effectively.First,through reviewing and analyzing the domestic and international research literature,this paper clarifies the focus,paradigms,and shortcomings of existing research,identifying the knowledge contribution of this study.Second,utilizing detailed macro-level data,case studies,and comparative research methods,it comprehensively presents the multidimensional status,structural characteristics,and development trends of China–Africa educational cooperation in areas such as student exchanges,cooperative education,vocational and technical training,language and cultural exchange,and emerging digital education.This paper also deeply analyzes the pressing and specific demands placed on the education system by Africa's modernization development across key dimensions like economic diversification,industrialization,agricultural modernization,social governance upgrading,and digital transformation.Third,based on the cooperative principle of“Africa's needs and China's strengths,”this paper innovatively proposes systematic countermeasures and suggestions for constructing new,multi-level,high-quality,sustainable,and future-oriented pathways for China–Africa educational cooperation:(a)promoting the strategic focus of cooperation to extend from“hard infrastructure”support to empowering“soft infrastructure”;(b)deepening the integration of industry and education and school–enterprise collaboration to precisely align with Africa's industrial development needs;vigorously developing digital education and jointly building a smart education ecosystem to help Africa bridge the digital divide;(c)improving an evidence-based,third-party evaluation system for cooperative effectiveness and a full-process quality assurance system;and(d)promoting the collaborative participation of multiple actors including governments,schools,enterprises,think tanks,and social organizations to build a new cooperative pattern of coconstruction,co-governance,and shared benefits.
文摘Experts and officials shared their insights on poverty reduction cooperation and sustainable development during the 2025 International Seminar on Global Poverty Reduction Partnerships.
基金The National Natural Science Foundation of China(62136008,62293541)The Beijing Natural Science Foundation(4232056)The Beijing Nova Program(20240484514).
文摘Cooperative multi-agent reinforcement learning(MARL)is a key technology for enabling cooperation in complex multi-agent systems.It has achieved remarkable progress in areas such as gaming,autonomous driving,and multi-robot control.Empowering cooperative MARL with multi-task decision-making capabilities is expected to further broaden its application scope.In multi-task scenarios,cooperative MARL algorithms need to address 3 types of multi-task problems:reward-related multi-task,arising from different reward functions;multi-domain multi-task,caused by differences in state and action spaces,state transition functions;and scalability-related multi-task,resulting from the dynamic variation in the number of agents.Most existing studies focus on scalability-related multitask problems.However,with the increasing integration between large language models(LLMs)and multi-agent systems,a growing number of LLM-based multi-agent systems have emerged,enabling more complex multi-task cooperation.This paper provides a comprehensive review of the latest advances in this field.By combining multi-task reinforcement learning with cooperative MARL,we categorize and analyze the 3 major types of multi-task problems under multi-agent settings,offering more fine-grained classifications and summarizing key insights for each.In addition,we summarize commonly used benchmarks and discuss future directions of research in this area,which hold promise for further enhancing the multi-task cooperation capabilities of multi-agent systems and expanding their practical applications in the real world.
文摘China provides a compelling reference for how governments can promote global climate governance and sustainable innovative development.SENIOR government officials,technical experts,and policy researchers from 14 developing countries in Asia,Africa,Latin America,and Pacific Island countries gathered at Tsinghua University(THU),Beijing,on September 1-13,2025 for in-depth study and exchange on“climate finance and low-carbon transformation.”
基金funded by the National Social Science Fund of China(Grant No.20CFX083)。
文摘International investment constitutes a key dimension of high-quality cooperation under the Belt and Road Initiative.However,affected by unilateralism,protectionist measures,and growing regulatory uncertainty,it faces increasingly complex and diverse risks.Deepening the rule of law cooperation based on an international order governed by international law is not only a fundamental way to mitigate investment risks associated with the Belt and Road Initiative but also an essential requirement for building a more just and reasonable system of global economic governance.At present,investment rule of law cooperation under the Belt and Road Initiative faces three main challenges:limited legal cooperation capacity among some participating countries at the subject level,insufficient coordination between hard law and soft law at the normative level,and relatively weak regional enforcement and dispute-prevention mechanisms at the implementation level.To advance regional investment rule of law cooperation and promote the high-quality development of the Belt and Road Initiative,China should foster consultative governance on investment rule of law cooperation among partner countries,comprehensively upgrade hard law and soft law instruments regulating investment,and strengthen enforcement and dispute settlement capacity,thereby contributing a more just,reasonable,and inclusive model of investment governance to the world.
基金Supported by National Natural Science Foundation of China(Grant Nos.52225212,52272418,U22A20100)National Key Research and Development Program of China(Grant No.2022YFB2503302).
文摘The development of chassis active safety control technology has improved vehicle stability under extreme conditions.However,its cross-system and multi-functional characteristics make the controller difficult to achieve cooperative goals.In addition,the chassis system,which has high complexity,numerous subsystems,and strong coupling,will also lead to low computing efficiency and poor control effect of the controller.Therefore,this paper proposes a scenario-driven hybrid distributed model predictive control algorithm with variable control topology.This algorithm divides multiple stability regions based on the vehicle’s β−γ phase plane,forming a mapping relationship between the control structure and the vehicle’s state.A control input fusion mechanism within the transition domain is designed to mitigate the problems of system state oscillation and control input jitter caused by switching control structures.Then,a distributed state-space equation with state coupling and input coupling characteristics is constructed,and a weighted local agent cost function in quadratic programming is derived.Through cost coupling,local agents can coordinate global performance goals.Finally,through Simulink/CarSim joint simulation and hardware-in-the-loop(HIL)test,the proposed algorithm is validated to improve vehicle stability while ensuring trajectory tracking accuracy and has good applicability for multi-objective coordinated control.This paper combines the advantages of distributed MPC and decentralized MPC,achieving a balance between approximating the global optimal results and the solution’s efficiency.
基金supported by the National Research and Development Program of China under Grant JCKY2018607C019in part by the Key Laboratory Fund of UAV of Northwestern Polytechnical University under Grant 2021JCJQLB0710L.
文摘This paper proposes a Multi-Agent Attention Proximal Policy Optimization(MA2PPO)algorithm aiming at the problems such as credit assignment,low collaboration efficiency and weak strategy generalization ability existing in the cooperative pursuit tasks of multiple unmanned aerial vehicles(UAVs).Traditional algorithms often fail to effectively identify critical cooperative relationships in such tasks,leading to low capture efficiency and a significant decline in performance when the scale expands.To tackle these issues,based on the proximal policy optimization(PPO)algorithm,MA2PPO adopts the centralized training with decentralized execution(CTDE)framework and introduces a dynamic decoupling mechanism,that is,sharing the multi-head attention(MHA)mechanism for critics during centralized training to solve the credit assignment problem.This method enables the pursuers to identify highly correlated interactions with their teammates,effectively eliminate irrelevant and weakly relevant interactions,and decompose large-scale cooperation problems into decoupled sub-problems,thereby enhancing the collaborative efficiency and policy stability among multiple agents.Furthermore,a reward function has been devised to facilitate the pursuers to encircle the escapee by combining a formation reward with a distance reward,which incentivizes UAVs to develop sophisticated cooperative pursuit strategies.Experimental results demonstrate the effectiveness of the proposed algorithm in achieving multi-UAV cooperative pursuit and inducing diverse cooperative pursuit behaviors among UAVs.Moreover,experiments on scalability have demonstrated that the algorithm is suitable for large-scale multi-UAV systems.
基金supported in part by the National Natural Science Foundation of China under Grants 62001225,62071236,62071234 and U22A2002in part by the Major Science and Technology plan of Hainan Province under Grant ZDKJ2021022+1 种基金in part by the Scientific Research Fund Project of Hainan University under Grant KYQD(ZR)-21008in part by the Key Technologies R&D Program of Jiangsu(Prospective and Key Technologies for Industry)under Grants BE2023022 and BE2023022-2.
文摘The Internet of Unmanned Aerial Vehicles(I-UAVs)is expected to execute latency-sensitive tasks,but limited by co-channel interference and malicious jamming.In the face of unknown prior environmental knowledge,defending against jamming and interference through spectrum allocation becomes challenging,especially when each UAV pair makes decisions independently.In this paper,we propose a cooperative multi-agent reinforcement learning(MARL)-based anti-jamming framework for I-UAVs,enabling UAV pairs to learn their own policies cooperatively.Specifically,we first model the problem as a modelfree multi-agent Markov decision process(MAMDP)to maximize the long-term expected system throughput.Then,for improving the exploration of the optimal policy,we resort to optimizing a MARL objective function with a mutual-information(MI)regularizer between states and actions,which can dynamically assign the probability for actions frequently used by the optimal policy.Next,through sharing their current channel selections and local learning experience(their soft Q-values),the UAV pairs can learn their own policies cooperatively relying on only preceding observed information and predicting others’actions.Our simulation results show that for both sweep jamming and Markov jamming patterns,the proposed scheme outperforms the benchmarkers in terms of throughput,convergence and stability for different numbers of jammers,channels and UAV pairs.
基金supported by the Beijing Municipal Science&Technology Commission China(No.Z19111000270000)the National Natural Science Foundation of China(Nos.62203050,51774042).
文摘Within the context of ground-air cooperation,the distributed formation trajectory tracking control problems for the Heterogeneous Multi-Agent Systems(HMASs)is studied.First,considering external disturbances and model uncertainties,a graph theory-based formation control protocol is designed for the HMASs consisting of Unmanned Aerial Vehicles(UAVs)and Unmanned Ground Vehicles(UGVs).Subsequently,a formation trajectory tracking control strategy employing adaptive Fractional-Order Sliding Mode Control(FOSMC)method is developed,and a Feedback Multilayer Fuzzy Neural Network(FMFNN)is introduced to estimate the lumped uncertainties.This approach empowers HMASs to adaptively follow the expected trajectory and adopt the designated formation configuration,even in the presence of various uncertainties.Additionally,an event-triggered mechanism is incorporated into the controller to reduce the update frequency of the controller and minimize the communication exchange among the agents,and the absence of Zeno behavior is rigorously demonstrated by an integral inequality analysis.Finally,to confirm the effectiveness of the proposed formation control protocol,some numerical simulations are presented.
基金Supported by the National Natural Science Foun-dation of China (60303025 )and the Natural Science Foundation ofJiangsu Province for Youth Scholar (BK2004411)
文摘The accomplishment of a complex problem usually involves cooperation between participators with different knowledge background concerned. This paper identifies interdependency between different sub problems (through problem decomposition) as the major factor that influences cooperative relations in multi-Agent systems, based on which we propose an efficient means to measure cooperation coefficient (degree) between different Agents. Then cognitive cooperation between Agents is analyzed which aims at collecting the wisdom of the cognitive community for a systematic solution to the overall problem.
基金Project supported by Science Foundation of Shanghai MunicipalCommission of Science and Technology (Grant Nos .025111052 ,04JC14038)
文摘In this paper, rough set theory is introduced into the interface multi-agent system (MAS) for industrial supervisory system. Taking advantages of rough set in data mining, a cooperation model for MAS is built. Rules for avoiding cooperation conflict are deduced. An optimization algorithm is used to enhance security and real time attributes of the system. An application based on the proposed algorithm and rules are given.
文摘The application of reinforcement learning is widely used by multi-agent systems in recent years. An agent uses a multi-agent system to cooperate with other agents to accomplish the given task, and one agent′s behavior usually affects the others′ behaviors. In traditional reinforcement learning, one agent takes the others location, so it is difficult to consider the others′ behavior, which decreases the learning efficiency. This paper proposes multi-agent reinforcement learning with cooperation based on eligibility traces, i.e. one agent estimates the other agent′s behavior with the other agent′s eligibility traces. The results of this simulation prove the validity of the proposed learning method.
基金Supported by the National Natural Science Foundation of China(91016017)the National Aviation Found of China(20115868009)~~
文摘The cooperative control and stability analysis problems for the multi-agent system with sampled com- munication are investigated. Distributed state feedback controllers are adopted for the cooperation of networked agents. A theorem in the form of linear matrix inequalities(LMI) is derived to analyze the system stability. An- other theorem in the form of optimization problem subject to LMI constraints is proposed to design the controller, and then the algorithm is presented. The simulation results verify the validity and the effectiveness of the pro- posed approach.