The integration of substantial renewable energy and controllable resources disrupts the supply-demand balance in distribution grids.Secure operations are dependent on the participation of user-side resources in demand...The integration of substantial renewable energy and controllable resources disrupts the supply-demand balance in distribution grids.Secure operations are dependent on the participation of user-side resources in demand response at both the day-ahead and intraday levels.Current studies typically overlook the spatial--temporal variations and coordination between these timescales,leading to significant day-ahead optimization errors,high intraday costs,and slow convergence.To address these challenges,we developed a multiagent,multitimescale aggregated regulation method for spatial--temporal coordinated demand response of user-side resources.Firstly,we established a framework considering the spatial--temporal coordinated characteristics of user-side resources with the objective to min-imize the total regulation cost and weighted sum of distribution grid losses.The optimization problem was then solved for two different timescales:day-ahead and intraday.For the day-ahead timescale,we developed an improved particle swarm optimization(IPSO)algo-rithm that dynamically adjusts the number of particles based on intraday outcomes to optimize the regulation strategies.For the intraday timescale,we developed an improved alternating direction method of multipliers(IADMM)algorithm that distributes tasks across edge distribution stations,dynamically adjusting penalty factors by using historical day-ahead data to synchronize the regulations and enhance precision.The simulation results indicate that this method can fully achieve multitimescale spatial--temporal coordinated aggregated reg-ulation between day-ahead and intraday,effectively reduce the total regulation cost and distribution grid losses,and enhance smart grid resilience.展开更多
The increasing adoption of unmanned aerial vehicles(UAVs)in urban low-altitude logistics systems,particularly for time-sensitive applications like parcel delivery and supply distribution,necessitates sophisticated coo...The increasing adoption of unmanned aerial vehicles(UAVs)in urban low-altitude logistics systems,particularly for time-sensitive applications like parcel delivery and supply distribution,necessitates sophisticated coordination mechanisms to optimize operational efficiency.However,the limited capability of UAVs to extract stateaction information in complex environments poses significant challenges to achieving effective cooperation in dynamic and uncertain scenarios.To address this,we presents an Improved Multi-Agent Hybrid Attention Critic(IMAHAC)framework that advances multi-agent deep reinforcement learning(MADRL)through two key innovations.Firstly,a Temporal Difference Error and Time-based Prioritized Experience Replay(TT-PER)mechanism that dynamically adjusts sample weights based on temporal relevance and prediction error magnitude,effectively reducing the interference from obsolete collaborative experiences while maintaining training stability.Secondly,a hybrid attention mechanism is developed,integrating a sensor fusion layer—which aggregates features from multi-sensor data to enhance decision-making—and a dissimilarity layer that evaluates the similarity between key-value pairs and query values.By combining this hybrid attention mechanism with theMulti-Actor Attention Critic(MAAC)framework,our approach strengthens UAVs’capability to extract critical state-action features in diverse environments.Comprehensive simulations in urban air mobility scenarios demonstrate IMAHAC’s superiority over conventional MADRL baselines and MAAC,achieving higher cumulative rewards,fewer collisions,and enhanced cooperative capabilities.This work provides both algorithmic advancements and empirical validation for developing robust autonomous aerial systems in smart city infrastructures.展开更多
In this paper, distributed event-triggered performance constraint control is proposed for Heterogeneous Multiagent Systems (HMASs) including quadrotor unmanned aerial vehicles and unmanned ground vehicles in the prese...In this paper, distributed event-triggered performance constraint control is proposed for Heterogeneous Multiagent Systems (HMASs) including quadrotor unmanned aerial vehicles and unmanned ground vehicles in the presence of unknown external disturbances. To tackle the problem of different dynamic characteristics and facilitate the controller design, the virtual variable is introduced in the z axis of the nonlinear model of unmanned ground vehicles. By using this approach, a universal model is established for the HMAS. Moreover, a distributed disturbance observer is established to cope with the adverse influence of the external disturbances. Then, an Appointed-Time Prescribed Performance Function (ATPPF) is designed to restrict the tracking error in the predefined regions. On this basis, the distributed performance constraint controller is proposed for the HMAS based on the ATPPF and the distributed disturbance observer. Furthermore, the improved event-triggered mechanism is proposed with a dynamic threshold, which depends on the distance between the tracking error and the boundary of the ATPPF. Finally, the effectiveness of the proposed control method is verified by the comparative experiments on an HMAS.展开更多
Dear Editor,This letter studies output consensus problem of heterogeneous linear multiagent systems over directed graphs. A novel adaptive dynamic event-triggered controller is presented based only on the feedback com...Dear Editor,This letter studies output consensus problem of heterogeneous linear multiagent systems over directed graphs. A novel adaptive dynamic event-triggered controller is presented based only on the feedback combination of the agent's own state and neighbors' output,which can achieve exponential output consensus through intermittent communication. The controller is obtained by solving two linear matrix equations, and Zeno behavior is excluded.展开更多
In this paper, the containment control problem in nonlinear multi-agent systems(NMASs) under denial-of-service(DoS) attacks is addressed. Firstly, a prediction model is obtained using the broad learning technique to t...In this paper, the containment control problem in nonlinear multi-agent systems(NMASs) under denial-of-service(DoS) attacks is addressed. Firstly, a prediction model is obtained using the broad learning technique to train historical data generated by the system offline without DoS attacks. Secondly, the dynamic linearization method is used to obtain the equivalent linearization model of NMASs. Then, a novel model-free adaptive predictive control(MFAPC) framework based on historical and online data generated by the system is proposed, which combines the trained prediction model with the model-free adaptive control method. The development of the MFAPC method motivates a much simpler robust predictive control solution that is convenient to use in the case of DoS attacks. Meanwhile, the MFAPC algorithm provides a unified predictive framework for solving consensus tracking and containment control problems. The boundedness of the containment error can be proven by using the contraction mapping principle and the mathematical induction method. Finally, the proposed MFAPC is assessed through comparative experiments.展开更多
In this paper,fixed-time consensus tracking for mul-tiagent systems(MASs)with dynamics in the form of strict feed-back affine nonlinearity is addressed.A fixed-time antidistur-bance consensus tracking protocol is prop...In this paper,fixed-time consensus tracking for mul-tiagent systems(MASs)with dynamics in the form of strict feed-back affine nonlinearity is addressed.A fixed-time antidistur-bance consensus tracking protocol is proposed,which consists of a distributed fixed-time observer,a fixed-time disturbance observer,a nonsmooth antidisturbance backstepping controller,and the fixed-time stability analysis is conducted by using the Lyapunov theory correspondingly.This paper includes three main improvements.First,a distributed fixed-time observer is developed for each follower to obtain an estimate of the leader’s output by utilizing the topology of the communication network.Second,a fixed-time disturbance observer is given to estimate the lumped disturbances for feedforward compensation.Finally,a nonsmooth antidisturbance backstepping tracking controller with feedforward compensation for lumped disturbances is designed.In order to mitigate the“explosion of complexity”in the tradi-tional backstepping approach,we have implemented a modified nonsmooth command filter to enhance the performance of the closed-loop system.The simulation results show that the pro-posed method is effective.展开更多
In this paper,a class of time-varying output group formation containment control problem of general linear hetero-geneous multiagent systems(MASs)is investigated under directed topology.The MAS is composed of a number...In this paper,a class of time-varying output group formation containment control problem of general linear hetero-geneous multiagent systems(MASs)is investigated under directed topology.The MAS is composed of a number of tracking leaders,formation leaders and followers,where two different types of leaders are used to provide reference trajectories for movement and to achieve certain formations,respectively.Firstly,compen-sators are designed whose states are estimations of tracking lead-ers,based on which,a controller is developed for each formation leader to accomplish the expected formation.Secondly,two event-triggered compensators are proposed for each follower to evalu-ate the state and formation information of the formation leaders in the same group,respectively.Subsequently,a control protocol is designed for each follower,utilizing the output information,to guide the output towards the convex hull generated by the forma-tion leaders within the group.Next,the triggering sequence in this paper is decomposed into two sequences,and the inter-event intervals of these two triggering conditions are provided to rule out the Zeno behavior.Finally,a numerical simulation is intro-duced to confirm the validity of the proposed results.展开更多
Using the differences and complementarities between human intelligence and artificial intelligence(AI),a hybrid-augmented intelligence,that is both stronger than human intelligence and AI,is created through Human-AI C...Using the differences and complementarities between human intelligence and artificial intelligence(AI),a hybrid-augmented intelligence,that is both stronger than human intelligence and AI,is created through Human-AI Cooperation(HAC)for teaching and learning.Human-AI Cooperation is infiltrating into all links of education,and recent research has focused a lot on the impact of teaching,learning,management,and evaluation with Human-AI Cooperation.However,AI still has its limits of intelligence,and cannot cooperate as humans.Thus,it is critical to study the obstacles of Human-AI Cooperation in education,as AI plays a role as a partner,not a tool.This study discussed for the first time how teachers and AI cooperate based on Multiple Intelligences of AI proposed by Andrzej Cichocki and puts forward a new Human-AI Cooperation teaching mode:human in the loop and teaching as leadership.It is proposed that humans in the loop and teaching as leadership can solve the problem that AI cannot cope with complex and dynamic teaching tasks in open situations,as well as the limits of intelligence for AI.展开更多
The static nature of cyber defense systems gives attackers a sufficient amount of time to explore and further exploit the vulnerabilities of information technology systems.In this paper,we investigate a problem where ...The static nature of cyber defense systems gives attackers a sufficient amount of time to explore and further exploit the vulnerabilities of information technology systems.In this paper,we investigate a problem where multiagent sys-tems sensing and acting in an environment contribute to adaptive cyber defense.We present a learning strategy that enables multiple agents to learn optimal poli-cies using multiagent reinforcement learning(MARL).Our proposed approach is inspired by the multiarmed bandits(MAB)learning technique for multiple agents to cooperate in decision making or to work independently.We study a MAB approach in which defenders visit a system multiple times in an alternating fash-ion to maximize their rewards and protect their system.We find that this game can be modeled from an individual player’s perspective as a restless MAB problem.We discover further results when the MAB takes the form of a pure birth process,such as a myopic optimal policy,as well as providing environments that offer the necessary incentives required for cooperation in multiplayer projects.展开更多
基金supported by Science and Technology Program of China Southern Power Grid Corporation under grant number 036000KK52222004(GDKJXM20222117)National Key R&D Program of China for International S&T Cooperation Projects(2019YFE0118700).
文摘The integration of substantial renewable energy and controllable resources disrupts the supply-demand balance in distribution grids.Secure operations are dependent on the participation of user-side resources in demand response at both the day-ahead and intraday levels.Current studies typically overlook the spatial--temporal variations and coordination between these timescales,leading to significant day-ahead optimization errors,high intraday costs,and slow convergence.To address these challenges,we developed a multiagent,multitimescale aggregated regulation method for spatial--temporal coordinated demand response of user-side resources.Firstly,we established a framework considering the spatial--temporal coordinated characteristics of user-side resources with the objective to min-imize the total regulation cost and weighted sum of distribution grid losses.The optimization problem was then solved for two different timescales:day-ahead and intraday.For the day-ahead timescale,we developed an improved particle swarm optimization(IPSO)algo-rithm that dynamically adjusts the number of particles based on intraday outcomes to optimize the regulation strategies.For the intraday timescale,we developed an improved alternating direction method of multipliers(IADMM)algorithm that distributes tasks across edge distribution stations,dynamically adjusting penalty factors by using historical day-ahead data to synchronize the regulations and enhance precision.The simulation results indicate that this method can fully achieve multitimescale spatial--temporal coordinated aggregated reg-ulation between day-ahead and intraday,effectively reduce the total regulation cost and distribution grid losses,and enhance smart grid resilience.
基金supported by theHubei Provincial Technology Innovation Special Project and the Natural Science Foundation of Hubei Province under Grants 2023BEB024,2024AFC066,respectively.
文摘The increasing adoption of unmanned aerial vehicles(UAVs)in urban low-altitude logistics systems,particularly for time-sensitive applications like parcel delivery and supply distribution,necessitates sophisticated coordination mechanisms to optimize operational efficiency.However,the limited capability of UAVs to extract stateaction information in complex environments poses significant challenges to achieving effective cooperation in dynamic and uncertain scenarios.To address this,we presents an Improved Multi-Agent Hybrid Attention Critic(IMAHAC)framework that advances multi-agent deep reinforcement learning(MADRL)through two key innovations.Firstly,a Temporal Difference Error and Time-based Prioritized Experience Replay(TT-PER)mechanism that dynamically adjusts sample weights based on temporal relevance and prediction error magnitude,effectively reducing the interference from obsolete collaborative experiences while maintaining training stability.Secondly,a hybrid attention mechanism is developed,integrating a sensor fusion layer—which aggregates features from multi-sensor data to enhance decision-making—and a dissimilarity layer that evaluates the similarity between key-value pairs and query values.By combining this hybrid attention mechanism with theMulti-Actor Attention Critic(MAAC)framework,our approach strengthens UAVs’capability to extract critical state-action features in diverse environments.Comprehensive simulations in urban air mobility scenarios demonstrate IMAHAC’s superiority over conventional MADRL baselines and MAAC,achieving higher cumulative rewards,fewer collisions,and enhanced cooperative capabilities.This work provides both algorithmic advancements and empirical validation for developing robust autonomous aerial systems in smart city infrastructures.
基金supported in part by the National Natural Science Foundation of China(Nos.U23B2036,U2013201).
文摘In this paper, distributed event-triggered performance constraint control is proposed for Heterogeneous Multiagent Systems (HMASs) including quadrotor unmanned aerial vehicles and unmanned ground vehicles in the presence of unknown external disturbances. To tackle the problem of different dynamic characteristics and facilitate the controller design, the virtual variable is introduced in the z axis of the nonlinear model of unmanned ground vehicles. By using this approach, a universal model is established for the HMAS. Moreover, a distributed disturbance observer is established to cope with the adverse influence of the external disturbances. Then, an Appointed-Time Prescribed Performance Function (ATPPF) is designed to restrict the tracking error in the predefined regions. On this basis, the distributed performance constraint controller is proposed for the HMAS based on the ATPPF and the distributed disturbance observer. Furthermore, the improved event-triggered mechanism is proposed with a dynamic threshold, which depends on the distance between the tracking error and the boundary of the ATPPF. Finally, the effectiveness of the proposed control method is verified by the comparative experiments on an HMAS.
基金supported by the National Science and Technology Innovation 2030-Major Program(2022ZD 0115403)the National Natural Science Foundation of China(61991414)+1 种基金Chongqing Natural Science Foundation(CSTB2023NSCQJQX0018)Beijing Natural Science Foundation(L221005)
文摘Dear Editor,This letter studies output consensus problem of heterogeneous linear multiagent systems over directed graphs. A novel adaptive dynamic event-triggered controller is presented based only on the feedback combination of the agent's own state and neighbors' output,which can achieve exponential output consensus through intermittent communication. The controller is obtained by solving two linear matrix equations, and Zeno behavior is excluded.
基金supported in part by the National Natural Science Foundation of China(62403396,62433018,62373113)the Guangdong Basic and Applied Basic Research Foundation(2023A1515011527,2023B1515120010)the Postdoctoral Fellowship Program of CPSF(GZB20240621)
文摘In this paper, the containment control problem in nonlinear multi-agent systems(NMASs) under denial-of-service(DoS) attacks is addressed. Firstly, a prediction model is obtained using the broad learning technique to train historical data generated by the system offline without DoS attacks. Secondly, the dynamic linearization method is used to obtain the equivalent linearization model of NMASs. Then, a novel model-free adaptive predictive control(MFAPC) framework based on historical and online data generated by the system is proposed, which combines the trained prediction model with the model-free adaptive control method. The development of the MFAPC method motivates a much simpler robust predictive control solution that is convenient to use in the case of DoS attacks. Meanwhile, the MFAPC algorithm provides a unified predictive framework for solving consensus tracking and containment control problems. The boundedness of the containment error can be proven by using the contraction mapping principle and the mathematical induction method. Finally, the proposed MFAPC is assessed through comparative experiments.
基金supported by the National Defense Basic Scientific Research Project(JCKY2020130C025)the National Science and Technology Major Project(J2019-III-0020-0064,J2019-V-0014-0109)。
文摘In this paper,fixed-time consensus tracking for mul-tiagent systems(MASs)with dynamics in the form of strict feed-back affine nonlinearity is addressed.A fixed-time antidistur-bance consensus tracking protocol is proposed,which consists of a distributed fixed-time observer,a fixed-time disturbance observer,a nonsmooth antidisturbance backstepping controller,and the fixed-time stability analysis is conducted by using the Lyapunov theory correspondingly.This paper includes three main improvements.First,a distributed fixed-time observer is developed for each follower to obtain an estimate of the leader’s output by utilizing the topology of the communication network.Second,a fixed-time disturbance observer is given to estimate the lumped disturbances for feedforward compensation.Finally,a nonsmooth antidisturbance backstepping tracking controller with feedforward compensation for lumped disturbances is designed.In order to mitigate the“explosion of complexity”in the tradi-tional backstepping approach,we have implemented a modified nonsmooth command filter to enhance the performance of the closed-loop system.The simulation results show that the pro-posed method is effective.
基金supported in part by the National Key Research and Development Program of China(2018YFA0702200)the National Natural Science Foundation of China(52377079,62203097,62373196)。
文摘In this paper,a class of time-varying output group formation containment control problem of general linear hetero-geneous multiagent systems(MASs)is investigated under directed topology.The MAS is composed of a number of tracking leaders,formation leaders and followers,where two different types of leaders are used to provide reference trajectories for movement and to achieve certain formations,respectively.Firstly,compen-sators are designed whose states are estimations of tracking lead-ers,based on which,a controller is developed for each formation leader to accomplish the expected formation.Secondly,two event-triggered compensators are proposed for each follower to evalu-ate the state and formation information of the formation leaders in the same group,respectively.Subsequently,a control protocol is designed for each follower,utilizing the output information,to guide the output towards the convex hull generated by the forma-tion leaders within the group.Next,the triggering sequence in this paper is decomposed into two sequences,and the inter-event intervals of these two triggering conditions are provided to rule out the Zeno behavior.Finally,a numerical simulation is intro-duced to confirm the validity of the proposed results.
基金This research was supported by"Zhejiang Soft Science Research Program,Grant no:2021C35016".
文摘Using the differences and complementarities between human intelligence and artificial intelligence(AI),a hybrid-augmented intelligence,that is both stronger than human intelligence and AI,is created through Human-AI Cooperation(HAC)for teaching and learning.Human-AI Cooperation is infiltrating into all links of education,and recent research has focused a lot on the impact of teaching,learning,management,and evaluation with Human-AI Cooperation.However,AI still has its limits of intelligence,and cannot cooperate as humans.Thus,it is critical to study the obstacles of Human-AI Cooperation in education,as AI plays a role as a partner,not a tool.This study discussed for the first time how teachers and AI cooperate based on Multiple Intelligences of AI proposed by Andrzej Cichocki and puts forward a new Human-AI Cooperation teaching mode:human in the loop and teaching as leadership.It is proposed that humans in the loop and teaching as leadership can solve the problem that AI cannot cope with complex and dynamic teaching tasks in open situations,as well as the limits of intelligence for AI.
基金This work is funded by the Deanship of Scientific Research(DSR)the University of Jeddah,under Grant No.(UJ-22-DR-1).
文摘The static nature of cyber defense systems gives attackers a sufficient amount of time to explore and further exploit the vulnerabilities of information technology systems.In this paper,we investigate a problem where multiagent sys-tems sensing and acting in an environment contribute to adaptive cyber defense.We present a learning strategy that enables multiple agents to learn optimal poli-cies using multiagent reinforcement learning(MARL).Our proposed approach is inspired by the multiarmed bandits(MAB)learning technique for multiple agents to cooperate in decision making or to work independently.We study a MAB approach in which defenders visit a system multiple times in an alternating fash-ion to maximize their rewards and protect their system.We find that this game can be modeled from an individual player’s perspective as a restless MAB problem.We discover further results when the MAB takes the form of a pure birth process,such as a myopic optimal policy,as well as providing environments that offer the necessary incentives required for cooperation in multiplayer projects.