Multi-agent technology has been used in many complex distributed and concurrent systems. A railway system is such a safety critical system and careful inves- tigation of the functional components is very important. St...Multi-agent technology has been used in many complex distributed and concurrent systems. A railway system is such a safety critical system and careful inves- tigation of the functional components is very important. Study of the various functional components in communi- cation-based train control (CBTC) system necessitates a good structural design followed by its validation and ver- ification through a formal modelling technique. The work presented here is the follow up of our multi-agent-based CBTC system for Indian railway designed using the methodology for engineering system of software agents. Behavioural analysis of the designed system involves several operating scenarios that arise during train run, and helps in understanding the reaction of the system to such situations. This validation and verification are very important as it allows the system designer to critically evaluate the desired function of the system and to correct the design errors, if any, before its actual implementation. Modelling, validation and verification of the structural design through Coloured petri net (CPN) are central to this paper. Analysis of simulation results validates the efficacy of the design.展开更多
As an important mechanism in multi-agent interaction,communication can make agents form complex team relationships rather than constitute a simple set of multiple independent agents.However,the existing communication ...As an important mechanism in multi-agent interaction,communication can make agents form complex team relationships rather than constitute a simple set of multiple independent agents.However,the existing communication schemes can bring much timing redundancy and irrelevant messages,which seriously affects their practical application.To solve this problem,this paper proposes a targeted multiagent communication algorithm based on state control(SCTC).The SCTC uses a gating mechanism based on state control to reduce the timing redundancy of communication between agents and determines the interaction relationship between agents and the importance weight of a communication message through a series connection of hard-and self-attention mechanisms,realizing targeted communication message processing.In addition,by minimizing the difference between the fusion message generated from a real communication message of each agent and a fusion message generated from the buffered message,the correctness of the final action choice of the agent is ensured.Our evaluation using a challenging set of Star Craft II benchmarks indicates that the SCTC can significantly improve the learning performance and reduce the communication overhead between agents,thus ensuring better cooperation between agents.展开更多
Due to the characteristics of line-of-sight(LoS)communication in unmanned aerial vehicle(UAV)networks,these systems are highly susceptible to eavesdropping and surveillance.To effectively address the security concerns...Due to the characteristics of line-of-sight(LoS)communication in unmanned aerial vehicle(UAV)networks,these systems are highly susceptible to eavesdropping and surveillance.To effectively address the security concerns in UAV communication,covert communication methods have been adopted.This paper explores the joint optimization problem of trajectory and transmission power in a multi-hop UAV relay covert communication system.Considering the communication covertness,power constraints,and trajectory limitations,an algorithm based on multi-agent proximal policy optimization(MAPPO),named covert-MAPPO(C-MAPPO),is proposed.The proposed method leverages the strengths of both optimization algorithms and reinforcement learning to analyze and make joint decisions on the transmission power and flight trajectory strategies for UAVs to achieve cooperation.Simulation results demonstrate that the proposed method can maximize the system throughput while satisfying covertness constraints,and it outperforms benchmark algorithms in terms of system throughput and reward convergence speed.展开更多
This paper studies the problem of jamming decision-making for dynamic multiple communication links in wireless communication networks(WCNs).We propose a novel jamming channel allocation and power decision-making(JCAPD...This paper studies the problem of jamming decision-making for dynamic multiple communication links in wireless communication networks(WCNs).We propose a novel jamming channel allocation and power decision-making(JCAPD)approach based on multi-agent deep reinforcement learning(MADRL).In high-dynamic and multi-target aviation communication environments,the rapid changes in channels make it difficult for sensors to accurately capture instantaneous channel state information.This poses a challenge to make centralized jamming decisions with single-agent deep reinforcement learning(DRL)approaches.In response,we design a distributed multi-agent decision architecture(DMADA).We formulate multi-jammer resource allocation as a multiagent Markov decision process(MDP)and propose a fingerprint-based double deep Q-Network(FBDDQN)algorithm for solving it.Each jammer functions as an agent that interacts with the environment in this framework.Through the design of a reasonable reward and training mechanism,our approach enables jammers to achieve distributed cooperation,significantly improving the jamming success rate while considering jamming power cost,and reducing the transmission rate of links.Our experimental results show the FBDDQN algorithm is superior to the baseline methods.展开更多
Dear Editor,This letter is concerned with the problem of time-varying formation tracking for heterogeneous multi-agent systems(MASs) under directed switching networks. For this purpose, our first step is to present so...Dear Editor,This letter is concerned with the problem of time-varying formation tracking for heterogeneous multi-agent systems(MASs) under directed switching networks. For this purpose, our first step is to present some sufficient conditions for the exponential stability of a particular category of switched systems.展开更多
The cooperative control and stability analysis problems for the multi-agent system with sampled com- munication are investigated. Distributed state feedback controllers are adopted for the cooperation of networked age...The cooperative control and stability analysis problems for the multi-agent system with sampled com- munication are investigated. Distributed state feedback controllers are adopted for the cooperation of networked agents. A theorem in the form of linear matrix inequalities(LMI) is derived to analyze the system stability. An- other theorem in the form of optimization problem subject to LMI constraints is proposed to design the controller, and then the algorithm is presented. The simulation results verify the validity and the effectiveness of the pro- posed approach.展开更多
In this paper, a distributed control scheme has been developed for consensus of single integrator multi-agent systems with directed fixed communication topology for arbitrarily large constant, time-varying or distribu...In this paper, a distributed control scheme has been developed for consensus of single integrator multi-agent systems with directed fixed communication topology for arbitrarily large constant, time-varying or distributed communication delays. It is proved that the closed loop control system can reach consensus with an exponential convergence rate if and only if the topology is quasi-strongly connected. Simulation results are also provided to demonstrate the effectiveness of the proposed controller.展开更多
To solve the dynamical consensus problem of second-order multi-agent systems with communication delay,delay-dependent compensations are added into the normal asynchronously-coupled consensus algorithm so as to make th...To solve the dynamical consensus problem of second-order multi-agent systems with communication delay,delay-dependent compensations are added into the normal asynchronously-coupled consensus algorithm so as to make the agents achieve a dynamical consensus. Based on frequency-domain analysis, sufficient conditions are gained for second-order multi-agent systems with communication delay under leaderless and leader-following consensus algorithms respectively. Simulation illustrates the correctness of the results.展开更多
Leader-following stationary consensus problem is investigated for the second-order multi-agent systems with timevarying communication delay and switching topology. Based on Lyapunov-Krasovskii functional and Lyapunov-...Leader-following stationary consensus problem is investigated for the second-order multi-agent systems with timevarying communication delay and switching topology. Based on Lyapunov-Krasovskii functional and Lyapunov-Razumikhin functions respectively, consensus criterions in the form of linear matrix inequality (LMI) are obtained for the system with time-varying communication delays under static interconnection topology con- verging to the leader's states. Moreover, the delay-dependent consensus criterion in the form of LMI is also obtained for the system with time-invariant communication delay and switching topologies by constructing Lyapunov-Krasovskii functional. Numerical simulations present the correctness of the results.展开更多
This paper studies the connectivity-maintaining consensus of multi-agent systems.Considering the impact of the sensing ranges of agents for connectivity and communication energy consumption,a novel communication manag...This paper studies the connectivity-maintaining consensus of multi-agent systems.Considering the impact of the sensing ranges of agents for connectivity and communication energy consumption,a novel communication management strategy is proposed for multi-agent systems so that the connectivity of the system can be maintained and the communication energy can be saved.In this paper,communication management means a strategy about how the sensing ranges of agents are adjusted in the process of reaching consensus.The proposed communication management in this paper is not coupled with controller but only imposes a constraint for controller,so there is more freedom to develop an appropriate control strategy for achieving consensus.For the multi-agent systems with this novel communication management,a predictive control based strategy is developed for achieving consensus.Simulation results indicate the effectiveness and advantages of our scheme.展开更多
The paper addresses the issue of H_(∞)couple-group consensus for a class of discrete-time stochastic multi-agent systems via output-feedback control.Both fixed and Markovian switching communication topologies are con...The paper addresses the issue of H_(∞)couple-group consensus for a class of discrete-time stochastic multi-agent systems via output-feedback control.Both fixed and Markovian switching communication topologies are considered.By employing linear transformations,the closed-loop systems are converted into reduced-order systems and the H_(∞)couplegroup consensus issue under consideration is changed into a stochastic H_(∞)control problem.New conditions for the mean-square asymptotic stability and H_(∞)performance of the reduced-order systems are proposed.On the basis of these conditions,constructive approaches for the design of the output-feedback control protocols are developed for the fixed communication topology and the Markovian switching communication topologies,respectively.Finally,two numerical examples are given to illustrate the applicability of the present design approaches.展开更多
The main contribution of this paper is the design of an event-triggered formation control for leader-following consensus in second-order multi-agent systems(MASs)under communication faults.All the agents must follow t...The main contribution of this paper is the design of an event-triggered formation control for leader-following consensus in second-order multi-agent systems(MASs)under communication faults.All the agents must follow the trajectories of a virtual leader despite communication faults considered as smooth time-varying delays dependent on the distance between the agents.Linear matrix inequalities(LMIs)-based conditions are obtained to synthesize a controller gain that guarantees stability of the synchronization error.Based on the closed-loop system,an event-triggered mechanism is designed to reduce the control law update and information exchange in order to reduce energy consumption.The proposed approach is implemented in a real platform of a fleet of unmanned aerial vehicles(UAVs)under communication faults.A comparison between a state-of-the-art technique and the proposed technique has been provided,demonstrating the performance improvement brought by the proposed approach.展开更多
The network performance and the unmanned aerial vehicle(UAV)number are important objectives when UAVs are placed as communication relays to enhance the multi-agent information exchange.The problem is a non-determinist...The network performance and the unmanned aerial vehicle(UAV)number are important objectives when UAVs are placed as communication relays to enhance the multi-agent information exchange.The problem is a non-deterministic polynomial hard(NP-hard)multi-objective optimization problem,instead of generating a Pareto solution,this work focuses on considering both objectives at the same level so as to achieve a balanced solution between them.Based on the property that agents connected to the same UAV are a cluster,two clustering-based algorithms,M-K-means(MKM)and modified fast search and find density of peaks(MFSFDP)methods,are first proposed.Since the former algorithm requires too much computational time and the latter one requires too many relays,an algorithm for the balanced network performance and relay number(BPN)is proposed by discretizing the area to avoid missing the optimal relay positions and defining a new local density function to reflect the network performance metric.Simulation results demonstrate that the proposed algorithms are feasible and effective.Comparisons between these algorithms show that the BPN algorithm uses fewer relay UAVs than the MFSFDP and classic set-covering based algorithm,and its computational time is far less than the MKM algorithm.展开更多
In this paper,we consider the average-consensus problem with communication time delays and noisy links.We analyze two different cases of coupling topologies:fixed and switching topologies.By utilizing the stability t...In this paper,we consider the average-consensus problem with communication time delays and noisy links.We analyze two different cases of coupling topologies:fixed and switching topologies.By utilizing the stability theory of the stochastic differential equations,we analytically show that the average consensus could be achieved almost surely with the perturbation of noise and the communication time delays even if the time delay is time-varying.The theoretical results show that multi-agent systems can tolerate relatively large time delays if the noise is weak,and they can tolerate relatively strong noise if the time delays are low.The simulation results show that systems with strong noise intensities yield slow convergence.展开更多
Intelligent Reflecting Surface(IRS),with the potential capability to reconstruct the electromagnetic propagation environment,evolves a new IRSassisted covert communications paradigm to eliminate the negligible detecti...Intelligent Reflecting Surface(IRS),with the potential capability to reconstruct the electromagnetic propagation environment,evolves a new IRSassisted covert communications paradigm to eliminate the negligible detection of malicious eavesdroppers by coherently beaming the scattered signals and suppressing the signals leakage.However,when multiple IRSs are involved,accurate channel estimation is still a challenge due to the extra hardware complexity and communication overhead.Besides the crossinterference caused by massive reflecting paths,it is hard to obtain the close-formed solution for the optimization of covert communications.On this basis,the paper improves a heterogeneous multi-agent deep deterministic policy gradient(MADDPG)approach for the joint active and passive beamforming(Joint A&P BF)optimization without the channel estimation,where the base station(BS)and multiple IRSs are taken as different types of agents and learn to enhance the covert spectrum efficiency(CSE)cooperatively.Thanks to the‘centralized training and distributed execution’feature of MADDPG,each agent can execute the active or passive beamforming independently based on its partial observation without referring to others.Numeral results demonstrate that the proposed deep reinforcement learning(DRL)approach could not only obtain a preferable CSE of legitimate users and a low detection of probability(LPD)of warden,but also alleviate the communication overhead and simplify the IRSs deployment.展开更多
With the improvement of mobile equipment performance and development of Pervasive Computing,interactive computational applications such as Multi-Agent (MA) systems in Pervasive Computing Environments (PCE) become more...With the improvement of mobile equipment performance and development of Pervasive Computing,interactive computational applications such as Multi-Agent (MA) systems in Pervasive Computing Environments (PCE) become more and more prevalent. Many applications in PCE require Agent communication,manual control,and diversity of devices. Hence system in PCE must be designed flexible,and optimize the use of network,storage and computing resources. However,traditional MA software framework cannot completely adapt to these new features. A new MA software framework and its Agent Communication Modules to solve the problem brought by PCE was proposed. To describe more precisely,it presents Wright/ADL (Architecture Description Language) description of the new framework. Then,it displays an application called AI Eleven based on this new framework. AI Eleven achieves Agent-Agent communication and good collaboration for a task. Two experiments on AI Eleven will demonstrate the new framework's practicability and superiority.展开更多
Device-to-Device(D2D)communication is a promising technology that can reduce the burden on cellular networks while increasing network capacity.In this paper,we focus on the channel resource allocation and power contro...Device-to-Device(D2D)communication is a promising technology that can reduce the burden on cellular networks while increasing network capacity.In this paper,we focus on the channel resource allocation and power control to improve the system resource utilization and network throughput.Firstly,we treat each D2D pair as an independent agent.Each agent makes decisions based on the local channel states information observed by itself.The multi-agent Reinforcement Learning(RL)algorithm is proposed for our multi-user system.We assume that the D2D pair do not possess any information on the availability and quality of the resource block to be selected,so the problem is modeled as a stochastic non-cooperative game.Hence,each agent becomes a player and they make decisions together to achieve global optimization.Thereby,the multi-agent Q-learning algorithm based on game theory is established.Secondly,in order to accelerate the convergence rate of multi-agent Q-learning,we consider a power allocation strategy based on Fuzzy C-means(FCM)algorithm.The strategy firstly groups the D2D users by FCM,and treats each group as an agent,and then performs multi-agent Q-learning algorithm to determine the power for each group of D2D users.The simulation results show that the Q-learning algorithm based on multi-agent can improve the throughput of the system.In particular,FCM can greatly speed up the convergence of the multi-agent Q-learning algorithm while improving system throughput.展开更多
It is essential to maximize capacity while satisfying the transmission time delay of unmanned aerial vehicle(UAV)swarm communication system.In order to address this challenge,a dynamic decentralized optimization mecha...It is essential to maximize capacity while satisfying the transmission time delay of unmanned aerial vehicle(UAV)swarm communication system.In order to address this challenge,a dynamic decentralized optimization mechanism is presented for the realization of joint spectrum and power(JSAP)resource allocation based on deep Q-learning networks(DQNs).Each UAV to UAV(U2U)link is regarded as an agent that is capable of identifying the optimal spectrum and power to communicate with one another.The convolutional neural network,target network,and experience replay are adopted while training.The findings of the simulation indicate that the proposed method has the potential to improve both communication capacity and probability of successful data transmission when compared with random centralized assignment and multichannel access methods.展开更多
The cooperation of multi-robot that is based on the multi-agent system (MAS) theory of distributed artificial intelligence has become a hotspot in the robotics R&D. In the research the multi-robot is regarded as m...The cooperation of multi-robot that is based on the multi-agent system (MAS) theory of distributed artificial intelligence has become a hotspot in the robotics R&D. In the research the multi-robot is regarded as multi-agent. So the communication and cooperation of multi-agent become the key problem for gaining the dynamic running information of cooperating robots. In this paper the authors introduce the communication modes for agent and provide a common strategy which aims at the communication resources of multi-agent model-the CSMA/CD (Carrier Sense Multiple Access with Collision Detection) protocol which is based on the transmittal medium. It supports the cable-communication of multi-robot and the experiments prove its validity.展开更多
This paper introduces a process planning system communication model based on a Multi-agent and all levels of the communication process are in described in detail. The KQML( Knowledge Query and Manipulation Language)...This paper introduces a process planning system communication model based on a Multi-agent and all levels of the communication process are in described in detail. The KQML( Knowledge Query and Manipulation Language) language communication is introduced emphatically using the communication performatives of the KQML language to achieve communication between the agents among the process planning.展开更多
基金The work is a part of project named "'Multi- Agent based Train Operation in Moving Block Setup" funded by Department of Information Technology (DIT), Ministry of Commu- nications and Information Technology, Government of India, vide Grant Number 2(6)/2010-EC dated 21/03/2011.
文摘Multi-agent technology has been used in many complex distributed and concurrent systems. A railway system is such a safety critical system and careful inves- tigation of the functional components is very important. Study of the various functional components in communi- cation-based train control (CBTC) system necessitates a good structural design followed by its validation and ver- ification through a formal modelling technique. The work presented here is the follow up of our multi-agent-based CBTC system for Indian railway designed using the methodology for engineering system of software agents. Behavioural analysis of the designed system involves several operating scenarios that arise during train run, and helps in understanding the reaction of the system to such situations. This validation and verification are very important as it allows the system designer to critically evaluate the desired function of the system and to correct the design errors, if any, before its actual implementation. Modelling, validation and verification of the structural design through Coloured petri net (CPN) are central to this paper. Analysis of simulation results validates the efficacy of the design.
文摘As an important mechanism in multi-agent interaction,communication can make agents form complex team relationships rather than constitute a simple set of multiple independent agents.However,the existing communication schemes can bring much timing redundancy and irrelevant messages,which seriously affects their practical application.To solve this problem,this paper proposes a targeted multiagent communication algorithm based on state control(SCTC).The SCTC uses a gating mechanism based on state control to reduce the timing redundancy of communication between agents and determines the interaction relationship between agents and the importance weight of a communication message through a series connection of hard-and self-attention mechanisms,realizing targeted communication message processing.In addition,by minimizing the difference between the fusion message generated from a real communication message of each agent and a fusion message generated from the buffered message,the correctness of the final action choice of the agent is ensured.Our evaluation using a challenging set of Star Craft II benchmarks indicates that the SCTC can significantly improve the learning performance and reduce the communication overhead between agents,thus ensuring better cooperation between agents.
基金supported by the Natural Science Foundation of Jiangsu Province,China(No.BK20240200)in part by the National Natural Science Foundation of China(Nos.62271501,62071488,62471489 and U22B2002)+1 种基金in part by the Key Technologies R&D Program of Jiangsu,China(Prospective and Key Technologies for Industry)(Nos.BE2023022 and BE2023022-4)in part by the Post-doctoral Fellowship Program of CPSF,China(No.GZB20240996).
文摘Due to the characteristics of line-of-sight(LoS)communication in unmanned aerial vehicle(UAV)networks,these systems are highly susceptible to eavesdropping and surveillance.To effectively address the security concerns in UAV communication,covert communication methods have been adopted.This paper explores the joint optimization problem of trajectory and transmission power in a multi-hop UAV relay covert communication system.Considering the communication covertness,power constraints,and trajectory limitations,an algorithm based on multi-agent proximal policy optimization(MAPPO),named covert-MAPPO(C-MAPPO),is proposed.The proposed method leverages the strengths of both optimization algorithms and reinforcement learning to analyze and make joint decisions on the transmission power and flight trajectory strategies for UAVs to achieve cooperation.Simulation results demonstrate that the proposed method can maximize the system throughput while satisfying covertness constraints,and it outperforms benchmark algorithms in terms of system throughput and reward convergence speed.
基金supported in part by the National Natural Science Foundation of China(No.61906156).
文摘This paper studies the problem of jamming decision-making for dynamic multiple communication links in wireless communication networks(WCNs).We propose a novel jamming channel allocation and power decision-making(JCAPD)approach based on multi-agent deep reinforcement learning(MADRL).In high-dynamic and multi-target aviation communication environments,the rapid changes in channels make it difficult for sensors to accurately capture instantaneous channel state information.This poses a challenge to make centralized jamming decisions with single-agent deep reinforcement learning(DRL)approaches.In response,we design a distributed multi-agent decision architecture(DMADA).We formulate multi-jammer resource allocation as a multiagent Markov decision process(MDP)and propose a fingerprint-based double deep Q-Network(FBDDQN)algorithm for solving it.Each jammer functions as an agent that interacts with the environment in this framework.Through the design of a reasonable reward and training mechanism,our approach enables jammers to achieve distributed cooperation,significantly improving the jamming success rate while considering jamming power cost,and reducing the transmission rate of links.Our experimental results show the FBDDQN algorithm is superior to the baseline methods.
基金supported in part by the National Natural Science Foundation of China(62273255,62350003,62088101)the Shanghai Science and Technology Cooperation Project(22510712000,21550760900)+1 种基金the Shanghai Municipal Science and Technology Major Project(2021SHZDZX0100)the Fundamental Research Funds for the Central Universities
文摘Dear Editor,This letter is concerned with the problem of time-varying formation tracking for heterogeneous multi-agent systems(MASs) under directed switching networks. For this purpose, our first step is to present some sufficient conditions for the exponential stability of a particular category of switched systems.
基金Supported by the National Natural Science Foundation of China(91016017)the National Aviation Found of China(20115868009)~~
文摘The cooperative control and stability analysis problems for the multi-agent system with sampled com- munication are investigated. Distributed state feedback controllers are adopted for the cooperation of networked agents. A theorem in the form of linear matrix inequalities(LMI) is derived to analyze the system stability. An- other theorem in the form of optimization problem subject to LMI constraints is proposed to design the controller, and then the algorithm is presented. The simulation results verify the validity and the effectiveness of the pro- posed approach.
文摘In this paper, a distributed control scheme has been developed for consensus of single integrator multi-agent systems with directed fixed communication topology for arbitrarily large constant, time-varying or distributed communication delays. It is proved that the closed loop control system can reach consensus with an exponential convergence rate if and only if the topology is quasi-strongly connected. Simulation results are also provided to demonstrate the effectiveness of the proposed controller.
基金Supported by the National Natural Science Foundation of China under Grant Nos.61104092,61134007,and61203147the Priority Academic Program Development of Jiangsu Higher Education Institutions
文摘To solve the dynamical consensus problem of second-order multi-agent systems with communication delay,delay-dependent compensations are added into the normal asynchronously-coupled consensus algorithm so as to make the agents achieve a dynamical consensus. Based on frequency-domain analysis, sufficient conditions are gained for second-order multi-agent systems with communication delay under leaderless and leader-following consensus algorithms respectively. Simulation illustrates the correctness of the results.
基金supported by the Fundamental Research Funds for the Central Universities(JUSRP11020)the Specialized Research Fund for the Doctoral Program of Higher Education of China(20090093120006)
文摘Leader-following stationary consensus problem is investigated for the second-order multi-agent systems with timevarying communication delay and switching topology. Based on Lyapunov-Krasovskii functional and Lyapunov-Razumikhin functions respectively, consensus criterions in the form of linear matrix inequality (LMI) are obtained for the system with time-varying communication delays under static interconnection topology con- verging to the leader's states. Moreover, the delay-dependent consensus criterion in the form of LMI is also obtained for the system with time-invariant communication delay and switching topologies by constructing Lyapunov-Krasovskii functional. Numerical simulations present the correctness of the results.
基金supported by the National Key Research and Development Program of China(2018AAA0101701)the National Natural Science Foundation of China(62173224,61833012)。
文摘This paper studies the connectivity-maintaining consensus of multi-agent systems.Considering the impact of the sensing ranges of agents for connectivity and communication energy consumption,a novel communication management strategy is proposed for multi-agent systems so that the connectivity of the system can be maintained and the communication energy can be saved.In this paper,communication management means a strategy about how the sensing ranges of agents are adjusted in the process of reaching consensus.The proposed communication management in this paper is not coupled with controller but only imposes a constraint for controller,so there is more freedom to develop an appropriate control strategy for achieving consensus.For the multi-agent systems with this novel communication management,a predictive control based strategy is developed for achieving consensus.Simulation results indicate the effectiveness and advantages of our scheme.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.61503002 and 61573008)
文摘The paper addresses the issue of H_(∞)couple-group consensus for a class of discrete-time stochastic multi-agent systems via output-feedback control.Both fixed and Markovian switching communication topologies are considered.By employing linear transformations,the closed-loop systems are converted into reduced-order systems and the H_(∞)couplegroup consensus issue under consideration is changed into a stochastic H_(∞)control problem.New conditions for the mean-square asymptotic stability and H_(∞)performance of the reduced-order systems are proposed.On the basis of these conditions,constructive approaches for the design of the output-feedback control protocols are developed for the fixed communication topology and the Markovian switching communication topologies,respectively.Finally,two numerical examples are given to illustrate the applicability of the present design approaches.
文摘The main contribution of this paper is the design of an event-triggered formation control for leader-following consensus in second-order multi-agent systems(MASs)under communication faults.All the agents must follow the trajectories of a virtual leader despite communication faults considered as smooth time-varying delays dependent on the distance between the agents.Linear matrix inequalities(LMIs)-based conditions are obtained to synthesize a controller gain that guarantees stability of the synchronization error.Based on the closed-loop system,an event-triggered mechanism is designed to reduce the control law update and information exchange in order to reduce energy consumption.The proposed approach is implemented in a real platform of a fleet of unmanned aerial vehicles(UAVs)under communication faults.A comparison between a state-of-the-art technique and the proposed technique has been provided,demonstrating the performance improvement brought by the proposed approach.
基金the National Natural Science Foundation of China(61573285)。
文摘The network performance and the unmanned aerial vehicle(UAV)number are important objectives when UAVs are placed as communication relays to enhance the multi-agent information exchange.The problem is a non-deterministic polynomial hard(NP-hard)multi-objective optimization problem,instead of generating a Pareto solution,this work focuses on considering both objectives at the same level so as to achieve a balanced solution between them.Based on the property that agents connected to the same UAV are a cluster,two clustering-based algorithms,M-K-means(MKM)and modified fast search and find density of peaks(MFSFDP)methods,are first proposed.Since the former algorithm requires too much computational time and the latter one requires too many relays,an algorithm for the balanced network performance and relay number(BPN)is proposed by discretizing the area to avoid missing the optimal relay positions and defining a new local density function to reflect the network performance metric.Simulation results demonstrate that the proposed algorithms are feasible and effective.Comparisons between these algorithms show that the BPN algorithm uses fewer relay UAVs than the MFSFDP and classic set-covering based algorithm,and its computational time is far less than the MKM algorithm.
基金Project supported by the National Natural Science Foundation of China (Grant Nos. 61203304,61203055,and 11226150)the Fundamental Research Funds for the Central Universities,China (Grant Nos. 2011QNA26,2010LKSX04,and 2010LKSX09)
文摘In this paper,we consider the average-consensus problem with communication time delays and noisy links.We analyze two different cases of coupling topologies:fixed and switching topologies.By utilizing the stability theory of the stochastic differential equations,we analytically show that the average consensus could be achieved almost surely with the perturbation of noise and the communication time delays even if the time delay is time-varying.The theoretical results show that multi-agent systems can tolerate relatively large time delays if the noise is weak,and they can tolerate relatively strong noise if the time delays are low.The simulation results show that systems with strong noise intensities yield slow convergence.
基金supported by the Key Laboratory of Near Ground Detection and Perception Technology(No.6142414220406 and 6142414210101)Shaanxi and Taicang Keypoint Research and Invention Program(No.2021GXLH-01-15 and TC2019SF03)。
文摘Intelligent Reflecting Surface(IRS),with the potential capability to reconstruct the electromagnetic propagation environment,evolves a new IRSassisted covert communications paradigm to eliminate the negligible detection of malicious eavesdroppers by coherently beaming the scattered signals and suppressing the signals leakage.However,when multiple IRSs are involved,accurate channel estimation is still a challenge due to the extra hardware complexity and communication overhead.Besides the crossinterference caused by massive reflecting paths,it is hard to obtain the close-formed solution for the optimization of covert communications.On this basis,the paper improves a heterogeneous multi-agent deep deterministic policy gradient(MADDPG)approach for the joint active and passive beamforming(Joint A&P BF)optimization without the channel estimation,where the base station(BS)and multiple IRSs are taken as different types of agents and learn to enhance the covert spectrum efficiency(CSE)cooperatively.Thanks to the‘centralized training and distributed execution’feature of MADDPG,each agent can execute the active or passive beamforming independently based on its partial observation without referring to others.Numeral results demonstrate that the proposed deep reinforcement learning(DRL)approach could not only obtain a preferable CSE of legitimate users and a low detection of probability(LPD)of warden,but also alleviate the communication overhead and simplify the IRSs deployment.
基金Guangdong-Hong Kong Technology Cooperation Funding Scheme, China ( No.2007A010101003)Guangdong-Ministry of Education Industry-University Cooperation Funding Scheme,China (No.2007B090200018)
文摘With the improvement of mobile equipment performance and development of Pervasive Computing,interactive computational applications such as Multi-Agent (MA) systems in Pervasive Computing Environments (PCE) become more and more prevalent. Many applications in PCE require Agent communication,manual control,and diversity of devices. Hence system in PCE must be designed flexible,and optimize the use of network,storage and computing resources. However,traditional MA software framework cannot completely adapt to these new features. A new MA software framework and its Agent Communication Modules to solve the problem brought by PCE was proposed. To describe more precisely,it presents Wright/ADL (Architecture Description Language) description of the new framework. Then,it displays an application called AI Eleven based on this new framework. AI Eleven achieves Agent-Agent communication and good collaboration for a task. Two experiments on AI Eleven will demonstrate the new framework's practicability and superiority.
基金This work was supported by the National Natural Science Foundation of China(61871058)Key Special Project in Intergovernmental International Scientific and Technological Innovation Cooperation of National Key Research and Development Program(2017YFE0118600).
文摘Device-to-Device(D2D)communication is a promising technology that can reduce the burden on cellular networks while increasing network capacity.In this paper,we focus on the channel resource allocation and power control to improve the system resource utilization and network throughput.Firstly,we treat each D2D pair as an independent agent.Each agent makes decisions based on the local channel states information observed by itself.The multi-agent Reinforcement Learning(RL)algorithm is proposed for our multi-user system.We assume that the D2D pair do not possess any information on the availability and quality of the resource block to be selected,so the problem is modeled as a stochastic non-cooperative game.Hence,each agent becomes a player and they make decisions together to achieve global optimization.Thereby,the multi-agent Q-learning algorithm based on game theory is established.Secondly,in order to accelerate the convergence rate of multi-agent Q-learning,we consider a power allocation strategy based on Fuzzy C-means(FCM)algorithm.The strategy firstly groups the D2D users by FCM,and treats each group as an agent,and then performs multi-agent Q-learning algorithm to determine the power for each group of D2D users.The simulation results show that the Q-learning algorithm based on multi-agent can improve the throughput of the system.In particular,FCM can greatly speed up the convergence of the multi-agent Q-learning algorithm while improving system throughput.
基金supported by the National Natural Science Foundation of China(62031017,61971221).
文摘It is essential to maximize capacity while satisfying the transmission time delay of unmanned aerial vehicle(UAV)swarm communication system.In order to address this challenge,a dynamic decentralized optimization mechanism is presented for the realization of joint spectrum and power(JSAP)resource allocation based on deep Q-learning networks(DQNs).Each UAV to UAV(U2U)link is regarded as an agent that is capable of identifying the optimal spectrum and power to communicate with one another.The convolutional neural network,target network,and experience replay are adopted while training.The findings of the simulation indicate that the proposed method has the potential to improve both communication capacity and probability of successful data transmission when compared with random centralized assignment and multichannel access methods.
文摘The cooperation of multi-robot that is based on the multi-agent system (MAS) theory of distributed artificial intelligence has become a hotspot in the robotics R&D. In the research the multi-robot is regarded as multi-agent. So the communication and cooperation of multi-agent become the key problem for gaining the dynamic running information of cooperating robots. In this paper the authors introduce the communication modes for agent and provide a common strategy which aims at the communication resources of multi-agent model-the CSMA/CD (Carrier Sense Multiple Access with Collision Detection) protocol which is based on the transmittal medium. It supports the cable-communication of multi-robot and the experiments prove its validity.
基金supported by the National Nature Science Foundation of China under Grant No. 50805099Excellent Young Academic Leaders Support Program of Colleges and Universities in Shanxi Province under Grant No. 20091091Shanxi Provincial Youth Science and Technology Research Fund of Shanxi Provincial under Grant No. 2008021031
文摘This paper introduces a process planning system communication model based on a Multi-agent and all levels of the communication process are in described in detail. The KQML( Knowledge Query and Manipulation Language) language communication is introduced emphatically using the communication performatives of the KQML language to achieve communication between the agents among the process planning.