期刊文献+
共找到101篇文章
< 1 2 6 >
每页显示 20 50 100
Multiagent,multitimescale aggregated regulation method for demand response considering spatial-temporal complementarity of user-side resources
1
作者 Tingzhe Pan Chao Li +3 位作者 Chen Yang Zijie Meng Zongyi Wang Zean Zhu 《Global Energy Interconnection》 2025年第2期240-257,共18页
The integration of substantial renewable energy and controllable resources disrupts the supply-demand balance in distribution grids.Secure operations are dependent on the participation of user-side resources in demand... The integration of substantial renewable energy and controllable resources disrupts the supply-demand balance in distribution grids.Secure operations are dependent on the participation of user-side resources in demand response at both the day-ahead and intraday levels.Current studies typically overlook the spatial--temporal variations and coordination between these timescales,leading to significant day-ahead optimization errors,high intraday costs,and slow convergence.To address these challenges,we developed a multiagent,multitimescale aggregated regulation method for spatial--temporal coordinated demand response of user-side resources.Firstly,we established a framework considering the spatial--temporal coordinated characteristics of user-side resources with the objective to min-imize the total regulation cost and weighted sum of distribution grid losses.The optimization problem was then solved for two different timescales:day-ahead and intraday.For the day-ahead timescale,we developed an improved particle swarm optimization(IPSO)algo-rithm that dynamically adjusts the number of particles based on intraday outcomes to optimize the regulation strategies.For the intraday timescale,we developed an improved alternating direction method of multipliers(IADMM)algorithm that distributes tasks across edge distribution stations,dynamically adjusting penalty factors by using historical day-ahead data to synchronize the regulations and enhance precision.The simulation results indicate that this method can fully achieve multitimescale spatial--temporal coordinated aggregated reg-ulation between day-ahead and intraday,effectively reduce the total regulation cost and distribution grid losses,and enhance smart grid resilience. 展开更多
关键词 Demand response User-side resources Aggregated regulation Multitimescale Multiagent spatial--temporal coordination
在线阅读 下载PDF
Mixed Motivation Driven Social Multi-Agent Reinforcement Learning for Autonomous Driving
2
作者 Long Chen Peng Deng +1 位作者 Lingxi Li Xuemin Hu 《IEEE/CAA Journal of Automatica Sinica》 2025年第6期1272-1282,共11页
Despite great achievement has been made in autonomous driving technologies,autonomous vehicles(AVs)still exhibit limitations in intelligence and lack social coordination,which is primarily attributed to their reliance... Despite great achievement has been made in autonomous driving technologies,autonomous vehicles(AVs)still exhibit limitations in intelligence and lack social coordination,which is primarily attributed to their reliance on single-agent technologies,neglecting inter-AV interactions.Current research on multi-agent autonomous driving(MAAD)predominantly focuses on either distributed individual learning or centralized cooperative learning,ignoring the mixed-motive nature of MAAD systems,where each agent is not only self-interested in reaching its own destination but also needs to coordinate with other traffic participants to enhance efficiency and safety.Inspired by the mixed motivation of human driving behavior and their learning process,we propose a novel mixed motivation driven social multi-agent reinforcement learning method for autonomous driving.In our method,a multi-agent reinforcement learning(MARL)algorithm,called Social Learning Policy Optimization(SoLPO),which takes advantage of both the individual and social learning paradigms,is proposed to empower agents to rapidly acquire self-interested policies and effectively learn socially coordinated behavior.Based on the proposed SoLPO,we further develop a mixed-motive MARL method for autonomous driving combined with a social reward integration module that can model the mixed-motive nature of MAAD systems by integrating individual and neighbor rewards into a social learning objective for improved learning speed and effectiveness.Experiments conducted on the MetaDrive simulator show that our proposed method outperforms existing state-of-the-art MARL approaches in metrics including the success rate,safety,and efficiency.More-over,the AVs trained by our method form coordinated social norms and exhibit human-like driving behavior,demonstrating a high degree of social coordination. 展开更多
关键词 Autonomous driving(AD) mixed motivation MULTIAGENT reinforcement learning social learning
在线阅读 下载PDF
Prescribed Performance Bipartite Consensus Control for MASs Under Data-Driven Strategy
3
作者 Qi Zhou Caiyun Yin +2 位作者 Hui Ma Hongru Ren Hongyi Li 《IEEE/CAA Journal of Automatica Sinica》 2025年第5期937-946,共10页
This paper investigates the bipartite consensus control problem for discrete time nonlinear multiagent systems(MASs)based on data-driven adaptive method.To begin with,a dynamic linearization strategy is utilized to es... This paper investigates the bipartite consensus control problem for discrete time nonlinear multiagent systems(MASs)based on data-driven adaptive method.To begin with,a dynamic linearization strategy is utilized to establish the relationship between bipartite tracking error and control input for MASs.Secondly,the unknown parameter linearly associated with control input is acquired by the adaptive control approach,and a discrete time extended state observer is designed to estimate nonlinear uncertainties.Thirdly,in order to achieve the prescribed performance,the constrained bipartite consensus error is transformed through a strictly increasing function.Based on the converted equivalent unconstrained error function,a sliding mode controller using only the input and output data of the MASs is designed.Finally,the efficacy of the controller is confirmed by simulations. 展开更多
关键词 DATA-DRIVEN nonlinear multiagent systems(MASs) prescribed performance sliding mode control
在线阅读 下载PDF
Multi-UAV Cooperative Pursuit Strategy With Limited Visual Field in Urban Airspace:A Multi-Agent Reinforcement Learning Approach
4
作者 Zhe Peng Guohua Wu +1 位作者 Biao Luo Ling Wang 《IEEE/CAA Journal of Automatica Sinica》 2025年第7期1350-1367,共18页
The application of multiple unmanned aerial vehicles(UAVs)for the pursuit and capture of unauthorized UAVs has emerged as a novel approach to ensuring the safety of urban airspace.However,pursuit UAVs necessitate the ... The application of multiple unmanned aerial vehicles(UAVs)for the pursuit and capture of unauthorized UAVs has emerged as a novel approach to ensuring the safety of urban airspace.However,pursuit UAVs necessitate the utilization of their own sensors to proactively gather information from the unauthorized UAV.Considering the restricted sensing range of sensors,this paper proposes a multi-UAV with limited visual field pursuit-evasion(MUV-PE)problem.Each pursuer has a visual field characterized by limited perception distance and viewing angle,potentially obstructed by buildings.Only when the unauthorized UAV,i.e.,the evader,enters the visual field of any pursuer can its position be acquired.The objective of the pursuers is to capture the evader as soon as possible without collision.To address this problem,we propose the normalizing flow actor with graph attention critic(NAGC)algorithm,a multi-agent reinforcement learning(MARL)approach.NAGC executes normalizing flows to augment the flexibility of policy network,enabling the agent to sample actions from more intricate distributions rather than common distributions.To enhance the capability of simultaneously comprehending spatial relationships among multiple UAVs and environmental obstacles,NAGC integrates the“obstacle-target”graph attention networks,significantly aiding pursuers in supporting search or pursuit activities.Extensive experiments conducted in a high-precision simulator validate the promising performance of the NAGC algorithm. 展开更多
关键词 Graph attention network limited visual field multiagent reinforcement learning(MARL) normalizing flow pursuitevasion
在线阅读 下载PDF
An Improved Multi-Actor Hybrid Attention Critic Algorithm for Cooperative Navigation in Urban Low-Altitude Logistics Environments
5
作者 Chao Li Quanzhi Feng +1 位作者 Caichang Ding Zhiwei Ye 《Computers, Materials & Continua》 2025年第8期3605-3621,共17页
The increasing adoption of unmanned aerial vehicles(UAVs)in urban low-altitude logistics systems,particularly for time-sensitive applications like parcel delivery and supply distribution,necessitates sophisticated coo... The increasing adoption of unmanned aerial vehicles(UAVs)in urban low-altitude logistics systems,particularly for time-sensitive applications like parcel delivery and supply distribution,necessitates sophisticated coordination mechanisms to optimize operational efficiency.However,the limited capability of UAVs to extract stateaction information in complex environments poses significant challenges to achieving effective cooperation in dynamic and uncertain scenarios.To address this,we presents an Improved Multi-Agent Hybrid Attention Critic(IMAHAC)framework that advances multi-agent deep reinforcement learning(MADRL)through two key innovations.Firstly,a Temporal Difference Error and Time-based Prioritized Experience Replay(TT-PER)mechanism that dynamically adjusts sample weights based on temporal relevance and prediction error magnitude,effectively reducing the interference from obsolete collaborative experiences while maintaining training stability.Secondly,a hybrid attention mechanism is developed,integrating a sensor fusion layer—which aggregates features from multi-sensor data to enhance decision-making—and a dissimilarity layer that evaluates the similarity between key-value pairs and query values.By combining this hybrid attention mechanism with theMulti-Actor Attention Critic(MAAC)framework,our approach strengthens UAVs’capability to extract critical state-action features in diverse environments.Comprehensive simulations in urban air mobility scenarios demonstrate IMAHAC’s superiority over conventional MADRL baselines and MAAC,achieving higher cumulative rewards,fewer collisions,and enhanced cooperative capabilities.This work provides both algorithmic advancements and empirical validation for developing robust autonomous aerial systems in smart city infrastructures. 展开更多
关键词 Unmanned aerial vehicles multiagent deep reinforcement learning attention mechanism
在线阅读 下载PDF
Distributed performance constraint control for heterogeneous multiagent systems with dynamic event-triggered mechanism
6
作者 Hongzhen GUO Mou CHEN Peng ZHANG 《Chinese Journal of Aeronautics》 2025年第3期124-133,共10页
In this paper, distributed event-triggered performance constraint control is proposed for Heterogeneous Multiagent Systems (HMASs) including quadrotor unmanned aerial vehicles and unmanned ground vehicles in the prese... In this paper, distributed event-triggered performance constraint control is proposed for Heterogeneous Multiagent Systems (HMASs) including quadrotor unmanned aerial vehicles and unmanned ground vehicles in the presence of unknown external disturbances. To tackle the problem of different dynamic characteristics and facilitate the controller design, the virtual variable is introduced in the z axis of the nonlinear model of unmanned ground vehicles. By using this approach, a universal model is established for the HMAS. Moreover, a distributed disturbance observer is established to cope with the adverse influence of the external disturbances. Then, an Appointed-Time Prescribed Performance Function (ATPPF) is designed to restrict the tracking error in the predefined regions. On this basis, the distributed performance constraint controller is proposed for the HMAS based on the ATPPF and the distributed disturbance observer. Furthermore, the improved event-triggered mechanism is proposed with a dynamic threshold, which depends on the distance between the tracking error and the boundary of the ATPPF. Finally, the effectiveness of the proposed control method is verified by the comparative experiments on an HMAS. 展开更多
关键词 Heterogeneous multiagent systems Quadrotor unmanned aerial vehicles Unmanned ground vehicles Distributed disturbance observer Appoin ted-timne prescribed performance function Event-triggered mechanism
原文传递
Output Consensus of Heterogeneous Linear MASs via Adaptive Event-Triggered Feedback Combination Control
7
作者 Shuo Yuan Chengpu Yu Jian Sun 《IEEE/CAA Journal of Automatica Sinica》 2025年第1期285-287,共3页
Dear Editor,This letter studies output consensus problem of heterogeneous linear multiagent systems over directed graphs. A novel adaptive dynamic event-triggered controller is presented based only on the feedback com... Dear Editor,This letter studies output consensus problem of heterogeneous linear multiagent systems over directed graphs. A novel adaptive dynamic event-triggered controller is presented based only on the feedback combination of the agent's own state and neighbors' output,which can achieve exponential output consensus through intermittent communication. The controller is obtained by solving two linear matrix equations, and Zeno behavior is excluded. 展开更多
关键词 intermittent communication feedback combination heterogeneous linear multiagent systems exponential output consensus directed graphs output consensus problem output consensus solving two linear matrix equations
在线阅读 下载PDF
Broad-Learning-System-Based Model-Free Adaptive Predictive Control for Nonlinear MASs Under DoS Attacks
8
作者 Hongxing Xiong Guangdeng Chen +1 位作者 Hongru Ren Hongyi Li 《IEEE/CAA Journal of Automatica Sinica》 2025年第2期381-393,共13页
In this paper, the containment control problem in nonlinear multi-agent systems(NMASs) under denial-of-service(DoS) attacks is addressed. Firstly, a prediction model is obtained using the broad learning technique to t... In this paper, the containment control problem in nonlinear multi-agent systems(NMASs) under denial-of-service(DoS) attacks is addressed. Firstly, a prediction model is obtained using the broad learning technique to train historical data generated by the system offline without DoS attacks. Secondly, the dynamic linearization method is used to obtain the equivalent linearization model of NMASs. Then, a novel model-free adaptive predictive control(MFAPC) framework based on historical and online data generated by the system is proposed, which combines the trained prediction model with the model-free adaptive control method. The development of the MFAPC method motivates a much simpler robust predictive control solution that is convenient to use in the case of DoS attacks. Meanwhile, the MFAPC algorithm provides a unified predictive framework for solving consensus tracking and containment control problems. The boundedness of the containment error can be proven by using the contraction mapping principle and the mathematical induction method. Finally, the proposed MFAPC is assessed through comparative experiments. 展开更多
关键词 Broad learning technique denial-of-service(DoS) model-free adaptive predictive control(MFAPC) nonlinear multiagent systems(NMASs)
在线阅读 下载PDF
Hybrid Distributed and Decentralised Reinforcement Learning for Formation Control of Multi-Robots With Obstacle Avoidance
9
作者 Yaoqian Peng Xinglong Zhang +1 位作者 Haibin Xie Xin Xu 《CAAI Transactions on Intelligence Technology》 2025年第5期1337-1349,共13页
Recently,learning-based control for multi-robot systems(MRS)with obstacle avoidance has received increasing attention.The goals of formation control and obstacle avoidance could be intrinsically tied.As a result,devel... Recently,learning-based control for multi-robot systems(MRS)with obstacle avoidance has received increasing attention.The goals of formation control and obstacle avoidance could be intrinsically tied.As a result,developing a safe and near-optimal control policy with the actor-critic structure is challenging.Therefore,a hybrid distributed and decentralised asynchronous actor-critic reinforcement learning(Di-De-RL)technique is proposed to address this problem.First,we decompose the integrated formation control and collision avoidance problem into two successive ones.To solve them,we design a distributed reinforcement learning(Di-RL)algorithm that employs a neural network-based actor-critic structure for formation control,and a decentralised RL(De-RL)algorithm that incorporates a potential-field(PF)-based actor-critic structure for collision avoidance.In Di-RL,the actor-critic pairs are trained in a distributed manner to achieve near-optimal consensus formation control.With the trained policy of Di-RL fixed,the PF actor-critic pairs in De-RL are trained in a decentralised manner for safe collision avoidance.Such an asynchronous training design of the hybrid Di-RL and De-RL enables weight convergence and control safety in the learning process.The simulated and real-world experimental results demonstrate the effectiveness and enhanced performance of the approach in formation control with both static and dynamic obstacle avoidance,highlighting its advantages in resolving the conflict between the safety objective and optimal control. 展开更多
关键词 intelligent multiagent systems intelligent robots
在线阅读 下载PDF
Data-Driven Iterative Learning Consensus Tracking Based on Robust Neural Models for Unknown Heterogeneous Nonlinear Multiagent Systems With Input Constraints
10
作者 Chong Zhang Yunfeng Hu +2 位作者 TingTing Wang Xun Gong Hong Chen 《IEEE/CAA Journal of Automatica Sinica》 2025年第10期2153-2155,共3页
Dear Editor,Aiming at the consensus tracking problem of a class of unknown heterogeneous nonlinear multiagent systems(MASs)with input constraints,a novel data-driven iterative learning consensus control(ILCC)protocol ... Dear Editor,Aiming at the consensus tracking problem of a class of unknown heterogeneous nonlinear multiagent systems(MASs)with input constraints,a novel data-driven iterative learning consensus control(ILCC)protocol based on zeroing neural networks(ZNNs)is proposed.First,a dynamic linearization data model(DLDM)is acquired via dynamic linearization technology(DLT). 展开更多
关键词 dynamic linearization data model dldm consensus tracking problem input constraints consensus tracking unknown heterogeneous nonlinear multiagent systems robust neural models data driven iterative learning zeroing neural networks znns
在线阅读 下载PDF
开发“网络课程”的几种信息技术
11
作者 刘龙 《吉林工程技术师范学院学报》 2006年第6期20-22,共3页
本文阐述了“网络课程”的概念,介绍了与开发优质网络课程相关的四种重要信息技术:Mu ltiA-gents技术,XML技术,GR ID技术和NLP技术。
关键词 网络课程 multiagents XML GRID NLP
在线阅读 下载PDF
基于MultiAgent和CSCW的多媒体教学模型 被引量:3
12
作者 余春艳 朱晓芸 王申康 《计算机工程》 CAS CSCD 北大核心 1999年第9期20-21,31,共3页
生于MultiAgent和CSCW的观点,提出一个多媒体教学模型.探讨了该模型的结构以及模型中Agent的表示、通信等关键技术。
关键词 CAI 多媒体教学 MULTIAGENT CSCW
在线阅读 下载PDF
基于Multiagent生态进化算法实现个性化主动信息服务 被引量:2
13
作者 路海明 徐晋晖 +1 位作者 卢增祥 李衍达 《计算机工程与应用》 CSCD 北大核心 1999年第10期1-2,7,共3页
网络资源的指数膨胀,使得用户在获得自己需要的信息资源时要花费大量的时间和精力。该文采用Multiagent技术协助用户获取需要的信息资源,以节省用户的时间。同时给出了一种生态进化算法,使得Agent可以逐渐把握用户的需求、适应用户... 网络资源的指数膨胀,使得用户在获得自己需要的信息资源时要花费大量的时间和精力。该文采用Multiagent技术协助用户获取需要的信息资源,以节省用户的时间。同时给出了一种生态进化算法,使得Agent可以逐渐把握用户的需求、适应用户需求的变化、探索用户可能感兴趣的领域,为用户提供个性化的主动信息服务。 展开更多
关键词 MULTIAGENT 生态进化算法 信息服务 INTERNET网
在线阅读 下载PDF
基于多智能体的梯级水电联合运行决策支持系统 被引量:3
14
作者 黎育红 周建中 《计算机应用研究》 CSCD 北大核心 2009年第11期4162-4165,共4页
以复杂系统自组织理论与multiagent分布式建模技术为基础,通过对流域梯级水电联合运行与调度过程的自组织行为特征的分析,建立基于multiagent理论的决策模型,实时采集流域的水情、雨情、工情、电情(如各电站机组出力、负荷分配、运行方... 以复杂系统自组织理论与multiagent分布式建模技术为基础,通过对流域梯级水电联合运行与调度过程的自组织行为特征的分析,建立基于multiagent理论的决策模型,实时采集流域的水情、雨情、工情、电情(如各电站机组出力、负荷分配、运行方式、全厂功率总和等),模拟仿真流域梯级水电联合运行与调度的决策过程,合理进行流域梯级水能的综合利用和发电生产以及跨流域补偿联合运行与调度,实现流域梯级水电联合运行与调度重大决策的动态预演,为梯级水电联合运行提供经济、高效、实时的决策支持。 展开更多
关键词 梯级水电 自组织理论 multiagent建模 决策支持系统
在线阅读 下载PDF
多主体撮合交易系统的设计与实现 被引量:1
15
作者 唐亮贵 钟增胜 程代杰 《计算机科学》 CSCD 北大核心 2006年第1期124-126,共3页
研究了电子商务交易模型的实现机制,设计了基于 Multi-Agent 的电子商务交易市场的组织结构,在基于Multi-Agent 的撮合交易系统中,把整个交易过程看成一个动态的交互过程,体现了 Multi—Agent 系统的动态特性,同时引入强化学习算法对竞... 研究了电子商务交易模型的实现机制,设计了基于 Multi-Agent 的电子商务交易市场的组织结构,在基于Multi-Agent 的撮合交易系统中,把整个交易过程看成一个动态的交互过程,体现了 Multi—Agent 系统的动态特性,同时引入强化学习算法对竞标策略进行动态修正,使多主体撮合交易系统具有一定的自均衡和自学习能力。试验表明,基于多主体的撮合交易模型和动态竞标机制具有较好的交易性能。 展开更多
关键词 MULTI-AGENT系统 撮合 动态竞标 交易系统 多主体 MULTI-AGENT MultiAgent系统 设计 强化学习算法 交易模型
在线阅读 下载PDF
基于Multi Agent的图象理解 被引量:1
16
作者 刘传才 《计算机应用研究》 CSCD 北大核心 2001年第10期54-57,共4页
最近十多年来 ,具有专家系统外壳的图象理解软件的开发一直是一个重要的研究课题。然而 ,多数基于知识的图象理解系统或者局限于诸如分割之类的特定图象操作上 ,或者局限于像理解建筑物的航空图象这样特定的应用。由于这些系统的目标和... 最近十多年来 ,具有专家系统外壳的图象理解软件的开发一直是一个重要的研究课题。然而 ,多数基于知识的图象理解系统或者局限于诸如分割之类的特定图象操作上 ,或者局限于像理解建筑物的航空图象这样特定的应用。由于这些系统的目标和知识结构的局限性 ,因而不能将它们推广到其它领域。此研究就是试图解决此问题 :首先将Agent构造环境扩展为一个MultiAgent的图象理解系统 ,而此系统是知识入口的开发工具 ,它为用户提供一个类似于专家系统外壳的界面。这样一方面能加速图象理解系统的开发 ,另一方面方便那些缺乏图象理解知识的人开发图象理解应用程序。 展开更多
关键词 图象理解 色直方图 纹理真方图 区域抽取 图象处理 计算机视觉 MULTIAGENT
在线阅读 下载PDF
Adaptive Cyber Defense Technique Based on Multiagent Reinforcement Learning Strategies 被引量:1
17
作者 Adel Alshamrani Abdullah Alshahrani 《Intelligent Automation & Soft Computing》 SCIE 2023年第6期2757-2771,共15页
The static nature of cyber defense systems gives attackers a sufficient amount of time to explore and further exploit the vulnerabilities of information technology systems.In this paper,we investigate a problem where ... The static nature of cyber defense systems gives attackers a sufficient amount of time to explore and further exploit the vulnerabilities of information technology systems.In this paper,we investigate a problem where multiagent sys-tems sensing and acting in an environment contribute to adaptive cyber defense.We present a learning strategy that enables multiple agents to learn optimal poli-cies using multiagent reinforcement learning(MARL).Our proposed approach is inspired by the multiarmed bandits(MAB)learning technique for multiple agents to cooperate in decision making or to work independently.We study a MAB approach in which defenders visit a system multiple times in an alternating fash-ion to maximize their rewards and protect their system.We find that this game can be modeled from an individual player’s perspective as a restless MAB problem.We discover further results when the MAB takes the form of a pure birth process,such as a myopic optimal policy,as well as providing environments that offer the necessary incentives required for cooperation in multiplayer projects. 展开更多
关键词 Multiarmed bandits reinforcement learning multiagents intrusion detection systems
在线阅读 下载PDF
Multiagent系统通讯及单元可靠性综合评测
18
作者 郭勇 马培军 苏小红 《哈尔滨工业大学学报》 EI CAS CSCD 北大核心 2011年第1期83-88,共6页
为了更准确的评测Multiagent系统可靠性,提出了将通讯可靠性、单元可靠性及系统拓扑结构都考虑在评测过程中的综合评测方法.该方法概括出Multiagent系统的结构,釆用基于马尔可夫过程的方法,分析通讯可靠性、单元可靠性等因素对集中式、... 为了更准确的评测Multiagent系统可靠性,提出了将通讯可靠性、单元可靠性及系统拓扑结构都考虑在评测过程中的综合评测方法.该方法概括出Multiagent系统的结构,釆用基于马尔可夫过程的方法,分析通讯可靠性、单元可靠性等因素对集中式、分布式及联邦式Multiagent系统可靠性的影响,并给出了系统可靠性评测模型.仿真结果表明,该方法能够将复杂系统的可靠性评测问题分解为简单的形式.针对各分解后的结构进行评测,进一步得到了整个系统的可靠性. 展开更多
关键词 Multiagent系统 可靠性评测 MAS结构 备份agent
在线阅读 下载PDF
上一页 1 2 6 下一页 到第
使用帮助 返回顶部