基于分布式联邦强化学习的多区域综合能源系统优化调度

Distributed federated reinforcement learning-based optimal scheduling of multi-regional integrated energy system

下载PDF

导出

摘要针对传统优化方法及集中式联邦强化学习在隐私保护和计算效率方面存在的局限性,提出一种基于分布式联邦强化学习的多区域综合能源系统优化调度方法。每个区域综合能源系统由单独智能体管理,各智能体通过双延迟确定性策略梯度算法优化本地Critic网络的参数,并与邻域智能体进行参数信息交互,无需额外的中央服务器即可高效管理能量调度。为了保证全局最优,参数交互的权重系数由双随机矩阵元素确定。算例分析结果表明,所提方法能在增强对综合能源系统隐私保护的同时,展现出良好的收敛性能,有效降低了运营成本。 Aiming at the limitations of traditional optimization methods and centralized federated reinforcement learning in terms of privacy protection and computational efficiency,a distributed federated reinforcement learning-based optimal scheduling method of multi-regional integrated energy system is proposed.Each regional integrated energy system is managed by an individual agent.Each agent optimizes the parameters of its local Critic network using the twin delayed deep deterministic policy gradient algorithm and exchanges parameter information with the neighboring agents,thereby enabling efficient energy scheduling management without reliance on an additional centralized server.To ensure global optimization,the weight coefficients of parameter interaction are determined by the elements of a doubly stochastic matrix.The results of case study analysis show that the proposed method can enhance the privacy protection of the integrated energy system while demonstrating excellent convergence performance and effectively reducing operation costs.

作者朱新文王家奇李生炜林文杰吴祥郭方洪 ZHU Xinwen;WANG Jiaqi;LI Shengwei;LIN Wenjie;WU Xiang;GUO Fanghong(College of Information Engineering,Zhejiang University of Technology,Hangzhou 310023,China)

机构地区浙江工业大学信息工程学院

出处《电力自动化设备》北大核心 2026年第4期94-102,共9页 Electric Power Automation Equipment

基金国家自然科学基金资助项目(62373328)。

关键词综合能源系统优化调度分布式联邦强化学习智能体双延迟确定性策略梯度算法 integrated energy system optimal scheduling distributed federated reinforcement learning agent twin delayed deep deterministic policy gradient algorithm

分类号 TM73 [电气工程—电力系统及自动化] TK01 [动力工程及工程热物理]

引文网络
相关文献

1李杰,刘苠渝,乔德文,乐俊青,向涛.基于数据集蒸馏的安全高效一次性交互联邦学习[J].西南大学学报(自然科学版),2026,48(4):155-166.
2王李羊.历史文化街区中的秩序感对当代设计的启示——以南通寺街历史文化街区导视设计为例[J].江苏工程职业技术学院学报,2025,25(2):62-68.
3牛福生,张红梅,张晋霞,王研,于晓东.基于PSO-BP算法的平-摆筛参数交互对分层效果的影响[J].矿产综合利用,2026,47(1):151-160.
4李贤壮,邓文泽,程万友.一种求解稀疏约束优化问题的无记忆拟牛顿算法[J].嘉兴大学学报,2025,37(6):14-23.
5姜苏宸.涤纶油剂的分离与剖析[J].时代技术,2025,3(3):52-57.

电力自动化设备

2026年第4期

浏览历史

内容加载中请稍等...

基于分布式联邦强化学习的多区域综合能源系统优化调度

相关作者

相关机构

相关主题

浏览历史