应用深度强化学习的电池热管理系统控制策略

Control Strategy for Battery Thermal Management System Using Deep Reinforcement Learning

下载PDF

导出

摘要针对电动汽车电池热管理系统中传统控制方法温控精度不足及环境适应性差的难题,提出基于深度强化学习的智能控制体方法。基于电池热电耦合模型与制冷空调系统模型,应用强化学习中的双延迟深度确定性策略(TD3)算法进行控制策略训练,通过双重价值网络与延迟策略更新机制,克服传统强化学习中的过高估计问题。结果表明:在夏季充电的训练工况下,能够将电池包的平均温度控制在25℃左右;在冬季充电的训练工况下,能够将电池包的平均温度控制在20℃左右,电池模组之间的最大温差控制在1℃以内。同时,在控制动作上,智能体控制的压缩机转速的调整更为平缓,与比例-积分-微分控制、开关控制相比,智能体控制在夏季放电时最高节能了32.1%,充电时最高节能了15.8%,冬季放电时最高节能了17.0%,充电时最高节能了26.3%。此外,在环境条件变化时,智能体能够及时调整控制动作,将电池包的温度控制在目标温度附近。该研究利用TD3强化学习算法能够在多变的环境条件下平稳、精准地控制电池热管理系统,证明了强化学习在电池热管理中的可行性与有效性。 To address the challenges of insufficient temperature control accuracy and poor environmental adaptability in traditional control methods for electric vehicle battery thermal management systems,an intelligent agent control method based on deep reinforcement learning is proposed.Based on a battery electro-thermal coupling model and an air conditioning refrigeration system model,the twin delayed deep deterministic policy gradient(TD3)algorithm in reinforcement learning is applied to train the control strategy.By utilizing dual critic networks and a delayed policy update mechanism,the issue of overestimation common in traditional reinforcement learning is overcome.Results show that under summer charging conditions,the average battery pack temperature can be controlled around 25℃,while under winter charging conditions,it can be maintained around 20℃,with the maximum temperature difference between battery modules controlled within 1℃.Moreover,the compressor speed adjustments made by the intelligent agent are smoother.Compared with proportional-integral-derivative control and on-off control,the intelligent agent control achieves energy savings of up to 32.1%during summer discharging,15.8%during summer charging,17.0%during winter discharging,and 26.3%during winter charging.Additionally,when environmental conditions change,the agent can promptly adjust control actions to maintain the battery pack temperature near the target.This study demonstrates that the TD3 reinforcement learning algorithm can achieve stable and precise control of the battery thermal management system under varying environmental conditions,proving the feasibility and effectiveness of reinforcement learning in battery thermal management.

作者席椿富赵东鹏黄驰邹子豪黄琨杰谢翌 XI Chunfu;ZHAO Dongpeng;HUANG Chi;ZOU Zihao;HUANG Kunjie;XIE Yi(China Automotive Engineering Research Institute Co.,Ltd.,Chongqing 401122,China;School of Mechanical Engineering,Tianjin University,Tianjin 300350,China;Key Laboratory of Low-Grade Energy Utilization Technologies and Systems of Ministry of Education,Chongqing University,Chongqing 400044,China;School of Energy and Power Engineering,Chongqing University,Chongqing 400044,China;School of Mechanical and Transportation Engineering,Chongqing University,Chongqing 400044,China)

机构地区中国汽车工程研究院股份有限公司天津大学机械工程学院重庆大学低品位能源利用技术及系统教育部重点实验室重庆大学能源与动力工程学院重庆大学机械与运载工程学院

出处《西安交通大学学报》北大核心 2026年第2期24-37,共14页 Journal of Xi'an Jiaotong University

基金国家自然科学基金资助项目(52472375)。

关键词电池热管理电池热电耦合模型强化学习制冷空调系统 battery thermal management battery thermoelectric coupling model reinforcement learning air conditioning modeling

分类号 TM912 [电气工程—电力电子与电力传动]

引文网络
相关文献

1丁伟豪,屈正浩,申凌峰,王光辉,朱政宇,张千坤.基于RIS辅助的UAV物理层安全传输技术[J].无线电工程,2025,55(10):1976-1985.
2郭勇,许杰,郭林文,赵骥.湛江吴川国际机场制冷空调系统增效降耗实践与探索[J].制冷,2025,44(5):18-24.
3王望升,于飞宇,魏成波,陈勇,肖平.卷烟工厂制冷空调系统精准供能模式研究与应用[J].今日制造与升级,2025(6):104-106.
4董涵予.基于强化学习的智能机器人路径规划算法研究[J].通讯世界,2026,33(1):193-195.
5唐波,朱华鑫,王洪伟.制冷空调的节能技术应用及发展趋势探讨[J].数码设计(电子版),2023(2):0311-0313.
6陈志.轮胎硫化机电加热系统节能效率提升研究[J].中国轮胎资源综合利用,2025(12):53-55.
7郭欣,韩晓红,刘晓红.抓住制冷设备更新时机推进制冷剂回收及再生利用[J].中国经贸导刊,2025(7):68-71. 被引量：1
8王宝玉,冯晓东,李智卿.结合ADRC与PID的水下机器人抗扰控制方法设计[J].制造业自动化,2025,47(12):103-114.
9张震.基于智能控制的医院手术室洁净空调系统优化研究[J].张江科技评论,2025(9):146-148.
10刘沁沅,王怀禹.南充地区猪场智能化养殖环境的构建与优化:基于试验场的数据分析[J].猪业科学,2026,43(1):123-125.

西安交通大学学报

2026年第2期

浏览历史

内容加载中请稍等...

应用深度强化学习的电池热管理系统控制策略

相关作者

相关机构

相关主题

浏览历史