自动驾驶车辆轨迹跟踪避撞的扩散强化学习方法研究

Research on Diffusion Reinforcement Learning Method for Vehicle Trajectory Tracking and Collision Avoidance of Autonomous Vehicles

下载PDF

导出

摘要自动驾驶汽车的智能化是推进汽车产业转型升级的关键,其中轨迹跟踪避撞技术对确保自动驾驶汽车行驶安全至关重要。本研究针对现有强化学习型控制方法探索不充分问题,提出了一种扩散型强化学习算法。通过将扩散模型与强化学习框架相结合,把传统策略网络替换为扩散式生成策略网络,将扩散模型的多模态分布匹配能力引入强化学习中,并与值分布柔性执行-评价算法结合,提出了扩散型值分布执行-评价算法。仿真与实车试验表明,所提算法展现出较高的探索效率,实车横向平均跟踪误差小于0.03 m,速度平均跟踪误差小于0.05 m/s,验证了算法的优越性。 The intelligence of autonomous vehicles is key to upgrading of the automotive industry,where trajectory tracking and collision avoidance technologies are crucial for ensuring vehicle safety.In this paper,for the problem of insufficient exploration of existing reinforcement learning control methods,a diffusion reinforcement learning algorithm is proposed.By combining diffusion models with reinforcement learning frameworks and replacing traditional policy networks with diffusion generative policy networks,introducing the multimodal distribution matching capability of diffusion models into reinforcement learning,and combining it with the distributional soft actor-critic algorithm,a diffusion distributional actor-critic algorithm(DDAC)is proposed.Simulation and real-vehicle experiments demonstrate that the proposed algorithm exhibits high exploration efficiency,with real vehicle lateral tracking error less than 0.03 m and velocity tracking error less than 0.05 m/s,verifying the superiority of the algorithm.

作者赵俊杰王以诺吴江吴思潮邹昌迪王洪达李升波马飞段京良 Zhao Junjie;Wang Yinuo;Wu Jiang;Wu Sichao;Zou Changdi;Wang Hongda;Li ShengboEben;Ma Fei;Duan Jingliang(School of Mechanical Engineering,University of Science and Technology Beijing,Beijing 100083;School of Vehicle and Mobility,Tsinghua University,Beijing 100084)

机构地区北京科技大学机械工程学院清华大学车辆与运载学院

出处《汽车工程》北大核心 2025年第8期1490-1500,共11页 Automotive Engineering

基金国家自然科学基金(52202487,62273256) 中央高校基本科研业务费专项资金项目(FRF-OT-23-02)资助。

关键词轨迹跟踪主动避撞值分布强化学习扩散模型 trajectory tracking active collision avoidance distributional reinforcement learning diffusion model

分类号 U463.6 [机械工程—车辆工程]

引文网络
相关文献

参考文献2

1肖礼明,张发旺,陈良发,闫昊琪,马飞,李升波,段京良.依托多风格强化学习的车辆轨迹跟踪避撞控制[J].汽车工程,2024,46(6):945-955. 被引量：6
2Wenxuan Wang,Yuhang Zhang,Jiaxin Gao,Yuxuan Jiang,Yujie Yang,Zhilong Zheng,Wenjun Zou,Jie Li,Congsheng Zhang,Wenhan Cao,Genjin Xie,Jingliang Duan,Shengbo Eben Li.GOPS:A general optimal control problem solver for autonomous driving and industrial control applications[J].Communications in Transportation Research,2023,3(1):92-106. 被引量：8

二级参考文献22

1李升波,关阳,侯廉,高洪波,段京良,梁爽,汪玉,成波,李克强,任伟,李骏.深度神经网络的关键技术及其在自动驾驶领域的应用[J].汽车安全与节能学报,2019,10(2):119-145. 被引量：36
2Laura Quante,Meng Zhang,Katharina Preuk,Caroline Schießl.Human Performance in Critical Scenarios as a Benchmark for Highly Automated Vehicles[J].Automotive Innovation,2021,4(3):274-283. 被引量：8
3Yixiao Liang,Yinong Li,Yinghong Yu,Zhida Zhang,Ling Zheng,Yue Ren.Path-Following Control of Autonomous Vehicles Considering Coupling Effects and Multi-source System Uncertainties[J].Automotive Innovation,2021,4(3):284-300. 被引量：12
4汪选要,魏星,谢东,徐同良.基于权值惩罚法自适应人机协同避撞策略[J].科学技术与工程,2022,22(13):5463-5471. 被引量：3
5Zongwei Liu,Wang Zhang,Fuquan Zhao.Impact,Challenges and Prospect of Software-Defined Vehicles[J].Automotive Innovation,2022,5(2):180-194. 被引量：7
6Hongliang Lu,Chao Lu,Yang Yu,Guangming Xiong,Jianwei Gong.Autonomous Overtaking for Intelligent Vehicles Considering Social Preference Based on Hierarchical Reinforcement Learning[J].Automotive Innovation,2022,5(2):195-208. 被引量：6
7李道飞,查安飞,徐彪,张家杰.半挂汽车列车紧急避撞轨迹跟踪控制算法[J].汽车工程,2022,44(7):1098-1106. 被引量：9
8Yang Liu,Cheng Lyu,Yuan Zhang,Zhiyuan Liu,Wenwu Yu,Xiaobo Qu.DeepTSP:Deep traffic state prediction model based on large-scale empirical data[J].Communications in Transportation Research,2021,1(1):90-99. 被引量：6
9Xiaowei Shi,Dongfang Zhao,Handong Yao,Xiaopeng Li,David K.Hale,Amir Ghiasi.Video-based trajectory extraction with deep learning for High-Granularity Highway Simulation(HIGH-SIM)[J].Communications in Transportation Research,2021,1(1):111-120. 被引量：4
10Bile Peng,Musa Furkan Keskin,Balazs Kulcsar,Henk Wymeersch.Connected autonomous vehicles for improving mixed traffic efficiency in unsignalized intersections with deep reinforcement learning[J].Communications in Transportation Research,2021,1(1):139-143. 被引量：14

共引文献11

1刘辉,张发旺,聂士达,段京良,郭丛帅,郭凌雄.基于逆模型预测控制的拟人驾驶控制[J].汽车工程,2024,46(4):596-604. 被引量：3
2肖礼明,张发旺,陈良发,闫昊琪,马飞,李升波,段京良.依托多风格强化学习的车辆轨迹跟踪避撞控制[J].汽车工程,2024,46(6):945-955. 被引量：6
3Changxi Ma,Yuanping Li,Wei Meng.A review of vehicle speed control strategies[J].Journal of Intelligent and Connected Vehicles,2023,6(4):190-201.
4Zihao Sheng,Zilin Huang,Sikai Chen.Kinematics-aware multigraph attention network with residual learning for heterogeneous trajectory prediction[J].Journal of Intelligent and Connected Vehicles,2024,7(2):138-150.
5刘洋,占佳豪,李深,李小鹏,陈峻.自动驾驶技术的未来:单车智能和智能车路协同[J].汽车安全与节能学报,2024,15(5):611-633. 被引量：15
6陈良发,宋绪杰,肖礼明,高路路,张发旺,李升波,马飞,段京良.依托平滑强化学习的铰接车轨迹跟踪方法[J].哈尔滨工业大学学报,2024,56(12):116-123.
7王志红,曾嘉荣,胡杰,张志凌,杨东浩,纪越丰.基于P-PP的轻型商用车路径跟踪控制[J].汽车工程,2025,47(4):669-679.
8吴坚,石裕康,朱冰,赵健,陈志成.基于主观先验强化学习的汽车环岛驾驶决策[J].汽车工程,2025,47(8):1513-1521.
9陈春林,谢义杰,汤淑芳,范金瑾.基于动力学分析的赛车结构优化及行驶稳定性控制[J].自动化与仪器仪表,2025(8):148-152.
10张发旺,陈良发,段京良,刘辉,聂士达,张晨.多轴轮式铰接特种车辆双策略轨迹跟踪控制[J].兵工学报,2025,46(8):116-127.

1尤鑫,薛金银,张北海,高宇航,田向丽,赵建东.基于跨雷达数据的车辆轨迹跟踪[J].科学技术与工程,2025,25(17):7373-7379.
2余莎莎,陈星雨.城市空中交通领域关键技术创新与挑战[J].航空学报,2024,45(S1):26-47. 被引量：12
3孙念昌,胡刚,张兆山.多结构遥控协同的履带运输车控制系统优化研究[J].国外电子测量技术,2025,44(3):118-124. 被引量：2
4李豪,文浩,邓波,秦成,刘桂玲.铝压铸支架断裂失效试验分析及改进措施[J].今日制造与升级,2025(4):58-60.
5谭紫.基于鲸鱼算法优化BP神经网络的全过程工程咨询平台服务质量评价研究[J].科技与创新,2025(16):28-31.
6周勇,周云海,赵畅,李欣,林可意,季怀招,罗琰琳.基于改进MASAC算法的配电网光储一体电压控制策略[J].电力自动化设备,2025,45(8):190-198.
7杨澜.赋权增能视角下青少年参与社区治理的内在机制与路径研究[J].社会科学前沿,2025,14(7):285-292.
8张光华,常继友,陈放,毛伯敏,王鹤,张建燕.基于库函数动态替换的物联网设备固件仿真方案[J].信息网络安全,2025(7):1053-1062.
9王成.基于随机可达集的“撞软墙”移动闭塞列车追踪安全防护方法[J].城市轨道交通研究,2025,28(8):196-201. 被引量：2
10马悦琦,王琳,赵彦涛,迟瑞娟,郭延超,陈松.水稻插秧机电驱动系统设计与试验[J].拖拉机与农用运输车,2025,52(4):61-68.

汽车工程

2025年第8期

浏览历史

内容加载中请稍等...

自动驾驶车辆轨迹跟踪避撞的扩散强化学习方法研究

参考文献2

二级参考文献22

共引文献11

相关作者

相关机构

相关主题

浏览历史