A Low-Collision and Efficient Grasping Method for Manipulator Based on Safe Reinforcement Learning

下载PDF

导出

摘要 Grasping is one of the most fundamental operations in modern robotics applications.While deep rein-forcement learning(DRL)has demonstrated strong potential in robotics,there is too much emphasis on maximizing the cumulative reward in executing tasks,and the potential safety risks are often ignored.In this paper,an optimization method based on safe reinforcement learning(Safe RL)is proposed to address the robotic grasping problem under safety constraints.Specifically,considering the obstacle avoidance constraints of the system,the grasping problem of the manipulator is modeled as a Constrained Markov Decision Process(CMDP).The Lagrange multiplier and a dynamic weighted mechanism are introduced into the Proximal Policy Optimization(PPO)framework,leading to the development of the dynamic weighted Lagrange PPO(DWL-PPO)algorithm.The behavior of violating safety constraints is punished while the policy is optimized in this proposed method.In addition,the orientation control of the end-effector is included in the reward function,and a compound reward function adapted to changes in pose is designed.Ultimately,the efficacy and advantages of the suggested method are proved by extensive training and testing in the Pybullet simulator.The results of grasping experiments reveal that the recommended approach provides superior safety and efficiency compared with other advanced RL methods and achieves a good trade-off between model learning and risk aversion.

作者 Qinglei Zhang Bai Hu Jiyun Qin Jianguo Duan Ying Zhou

机构地区 China Institute of FTZ Supply Chain

出处《Computers, Materials & Continua》 2025年第4期1257-1273,共17页 计算机、材料和连续体(英文)

关键词 Safe reinforcement learning(Safe RL) manipulator grasping obstacle avoidance constraints lagrange multiplier dynamic weighted

分类号 TP242 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

1马敏捷,方晓峰.基于5E教学模式的高等数学教学探讨——以条件极值为例[J].创新教育研究,2024,12(12):160-165.
2陈江涛.深度强化学习在无人驾驶路径规划中的应用[J].智能城市应用,2025,8(4):98-100. 被引量：1
3尹昊,陈帆,和红杰.基于深度强化学习的四向协同三维装箱方法[J].自动化学报,2024,50(12):2420-2431. 被引量：1
4Ektor-Ioannis E.Stasinos,Dimitris N.Trakas,Nikos D.Hatziargyriou.Microgrids for power system resilience enhancement[J].iEnergy,2022,1(2):158-169. 被引量：2
5Ziwu Ren,Zhongyuan Wang,Xiaohan Liu,Rui Lin.Parameterization-based trajectory planning for an 8-DOF manipulator with multiple constraints[J].Biomimetic Intelligence & Robotics,2025,5(1):67-76.
6Rajesh Kannan Megalingam,Kondareddy Thanigundala,Sreevatsava Reddy Musani,Hemanth Nidamanuru,Lokesh Gadde.Indian traffic sign detection and recognition using deep learning[J].International Journal of Transportation Science and Technology,2023,12(3):683-699. 被引量：1
7Yanan LU,Ke YOU,Yuxiang WANG,Ying LIU,Cheng ZHOU,Yutian JIANG,Zhangang WU.Deep Reinforcement Learning for automated scheduling of mining earthwork equipment with spatio-temporal safety constraints[J].Frontiers of Engineering Management,2025,12(1):39-58.
8DUO Nanxun,WANG Qinzhao,LYU Qiang,WANG Wei.Tactical reward shaping for large-scale combat by multi-agent reinforcement learning[J].Journal of Systems Engineering and Electronics,2024,35(6):1516-1529. 被引量：1
9Zhifei Shen,Zhiyong Jiang,Jingwang Zhang,Jun Wu,Qiuguo Zhu.Learning-based robot assembly method for peg insertion tasks on inclined hole using time-series force information[J].Biomimetic Intelligence & Robotics,2025,5(1):116-123.
10ZHANG Xiao-cang,XULi-ping.High Energy Normalized Solutions for the Schrodinger Equations with Exponential Critical Growth[J].Chinese Quarterly Journal of Mathematics,2025,40(1):1-19.

Computers, Materials & Continua

2025年第4期

浏览历史

内容加载中请稍等...

A Low-Collision and Efficient Grasping Method for Manipulator Based on Safe Reinforcement Learning

相关作者

相关机构

相关主题

浏览历史