期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Active Object Detection Based on PPO Learning Algorithm with Decision Knowledge Guidance
1
作者 Fujing Yao Guohui Tian +1 位作者 Yuhao Wang Ning Yang 《Machine Intelligence Research》 2025年第2期386-396,共11页
After detecting a target object,a service robot must approach the target object to perform the associated service task.In active object detection(AOD)tasks,effective feature information representation and comprehensiv... After detecting a target object,a service robot must approach the target object to perform the associated service task.In active object detection(AOD)tasks,effective feature information representation and comprehensive action execution strategies are crucial.Currently,most AOD tasks are accomplished by traditional reinforcement learning algorithms,but there are still problems such as high task failure rates and model training efficiency.To solve these problems,this paper proposes a combined data-driven and knowledge-guided solution.First,semantic information features,depth information features and target object bounding box information are used as inputs to comprehensively represent feature information.Second,a policy network is constructed based on the proximal policy optimizaton(PPO)algorithm.The reward value is set according to the robot′s action,the position of the bounding box,and the distance to the target object,and then applied to the robot′s training process.Finally,the knowledge of the path experience in the task,the robot′s collision avoidance ability and the prediction of target object loss are combined to guide the robot′s behavior,and a comprehensive decision model is proposed to enable the robot to make the best decision.Relevant experiments were conducted on an active vision dataset.The robot achieves an average success rate of 91.36%and an average step size of 9.3631 in performing the AOD task in the test scenes,which verifies the effectiveness of the proposed scheme. 展开更多
关键词 Service robot active object detection reinforcement learning path experience comprehensive decision model
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部