检索结果-维普期刊中文期刊服务平台

期刊文献⁺

任意字段

题名或关键词

题名

关键词

文摘

作者

第一作者

机构

刊名

分类号

参考文献

作者简介

基金资助

栏目信息

共找到3篇文章

< 1 >

每页显示 20 50 100

已选择0条

导出题录引用分析

统计分析

显示方式：

文摘详细列表

相关度排序被引量排序时效性排序

A policy gradient algorithm integrating long and short-term rewards for soft continuum arm control 被引量：3: 1; 作者 DONG Xiang ZHANG Jing +3 位作者 CHENG Long XU WenJun SU Hang MEI Tao 《Science China(Technological Sciences)》 SCIE EI CAS CSCD 2022年第10期2409-2419,共11页; The soft continuum arm has extensive application in industrial production and human life due to its superior safety and flexibility. Reinforcement learning is a powerful technique for solving soft arm continuous contr... 展开更多; 关键词 soft arm control Cosserat rod deep reinforcement learning policy gradient algorithm high sample complexity; 原文传递

A UAV collaborative defense scheme driven by DDPG algorithm 被引量：3: 2; 作者 ZHANG Yaozhong WU Zhuoran +1 位作者 XIONG Zhenkai CHEN Long 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2023年第5期1211-1224,共14页; The deep deterministic policy gradient(DDPG)algo-rithm is an off-policy method that combines two mainstream reinforcement learning methods based on value iteration and policy iteration.Using the DDPG algorithm,agents ... 展开更多; 关键词 deep deterministic policy gradient(DDPG)algorithm unmanned aerial vehicles(UAVs)swarm task decision making deep reinforcement learning sparse reward problem; 在线阅读下载PDF 职称材料

Three-degree-of-freedom motion posture stabilization control of platform based on DTW-LSTM-MATD3 under high and low frequency disturbances of ships: 3; 作者 Qin ZHANG Jingyi ZHOU +1 位作者 Bangping GU Xiong HU 《Journal of Zhejiang University-SCIENCE A》 2026年第3期246-261,共16页; In the complex and variable deep-sea environment,the compensation control of ship motion ensures the safety and efficiency of equipment installation and transportation in offshore wind farms.However,the ship motion po... 展开更多; 关键词 Compensation control Multi-agent twin delayed deep deterministic policy gradient(MATD3)algorithm Dynamic time warping(DTW)algorithm Long short-term memory(LSTM)network; 原文传递

	题名	作者	出处	发文年	被引量	操作
1	A policy gradient algorithm integrating long and short-term rewards for soft continuum arm control	DONG Xiang ZHANG Jing CHENG Long XU WenJun SU Hang MEI Tao	《Science China(Technological Sciences)》 SCIE EI CAS CSCD	2022	3	原文传递
2	A UAV collaborative defense scheme driven by DDPG algorithm	ZHANG Yaozhong WU Zhuoran XIONG Zhenkai CHEN Long	《Journal of Systems Engineering and Electronics》 SCIE EI CSCD	2023	3	在线阅读下载PDF 职称材料
3	Three-degree-of-freedom motion posture stabilization control of platform based on DTW-LSTM-MATD3 under high and low frequency disturbances of ships	Qin ZHANG Jingyi ZHOU Bangping GU Xiong HU	《Journal of Zhejiang University-SCIENCE A》	2026		原文传递

已选择0条

导出题录引用分析

统计分析

使用帮助返回顶部