检索结果-维普期刊中文期刊服务平台

期刊文献⁺

任意字段

题名或关键词

题名

关键词

文摘

作者

第一作者

机构

刊名

分类号

参考文献

作者简介

基金资助

栏目信息

共找到5篇文章

< 1 >

每页显示 20 50 100

已选择0条

导出题录引用分析

统计分析

显示方式：

文摘详细列表

相关度排序被引量排序时效性排序

SPaRM: an efficient exploration and planning framework for sparse reward reinforcement learning: 1; 作者 BAN Jian LI Gongyan XU Shaoyun 《High Technology Letters》 EI CAS 2024年第4期344-355,共12页; Due to the issue of long-horizon,a substantial number of visits to the state space is required during the exploration phase of reinforcement learning(RL)to gather valuable information.Addi-tionally,due to the challeng... 展开更多; 关键词 reinforcement learning(RL) sparse reward reward-free exploration(RFE) space partitioning(SP) reverse merging(RM); 在线阅读下载PDF 职称材料

A UAV collaborative defense scheme driven by DDPG algorithm 被引量：3: 2; 作者 ZHANG Yaozhong WU Zhuoran +1 位作者 XIONG Zhenkai CHEN Long 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2023年第5期1211-1224,共14页; The deep deterministic policy gradient(DDPG)algo-rithm is an off-policy method that combines two mainstream reinforcement learning methods based on value iteration and policy iteration.Using the DDPG algorithm,agents ... 展开更多; 关键词 deep deterministic policy gradient(DDPG)algorithm unmanned aerial vehicles(UAVs)swarm task decision making deep reinforcement learning sparse reward problem; 在线阅读下载PDF 职称材料

Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification 被引量：1: 3; 作者 Chenghao Liu Fei Zhu +1 位作者 Quan Liu Yuchen Fu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2021年第10期1686-1696,共11页; In reinforcement learning an agent may explore ineffectively when dealing with sparse reward tasks where finding a reward point is difficult.To solve the problem,we propose an algorithm called hierarchical deep reinfo... 展开更多; 关键词 Hierarchical control hierarchical reinforcement learning OPTION sparse reward sub-goal; 在线阅读下载PDF 职称材料

Prioritization Hindsight Experience Based on Spatial Position Attention for Robots: 4; 作者 Ye Yuan Yu Sha +3 位作者 Feixiang Sun Haofan Lu Shuiping Gou Jie Luo 《Machine Intelligence Research》 2025年第1期160-175,共16页; Sparse rewards pose significant challenges in deep reinforcement learning as agents struggle to learn from experiences with limited reward signals.Hindsight experience replay(HER)addresses this problem by creating“sm... 展开更多; 关键词 Hindsight experience replay spatial position attention sparse reward deep reinforcement learning prioritization hindsight experience; 原文传递

Fast-converging Deep Reinforcement Learning for Optimal Dispatch of Large-scale Power Systems Under Transient Security Constraints: 5; 作者 Tannan Xiao Ying Chen +2 位作者 Han Diao Shaowei Huang Chen Shen 《Journal of Modern Power Systems and Clean Energy》 2025年第5期1495-1506,共12页; Power system optimal dispatch with transient security constraints is commonly represented as transient securityconstrained optimal power flow(TSC-OPF).Deep reinforcement learning(DRL)-based TSC-OPF trains efficient de... 展开更多; 关键词 Large-scale power system optimal dispatch transient security optimal power flow reinforcement learning sparse reward; 原文传递

	题名	作者	出处	发文年	被引量	操作
1	SPaRM: an efficient exploration and planning framework for sparse reward reinforcement learning	BAN Jian LI Gongyan XU Shaoyun	《High Technology Letters》 EI CAS	2024	0	在线阅读下载PDF 职称材料
2	A UAV collaborative defense scheme driven by DDPG algorithm	ZHANG Yaozhong WU Zhuoran XIONG Zhenkai CHEN Long	《Journal of Systems Engineering and Electronics》 SCIE EI CSCD	2023	3	在线阅读下载PDF 职称材料
3	Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification	Chenghao Liu Fei Zhu Quan Liu Yuchen Fu	《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD	2021	1	在线阅读下载PDF 职称材料
4	Prioritization Hindsight Experience Based on Spatial Position Attention for Robots	Ye Yuan Yu Sha Feixiang Sun Haofan Lu Shuiping Gou Jie Luo	《Machine Intelligence Research》	2025	0	原文传递
5	Fast-converging Deep Reinforcement Learning for Optimal Dispatch of Large-scale Power Systems Under Transient Security Constraints	Tannan Xiao Ying Chen Han Diao Shaowei Huang Chen Shen	《Journal of Modern Power Systems and Clean Energy》	2025	0	原文传递

已选择0条

导出题录引用分析

统计分析

使用帮助返回顶部