期刊文献+
共找到376篇文章
< 1 2 19 >
每页显示 20 50 100
Harnessing Trend Theory to Enhance Distributed Proximal Point Algorithm Approaches for Multi-Area Economic Dispatch Optimization
1
作者 Yaming Ren Xing Deng 《Computers, Materials & Continua》 2025年第3期4503-4533,共31页
The exponential growth in the scale of power systems has led to a significant increase in the complexity of dispatch problem resolution,particularly within multi-area interconnected power grids.This complexity necessi... The exponential growth in the scale of power systems has led to a significant increase in the complexity of dispatch problem resolution,particularly within multi-area interconnected power grids.This complexity necessitates the employment of distributed solution methodologies,which are not only essential but also highly desirable.In the realm of computational modelling,the multi-area economic dispatch problem(MAED)can be formulated as a linearly constrained separable convex optimization problem.The proximal point algorithm(PPA)is particularly adept at addressing such mathematical constructs effectively.This study introduces parallel(PPPA)and serial(SPPA)variants of the PPA as distributed algorithms,specifically designed for the computational modelling of the MAED.The PPA introduces a quadratic term into the objective function,which,while potentially complicating the iterative updates of the algorithm,serves to dampen oscillations near the optimal solution,thereby enhancing the convergence characteristics.Furthermore,the convergence efficiency of the PPA is significantly influenced by the parameter c.To address this parameter sensitivity,this research draws on trend theory from stock market analysis to propose trend theory-driven distributed PPPA and SPPA,thereby enhancing the robustness of the computational models.The computational models proposed in this study are anticipated to exhibit superior performance in terms of convergence behaviour,stability,and robustness with respect to parameter selection,potentially outperforming existing methods such as the alternating direction method of multipliers(ADMM)and Auxiliary Problem Principle(APP)in the computational simulation of power system dispatch problems.The simulation results demonstrate that the trend theory-based PPPA,SPPA,ADMM and APP exhibit significant robustness to the initial value of parameter c,and show superior convergence characteristics compared to the residual balancing ADMM. 展开更多
关键词 Multi-area economic dispatch problem proximal point algorithm trend theory
在线阅读 下载PDF
Comparison of two kinds of approximate proximal point algorithms for monotone variational inequalities
2
作者 陶敏 《Journal of Southeast University(English Edition)》 EI CAS 2008年第4期537-540,共4页
This paper proposes two kinds of approximate proximal point algorithms (APPA) for monotone variational inequalities, both of which can be viewed as two extended versions of Solodov and Svaiter's APPA in the paper ... This paper proposes two kinds of approximate proximal point algorithms (APPA) for monotone variational inequalities, both of which can be viewed as two extended versions of Solodov and Svaiter's APPA in the paper "Error bounds for proximal point subproblems and associated inexact proximal point algorithms" published in 2000. They are both prediction- correction methods which use the same inexactness restriction; the only difference is that they use different search directions in the correction steps. This paper also chooses an optimal step size in the two versions of the APPA to improve the profit at each iteration. Analysis also shows that the two APPAs are globally convergent under appropriate assumptions, and we can expect algorithm 2 to get more progress in every iteration than algorithm 1. Numerical experiments indicate that algorithm 2 is more efficient than algorithm 1 with the same correction step size, 展开更多
关键词 monotone variational inequality approximate proximate point algorithm inexactness criterion
在线阅读 下载PDF
PROXIMAL POINT ALGORITHM WITH ERRORS FOR GENERALIZED STRONGLY NONLINEARQUASIVARIATIONAL INCLUSIONS 被引量:1
3
作者 丁协平 《Applied Mathematics and Mechanics(English Edition)》 SCIE EI 1998年第7期637-643,共7页
In this paper, a class of generalized strongly nonlinear quasivariational inclusions are studied. By using the properties of the resolvent operator associated with a maximal monotone; mapping in Hilbert space, an exis... In this paper, a class of generalized strongly nonlinear quasivariational inclusions are studied. By using the properties of the resolvent operator associated with a maximal monotone; mapping in Hilbert space, an existence theorem of solutions for generalized strongly nonlinear quasivariational inclusion is established and a new proximal point algorithm with errors is suggested for finding approximate solutions which strongly converge to the exact solution of the generalized strongly, nonlinear quasivariational inclusion. As special cases, some known results in this field are also discussed. 展开更多
关键词 generalized strongly nonlinear quasivariational inclusion proximal point algorithm with errors
在线阅读 下载PDF
Comparison of two approximal proximal point algorithms for monotone variational inequalities 被引量:1
4
作者 TAO Min 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2007年第6期969-977,共9页
Proximal point algorithms (PPA) are attractive methods for solving monotone variational inequalities (MVI). Since solving the sub-problem exactly in each iteration is costly or sometimes impossible, various approx... Proximal point algorithms (PPA) are attractive methods for solving monotone variational inequalities (MVI). Since solving the sub-problem exactly in each iteration is costly or sometimes impossible, various approximate versions ofPPA (APPA) are developed for practical applications. In this paper, we compare two APPA methods, both of which can be viewed as prediction-correction methods. The only difference is that they use different search directions in the correction-step. By extending the general forward-backward splitting methods, we obtain Algorithm Ⅰ; in the same way, Algorithm Ⅱ is proposed by spreading the general extra-gradient methods. Our analysis explains theoretically why Algorithm Ⅱ usually outperforms Algorithm Ⅰ. For computation practice, we consider a class of MVI with a special structure, and choose the extending Algorithm Ⅱ to implement, which is inspired by the idea of Gauss-Seidel iteration method making full use of information about the latest iteration. And in particular, self-adaptive techniques are adopted to adjust relevant parameters for faster convergence. Finally, some numerical experiments are reported on the separated MVI. Numerical results showed that the extending Algorithm II is feasible and easy to implement with relatively low computation load. 展开更多
关键词 Projection and contraction methods proximal point algorithm (PPA) Approximate PPA (APPA) Monotone variational inequality (MVI) Prediction and correction
在线阅读 下载PDF
MODIFIED APPROXIMATE PROXIMAL POINT ALGORITHMS FOR FINDING ROOTS OF MAXIMAL MONOTONE OPERATORS
5
作者 曾六川 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2004年第3期293-301,共9页
In order to find roots of maximal monotone operators, this paper introduces and studies the modified approximate proximal point algorithm with an error sequence {e k} such that || ek || \leqslant hk || xk - [(x)\tilde... In order to find roots of maximal monotone operators, this paper introduces and studies the modified approximate proximal point algorithm with an error sequence {e k} such that || ek || \leqslant hk || xk - [(x)\tilde]k ||\left\| { e^k } \right\| \leqslant \eta _k \left\| { x^k - \tilde x^k } \right\| with ?k = 0¥ ( hk - 1 ) < + ¥\sum\limits_{k = 0}^\infty {\left( {\eta _k - 1} \right)} and infk \geqslant 0 hk = m\geqslant 1\mathop {\inf }\limits_{k \geqslant 0} \eta _k = \mu \geqslant 1 . Here, the restrictions on {η k} are very different from the ones on {η k}, given by He et al (Science in China Ser. A, 2002, 32 (11): 1026–1032.) that supk \geqslant 0 hk = v < 1\mathop {\sup }\limits_{k \geqslant 0} \eta _k = v . Moreover, the characteristic conditions of the convergence of the modified approximate proximal point algorithm are presented by virtue of the new technique very different from the ones given by He et al. 展开更多
关键词 modified approximate proximal point algorithm maximal monotone operator CONVERGENCE
在线阅读 下载PDF
On Over-Relaxed Proximal Point Algorithms for Generalized Nonlinear Operator Equation with (A,η,m)-Monotonicity Framework
6
作者 Fang Li 《International Journal of Modern Nonlinear Theory and Application》 2012年第3期67-72,共6页
In this paper, a new class of over-relaxed proximal point algorithms for solving nonlinear operator equations with (A,η,m)-monotonicity framework in Hilbert spaces is introduced and studied. Further, by using the gen... In this paper, a new class of over-relaxed proximal point algorithms for solving nonlinear operator equations with (A,η,m)-monotonicity framework in Hilbert spaces is introduced and studied. Further, by using the generalized resolvent operator technique associated with the (A,η,m)-monotone operators, the approximation solvability of the operator equation problems and the convergence of iterative sequences generated by the algorithm are discussed. Our results improve and generalize the corresponding results in the literature. 展开更多
关键词 New Over-Relaxed proximal Point algorithm Nonlinear OPERATOR Equation with (A η m)-Monotonicity FRAMEWORK Generalized RESOLVENT OPERATOR Technique Solvability and Convergence
在线阅读 下载PDF
多目标Proximal-Gradient算法的收敛性分析
7
作者 张世豪 张露方 李尹 《杭州师范大学学报(自然科学版)》 2025年第6期657-663,共7页
为研究求解无约束多目标优化问题的Proximal-Gradient算法的收敛性问题,对经典Polyak-Lojasiewicz不等式(P-L不等式)、近点P-L不等式及多目标近点P-L不等式进行推广,引入了带有指数的多目标近点P-L不等式(多目标广义近点P-L不等式).在... 为研究求解无约束多目标优化问题的Proximal-Gradient算法的收敛性问题,对经典Polyak-Lojasiewicz不等式(P-L不等式)、近点P-L不等式及多目标近点P-L不等式进行推广,引入了带有指数的多目标近点P-L不等式(多目标广义近点P-L不等式).在目标函数的可微部分满足梯度Lipschitz连续条件,以及多目标广义P-L不等式成立的条件下,得到了Proximal-Gradient算法的收敛性结果,并在引入指数为1的情况下,得到了Proximal-Gradient算法的线性收敛性. 展开更多
关键词 多目标优化 proximal-Gradient算法 收敛速率 线性收敛
在线阅读 下载PDF
Proximal point algorithm for a new class of fuzzy set-valued variational inclusions with (H,η)-monotone mappings
8
作者 李红刚 《Journal of Chongqing University》 CAS 2008年第1期79-84,共6页
We introduced a new class of fuzzy set-valued variational inclusions with (H,η)-monotone mappings. Using the resolvent operator method in Hilbert spaces, we suggested a new proximal point algorithm for finding approx... We introduced a new class of fuzzy set-valued variational inclusions with (H,η)-monotone mappings. Using the resolvent operator method in Hilbert spaces, we suggested a new proximal point algorithm for finding approximate solutions, which strongly converge to the exact solution of a fuzzy set-valued variational inclusion with (H,η)-monotone. The results improved and generalized the general quasi-variational inclusions with fuzzy set-valued mappings proposed by Jin and Tian Jin MM, Perturbed proximal point algorithm for general quasi-variational inclusions with fuzzy set-valued mappings, OR Transactions, 2005, 9(3): 31-38, (In Chinese); Tian YX, Generalized nonlinear implicit quasi-variational inclusions with fuzzy mappings, Computers & Mathematics with Applications, 2001, 42: 101-108. 展开更多
关键词 variational inclusion (H η)-monotone mapping resolvent operator technique fuzzy set-valued mapping proximal point algorithm convergence of numerical methods
在线阅读 下载PDF
TWO PARALLEL ALGORITHMS FOR A CLASS OF SPLIT COMMON SOLUTION PROBLEMS
9
作者 Truong Minh TUYEN Nguyen Thi TRANG Tran Thi HUONG 《Acta Mathematica Scientia》 2026年第1期505-518,共14页
We study the split common solution problem with multiple output sets for monotone operator equations in Hilbert spaces.To solve this problem,we propose two new parallel algorithms.We establish a weak convergence theor... We study the split common solution problem with multiple output sets for monotone operator equations in Hilbert spaces.To solve this problem,we propose two new parallel algorithms.We establish a weak convergence theorem for the first and a strong convergence theorem for the second. 展开更多
关键词 iterative algorithm Hilbert space metric projection proximal point algorithm
在线阅读 下载PDF
基于FA-LSTM-GRU的日光温室温度预测及拉膜通风控制研究
10
作者 李天华 赵敬德 +4 位作者 韩威 苏国秀 魏珉 张观山 赵秀艳 《农业工程》 2026年第1期61-69,共9页
日光温室作为冬季节能型蔬菜生产设施,内部温度控制面临高热惯性、强非线性与外部扰动大的挑战。传统通风控制策略普遍存在响应滞后与精度不足的问题,难以满足作物稳定生长的环境要求。为提升温室调温系统的智能化与实时性,提出一种基... 日光温室作为冬季节能型蔬菜生产设施,内部温度控制面临高热惯性、强非线性与外部扰动大的挑战。传统通风控制策略普遍存在响应滞后与精度不足的问题,难以满足作物稳定生长的环境要求。为提升温室调温系统的智能化与实时性,提出一种基于萤火虫算法(FA)-优化的长短期记忆网络(LSTM)-门控循环单元(GRU)混合模型(FALSTM-GRU),用于温室温度预测与通风控制。首先,结合LSTM与GRU结构,引入多头注意力机制(MHA)以增强时序特征提取能力,并通过FA优化模型超参数。其次,设计基于预测值的模型预测控制策略,利用近端策略优化(PPO)实现通风前瞻性调节。最后,搭建云服务器与Arduino平台的控制系统,实现闭环集成。试验结果表明,所构建的FALSTM-GRU模型在测试集上获得R2=0.9769、均方根误差0.7708°C的预测性能,控制策略能在±0.6°C范围内稳定温度波动,具备良好的控制精度与系统鲁棒性。 展开更多
关键词 日光温室 温度预测 通风控制 长短期记忆网络 门控循环神经网络 萤火虫算法 近端策略优化
在线阅读 下载PDF
基于双重决策机制的深度符号回归算法
11
作者 郭泽一 李凤莲 徐利春 《计算机应用》 北大核心 2026年第2期406-415,共10页
深度符号回归(DSR)算法由循环神经网络(RNN)自动化生成表达式树,进而获得较高的模型性能,然而,它无法兼顾表达式树的准确性和结构的简洁性。因此,提出一种基于双重决策机制的深度符号回归(DDSR)算法。首先,在RNN初步决策的基础上,利用... 深度符号回归(DSR)算法由循环神经网络(RNN)自动化生成表达式树,进而获得较高的模型性能,然而,它无法兼顾表达式树的准确性和结构的简洁性。因此,提出一种基于双重决策机制的深度符号回归(DDSR)算法。首先,在RNN初步决策的基础上,利用双评分机制综合评估表达式树的结构简洁性和准确性。其次,采用强化学习对表达式树生成进行训练,将表达式树生成视为序列决策过程,并利用风险近端策略优化(RPPO)算法进行奖励反馈以更新下一批次的模型参数。在公共数据集上的实验结果表明,相较于DSR算法,DDSR算法在拟合度相关系数上最多提高了0.396,最少提高了0.001,而整体性能提升了0.116。以上证明了DDSR算法的有效性。 展开更多
关键词 符号回归 深度学习 评分机制 近端策略优化算法 风险寻优策略梯度
在线阅读 下载PDF
基于ACVAE-MPPO算法的端到端自动驾驶算法研究
12
作者 于康鸿 张军 刘元盛 《计算机工程与应用》 北大核心 2026年第4期210-223,共14页
由于道路类型多样、交互实体众多以及环境复杂,在城市环境中实现高效的自动驾驶是当今自动驾驶技术研究的重点和挑战之一。端到端强化学习在自动驾驶应用中,面临表征模型提取特征能力不足和决策模型学习特征间历史联系困难的问题,这些... 由于道路类型多样、交互实体众多以及环境复杂,在城市环境中实现高效的自动驾驶是当今自动驾驶技术研究的重点和挑战之一。端到端强化学习在自动驾驶应用中,面临表征模型提取特征能力不足和决策模型学习特征间历史联系困难的问题,这些限制影响了算法在复杂城市环境中的决策性能。针对上述问题,提出ACVAE-MPPO算法。为了解决特征提取精度低的问题,在变分自编码器(variational auto-encoder,VAE)中加入坐标卷积层,使用判别器进行辅助训练,形成辅助训练坐标卷积变分自编码器(auxiliary training coordinate convolutional variational auto-encoder,ACVAE),最终提升特征提取的精度;为了增强决策模型提取历史特征的能力,在近端策略优化算法(proximal policy optimization,PPO)中引入长短期记忆网络,形成记忆近端策略优化算法(memory proximal policy optimization,MPPO),使PPO能够记忆和有效利用时序信息,提升决策准确性。将两个模型结合形成ACVAE-MPPO算法。Carla仿真器的实验结果表明,ACVAE-MPPO算法能展现出更强的决策能力,实现更稳定且成功率更高的驾驶决策。 展开更多
关键词 变分自编码器 近端策略优化算法 深度强化学习 自动驾驶
在线阅读 下载PDF
“数字附近”:数字平台劳动中的社会性联结与社会资本重构
13
作者 倪晨旭 《人口与社会》 2026年第1期48-60,共13页
立足于中国数字平台经济的现实情境,提出“数字附近”这一核心分析概念,探讨平台零工劳动者在看似原子化的工作环境中如何重构社会性联结与社会资本。平台劳动者依托数字媒介,围绕特定的工作场景、共享的职业身份及生存策略,形成了一种... 立足于中国数字平台经济的现实情境,提出“数字附近”这一核心分析概念,探讨平台零工劳动者在看似原子化的工作环境中如何重构社会性联结与社会资本。平台劳动者依托数字媒介,围绕特定的工作场景、共享的职业身份及生存策略,形成了一种线上与线下一体化的新型社会关系网络。这种“数字附近”修正了关于平台劳动者的原子化论述,不仅承载着信息协同、风险规避等工具性功能,更在特定条件下衍生出情感支持、身份认同乃至潜在集体行动等价值理性功能,从而构成一种独特的嵌合型社会资本。通过与社会资本经典理论对话,证实“数字附近”中外围弱关系负责信息高效流通、核心强关系提供深度互惠支持,同时其内含赋权与规训的双重逻辑,既提升了劳动者的生存韧性,也可能引发内卷竞争与被平台柔性收编的风险。应转变政策思路,从单一规制平台转为赋能劳动者社群的多元治理;建构空间支持机制,让线上的“数字附近”在线下落地生根;探索组织化路径创新,培育适应平台经济特征的新型劳动者组织形态。 展开更多
关键词 数字附近 平台经济 数字技术 算法治理 零工劳动者 社会资本
在线阅读 下载PDF
计及碳排放的电动汽车充电站优化定价策略
14
作者 尹力 盛俊杰 +1 位作者 袁杰 冯燕钧 《电气传动》 2026年第2期50-57,共8页
在碳中和背景下,提出了一种计及碳排放的电动汽车充电站优化定价策略。首先,构建电动汽车用户价格响应特性模型;其次,建立以碳排放最低、负荷峰谷差最小以及充电站收益最高为目标的定价优化模型并将其转化为马尔科夫决策过程;然后,提出... 在碳中和背景下,提出了一种计及碳排放的电动汽车充电站优化定价策略。首先,构建电动汽车用户价格响应特性模型;其次,建立以碳排放最低、负荷峰谷差最小以及充电站收益最高为目标的定价优化模型并将其转化为马尔科夫决策过程;然后,提出一种基于时间差分误差的改进近端策略优化算法,以提高算法的效率与稳定性;最后,通过算例分析表明,所提定价策略能够降低配电网负荷峰谷差,提高充电站经济效益与低碳水平。 展开更多
关键词 充电站 碳排放 定价策略 深度强化学习 改进近端策略优化算法
在线阅读 下载PDF
Improved pruning algorithm for Gaussian mixture probability hypothesis density filter 被引量:8
15
作者 NIE Yongfang ZHANG Tao 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2018年第2期229-235,共7页
With the increment of the number of Gaussian components, the computation cost increases in the Gaussian mixture probability hypothesis density(GM-PHD) filter. Based on the theory of Chen et al, we propose an improved ... With the increment of the number of Gaussian components, the computation cost increases in the Gaussian mixture probability hypothesis density(GM-PHD) filter. Based on the theory of Chen et al, we propose an improved pruning algorithm for the GM-PHD filter, which utilizes not only the Gaussian components’ means and covariance, but their weights as a new criterion to improve the estimate accuracy of the conventional pruning algorithm for tracking very closely proximity targets. Moreover, it solves the end-less while-loop problem without the need of a second merging step. Simulation results show that this improved algorithm is easier to implement and more robust than the formal ones. 展开更多
关键词 Gaussian mixture probability hypothesis density(GM-PHD) filter pruning algorithm proximity targets clutter rate
在线阅读 下载PDF
A CLASS OF COLLINEAR SCALING ALGORITHMS FOR UNCONSTRAINED OPTIMIZATON
16
作者 盛松柏 《Numerical Mathematics A Journal of Chinese Universities(English Series)》 SCIE 1997年第2期219-230,共12页
A Class of Collinear Scaling Algorithms for Unconstrained Optimization. An appealing approach to the solution of nonlinear optimization problems based on conic models of the objective function has been in troduced by ... A Class of Collinear Scaling Algorithms for Unconstrained Optimization. An appealing approach to the solution of nonlinear optimization problems based on conic models of the objective function has been in troduced by Davidon (1980). It leads to a broad class of algorithms which can be considered to generalize the existing quasi-Newton methods. One particular member of this class has been deeply discussed by Sorensen (1980), who has proved some interesting theoretical properties. In this paper, we generalize Sorensen’s technique to Spedicato three-parameter family of variable-metric updates. Furthermore, we point out that the collinear scaling three- parameter family is essentially equivalent to the Spedicato three-parameter family. In addition, numerical expriments have been carried out to compare some colliner scaling algorithms with a straightforward implementation of the BFGS quasi-Newton method. 展开更多
关键词 UNCONSTRAINED optimization CONIC models COLLINEAR scaling quasi-newton algorithms.
在线阅读 下载PDF
Existence and algorithm of solutions for a system of generalized mixed implicit equilibrium problems in Banach spaces
17
作者 丁协平 《Applied Mathematics and Mechanics(English Edition)》 SCIE EI 2010年第9期1049-1062,共14页
A new system of generalized mixed implicit equilibrium problems is introduced and studied in Banach spaces. First, the notion of the Yosida proximal mapping for generalized mixed implicit equilibrium problems is intro... A new system of generalized mixed implicit equilibrium problems is introduced and studied in Banach spaces. First, the notion of the Yosida proximal mapping for generalized mixed implicit equilibrium problems is introduced. By using the notion, a system of generalized equation problems is considered, and its equivalence with the system of generalized mixed implicit equilibrium problems is also proved. Next, by applying the system of generalized equation problems, we suggest and analyze an iterative algorithm to compute the approximate solutions of the system of generalized mixed implicit equilibrium problems. The strong convergence of the iterative sequences generated by the algorithm is proved under quite mild conditions. The results are new and unify and generalize some recent results in this field. 展开更多
关键词 generalized mixed implicit equilibrium problem Yosida proximal mapping generalized equation problem iterative algorithm Banach space
在线阅读 下载PDF
决策学习型蜣螂优化算法的无人机协同路径规划 被引量:3
18
作者 张乐 胡毅文 +2 位作者 杨红 杨超 马宏远 《计算机应用研究》 北大核心 2025年第1期196-204,共9页
针对多无人机协同路径规划问题,提出了一种决策学习型蜣螂优化算法(DLDBO)。传统蜣螂优化算法(DBO)种群之间缺乏信息互换,容易陷入局部最优解。因此,利用Pearson相关系数计算个体之间的相似性,通过相似性指标判断并作出决策:若不相似,... 针对多无人机协同路径规划问题,提出了一种决策学习型蜣螂优化算法(DLDBO)。传统蜣螂优化算法(DBO)种群之间缺乏信息互换,容易陷入局部最优解。因此,利用Pearson相关系数计算个体之间的相似性,通过相似性指标判断并作出决策:若不相似,利用折射反向学习计算得到候选解,在一定程度上提高个体之间影响的同时增强算法跳出局部最优的能力;若相似,利用所提出的链式邻近学习引导蜣螂个体,增加影响个体更新的因素,充分促进个体之间的信息交流。在CEC2017测试套件的29个测试函数上进行了充分的对比实验,结果表明,DLDBO性能明显优于其他六种先进的变体算法。利用DLDBO规划无人机群的飞行路径,最终能够得到较为理想的协同路径并且有效避开威胁,优于其余三种优秀的协同路径规划算法,满足了无人机协同飞行的需求。 展开更多
关键词 蜣螂优化算法 折射反向学习 链式邻近学习 无人机协同路径规划
在线阅读 下载PDF
基于改进近端策略优化算法的柔性作业车间调度 被引量:3
19
作者 王艳红 付威通 +2 位作者 张俊 谭园园 田中大 《控制与决策》 北大核心 2025年第6期1883-1891,共9页
柔性作业车间调度是经典且复杂的组合优化问题,对于离散制造系统的生产优化具有重要的理论和实际意义.基于多指针图网络框架和近端策略优化算法设计一种求解柔性作业车间调度问题的深度强化学习算法.首先,将“工序-机器”分配调度过程... 柔性作业车间调度是经典且复杂的组合优化问题,对于离散制造系统的生产优化具有重要的理论和实际意义.基于多指针图网络框架和近端策略优化算法设计一种求解柔性作业车间调度问题的深度强化学习算法.首先,将“工序-机器”分配调度过程表征成由选择工序和分配机器两类动作构成的马尔可夫决策过程;其次,通过解耦策略解除动作之间的耦合关系,并设计新的损失函数和贪婪采样策略以提高算法的验证推理能力;在此基础上扩充状态空间,使评估网络能够更全面地感知与评估,从而进一步提升算法的学习和决策能力.在随机生成算例及基准算例上进行仿真和对比分析,验证算法的良好性能及泛化能力. 展开更多
关键词 柔性作业车间调度 近端策略优化算法 双动作耦合网络 损失函数优化 贪婪采样 深度强化学习
原文传递
基于深度强化学习的游戏智能引导算法 被引量:2
20
作者 白天 吕璐瑶 +1 位作者 李储 何加亮 《吉林大学学报(理学版)》 北大核心 2025年第1期91-98,共8页
针对传统游戏智能体算法存在模型输入维度大及训练时间长的问题,提出一种结合状态信息转换与奖励函数塑形技术的新型深度强化学习游戏智能引导算法.首先,利用Unity引擎提供的接口直接读取游戏后台信息,以有效压缩状态空间的维度,减少输... 针对传统游戏智能体算法存在模型输入维度大及训练时间长的问题,提出一种结合状态信息转换与奖励函数塑形技术的新型深度强化学习游戏智能引导算法.首先,利用Unity引擎提供的接口直接读取游戏后台信息,以有效压缩状态空间的维度,减少输入数据量;其次,通过精细化设计奖励机制,加速模型的收敛过程;最后,从主观定性和客观定量两方面对该算法模型与现有方法进行对比实验,实验结果表明,该算法不仅显著提高了模型的训练效率,还大幅度提高了智能体的性能. 展开更多
关键词 深度强化学习 游戏智能体 奖励函数塑形 近端策略优化算法
在线阅读 下载PDF
上一页 1 2 19 下一页 到第
使用帮助 返回顶部