期刊文献+
共找到396篇文章
< 1 2 20 >
每页显示 20 50 100
Distributed projection subgradient algorithm for two-network zero-sum game with random sleep scheme 被引量:1
1
作者 Hongyun Xiong Jiangxiong Han +1 位作者 Xiaohong Nian Shiling Li 《Control Theory and Technology》 EI CSCD 2021年第3期405-417,共13页
In this paper,a zero-sum game Nash equilibrium computation problem with a common constraint set is investigated under two time-varying multi-agent subnetworks,where the two subnetworks have opposite payoff function.A ... In this paper,a zero-sum game Nash equilibrium computation problem with a common constraint set is investigated under two time-varying multi-agent subnetworks,where the two subnetworks have opposite payoff function.A novel distributed projection subgradient algorithm with random sleep scheme is developed to reduce the calculation amount of agents in the process of computing Nash equilibrium.In our algorithm,each agent is determined by an independent identically distributed Bernoulli decision to compute the subgradient and perform the projection operation or to keep the previous consensus estimate,it effectively reduces the amount of computation and calculation time.Moreover,the traditional assumption of stepsize adopted in the existing methods is removed,and the stepsizes in our algorithm are randomized diminishing.Besides,we prove that all agents converge to Nash equilibrium with probability 1 by our algorithm.Finally,a simulation example verifies the validity of our algorithm. 展开更多
关键词 zero-sum game Nash equilibrium Time-varying multi-agent network Projection subgradient algorithm Random sleep scheme
原文传递
Interactive Fuzzy Approaches for Solving Multiobjective Two-Person Zero-Sum Games
2
作者 Hitoshi Yano Ichiro Nishizaki 《Applied Mathematics》 2016年第5期387-398,共12页
In this paper, we consider multiobjective two-person zero-sum games with vector payoffs and vector fuzzy payoffs. We translate such games into the corresponding multiobjective programming problems and introduce the pe... In this paper, we consider multiobjective two-person zero-sum games with vector payoffs and vector fuzzy payoffs. We translate such games into the corresponding multiobjective programming problems and introduce the pessimistic Pareto optimal solution concept by assuming that a player supposes the opponent adopts the most disadvantage strategy for the self. It is shown that any pessimistic Pareto optimal solution can be obtained on the basis of linear programming techniques even if the membership functions for the objective functions are nonlinear. Moreover, we propose interactive algorithms based on the bisection method to obtain a pessimistic compromise solution from among the set of all pessimistic Pareto optimal solutions. In order to show the efficiency of the proposed method, we illustrate interactive processes of an application to a vegetable shipment problem. 展开更多
关键词 Multiobjective Two-Person zero-sum Games LR Fuzzy Numbers Fuzzy Payoff Matrices Fuzzy Goals Possibility Measure Pareto Optimal Solutions Linear Programming
在线阅读 下载PDF
It Is Not A Zero-Sum Game
3
作者 Liu Xinwei 《China's Foreign Trade》 2018年第6期46-47,共2页
Nowadays,China is the largest developing country in the world,and the US is the largest developed country in the world.Sino-US economic and trade relations are of great significance to the two nations and may have apr... Nowadays,China is the largest developing country in the world,and the US is the largest developed country in the world.Sino-US economic and trade relations are of great significance to the two nations and may have aprominent impact on the stability and development of the global economy. 展开更多
关键词 US It Is Not A zero-sum Game WTO
在线阅读 下载PDF
Polynomial Time Method for Solving Nash Equilibria of Zero-Sum Games
4
作者 Yoshihiro Tanaka Mitsuru Togashi 《American Journal of Computational Mathematics》 2021年第1期23-30,共8页
There are a few studies that focus on solution methods for finding a Nash equilibrium of zero-sum games. We discuss the use of Karmarkar’s interior point method to solve the Nash equilibrium problems of a zero-sum ga... There are a few studies that focus on solution methods for finding a Nash equilibrium of zero-sum games. We discuss the use of Karmarkar’s interior point method to solve the Nash equilibrium problems of a zero-sum game, and prove that it is theoretically a polynomial time algorithm. We implement the Karmarkar method, and a preliminary computational result shows that it performs well for zero-sum games. We also mention an affine scaling method that would help us compute Nash equilibria of general zero-sum games effectively. 展开更多
关键词 zero-sum Games Nash Equilibria Karmarkar’s Method Polynomial Time
在线阅读 下载PDF
Data-based Optimal Control for Discrete-time Zero-sum Games of 2-D Systems Using Adaptive Critic Designs 被引量:8
5
作者 WEI Qing-Lai ZHANG Hua-Guang CUI Li-Li 《自动化学报》 EI CSCD 北大核心 2009年第6期682-692,共11页
关键词 自适应系统 最优控制 离散时间 自动化系统
在线阅读 下载PDF
On-Policy and Off-Policy Value Iteration Algorithms for Stochastic Zero-Sum Dynamic Games
6
作者 GUO Liangyuan WANG Bing-Chang ZHANG Ji-Feng 《Journal of Systems Science & Complexity》 2025年第1期421-435,共15页
This paper considers the value iteration algorithms of stochastic zero-sum linear quadratic games with unkown dynamics.On-policy and off-policy learning algorithms are developed to solve the stochastic zero-sum games,... This paper considers the value iteration algorithms of stochastic zero-sum linear quadratic games with unkown dynamics.On-policy and off-policy learning algorithms are developed to solve the stochastic zero-sum games,where the system dynamics is not required.By analyzing the value function iterations,the convergence of the model-based algorithm is shown.The equivalence of several types of value iteration algorithms is established.The effectiveness of model-free algorithms is demonstrated by a numerical example. 展开更多
关键词 Approximate dynamic programming on-policy off-policy stochastic zero-sum games valueiteration
原文传递
通信受限的双网络零和博弈分布式在线优化
7
作者 廖岚 于湛 +2 位作者 袁德明 张保勇 徐胜元 《自动化学报》 北大核心 2026年第1期108-120,共13页
研究双网络零和博弈中的分布式优化问题,其中两个网络代表两个对立的玩家.每个网络由一组具有时变损失函数的智能体组成,智能体通过通信和协作来优化己方网络在博弈中的收益.考虑到现实优化场景中通信资源受限和信息反馈受限两种通信受... 研究双网络零和博弈中的分布式优化问题,其中两个网络代表两个对立的玩家.每个网络由一组具有时变损失函数的智能体组成,智能体通过通信和协作来优化己方网络在博弈中的收益.考虑到现实优化场景中通信资源受限和信息反馈受限两种通信受限情形,设计基于事件触发通信和两点Bandit反馈的分布式在线优化算法,并采用动态纳什均衡遗憾评估算法的性能.在某些假设条件下,建立相对于总博弈次数为次线性的动态纳什均衡遗憾界,从而验证了算法的有效性.此外,将设计的算法拓展为多周期版本并建立次线性的动态纳什均衡遗憾界.最后,通过双线性矩阵博弈的仿真算例进一步验证了所设计的两个算法的性能. 展开更多
关键词 零和博弈 分布式在线优化 动态纳什均衡遗憾 Bandit反馈 事件触发通信
在线阅读 下载PDF
非零和博弈下具有共同冲击和衍生品交易的时滞最优再保险-投资策略
8
作者 崔雅茹 马世霞 董文泽 《应用数学》 北大核心 2026年第2期594-604,共11页
本文在非零和博弈框架下,研究具有共同冲击和衍生品交易的两家保险公司的时滞最优再保险-投资策略.假设两家保险公司的盈余过程由扩散逼近模型来刻画,并且通过购买比例再保险来分散索赔风险.同时,两家公司都将盈余投资于无风险资产和风... 本文在非零和博弈框架下,研究具有共同冲击和衍生品交易的两家保险公司的时滞最优再保险-投资策略.假设两家保险公司的盈余过程由扩散逼近模型来刻画,并且通过购买比例再保险来分散索赔风险.同时,两家公司都将盈余投资于无风险资产和风险资产,其中风险资产由Heston模型描述的股票及以其为标的资产的衍生品交易组成.在终端期望效用最大化的准则下,应用动态规划原理,建立了拓展的Hamilton-Jacobi-Bellman(HJB)方程并得到最优再保险和投资策略以及相应的最优值函数.最后,通过数值分析探讨了相关参数对最优再保险和投资策略的影响. 展开更多
关键词 共同冲击 延迟 衍生品交易 非零和博弈 最优再保险-投资策略
在线阅读 下载PDF
考虑性能约束的高超声速变体飞行器自适应优化控制
9
作者 苏健欣 廖宇新 +2 位作者 许炜平 谭君岳 高兴 《航空学报》 北大核心 2026年第4期305-320,共16页
针对高超声速变体飞行器变形飞行过程中强非线性、大不确定性、受多源外部干扰等特点,提出了一种考虑性能约束的自适应优化控制方法。首先,基于自适应预设性能控制设计稳态控制器,通过设计性能包络自适应调整策略解决了固定包络误差转... 针对高超声速变体飞行器变形飞行过程中强非线性、大不确定性、受多源外部干扰等特点,提出了一种考虑性能约束的自适应优化控制方法。首先,基于自适应预设性能控制设计稳态控制器,通过设计性能包络自适应调整策略解决了固定包络误差转换时存在的控制量奇异问题;然后,基于自适应动态规划设计最优补偿控制器,通过离线-在线策略迭代实现了在线优化控制,离线策略迭代增强了网络在线更新初始阶段的稳定性,在线策略迭代引入零和博弈思想增强了最优补偿控制器的抗扰能力;最后,基于Lyapunov稳定性理论证明闭环系统的稳定性。仿真结果表明所提出的方法保证了飞行器的瞬态性能与稳态性能,对于变形飞行过程中的不确定性和外部干扰具有较好的鲁棒性与自适应性。 展开更多
关键词 高超声速变体飞行器 姿态控制 预设性能控制 自适应动态规划 策略迭代 零和博弈
原文传递
Zero-Sum Stochastic Games with Average Payoffs:New Optimality Conditions 被引量:2
10
作者 Jie YANG Xian Ping GUO 《Acta Mathematica Sinica,English Series》 SCIE CSCD 2009年第7期1201-1216,共16页
In this paper we study zero-sum stochastic games. The optimality criterion is the long-run expected average criterion, and the payoff function may have neither upper nor lower bounds. We give a new set of conditions f... In this paper we study zero-sum stochastic games. The optimality criterion is the long-run expected average criterion, and the payoff function may have neither upper nor lower bounds. We give a new set of conditions for the existence of a value and a pair of optimal stationary strategies. Our conditions are slightly weaker than those in the previous literature, and some new sufficient conditions for the existence of a pair of optimal stationary strategies are imposed on the primitive data of the model. Our results are illustrated with a queueing system, for which our conditions are satisfied but some of the conditions in some previous literatures fail to hold. 展开更多
关键词 zero-sum stochastic games countable state space expected average criterion new condi- tion a pair of optimal stationary strategies
原文传递
一种针对物理层攻防特性的电力系统受击损失分析模型
11
作者 罗逸夫 喻志谦 +2 位作者 王启瑞 曾志红 刘乃通 《南京师大学报(自然科学版)》 北大核心 2026年第2期98-109,共12页
聚焦于电力系统作为关键基础设施在军事行动和恐怖袭击等外部物理攻击过程中的易受击性,区别于信息网络单点攻击过程,构建了一种考虑物理层攻防范围特性的电力系统受击损失鲁棒优化模型.首先,根据电力系统的地理信息和网架结构,建立了... 聚焦于电力系统作为关键基础设施在军事行动和恐怖袭击等外部物理攻击过程中的易受击性,区别于信息网络单点攻击过程,构建了一种考虑物理层攻防范围特性的电力系统受击损失鲁棒优化模型.首先,根据电力系统的地理信息和网架结构,建立了区域受击毁伤关联传播模型.其次,考虑攻防双方的决策互动,建立了零和博弈双层模型.最后,利用KKT条件和对非线性约束的逻辑展开实现模型线性化和精确求解.算例验证表明,本文所提模型能够准确求解电力系统在面对物理攻击者最优进攻方案时的最小损失;同时,不同防御资源分配方案的优劣会随着攻击者的能力而变化,反映了攻防博弈问题的复杂交互过程,进一步验证模型的合理性. 展开更多
关键词 鲁棒优化 零和博弈 电力系统 安全防御 线性化
在线阅读 下载PDF
A Parallel Control Method for Zero-Sum Game with Unknown Time-Varying System 被引量:3
12
作者 Qinglai Wei Zhenhua Zhu +1 位作者 Jie Zhang Fei-Yue Wang 《The International Journal of Intelligent Control and Systems》 2024年第1期37-41,共5页
In this paper,based on ACP(ACP:artificial societies,computational experiments,and parallel execution)approach,a parallel control method is proposed for zero-sum games of unknown time-varying systems.The process of con... In this paper,based on ACP(ACP:artificial societies,computational experiments,and parallel execution)approach,a parallel control method is proposed for zero-sum games of unknown time-varying systems.The process of constructing a sequence of artificial systems,implementing the computational experiments,and conducting the parallel execution is presented.The artificial systems are constructed to model the real system.Computational experiments adopting adaptive dynamic programming(ADP)are shown to derive control laws for a sequence of artificial systems.The purpose of the parallel execution step is to derive the control laws for the real system.Finally,simulation experiments are provided to show the effectiveness of the proposed method. 展开更多
关键词 zero-sum games parallel control ACP(ACP:artificial societies computational experiments and parallel execution) adaptive dynamic programming(ADP)
在线阅读 下载PDF
平面上一类整点区域的Davenport常数
13
作者 邓贵新 黄彬 程鹏 《南宁师范大学学报(自然科学版)》 2026年第2期152-160,共9页
零和理论是加法组合领域中的一个重要研究方向,主要研究交换群上与零和序列相关的组合性质。计算交换群的子集的Davenport常数是零和理论中一个困难的核心问题。设G是交换群,X是G的非空子集,S是X上的一个序列,如果S中的各元素之和为0,则... 零和理论是加法组合领域中的一个重要研究方向,主要研究交换群上与零和序列相关的组合性质。计算交换群的子集的Davenport常数是零和理论中一个困难的核心问题。设G是交换群,X是G的非空子集,S是X上的一个序列,如果S中的各元素之和为0,则称S是零和的;若S是零和的,且它的任意非空真子列都不是零和的,则称S是极小零和的;X上的所有极小零和序列的长度的上确界称为X的Davenport常数,记为D(X)。目前已经知道当X为秩为1或2的有限交换群时和当X为有限交换p-群时D(X)的值,而对无限交换群G的子集的Davenport常数的研究主要集中在G为自由群的情形。本文证明,当X为平面整点区域[-2,2]×[-2,n](n≥36)时,有D(X)=4n+k,其中k在n≡2(mod 3)时为5,在n≡0,1(mod 3)时为7。 展开更多
关键词 零和问题 零和序列 极小零和序列 Davenport常数
在线阅读 下载PDF
Nash equilibrium computation of two-network zero-sum games with event-triggered communication
14
作者 Hongyun Xiong Jiangxiong Han +1 位作者 Xiaohong Nian Shiling Li 《Journal of Control and Decision》 EI 2022年第3期334-346,共13页
In this paper,a zero-sum game Nash equilibrium computation problem with event-triggered communication is investigated under an undirected weight-balanced multi-agent network.A novel distributed event-triggered project... In this paper,a zero-sum game Nash equilibrium computation problem with event-triggered communication is investigated under an undirected weight-balanced multi-agent network.A novel distributed event-triggered projection subgradient algorithm is developed to reduce the communication burden within the subnetworks.In the proposed algorithm,when the difference between the current state of the agent and the state of the last trigger time exceeds a given threshold,the agent will be triggered to communicate with its neighbours.Moreover,we prove that all agents converge to Nash equilibrium by the proposed algorithm.Finally,two simulation examples verify that our algorithm not only reduces the communication burden but also ensures that the convergence speed and accuracy are close to that of the time-triggered method under the appropriate threshold. 展开更多
关键词 zero-sum game Nash equilibrium multi-agent network event-triggered communication projection subgradient algorithm
原文传递
Policy Iteration Algorithms for Zero-Sum Stochastic Differential Games with Long-Run Average Payoff Criteria
15
作者 JoséDaniel López-Barrientos 《Journal of the Operations Research Society of China》 EI 2014年第4期395-421,共27页
This paper studies the policy iteration algorithm(PIA)for zero-sum stochastic differential games with the basic long-run average criterion,as well as with its more selective version,the so-called bias criterion.The sy... This paper studies the policy iteration algorithm(PIA)for zero-sum stochastic differential games with the basic long-run average criterion,as well as with its more selective version,the so-called bias criterion.The system is assumed to be a nondegenerate diffusion.We use Lyapunov-like stability conditions that ensure the existence and boundedness of the solution to certain Poisson equation.We also ensure the convergence of a sequence of such solutions,of the corresponding sequence of policies,and,ultimately,of the PIA. 展开更多
关键词 Ergodic payoff criterion zero-sum stochastic differential games Policy iteration algorithm Nondegenerate diffusions Poisson equation Schäl convergence Bias game
原文传递
Solution of a zero-sum linear quadratic differential game with singular control cost of minimiser
16
作者 Valery Y.Glizer Oleg Kelis 《Journal of Control and Decision》 EI 2015年第3期155-184,共30页
We consider a finite horizon,zero-sum linear quadratic differential game.The feature of this game is that a weight matrix of the minimiser’s control cost in the cost functional is singular.Due to this singularity,the... We consider a finite horizon,zero-sum linear quadratic differential game.The feature of this game is that a weight matrix of the minimiser’s control cost in the cost functional is singular.Due to this singularity,the game can be solved neither by applying the Isaacs MinMax principle nor using the Bellman–Isaacs equation approach,i.e.this game is singular.Aprevious paper of one of the authors analysed such a game in the case where the cost functional does not contain the minimiser’s control cost at all,i.e.the weight matrix of this cost equals zero.In this case,all coordinates of the minimiser’s control are singular.In the present paper,we study the general case where the weight matrix of the minimiser’s control cost,being singular,is not,in general,zero.This means that only a part of the coordinates of the minimiser’s control is singular,while others are regular.The considered game is treated by a regularisation,i.e.by its approximate conversion to an auxiliary regular game.The latter has the same equation of dynamics and a similar cost functional augmented by an integral of the squares of the singular control coordinates with a small positive weight.Thus,the auxiliary game is a partial cheap control differential game.Based on a singular perturbation’s asymptotic analysis of this auxiliary game,the existence of the value of the original(singular)game is established,and its expression is obtained.The maximiser’s optimal state feedback strategy and the minimising control sequence in the original game are designed.It is shown that the coordinates of the minimising control sequence,corresponding to the regular coordinates of the minimiser’s control,are point-wise convergent in the class of regular functions.The optimal trajectory sequence and the optimal trajectory in the considered singular game also are obtained.An illustrative example is presented. 展开更多
关键词 zero-sum linear quadratic differential game singular minimiser’s control cost REGULARISATION partial cheap control game singular perturbation techniques minimising control sequence optimal trajectory sequence
原文传递
Accelerated Value Iteration for Nonlinear Zero-Sum Games with Convergence Guarantee
17
作者 Yuan Wang Mingming Zhao +1 位作者 Nan Liu Ding Wang 《Guidance, Navigation and Control》 2024年第1期121-148,共28页
In this paper,an accelerated value iteration(VI)algorithm is established to solve the zero-sum game problem with convergence guarantee.First,inspired by the successive over relaxation theory,the convergence rate of th... In this paper,an accelerated value iteration(VI)algorithm is established to solve the zero-sum game problem with convergence guarantee.First,inspired by the successive over relaxation theory,the convergence rate of the iterative value function sequence is accelerated significantly with the relaxation factor.Second,the convergence and monotonicity of the value function sequence are analyzed under different ranges of the relaxation factor.Third,two practical approaches,namely the integrated scheme and the relaxation function,are introduced into the accelerated VI algorithm to guarantee the convergence of the iterative value function sequence for zero-sum games.The integrated scheme consists of the accelerated stage and the convergence stage,and the relaxation function can adjust the value of the relaxation factor.Finally,including the autopilot controller,the fantastic performance of the accelerated VI algorithm is verified through two examples with practical physical backgrounds. 展开更多
关键词 Adaptive dynamic programming convergence rate value iteration zero-sum games
在线阅读 下载PDF
Robust control design for zero-sum differential games problem based on off-policy reinforcement learning technique
18
作者 Hongji Zhuang Hongxu Zhu +3 位作者 Shufan Wu Xiaoliang Wang Zhongcheng Mu Qiang Shen 《Aerospace Systems》 2024年第2期261-269,共9页
This paper aims to figure out the robust zero-sum differential game problem using an off-policy reinforcement learning technique.The robust system model is first established based on the nominal one.The control strate... This paper aims to figure out the robust zero-sum differential game problem using an off-policy reinforcement learning technique.The robust system model is first established based on the nominal one.The control strategy is proposed with the asymptotic stability and optimality being strictly proved.The off-policy reinforcement learning technique is built from the Bellman equation to generate the control policy.Apotentially inaccurate system dynamic model’s influence is avoided because the outcome is attained from the system data set obtained.It is the first-time application of the off-policy RL algorithm on this robust two-player zero-sum differential game problem.Additionally,the final algorithm’s convergence is demonstrated,and a simulation example is run to confirm its efficacy. 展开更多
关键词 Robust control zero-sum game Off-policy reinforcement learning
在线阅读 下载PDF
Convex and Nonconvex Optimization Based on Neurodynamic Method with Zero-Sum Initial Constraint
19
作者 Yiyang Ge Zhanshan Wang Bibo Zheng 《The International Journal of Intelligent Control and Systems》 2024年第4期184-194,共11页
A neurodynamic method(NdM)for convex optimization is proposed in this paper with an equality constraint.The method utilizes a neurodynamic system(NdS)that converges to the optimal solution of a convex optimization pro... A neurodynamic method(NdM)for convex optimization is proposed in this paper with an equality constraint.The method utilizes a neurodynamic system(NdS)that converges to the optimal solution of a convex optimization problem in a fixed time.Due to its mathematical simplicity,it can also be combined with reinforcement learning(RL)to solve a class of nonconvex optimization problems.To maintain the mathematical simplicity of NdS,zero-sum initial constraints are introduced to reduce the number of auxiliary multipliers.First,the initial sum of the state variables must satisfy the equality constraint.Second,the sum of their derivatives is designed to remain zero.In order to apply the proposed convex optimization algorithm to nonconvex optimization with mixed constraints,the virtual actions in RL are redefined to avoid the use of NdS inequality constrained multipliers.The proposed NdM plays an effective search tool in constrained nonconvex optimization algorithms.Numerical examples demonstrate the effectiveness of the proposed algorithm. 展开更多
关键词 Neurodynamic method(NdM) zero-sum initial constraint distribued optimization convex and nonconvex optimization reinforcement learning(RL)
在线阅读 下载PDF
Improved zero-sum distinguisher for full round Keccak-f permutation 被引量:8
20
作者 DUAN Ming LAI XueJia 《Chinese Science Bulletin》 SCIE CAS 2012年第6期694-697,共4页
Keccak is one of the five hash functions selected for the final round of the SHA-3 competition,and its inner primitive is a permutation called Keccak-f.In this paper,we observe that for the inverse of the only nonline... Keccak is one of the five hash functions selected for the final round of the SHA-3 competition,and its inner primitive is a permutation called Keccak-f.In this paper,we observe that for the inverse of the only nonlinear transformation in Keccak-f,the algebraic degree of any output coordinate and the one of the product of any two output coordinates are both 3,which is 2 less than its size of 5.Combining this observation with a proposition on the upper bound of the degree of iterated permutations,we improve the zero-sum distinguisher for the Keccak-f permutation with full 24 rounds by lowering the size of the zero-sum partition from 21590 to 21575. 展开更多
关键词 置换 非线性变换 哈希函数 坐标 输出 排列 迭代
在线阅读 下载PDF
上一页 1 2 20 下一页 到第
使用帮助 返回顶部