期刊文献+
共找到56,341篇文章
< 1 2 250 >
每页显示 20 50 100
Exploiting a No-Regret Opponent in Repeated Zero-Sum Games
1
作者 LI Kai HUANG Wenhan +1 位作者 LI Chenchen DENG Xiaotie 《Journal of Shanghai Jiaotong university(Science)》 2025年第2期385-398,共14页
In repeated zero-sum games,instead of constantly playing an equilibrium strategy of the stage game,learning to exploit the opponent given historical interactions could typically obtain a higher utility.However,when pl... In repeated zero-sum games,instead of constantly playing an equilibrium strategy of the stage game,learning to exploit the opponent given historical interactions could typically obtain a higher utility.However,when playing against a fully adaptive opponent,one would have dificulty identifying the opponent's adaptive dynamics and further exploiting its potential weakness.In this paper,we study the problem of optimizing against the adaptive opponent who uses no-regret learning.No-regret learning is a classic and widely-used branch of adaptive learning algorithms.We propose a general framework for online modeling no-regret opponents and exploiting their weakness.With this framework,one could approximate the opponent's no-regret learning dynamics and then develop a response plan to obtain a significant profit based on the inferences of the opponent's strategies.We employ two system identification architectures,including the recurrent neural network(RNN)and the nonlinear autoregressive exogenous model,and adopt an efficient greedy response plan within the framework.Theoretically,we prove the approximation capability of our RNN architecture at approximating specific no-regret dynamics.Empirically,we demonstrate that during interactions at a low level of non-stationarity,our architectures could approximate the dynamics with a low error,and the derived policies could exploit the no-regret opponent to obtain a decent utility. 展开更多
关键词 no-regret learning repeated game opponent exploitation opponent modeling dynamical system system identification recurrent neural network(RNN)
原文传递
Distributed projection subgradient algorithm for two-network zero-sum game with random sleep scheme 被引量:1
2
作者 Hongyun Xiong Jiangxiong Han +1 位作者 Xiaohong Nian Shiling Li 《Control Theory and Technology》 EI CSCD 2021年第3期405-417,共13页
In this paper,a zero-sum game Nash equilibrium computation problem with a common constraint set is investigated under two time-varying multi-agent subnetworks,where the two subnetworks have opposite payoff function.A ... In this paper,a zero-sum game Nash equilibrium computation problem with a common constraint set is investigated under two time-varying multi-agent subnetworks,where the two subnetworks have opposite payoff function.A novel distributed projection subgradient algorithm with random sleep scheme is developed to reduce the calculation amount of agents in the process of computing Nash equilibrium.In our algorithm,each agent is determined by an independent identically distributed Bernoulli decision to compute the subgradient and perform the projection operation or to keep the previous consensus estimate,it effectively reduces the amount of computation and calculation time.Moreover,the traditional assumption of stepsize adopted in the existing methods is removed,and the stepsizes in our algorithm are randomized diminishing.Besides,we prove that all agents converge to Nash equilibrium with probability 1 by our algorithm.Finally,a simulation example verifies the validity of our algorithm. 展开更多
关键词 zero-sum game Nash equilibrium Time-varying multi-agent network Projection subgradient algorithm Random sleep scheme
原文传递
Interactive Fuzzy Approaches for Solving Multiobjective Two-Person Zero-Sum Games
3
作者 Hitoshi Yano Ichiro Nishizaki 《Applied Mathematics》 2016年第5期387-398,共12页
In this paper, we consider multiobjective two-person zero-sum games with vector payoffs and vector fuzzy payoffs. We translate such games into the corresponding multiobjective programming problems and introduce the pe... In this paper, we consider multiobjective two-person zero-sum games with vector payoffs and vector fuzzy payoffs. We translate such games into the corresponding multiobjective programming problems and introduce the pessimistic Pareto optimal solution concept by assuming that a player supposes the opponent adopts the most disadvantage strategy for the self. It is shown that any pessimistic Pareto optimal solution can be obtained on the basis of linear programming techniques even if the membership functions for the objective functions are nonlinear. Moreover, we propose interactive algorithms based on the bisection method to obtain a pessimistic compromise solution from among the set of all pessimistic Pareto optimal solutions. In order to show the efficiency of the proposed method, we illustrate interactive processes of an application to a vegetable shipment problem. 展开更多
关键词 Multiobjective Two-Person zero-sum games LR Fuzzy Numbers Fuzzy Payoff Matrices Fuzzy Goals Possibility Measure Pareto Optimal Solutions Linear Programming
在线阅读 下载PDF
It Is Not A Zero-Sum Game
4
作者 Liu Xinwei 《China's Foreign Trade》 2018年第6期46-47,共2页
Nowadays,China is the largest developing country in the world,and the US is the largest developed country in the world.Sino-US economic and trade relations are of great significance to the two nations and may have apr... Nowadays,China is the largest developing country in the world,and the US is the largest developed country in the world.Sino-US economic and trade relations are of great significance to the two nations and may have aprominent impact on the stability and development of the global economy. 展开更多
关键词 US It Is Not A zero-sum game WTO
在线阅读 下载PDF
Polynomial Time Method for Solving Nash Equilibria of Zero-Sum Games
5
作者 Yoshihiro Tanaka Mitsuru Togashi 《American Journal of Computational Mathematics》 2021年第1期23-30,共8页
There are a few studies that focus on solution methods for finding a Nash equilibrium of zero-sum games. We discuss the use of Karmarkar’s interior point method to solve the Nash equilibrium problems of a zero-sum ga... There are a few studies that focus on solution methods for finding a Nash equilibrium of zero-sum games. We discuss the use of Karmarkar’s interior point method to solve the Nash equilibrium problems of a zero-sum game, and prove that it is theoretically a polynomial time algorithm. We implement the Karmarkar method, and a preliminary computational result shows that it performs well for zero-sum games. We also mention an affine scaling method that would help us compute Nash equilibria of general zero-sum games effectively. 展开更多
关键词 zero-sum games Nash Equilibria Karmarkar’s Method Polynomial Time
在线阅读 下载PDF
Data-based Optimal Control for Discrete-time Zero-sum Games of 2-D Systems Using Adaptive Critic Designs 被引量:8
6
作者 WEI Qing-Lai ZHANG Hua-Guang CUI Li-Li 《自动化学报》 EI CSCD 北大核心 2009年第6期682-692,共11页
关键词 自适应系统 最优控制 离散时间 自动化系统
在线阅读 下载PDF
Behavior Prediction of Untrusted Relays Based on Nonzero-Sum Game
7
作者 付晓梅 吴晓 汪清 《Transactions of Tianjin University》 EI CAS 2015年第4期371-376,共6页
To keep the secrecy performance from being badly influenced by untrusted relay(UR), a multi-UR network through amplify-and-forward(AF) cooperative scheme is put forward, which takes relay weight and harmful factor int... To keep the secrecy performance from being badly influenced by untrusted relay(UR), a multi-UR network through amplify-and-forward(AF) cooperative scheme is put forward, which takes relay weight and harmful factor into account. A nonzero-sum game is established to capture the interaction among URs and detection strategies. Secrecy capacity is investigated as game payoff to indicate the untrusted behaviors of the relays. The maximum probabilities of the behaviors of relay and the optimal system detection strategy can be obtained by using the proposed algorithm. 展开更多
关键词 physical layer security COOPERATIVE communication untrusted RELAY SECRECY capacity nonzero-sum game
在线阅读 下载PDF
Secure Downlink Transmission Strategies against Active Eavesdropping in NOMA Systems:A Zero-Sum Game Approach
8
作者 Yanqiu Chen Xiaopeng Ji 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第7期531-553,共23页
Non-orthogonal multiple access technology(NOMA),as a potentially promising technology in the 5G/B5G era,suffers fromubiquitous security threats due to the broadcast nature of the wirelessmedium.In this paper,we focus ... Non-orthogonal multiple access technology(NOMA),as a potentially promising technology in the 5G/B5G era,suffers fromubiquitous security threats due to the broadcast nature of the wirelessmedium.In this paper,we focus on artificial-signal-assisted and relay-assisted secure downlink transmission schemes against external eavesdropping in the context of physical layer security,respectively.To characterize the non-cooperative confrontation around the secrecy rate between the legitimate communication party and the eavesdropper,their interactions are modeled as a two-person zero-sum game.The existence of the Nash equilibrium of the proposed game models is proved,and the pure strategyNash equilibriumand mixed-strategyNash equilibriumprofiles in the two schemes are solved and analyzed,respectively.The numerical simulations are conducted to validate the analytical results,and showthat the two schemes improve the secrecy rate and further enhance the physical layer security performance of NOMA systems. 展开更多
关键词 Non-orthogonalmultiple access technology(NOMA) physical layer security game theory nash equilibrium zerosum game
在线阅读 下载PDF
Data-Driven Dynamic Output Feedback Nash Strategy for Multi-Player Non-Zero-Sum Games
9
作者 XIE Kedi LU Maobin +2 位作者 DENG Fang SUN Jian CHEN Jie 《Journal of Systems Science & Complexity》 2025年第2期597-612,共16页
This paper investigates the multi-player non-zero-sum game problem for unknown linear continuous-time systems with unmeasurable states.By only accessing the data information of input and output,a data-driven learning ... This paper investigates the multi-player non-zero-sum game problem for unknown linear continuous-time systems with unmeasurable states.By only accessing the data information of input and output,a data-driven learning control approach is proposed to estimate N-tuple dynamic output feedback control policies which can form Nash equilibrium solution to the multi-player non-zero-sum game problem.In particular,the explicit form of dynamic output feedback Nash strategy is constructed by embedding the internal dynamics and solving coupled algebraic Riccati equations.The coupled policy-iteration based iterative learning equations are established to estimate the N-tuple feedback control gains without prior knowledge of system matrices.Finally,an example is used to illustrate the effectiveness of the proposed approach. 展开更多
关键词 Adaptive dynamic programming non-zero-sum games output feedback policy-iteration
原文传递
A Verification Theorem for Feedback Nash Equilibrium in Multiple-Player Nonzero-Sum Impulse Game 被引量:1
10
作者 Ruihai Li Yaoyao Tan +1 位作者 Xiaojie Su Jiangshuai Huang 《IEEE/CAA Journal of Automatica Sinica》 2025年第3期648-650,共3页
Dear Editor,This letter addresses the impulse game problem for a general scope of deterministic,multi-player,nonzero-sum differential games wherein all participants adopt impulse controls.Our objective is to formulate... Dear Editor,This letter addresses the impulse game problem for a general scope of deterministic,multi-player,nonzero-sum differential games wherein all participants adopt impulse controls.Our objective is to formulate this impulse game problem with the modified objective function including interaction costs among the players in a discontinuous fashion,and subsequently,to derive a verification theorem for identifying the feedback Nash equilibrium strategy. 展开更多
关键词 impulse game feedback Nash equilibrium multiple player feedback nash equilibrium strategy impulse game problem nonzero sum modified objective function impulse controlsour
在线阅读 下载PDF
Output Feedback Q-Learning for a Non-Zero-Sum Game Problem in Building HVAC Control
11
作者 ANWAR Junaid RIZVI Syed Ali Asad LIN Zongli 《Journal of Systems Science & Complexity》 2025年第2期739-755,共17页
Building heating,ventilating,and air conditioning(HVAC)systems have one of the largest energy footprint worldwide,which necessitates the design of intelligent control algorithms that improve the energy utilization whi... Building heating,ventilating,and air conditioning(HVAC)systems have one of the largest energy footprint worldwide,which necessitates the design of intelligent control algorithms that improve the energy utilization while still providing thermal comfort.In this work,the authors formulate the HVAC equipment dynamics in the setting of a two-player non-zero-sum cooperative game,which enables two decision variables(mass flow rate and supply air temperature)to perform joint optimization of the control utilization and thermal setpoint tracking by simultaneously exchanging their policies.The HVAC zone serves as a game environment for these two decision variables that act as two players in a game.It is assumed that dynamic models of HVAC equipment are not available.Furthermore,neither the state nor any estimates of HVAC disturbance(heat gains,outside variations,etc.)are accessible,but only the measurement of the zone temperature is available for feedback.Under these constraints,the authors develop a new data-driven Q-learning scheme employing policy iteration and value iteration with a bias compensation mechanism that accounts for unmeasurable disturbances and circumvents the need of full-state measurement.The proposed algorithms are shown to converge to the optimal solution corresponding to the generalized algebraic Riccati equations(GAREs)in dynamic games. 展开更多
关键词 s game theory HVAC control optimal control output feedback Q-LEARNING
原文传递
Nonzero-sum games for continuous-time Markov chains with unbounded transition and average payoff rates 被引量:2
12
作者 ZHANG WenZhao GUO XianPing 《Science China Mathematics》 SCIE 2012年第11期2405-2416,共12页
This paper attempts to study two-person nonzero-sum games for denumerable continuous-time Markov chains determined by transition rates,with an expected average criterion.The transition rates are allowed to be unbounde... This paper attempts to study two-person nonzero-sum games for denumerable continuous-time Markov chains determined by transition rates,with an expected average criterion.The transition rates are allowed to be unbounded,and the payoff functions may be unbounded from above and from below.We give suitable conditions under which the existence of a Nash equilibrium is ensured.More precisely,using the socalled "vanishing discount" approach,a Nash equilibrium for the average criterion is obtained as a limit point of a sequence of equilibrium strategies for the discounted criterion as the discount factors tend to zero.Our results are illustrated with a birth-and-death game. 展开更多
关键词 nonzero-sum game expected average criterion Nash equilibrium unbounded transition rates unbounded payoff function
原文传递
Nash equilibrium computation of two-network zero-sum games with event-triggered communication
13
作者 Hongyun Xiong Jiangxiong Han +1 位作者 Xiaohong Nian Shiling Li 《Journal of Control and Decision》 EI 2022年第3期334-346,共13页
In this paper,a zero-sum game Nash equilibrium computation problem with event-triggered communication is investigated under an undirected weight-balanced multi-agent network.A novel distributed event-triggered project... In this paper,a zero-sum game Nash equilibrium computation problem with event-triggered communication is investigated under an undirected weight-balanced multi-agent network.A novel distributed event-triggered projection subgradient algorithm is developed to reduce the communication burden within the subnetworks.In the proposed algorithm,when the difference between the current state of the agent and the state of the last trigger time exceeds a given threshold,the agent will be triggered to communicate with its neighbours.Moreover,we prove that all agents converge to Nash equilibrium by the proposed algorithm.Finally,two simulation examples verify that our algorithm not only reduces the communication burden but also ensures that the convergence speed and accuracy are close to that of the time-triggered method under the appropriate threshold. 展开更多
关键词 zero-sum game Nash equilibrium multi-agent network event-triggered communication projection subgradient algorithm
原文传递
Optimal synchronization control formulti-agent systems with input saturation:a nonzero-sum game 被引量:1
14
作者 Hongyang LI Qinglai WEI 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2022年第7期1010-1019,共10页
This paper presents a novel optimal synchronization control method for multi-agent systems with input saturation.The multi-agent game theory is introduced to transform the optimal synchronization control problem into ... This paper presents a novel optimal synchronization control method for multi-agent systems with input saturation.The multi-agent game theory is introduced to transform the optimal synchronization control problem into a multi-agent nonzero-sum game.Then,the Nash equilibrium can be achieved by solving the coupled Hamilton–Jacobi–Bellman(HJB)equations with nonquadratic input energy terms.A novel off-policy reinforcement learning method is presented to obtain the Nash equilibrium solution without the system models,and the critic neural networks(NNs)and actor NNs are introduced to implement the presented method.Theoretical analysis is provided,which shows that the iterative control laws converge to the Nash equilibrium.Simulation results show the good performance of the presented method. 展开更多
关键词 Optimal synchronization control Multi-agent systems Nonzero-sum game Adaptive dynamic programming Input saturation Off-policy reinforcement learning Policy iteration
原文传递
LINEAR QUADRATIC NONZERO-SUM DIFFERENTIAL GAMES WITH RANDOM JUMPS 被引量:3
15
作者 吴臻 于志勇 《应用数学和力学》 CSCD 北大核心 2005年第8期945-950,共6页
The existence and uniqueness of the solutions for one kind of forward-backward stochastic differential equations with Brownian motion and Poisson process as the noise source were given under the monotone conditions.Th... The existence and uniqueness of the solutions for one kind of forward-backward stochastic differential equations with Brownian motion and Poisson process as the noise source were given under the monotone conditions.Then these results were applied to nonzero-sum differential games with random jumps to get the explicit form of the open-loop Nash equilibrium point by the solution of the forward-backward stochastic differential equations. 展开更多
关键词 随机微分方程 泊松过程 随机微分对策
在线阅读 下载PDF
A Parallel Control Method for Zero-Sum Game with Unknown Time-Varying System
16
作者 Qinglai Wei Zhenhua Zhu +1 位作者 Jie Zhang Fei-Yue Wang 《The International Journal of Intelligent Control and Systems》 2024年第1期37-41,共5页
In this paper,based on ACP(ACP:artificial societies,computational experiments,and parallel execution)approach,a parallel control method is proposed for zero-sum games of unknown time-varying systems.The process of con... In this paper,based on ACP(ACP:artificial societies,computational experiments,and parallel execution)approach,a parallel control method is proposed for zero-sum games of unknown time-varying systems.The process of constructing a sequence of artificial systems,implementing the computational experiments,and conducting the parallel execution is presented.The artificial systems are constructed to model the real system.Computational experiments adopting adaptive dynamic programming(ADP)are shown to derive control laws for a sequence of artificial systems.The purpose of the parallel execution step is to derive the control laws for the real system.Finally,simulation experiments are provided to show the effectiveness of the proposed method. 展开更多
关键词 zero-sum games parallel control ACP(ACP:artificial societies computational experiments and parallel execution) adaptive dynamic programming(ADP)
在线阅读 下载PDF
Accelerated Value Iteration for Nonlinear Zero-Sum Games with Convergence Guarantee
17
作者 Yuan Wang Mingming Zhao +1 位作者 Nan Liu Ding Wang 《Guidance, Navigation and Control》 2024年第1期121-148,共28页
In this paper,an accelerated value iteration(VI)algorithm is established to solve the zero-sum game problem with convergence guarantee.First,inspired by the successive over relaxation theory,the convergence rate of th... In this paper,an accelerated value iteration(VI)algorithm is established to solve the zero-sum game problem with convergence guarantee.First,inspired by the successive over relaxation theory,the convergence rate of the iterative value function sequence is accelerated significantly with the relaxation factor.Second,the convergence and monotonicity of the value function sequence are analyzed under different ranges of the relaxation factor.Third,two practical approaches,namely the integrated scheme and the relaxation function,are introduced into the accelerated VI algorithm to guarantee the convergence of the iterative value function sequence for zero-sum games.The integrated scheme consists of the accelerated stage and the convergence stage,and the relaxation function can adjust the value of the relaxation factor.Finally,including the autopilot controller,the fantastic performance of the accelerated VI algorithm is verified through two examples with practical physical backgrounds. 展开更多
关键词 Adaptive dynamic programming convergence rate value iteration zero-sum games
在线阅读 下载PDF
LINEAR QUADRATIC NONZERO-SUM DIFFERENTIAL GAMES WITH RANDOM JUMPS 被引量:5
18
作者 WU Zhen(吴臻) YU Zhi-yong(于志勇) 《Applied Mathematics and Mechanics(English Edition)》 SCIE EI 2005年第8期1034-1039,共6页
The existence and uniqueness of the solutions for one kind of forward- backward stochastic differential equations with Brownian motion and Poisson process as the noise source were given under the monotone conditions. ... The existence and uniqueness of the solutions for one kind of forward- backward stochastic differential equations with Brownian motion and Poisson process as the noise source were given under the monotone conditions. Then these results were applied to nonzero-sum differential games with random jumps to get the explicit form of the open-loop Nash equilibrium point by the solution of the forward-backward stochastic differential equations. 展开更多
关键词 stochastic differential equation Poisson process stochastic differential game
在线阅读 下载PDF
Infinite Horizon LQ Zero-Sum Stochastic Differential Games with Markovian Jumps 被引量:2
19
作者 Huai-Nian Zhu Cheng-Ke Zhang Ning Bin 《Applied Mathematics》 2012年第10期1321-1326,共6页
This paper studies a class of continuous-time two person zero-sum stochastic differential games characterized by linear It?’s differential equation with state-dependent noise and Markovian parameter jumps. Under the ... This paper studies a class of continuous-time two person zero-sum stochastic differential games characterized by linear It?’s differential equation with state-dependent noise and Markovian parameter jumps. Under the assumption of stochastic stabilizability, necessary and sufficient condition for the existence of the optimal control strategies is presented by means of a system of coupled algebraic Riccati equations via using the stochastic optimal control theory. Furthermore, the stochastic H∞ control problem for stochastic systems with Markovian jumps is discussed as an immediate application, and meanwhile, an illustrative example is presented. 展开更多
关键词 STOCHASTIC Systems DIFFERENTIAL gameS Markovian JUMPS STOCHASTIC H∞ Control
在线阅读 下载PDF
Two-to-one differential game via improved MOGWO 被引量:1
20
作者 BAI Yu ZHOU Di +2 位作者 ZHANG Bolun HE Zhen HE Ping 《Journal of Systems Engineering and Electronics》 2025年第1期233-255,共23页
When the maneuverability of a pursuer is not significantly higher than that of an evader,it will be difficult to intercept the evader with only one pursuer.Therefore,this article adopts a two-to-one differential game ... When the maneuverability of a pursuer is not significantly higher than that of an evader,it will be difficult to intercept the evader with only one pursuer.Therefore,this article adopts a two-to-one differential game strategy,the game of kind is generally considered to be angle-optimized,which allows unlimited turns,but these practices do not take into account the effect of acceleration,which does not correspond to the actual situation,thus,based on the angle-optimized,the acceleration optimization and the acceleration upper bound constraint are added into the game for consideration.A two-to-one differential game problem is proposed in the three-dimensional space,and an improved multi-objective grey wolf optimization(IMOGWO)algorithm is proposed to solve the optimal game point of this problem.With the equations that describe the relative motions between the pursuers and the evader in the three-dimensional space,a multi-objective function with constraints is given as the performance index to design an optimal strategy for the differential game.Then the optimal game point is solved by using the IMOGWO algorithm.It is proved based on Markov chains that with the IMOGWO,the Pareto solution set is the solution of the differential game.Finally,it is verified through simulations that the pursuers can capture the escapee,and via comparative experiments,it is shown that the IMOGWO algorithm performs well in terms of running time and memory usage. 展开更多
关键词 differential game improved multi-objective grey wolf optimization(IMOGWO) cooperative pursuit optimal game point
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部