期刊文献+
共找到56,759篇文章
< 1 2 250 >
每页显示 20 50 100
Exploiting a No-Regret Opponent in Repeated Zero-Sum Games
1
作者 LI Kai HUANG Wenhan +1 位作者 LI Chenchen DENG Xiaotie 《Journal of Shanghai Jiaotong university(Science)》 2025年第2期385-398,共14页
In repeated zero-sum games,instead of constantly playing an equilibrium strategy of the stage game,learning to exploit the opponent given historical interactions could typically obtain a higher utility.However,when pl... In repeated zero-sum games,instead of constantly playing an equilibrium strategy of the stage game,learning to exploit the opponent given historical interactions could typically obtain a higher utility.However,when playing against a fully adaptive opponent,one would have dificulty identifying the opponent's adaptive dynamics and further exploiting its potential weakness.In this paper,we study the problem of optimizing against the adaptive opponent who uses no-regret learning.No-regret learning is a classic and widely-used branch of adaptive learning algorithms.We propose a general framework for online modeling no-regret opponents and exploiting their weakness.With this framework,one could approximate the opponent's no-regret learning dynamics and then develop a response plan to obtain a significant profit based on the inferences of the opponent's strategies.We employ two system identification architectures,including the recurrent neural network(RNN)and the nonlinear autoregressive exogenous model,and adopt an efficient greedy response plan within the framework.Theoretically,we prove the approximation capability of our RNN architecture at approximating specific no-regret dynamics.Empirically,we demonstrate that during interactions at a low level of non-stationarity,our architectures could approximate the dynamics with a low error,and the derived policies could exploit the no-regret opponent to obtain a decent utility. 展开更多
关键词 no-regret learning repeated game opponent exploitation opponent modeling dynamical system system identification recurrent neural network(RNN)
原文传递
Interactive Fuzzy Approaches for Solving Multiobjective Two-Person Zero-Sum Games
2
作者 Hitoshi Yano Ichiro Nishizaki 《Applied Mathematics》 2016年第5期387-398,共12页
In this paper, we consider multiobjective two-person zero-sum games with vector payoffs and vector fuzzy payoffs. We translate such games into the corresponding multiobjective programming problems and introduce the pe... In this paper, we consider multiobjective two-person zero-sum games with vector payoffs and vector fuzzy payoffs. We translate such games into the corresponding multiobjective programming problems and introduce the pessimistic Pareto optimal solution concept by assuming that a player supposes the opponent adopts the most disadvantage strategy for the self. It is shown that any pessimistic Pareto optimal solution can be obtained on the basis of linear programming techniques even if the membership functions for the objective functions are nonlinear. Moreover, we propose interactive algorithms based on the bisection method to obtain a pessimistic compromise solution from among the set of all pessimistic Pareto optimal solutions. In order to show the efficiency of the proposed method, we illustrate interactive processes of an application to a vegetable shipment problem. 展开更多
关键词 Multiobjective Two-Person zero-sum games LR Fuzzy Numbers Fuzzy Payoff Matrices Fuzzy Goals Possibility Measure Pareto Optimal Solutions Linear Programming
在线阅读 下载PDF
Polynomial Time Method for Solving Nash Equilibria of Zero-Sum Games
3
作者 Yoshihiro Tanaka Mitsuru Togashi 《American Journal of Computational Mathematics》 2021年第1期23-30,共8页
There are a few studies that focus on solution methods for finding a Nash equilibrium of zero-sum games. We discuss the use of Karmarkar’s interior point method to solve the Nash equilibrium problems of a zero-sum ga... There are a few studies that focus on solution methods for finding a Nash equilibrium of zero-sum games. We discuss the use of Karmarkar’s interior point method to solve the Nash equilibrium problems of a zero-sum game, and prove that it is theoretically a polynomial time algorithm. We implement the Karmarkar method, and a preliminary computational result shows that it performs well for zero-sum games. We also mention an affine scaling method that would help us compute Nash equilibria of general zero-sum games effectively. 展开更多
关键词 zero-sum games Nash Equilibria Karmarkar’s Method Polynomial Time
在线阅读 下载PDF
Data-based Optimal Control for Discrete-time Zero-sum Games of 2-D Systems Using Adaptive Critic Designs 被引量:8
4
作者 WEI Qing-Lai ZHANG Hua-Guang CUI Li-Li 《自动化学报》 EI CSCD 北大核心 2009年第6期682-692,共11页
关键词 自适应系统 最优控制 离散时间 自动化系统
在线阅读 下载PDF
Data-Driven Dynamic Output Feedback Nash Strategy for Multi-Player Non-Zero-Sum Games
5
作者 XIE Kedi LU Maobin +2 位作者 DENG Fang SUN Jian CHEN Jie 《Journal of Systems Science & Complexity》 2025年第2期597-612,共16页
This paper investigates the multi-player non-zero-sum game problem for unknown linear continuous-time systems with unmeasurable states.By only accessing the data information of input and output,a data-driven learning ... This paper investigates the multi-player non-zero-sum game problem for unknown linear continuous-time systems with unmeasurable states.By only accessing the data information of input and output,a data-driven learning control approach is proposed to estimate N-tuple dynamic output feedback control policies which can form Nash equilibrium solution to the multi-player non-zero-sum game problem.In particular,the explicit form of dynamic output feedback Nash strategy is constructed by embedding the internal dynamics and solving coupled algebraic Riccati equations.The coupled policy-iteration based iterative learning equations are established to estimate the N-tuple feedback control gains without prior knowledge of system matrices.Finally,an example is used to illustrate the effectiveness of the proposed approach. 展开更多
关键词 Adaptive dynamic programming non-zero-sum games output feedback policy-iteration
原文传递
LINEAR QUADRATIC NONZERO-SUM DIFFERENTIAL GAMES WITH RANDOM JUMPS 被引量:3
6
作者 吴臻 于志勇 《应用数学和力学》 CSCD 北大核心 2005年第8期945-950,共6页
The existence and uniqueness of the solutions for one kind of forward-backward stochastic differential equations with Brownian motion and Poisson process as the noise source were given under the monotone conditions.Th... The existence and uniqueness of the solutions for one kind of forward-backward stochastic differential equations with Brownian motion and Poisson process as the noise source were given under the monotone conditions.Then these results were applied to nonzero-sum differential games with random jumps to get the explicit form of the open-loop Nash equilibrium point by the solution of the forward-backward stochastic differential equations. 展开更多
关键词 随机微分方程 泊松过程 随机微分对策
在线阅读 下载PDF
Accelerated Value Iteration for Nonlinear Zero-Sum Games with Convergence Guarantee
7
作者 Yuan Wang Mingming Zhao +1 位作者 Nan Liu Ding Wang 《Guidance, Navigation and Control》 2024年第1期121-148,共28页
In this paper,an accelerated value iteration(VI)algorithm is established to solve the zero-sum game problem with convergence guarantee.First,inspired by the successive over relaxation theory,the convergence rate of th... In this paper,an accelerated value iteration(VI)algorithm is established to solve the zero-sum game problem with convergence guarantee.First,inspired by the successive over relaxation theory,the convergence rate of the iterative value function sequence is accelerated significantly with the relaxation factor.Second,the convergence and monotonicity of the value function sequence are analyzed under different ranges of the relaxation factor.Third,two practical approaches,namely the integrated scheme and the relaxation function,are introduced into the accelerated VI algorithm to guarantee the convergence of the iterative value function sequence for zero-sum games.The integrated scheme consists of the accelerated stage and the convergence stage,and the relaxation function can adjust the value of the relaxation factor.Finally,including the autopilot controller,the fantastic performance of the accelerated VI algorithm is verified through two examples with practical physical backgrounds. 展开更多
关键词 Adaptive dynamic programming convergence rate value iteration zero-sum games
在线阅读 下载PDF
LINEAR QUADRATIC NONZERO-SUM DIFFERENTIAL GAMES WITH RANDOM JUMPS 被引量:5
8
作者 WU Zhen(吴臻) YU Zhi-yong(于志勇) 《Applied Mathematics and Mechanics(English Edition)》 SCIE EI 2005年第8期1034-1039,共6页
The existence and uniqueness of the solutions for one kind of forward- backward stochastic differential equations with Brownian motion and Poisson process as the noise source were given under the monotone conditions. ... The existence and uniqueness of the solutions for one kind of forward- backward stochastic differential equations with Brownian motion and Poisson process as the noise source were given under the monotone conditions. Then these results were applied to nonzero-sum differential games with random jumps to get the explicit form of the open-loop Nash equilibrium point by the solution of the forward-backward stochastic differential equations. 展开更多
关键词 stochastic differential equation Poisson process stochastic differential game
在线阅读 下载PDF
A Verification Theorem for Feedback Nash Equilibrium in Multiple-Player Nonzero-Sum Impulse Game 被引量:1
9
作者 Ruihai Li Yaoyao Tan +1 位作者 Xiaojie Su Jiangshuai Huang 《IEEE/CAA Journal of Automatica Sinica》 2025年第3期648-650,共3页
Dear Editor,This letter addresses the impulse game problem for a general scope of deterministic,multi-player,nonzero-sum differential games wherein all participants adopt impulse controls.Our objective is to formulate... Dear Editor,This letter addresses the impulse game problem for a general scope of deterministic,multi-player,nonzero-sum differential games wherein all participants adopt impulse controls.Our objective is to formulate this impulse game problem with the modified objective function including interaction costs among the players in a discontinuous fashion,and subsequently,to derive a verification theorem for identifying the feedback Nash equilibrium strategy. 展开更多
关键词 impulse game feedback Nash equilibrium multiple player feedback nash equilibrium strategy impulse game problem nonzero sum modified objective function impulse controlsour
在线阅读 下载PDF
Infinite Horizon LQ Zero-Sum Stochastic Differential Games with Markovian Jumps 被引量:2
10
作者 Huai-Nian Zhu Cheng-Ke Zhang Ning Bin 《Applied Mathematics》 2012年第10期1321-1326,共6页
This paper studies a class of continuous-time two person zero-sum stochastic differential games characterized by linear It?’s differential equation with state-dependent noise and Markovian parameter jumps. Under the ... This paper studies a class of continuous-time two person zero-sum stochastic differential games characterized by linear It?’s differential equation with state-dependent noise and Markovian parameter jumps. Under the assumption of stochastic stabilizability, necessary and sufficient condition for the existence of the optimal control strategies is presented by means of a system of coupled algebraic Riccati equations via using the stochastic optimal control theory. Furthermore, the stochastic H∞ control problem for stochastic systems with Markovian jumps is discussed as an immediate application, and meanwhile, an illustrative example is presented. 展开更多
关键词 STOCHASTIC Systems DIFFERENTIAL games Markovian JUMPS STOCHASTIC H∞ Control
在线阅读 下载PDF
Distributed algorithms for aggregative games with multiple uncertain Euler–Lagrange systems over switching networks 被引量:1
11
作者 Zhaocong Liu Jie Huang 《Journal of Automation and Intelligence》 2025年第1期2-9,共8页
In this paper,we investigate the distributed Nash equilibrium(NE)seeking problem for aggregative games with multiple uncertain Euler–Lagrange(EL)systems over jointly connected and weight-balanced switching networks.T... In this paper,we investigate the distributed Nash equilibrium(NE)seeking problem for aggregative games with multiple uncertain Euler–Lagrange(EL)systems over jointly connected and weight-balanced switching networks.The designed distributed controller consists of two parts:a dynamic average consensus part that asymptotically reproduces the unknown NE,and an adaptive reference-tracking module responsible for steering EL systems’positions to track a desired trajectory.The generalized Barbalat’s Lemma is used to overcome the discontinuity of the closed-loop system caused by the switching networks.The proposed algorithm is illustrated by a sensor network deployment problem. 展开更多
关键词 Aggregative games Euler-Lagrange systems Jointly connected networks Adaptive control
在线阅读 下载PDF
Nash equilibrium computation of two-network zero-sum games with event-triggered communication
12
作者 Hongyun Xiong Jiangxiong Han +1 位作者 Xiaohong Nian Shiling Li 《Journal of Control and Decision》 EI 2022年第3期334-346,共13页
In this paper,a zero-sum game Nash equilibrium computation problem with event-triggered communication is investigated under an undirected weight-balanced multi-agent network.A novel distributed event-triggered project... In this paper,a zero-sum game Nash equilibrium computation problem with event-triggered communication is investigated under an undirected weight-balanced multi-agent network.A novel distributed event-triggered projection subgradient algorithm is developed to reduce the communication burden within the subnetworks.In the proposed algorithm,when the difference between the current state of the agent and the state of the last trigger time exceeds a given threshold,the agent will be triggered to communicate with its neighbours.Moreover,we prove that all agents converge to Nash equilibrium by the proposed algorithm.Finally,two simulation examples verify that our algorithm not only reduces the communication burden but also ensures that the convergence speed and accuracy are close to that of the time-triggered method under the appropriate threshold. 展开更多
关键词 zero-sum game Nash equilibrium multi-agent network event-triggered communication projection subgradient algorithm
原文传递
Nonzero-sum games for continuous-time Markov chains with unbounded transition and average payoff rates 被引量:2
13
作者 ZHANG WenZhao GUO XianPing 《Science China Mathematics》 SCIE 2012年第11期2405-2416,共12页
This paper attempts to study two-person nonzero-sum games for denumerable continuous-time Markov chains determined by transition rates,with an expected average criterion.The transition rates are allowed to be unbounde... This paper attempts to study two-person nonzero-sum games for denumerable continuous-time Markov chains determined by transition rates,with an expected average criterion.The transition rates are allowed to be unbounded,and the payoff functions may be unbounded from above and from below.We give suitable conditions under which the existence of a Nash equilibrium is ensured.More precisely,using the socalled "vanishing discount" approach,a Nash equilibrium for the average criterion is obtained as a limit point of a sequence of equilibrium strategies for the discounted criterion as the discount factors tend to zero.Our results are illustrated with a birth-and-death game. 展开更多
关键词 nonzero-sum game expected average criterion Nash equilibrium unbounded transition rates unbounded payoff function
原文传递
Solving Stackelberg prediction games using inexact hyper-gradient methods
14
作者 SHI Xu WANG Jiulin +1 位作者 JIANG Rujun SONG Weizheng 《运筹学学报(中英文)》 北大核心 2025年第3期93-123,共31页
The Stackelberg prediction game(SPG)is a bilevel optimization frame-work for modeling strategic interactions between a learner and a follower.Existing meth-ods for solving this problem with general loss functions are ... The Stackelberg prediction game(SPG)is a bilevel optimization frame-work for modeling strategic interactions between a learner and a follower.Existing meth-ods for solving this problem with general loss functions are computationally expensive and scarce.We propose a novel hyper-gradient type method with a warm-start strategy to address this challenge.Particularly,we first use a Taylor expansion-based approach to obtain a good initial point.Then we apply a hyper-gradient descent method with an ex-plicit approximate hyper-gradient.We establish the convergence results of our algorithm theoretically.Furthermore,when the follower employs the least squares loss function,our method is shown to reach an e-stationary point by solving quadratic subproblems.Numerical experiments show our algorithms are empirically orders of magnitude faster than the state-of-the-art. 展开更多
关键词 Stackelberg prediction game approximate hyper-gradient bilevel opti-mization
在线阅读 下载PDF
Linear Exponential Quadratic Stochastic Differential Games and Applications
15
作者 Su Qing Zhao Jirui 《南开大学学报(自然科学版)》 北大核心 2025年第5期110-120,共11页
The two-player nonzero-sum linear-exponential-quadratic stochastic differential game is studied.The game takes into account the players'attitudes to risk.The nonlinear transformations and change of probability mea... The two-player nonzero-sum linear-exponential-quadratic stochastic differential game is studied.The game takes into account the players'attitudes to risk.The nonlinear transformations and change of probability measure techniques are used to study the existence of both open-loop and closed-loop Nash equilibria for the game.Some examples are constructed to illustrate their differences.Furthermore,theoretical results are applied to solve the risk-sensitive portfolio game problem in the financial market and show the effects of risk attitudes and economic performance on equilibria. 展开更多
关键词 risk-sensitive stochastic differential games linear-quadratic problem Nash equilibria open-loop and closed-loop
原文传递
《Shapley's Conjecture on the Cores of Abstract Market Games》获教育部第九届高等学校科学研究优秀成果奖(人文社会科学)二等奖
16
《北京交通大学学报(社会科学版)》 北大核心 2025年第4期F0002-F0002,共1页
成果名称:Shapley's Conjecture on the Cores of Abstract Market Games主要作者:曹志刚,秦承忠,杨晓光奖项类别:著作论文奖获奖等级:二等奖获奖论文《Shapley's Conjecture on the Cores of Abstract Market Games》发表于博... 成果名称:Shapley's Conjecture on the Cores of Abstract Market Games主要作者:曹志刚,秦承忠,杨晓光奖项类别:著作论文奖获奖等级:二等奖获奖论文《Shapley's Conjecture on the Cores of Abstract Market Games》发表于博弈论领域顶级期刊《Games and Economic Behavior》2018年第2期。论文研究成果初步解决了诺贝尔经济学奖获得者罗伊德·沙普利(Lloyd S. Shapley)提出的抽象市场博弈核非空的猜想。 展开更多
关键词 博弈论 抽象市场博弈 Shapleys Conjecture games and Economic Behavior
在线阅读 下载PDF
An Overview of Distributed Nash Equilibrium Seeking in Noncooperative Games for Multi-agent Systems:A Dynamic Control-Based Perspective
17
作者 Guanghui WEN Xiao FANG Meng LUAN 《Artificial Intelligence Science and Engineering》 2025年第4期239-254,共16页
This paper presents a comprehensive overview of distributed Nash equilibrium(NE)seeking algorithms in non-cooperative games for multiagent systems(MASs),with a distinct emphasis on the dynamic control perspective.It s... This paper presents a comprehensive overview of distributed Nash equilibrium(NE)seeking algorithms in non-cooperative games for multiagent systems(MASs),with a distinct emphasis on the dynamic control perspective.It specifically focuses on the research addressing distributed NE seeking problems in which agents are governed by heterogeneous dynamics.The paper begins by introducing fundamental concepts of general non-cooperative games and the NE,along with definitions of specific game structures such as aggregative games and multi-cluster games.It then systematically reviews existing studies on distributed NE seeking for various classes of MASs from the viewpoint of agent dynamics,including first-order,second-order,high-order,linear,and Euler-Lagrange(EL)systems.Furthermore,the paper highlights practical applications of these theoretical advances in cooperative control scenarios involving autonomous systems with complex dynamics,such as autonomous surface vessels,autonomous aerial vehicles,and other autonomous vehicles.Finally,the paper outlines several promising directions for future research. 展开更多
关键词 non-cooperative game aggregative game multi-cluster game Nash equilibrium dynamic control autonomous system
在线阅读 下载PDF
Framework for adaptive multimodal serious games for early intervention of autistic children
18
作者 Zhiqi XU Yuyong ZHAO +2 位作者 Jie WANG Jian CHANG Yuetian ZHANG 《虚拟现实与智能硬件(中英文)》 2025年第5期523-542,共20页
Background Autism spectrum disorder(ASD)is a pervasive developmental disorder characterized by difficulties in social communication and restricted,repetitive behaviors.Early intervention is essential to improve develo... Background Autism spectrum disorder(ASD)is a pervasive developmental disorder characterized by difficulties in social communication and restricted,repetitive behaviors.Early intervention is essential to improve developmental outcomes in children with ASD.Serious games,which combine educational objectives with game based interactions,have shown potential as tools for early intervention in patients with ASD.However,in China,the development of serious games specifically designed for children with ASD remains in its infancy,with significant gaps in technical frameworks and effective data management methods.Method This paper proposes a framework aimed at facilitating the development of multimodal serious games designed for ASD interventions.We demonstrated the feasibility of the framework by developing and integrating several components,such as web applications,mobile games,and augmented reality games.These tools are interconnected to achieve data connectivity and management.Additionally,adaptive mechanics were employed within the framework to analyze real-time player data,which allowed the game difficulty to be dynamically adjusted and provide a personalized experience for each child.Results The framework successfully integrated various multimodal games,ensuring that real-time data management supported personalized game experiences.This approach ensured that the interventions remained appropriately challenging while still achievable.Conclusion The results indicate that the proposed framework enhances collaboration among therapists,parents,and developers while also improving the effectiveness of ASD interventions.By delivering personalized gameplay experiences that are both challenging and achievable,the framework offers a scalable platform for the future development of serious games. 展开更多
关键词 Autism spectrum disorder(ASD) Serious games Multimodal games Early intervention Technical framework Augmented reality(AR)games Mobile games
在线阅读 下载PDF
Impulsive thrust strategy for orbital pursuit-evasion games based on impulse-like constraint
19
作者 Hongbo WANG Yao ZHANG +1 位作者 Hao LIU Kunpeng ZHANG 《Chinese Journal of Aeronautics》 2025年第1期520-536,共17页
This paper proposes a novel impulsive thrust strategy guided by optimal continuous thrust strategy to address two-player orbital pursuit-evasion game under impulsive thrust control.The strategy seeks to enhance the in... This paper proposes a novel impulsive thrust strategy guided by optimal continuous thrust strategy to address two-player orbital pursuit-evasion game under impulsive thrust control.The strategy seeks to enhance the interpretability of impulsive thrust strategy by integrating it within the framework of differential game in traditional continuous systems.First,this paper introduces an impulse-like constraint,with periodical changes in thrust amplitude,to characterize the impulsive thrust control.Then,the game with the impulse-like constraint is converted into the two-point boundary value problem,which is solved by the combined shooting and deep learning method proposed in this paper.Deep learning and numerical optimization are employed to obtain the guesses for unknown terminal adjoint variables and the game terminal time.Subsequently,the accurate values are solved by the shooting method to yield the optimal continuous thrust strategy with the impulse-like constraint.Finally,the shooting method is iteratively employed at each impulse decision moment to derive the impulsive thrust strategy guided by the optimal continuous thrust strategy.Numerical examples demonstrate the convergence of the combined shooting and deep learning method,even if the strongly nonlinear impulse-like constraint is introduced.The effect of the impulsive thrust strategy guided by the optimal continuous thrust strategy is also discussed. 展开更多
关键词 Orbital pursuit-evasion game Differential game Impulsive thrust Deep learning Shooting method
原文传递
My Experience as a Volunteer for the 2025 Asian Winter Games
20
作者 王冉 张超(指导) 《中学生英语》 2025年第23期7-7,共1页
I was so excited to be a volunteer for the 2025 Asian Winter Games.It was a wonderful chance to meet people from all over Asia.During the Games,I helped players find their way around the stadium.I also answered questi... I was so excited to be a volunteer for the 2025 Asian Winter Games.It was a wonderful chance to meet people from all over Asia.During the Games,I helped players find their way around the stadium.I also answered questions from visitors.Everyone was friendly,and I felt happy to help them. 展开更多
关键词 VOLUNTEER ASSISTANCE Asian Winter games STADIUM VISITORS
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部