Dear Editor,This letter proposes a reinforcement learning-based predictive learning algorithm for unknown continuous-time nonlinear systems with observation loss.Firstly,we construct a temporal nonzero-sum game over p...Dear Editor,This letter proposes a reinforcement learning-based predictive learning algorithm for unknown continuous-time nonlinear systems with observation loss.Firstly,we construct a temporal nonzero-sum game over predictive control input sequences,deriving multiple optimal predictive control input sequences from its solution.展开更多
§1. Introduction In this paper we consider the parabolic system (?)t/(?)ui-div(|▽u|p-2▽ui)=0(1≤i≤m), with p>1, where ui=ui(x, t), ▽=gradxand x varies in an open domain Ω(?)RN. The system is degenerate if...§1. Introduction In this paper we consider the parabolic system (?)t/(?)ui-div(|▽u|p-2▽ui)=0(1≤i≤m), with p>1, where ui=ui(x, t), ▽=gradxand x varies in an open domain Ω(?)RN. The system is degenerate if p>2 or singular if 1<p<2. A vector function u=(u1, u2, …, um) defined in ΩT=Ω×[0, T] is called a solution of the system (1.1) if展开更多
基金supported by the National Natural Science Foundation of China(62433014,62373287,62573324,62333005,62273255)in part by the International Exchange Program for Graduate Students of Tongji University(4360143306)+3 种基金in part by the Fundamental Research Funds for Central Universities(22120230311)supported by DeutscheForschungsgemeinschaft(DFG,German Research Foundation)under Germany’s Excellence Strategy(EXC 2075390740016,468094890)support by the Stuttgart Center for Simulation Science(SimTech)the International Max Planck Research School for Intelligent Systems(IMPRS-IS)for supporting Y.Xie。
文摘Dear Editor,This letter proposes a reinforcement learning-based predictive learning algorithm for unknown continuous-time nonlinear systems with observation loss.Firstly,we construct a temporal nonzero-sum game over predictive control input sequences,deriving multiple optimal predictive control input sequences from its solution.
文摘§1. Introduction In this paper we consider the parabolic system (?)t/(?)ui-div(|▽u|p-2▽ui)=0(1≤i≤m), with p>1, where ui=ui(x, t), ▽=gradxand x varies in an open domain Ω(?)RN. The system is degenerate if p>2 or singular if 1<p<2. A vector function u=(u1, u2, …, um) defined in ΩT=Ω×[0, T] is called a solution of the system (1.1) if