In this study,a dynamic model for an inverted pendulum system(IPS)attached to a car is created,and two different control methods are applied to control the system.The designed control algorithms aim to stabilize the p...In this study,a dynamic model for an inverted pendulum system(IPS)attached to a car is created,and two different control methods are applied to control the system.The designed control algorithms aim to stabilize the pendulum arms in the upright position and the car to reach the equilibrium position.Grey Wolf Optimization-based Linear Quadratic Regulator(GWO-LQR)and GWO-based Fuzzy LQR(FLQR)control algorithms are used in the control process.To improve the performance of the LQR and FLQR methods,the optimum values of the coefficients corresponding to the foot points of the membership functions are determined by the GWO algorithm.Both a graphic and a numerical analysis of the outcomes are provided.In the comparative analysis,it is observed that the GWO-based FLQR method reduces the settling time by 22.58% and the maximum peak value by 18.2% when evaluated in terms of the angular response of the pendulum arm.Furthermore,this approach outperformed comparable research in the literature with a settling time of 2.4 s.These findings demonstrate that the suggested GWO-based FLQR controlmethod outperforms existing literature in terms of the time required for the pendulum arm to reach equilibrium.展开更多
This paper discusses the data-driven design of linear quadratic regulators,i.e.,to obtain the regulators directly from experimental data without using the models of plants.In particular,we aim to improve an existing d...This paper discusses the data-driven design of linear quadratic regulators,i.e.,to obtain the regulators directly from experimental data without using the models of plants.In particular,we aim to improve an existing design method by reducing the amount of the required experimental data.Reducing the data amount leads to the cost reduction of experiments and computation for the data-driven design.We present a simplified version of the existing method,where parameters yielding the gain of the regulator are estimated from only part of the data required in the existing method.We then show that the data amount required in the presented method is less than half of that in the existing method under certain conditions.In addition,assuming the presence of measurement noise,we analyze the relations between the expectations and variances of the estimated parameters and the noise.As a result,it is shown that using a larger amount of the experimental data might mitigate the effects of the noise on the estimated parameters.These results are verified by numerical examples.展开更多
Double cost function linear quadratic regulator (DLQR) is developed from LQR theory to solve an optimal control problem with a general nonlinear cost function. In addition to the traditional LQ cost function, anothe...Double cost function linear quadratic regulator (DLQR) is developed from LQR theory to solve an optimal control problem with a general nonlinear cost function. In addition to the traditional LQ cost function, another free form cost function was introduced to express the physical need plainly and optimize weights of LQ cost function using the search algorithms. As an instance, DLQR was applied in determining the control input in the front steering angle compensation control (FSAC) model for heavy duty vehicles. The brief simulations show that DLQR is powerful enough to specify the engineering requirements correctly and balance many factors effectively. The concept and applicable field of LQR are expanded by DLQR to optimize the system with a free form cost function.展开更多
In order to intercept the future targets that are characterized by high maneuverability, multiple interceptors may be launched and aimed at single target. The scenario of two missiles P and Q intercepting a single tar...In order to intercept the future targets that are characterized by high maneuverability, multiple interceptors may be launched and aimed at single target. The scenario of two missiles P and Q intercepting a single target is modeled as a two-pursuit single-evader non-zero-sum linear quadratic differential game. The intercept space is decomposed into three subspaces which are mutually disjoint and their union covers the entire intercept space. The effect of adding the second interceptor arises in the intercept space of both P and Q (PQ-intercept space). A guidance law is derived from the Nash equilibrium strategy set (NESS) of the game. Simulation studies are focused on the PQ-intercept space. It is indicated that 1) increasing the target's maneuverability will enlarge PQ-intercept space; 2) the handover conditions will be released if the initial zero-effort-miss (ZEM) of both interceptors has opposite sign; 3) overvaluation of the target's maneuverability by choosing a small weight coefficient will generate robust performance with respect to the target maneuvering command switch time and decrease the fuel requirement; and 4) cooperation between interceptors increases the interception probability.展开更多
In this paper,the problem of inverse quadratic optimal control over fnite time-horizon for discrete-time linear systems is considered.Our goal is to recover the corresponding quadratic objective function using noisy o...In this paper,the problem of inverse quadratic optimal control over fnite time-horizon for discrete-time linear systems is considered.Our goal is to recover the corresponding quadratic objective function using noisy observations.First,the identifability of the model structure for the inverse optimal control problem is analyzed under relative degree assumption and we show the model structure is strictly globally identifable.Next,we study the inverse optimal control problem whose initial state distribution and the observation noise distribution are unknown,yet the exact observations on the initial states are available.We formulate the problem as a risk minimization problem and approximate the problem using empirical average.It is further shown that the solution to the approximated problem is statistically consistent under the assumption of relative degrees.We then study the case where the exact observations on the initial states are not available,yet the observation noises are known to be white Gaussian distributed and the distribution of the initial state is also Gaussian(with unknown mean and covariance).EM-algorithm is used to estimate the parameters in the objective function.The efectiveness of our results are demonstrated by numerical examples.展开更多
This paper deals with the problem of singular linear quadratic performance with the worst-disturbance rejection for descriptor systems. Under the conditions we give, the worst-disturbance and the optimal control-state...This paper deals with the problem of singular linear quadratic performance with the worst-disturbance rejection for descriptor systems. Under the conditions we give, the worst-disturbance and the optimal control-state pair are unique respectively, the optimal control can be synthesized as state feedback and the closed-loop system is regular, stable and impulse-free.展开更多
The quarter model of an active suspension is established in the form of controllable autoregressive moving average (CARMA) model. An accelerometer can be mounted on the wheel hub for measuring road disturbance; this...The quarter model of an active suspension is established in the form of controllable autoregressive moving average (CARMA) model. An accelerometer can be mounted on the wheel hub for measuring road disturbance; this signal is used to identify the CARMA model parameters by recursive forgetting factors least square method. The linear quadratic integral (LQI) control method for the active suspension is presented. The LQI control algorithm is fit for vehicle suspension control, for the control performance index can comprise multi controlled variables. The simulation results show that the vertical acceleration and suspension travel both are decreased with the LQI control in the low frequency band, and the suspension travel is increased with the LQI control in the middle or high frequency band. The suspension travel is very small in the middle or high frequency band, the suspension bottoming stop will not happen, so the vehicle ride quality can be improved apparently by the LQI control.展开更多
Optimal control system of state space is a conservative system, whose approximate method should be symplectic conservation. Based on the precise integration method, an algorithm of symplectic conservative perturbation...Optimal control system of state space is a conservative system, whose approximate method should be symplectic conservation. Based on the precise integration method, an algorithm of symplectic conservative perturbation is presented. It gives a uniform way to solve the linear quadratic control (LQ control) problems for linear timevarying systems accurately and efficiently, whose key points are solutions of differential Riccati equation (DRE) with variable coefficients and the state feedback equation. The method is symplectic conservative and has a good numerical stability and high precision. Numerical examples demonstrate the effectiveness of the proposed method.展开更多
Most of the processes in the industry have nonlinear behavior. Control of such processes with conventional control methods could lead to unstable, suboptimal, etc., results. On the other hand, the adaptive control is ...Most of the processes in the industry have nonlinear behavior. Control of such processes with conventional control methods could lead to unstable, suboptimal, etc., results. On the other hand, the adaptive control is a technique widely used for controlling of nonlinear systems. The approach here is based on the recursive identification of the external linear model as a linear representation of the originally nonlinear system. The controller then reacts to the change of the working point or disturbances which could occur by the change of the parameters, structure, etc. The polynomial synthesis together with the linear quadratic(LQ) approach is employed here for the controller synthesis. These techniques satisfy basic control requirements such as the stability, the reference signal tracking and the disturbance attenuation. Resulted controller could be tuned with the choice of weighting factors in LQ approach. This work investigates the effect of these factors on control results. Proposed methods are tested on the mathematical model of the isothermal continuous stirred-tank reactor and simulated results are also verified on the real model of the continuous stirred tank reactor.展开更多
In this paper, the Nash equilibria for differential games with multiple players is studied. A method for solving the Riccati-type matrix differential equations for open-loop Nash strategy in linear quadratic game with...In this paper, the Nash equilibria for differential games with multiple players is studied. A method for solving the Riccati-type matrix differential equations for open-loop Nash strategy in linear quadratic game with multiple players is presented and analytical solution is given for a type of differential games in which the system matrixcan be diagonalizable. As the special cases, the Nash equilibria for some type of differential games with particular structure is studied also, and some results in previous literatures are extended. Finally, a numerical example is given to illustrate the effectiveness of the solution procedure.展开更多
This paper deals with Furuta Pendulum(FP)or Rotary Inverted Pendulum(RIP),which is an under-actuated non-minimum unstable non-linear process.The process considered along with uncertainties which are unmodelled and ana...This paper deals with Furuta Pendulum(FP)or Rotary Inverted Pendulum(RIP),which is an under-actuated non-minimum unstable non-linear process.The process considered along with uncertainties which are unmodelled and analyses the performance of Linear Quadratic Regulator(LQR)with Kalman filter and H∞filter as two filter configurations.The LQR is a technique for developing practical feedback,in addition the desired x shows the vector of desirable states and is used as the external input to the closed-loop system.The effectiveness of the two filters in FP or RIP are measured and contrasted with rise time,peak time,settling time and maximum peak overshoot for time domain performance.The filters are also tested with gain margin,phase margin,disk stability margins for frequency domain performance and worst case stability margins for performance due to uncertainties.The H-infinity filter reduces the estimate error to a minimum,making it resilient in the worst case than the standard Kalman filter.Further,when theβrestriction value lowers,the H∞filter becomes more robust.The worst case gain performance is also focused for the two filter configurations and tested where H∞filter is found to outperform towards robust stability and performance.Also the switchover between the two filters is dependent upon a user-specified co-efficient that gives the flexibility in the design of non-linear systems.The non-linear process is tested for set point tracking,disturbance rejection,un-modelled noise dynamics and uncertainties,which records robust performance towards stability.展开更多
In this paper,we focus on a control-constrained stochastic LQ optimal control problem via backward stochastic differential equation(BSDE in short)with deterministic coefficients.One of the significant features in this...In this paper,we focus on a control-constrained stochastic LQ optimal control problem via backward stochastic differential equation(BSDE in short)with deterministic coefficients.One of the significant features in this framework,in contrast to the classical LQ issue,embodies that the admissible control set needs to satisfy more than the square integrability.By introducing two kinds of new generalized Riccati equations,we are able to announce the explicit optimal control and the solution to the corresponding H-J-B equation.A linear quadratic recursive utility portfolio optimization problem in the financial engineering is discussed as an explicitly illustrated example of the main result with short-selling prohibited.Feasibility of the mean-variance portfolio selection problem via BSDE for a financial market is characterized,and associated efficient portfolios are given in a closed form.展开更多
In this paper,the authors revisit decentralized control of linear quadratic(LQ)systems.Instead of imposing an assumption that the process and observation noises are Gaussian,the authors assume that the controllers are...In this paper,the authors revisit decentralized control of linear quadratic(LQ)systems.Instead of imposing an assumption that the process and observation noises are Gaussian,the authors assume that the controllers are restricted to be linear.The authors show that the multiple decentralized control models,the form of the best linear controllers is identical to the optimal controllers obtained under the Gaussian noise assumption.The main contribution of the paper is the solution technique.Traditionally,optimal controllers for decentralized LQ systems are identified using dynamic programming,maximum principle,or spectral decomposition.The authors present an alternative approach which is based by combining elementary building blocks from linear systems,namely,completion of squares,state splitting,static reduction,orthogonal projection,(conditional)independence of state processes,and decentralized estimation.展开更多
The author studies a stochastic linear quadratic(SLQ for short)optimal control problem for systems governed by stochastic evolution equations,where the control operator in the drift term may be unbounded.Under the con...The author studies a stochastic linear quadratic(SLQ for short)optimal control problem for systems governed by stochastic evolution equations,where the control operator in the drift term may be unbounded.Under the condition that the cost functional is uniformly convex,the well-posedness of the operator-valued Riccati equation is proved.Based on that,the optimal feedback control of the control problem is given.展开更多
This paper reviews the mean field social(MFS)optimal control problem for multi-agent dynamic systems and the mean-field-type(MFT)optimal control problem for single-agent dynamic systems within the linear quadratic(LQ)...This paper reviews the mean field social(MFS)optimal control problem for multi-agent dynamic systems and the mean-field-type(MFT)optimal control problem for single-agent dynamic systems within the linear quadratic(LQ)framework.For the MFS control problem,this review discusses the existing conclusions on optimization in dynamic systems affected by both additive and multiplicative noises.In exploring MFT optimization,the authors first revisit researches associated with single-player systems constrained by these dynamics.The authors then extend the proposed review to scenarios that include multiple players engaged in Nash games,Stackelberg games,and cooperative Pareto games.Finally,the paper concludes by emphasizing future research on intelligent algorithms for mean field optimization,particularly using reinforcement learning method to design strategies for models with unknown parameters.展开更多
This paper focuses on linear-quadratic(LQ)optimal control for a class of systems governed by first-order hyperbolic partial differential equations(PDEs).Different from most of the previous works,an approach of discret...This paper focuses on linear-quadratic(LQ)optimal control for a class of systems governed by first-order hyperbolic partial differential equations(PDEs).Different from most of the previous works,an approach of discretization-then-continuousization is proposed in this paper to cope with the infinite-dimensional nature of PDE systems.The contributions of this paper consist of the following aspects:(1)The differential Riccati equations and the solvability condition of the LQ optimal control problems are obtained via the discretization-then-continuousization method.(2)A numerical calculation way of the differential Riccati equations and a practical design way of the optimal controller are proposed.Meanwhile,the relationship between the optimal costate and the optimal state is established by solving a set of forward and backward partial difference equations(FBPDEs).(3)The correctness of the method used in this paper is verified by a complementary continuous method and the comparative analysis with the existing operator results is presented.It is shown that the proposed results not only contain the classic results of the standard LQ control problem of systems governed by ordinary differential equations as a special case,but also support the existing operator results and give a more convenient form of computation.展开更多
In this paper,we study the linear quadratic(LQ)optimal control of time-varying difference system with terminal state constraints.The main contribution is to provide the Q-learning algorithm for the optimal controller ...In this paper,we study the linear quadratic(LQ)optimal control of time-varying difference system with terminal state constraints.The main contribution is to provide the Q-learning algorithm for the optimal controller under the case that the time-varying system matrices and input matrices are both unknown,which consists of learning the solution of the Riccati equation and calculating the specific Lagrange multiplier from the data-driven matrix equation.Different from the existing Q-learning algorithms that mainly focus on unconstrained optimal control problems,the novelty of the proposed algorithm can be applied to handle situations with terminal state constraints.The effectiveness of the proposed Q-learning algorithm is demonstrated through a numerical example.展开更多
In this paper, tile authors first study two kinds of stochastic differential equations (SDEs) with Levy processes as noise source. Based on the existence and uniqueness of the solutions of these SDEs and multi-dimen...In this paper, tile authors first study two kinds of stochastic differential equations (SDEs) with Levy processes as noise source. Based on the existence and uniqueness of the solutions of these SDEs and multi-dimensional backward stochastic differential equations (BSDEs) driven by Levy pro- cesses, the authors proceed to study a stochastic linear quadratic (LQ) optimal control problem with a Levy process, where the cost weighting matrices of the state and control are allowed to be indefinite. One kind of new stochastic Riccati equation that involves equality and inequality constraints is derived from the idea of square completion and its solvability is proved to be sufficient for the well-posedness and the existence of optimal control which can be of either state feedback or open-loop form of the LQ problems. Moreover, the authors obtain the existence and uniqueness of the solution to the Riccati equation for some special cases. Finally, two examples are presented to illustrate these theoretical results.展开更多
This paper studies an asymptotic solvability problem for linear quadratic(LQ)mean field games with controlled diffusions and indefinite weights for the state and control in the costs.The authors employ a rescaling app...This paper studies an asymptotic solvability problem for linear quadratic(LQ)mean field games with controlled diffusions and indefinite weights for the state and control in the costs.The authors employ a rescaling approach to derive a low dimensional Riccati ordinary differential equation(ODE)system,which characterizes a necessary and sufficient condition for asymptotic solvability.The rescaling technique is further used for performance estimates,establishing an O(1/N)-Nash equilibrium for the obtained decentralized strategies.展开更多
We present an iterative linear quadratic regulator(ILQR) method for trajectory tracking control of a wheeled mobile robot system.The proposed scheme involves a kinematic model linearization technique,a global trajecto...We present an iterative linear quadratic regulator(ILQR) method for trajectory tracking control of a wheeled mobile robot system.The proposed scheme involves a kinematic model linearization technique,a global trajectory generation algorithm,and trajectory tracking controller design.A lattice planner,which searches over a 3D(x,y,θ) configuration space,is adopted to generate the global trajectory.The ILQR method is used to design a local trajectory tracking controller.The effectiveness of the proposed method is demonstrated in simulation and experiment with a significantly asymmetric differential drive robot.The performance of the local controller is analyzed and compared with that of the existing linear quadratic regulator(LQR) method.According to the experiments,the new controller improves the control sequences(v,ω) iteratively and produces slightly better results.Specifically,two trajectories,'S' and '8' courses,are followed with sufficient accuracy using the proposed controller.展开更多
文摘In this study,a dynamic model for an inverted pendulum system(IPS)attached to a car is created,and two different control methods are applied to control the system.The designed control algorithms aim to stabilize the pendulum arms in the upright position and the car to reach the equilibrium position.Grey Wolf Optimization-based Linear Quadratic Regulator(GWO-LQR)and GWO-based Fuzzy LQR(FLQR)control algorithms are used in the control process.To improve the performance of the LQR and FLQR methods,the optimum values of the coefficients corresponding to the foot points of the membership functions are determined by the GWO algorithm.Both a graphic and a numerical analysis of the outcomes are provided.In the comparative analysis,it is observed that the GWO-based FLQR method reduces the settling time by 22.58% and the maximum peak value by 18.2% when evaluated in terms of the angular response of the pendulum arm.Furthermore,this approach outperformed comparable research in the literature with a settling time of 2.4 s.These findings demonstrate that the suggested GWO-based FLQR controlmethod outperforms existing literature in terms of the time required for the pendulum arm to reach equilibrium.
文摘This paper discusses the data-driven design of linear quadratic regulators,i.e.,to obtain the regulators directly from experimental data without using the models of plants.In particular,we aim to improve an existing design method by reducing the amount of the required experimental data.Reducing the data amount leads to the cost reduction of experiments and computation for the data-driven design.We present a simplified version of the existing method,where parameters yielding the gain of the regulator are estimated from only part of the data required in the existing method.We then show that the data amount required in the presented method is less than half of that in the existing method under certain conditions.In addition,assuming the presence of measurement noise,we analyze the relations between the expectations and variances of the estimated parameters and the noise.As a result,it is shown that using a larger amount of the experimental data might mitigate the effects of the noise on the estimated parameters.These results are verified by numerical examples.
文摘Double cost function linear quadratic regulator (DLQR) is developed from LQR theory to solve an optimal control problem with a general nonlinear cost function. In addition to the traditional LQ cost function, another free form cost function was introduced to express the physical need plainly and optimize weights of LQ cost function using the search algorithms. As an instance, DLQR was applied in determining the control input in the front steering angle compensation control (FSAC) model for heavy duty vehicles. The brief simulations show that DLQR is powerful enough to specify the engineering requirements correctly and balance many factors effectively. The concept and applicable field of LQR are expanded by DLQR to optimize the system with a free form cost function.
文摘In order to intercept the future targets that are characterized by high maneuverability, multiple interceptors may be launched and aimed at single target. The scenario of two missiles P and Q intercepting a single target is modeled as a two-pursuit single-evader non-zero-sum linear quadratic differential game. The intercept space is decomposed into three subspaces which are mutually disjoint and their union covers the entire intercept space. The effect of adding the second interceptor arises in the intercept space of both P and Q (PQ-intercept space). A guidance law is derived from the Nash equilibrium strategy set (NESS) of the game. Simulation studies are focused on the PQ-intercept space. It is indicated that 1) increasing the target's maneuverability will enlarge PQ-intercept space; 2) the handover conditions will be released if the initial zero-effort-miss (ZEM) of both interceptors has opposite sign; 3) overvaluation of the target's maneuverability by choosing a small weight coefficient will generate robust performance with respect to the target maneuvering command switch time and decrease the fuel requirement; and 4) cooperation between interceptors increases the interception probability.
文摘In this paper,the problem of inverse quadratic optimal control over fnite time-horizon for discrete-time linear systems is considered.Our goal is to recover the corresponding quadratic objective function using noisy observations.First,the identifability of the model structure for the inverse optimal control problem is analyzed under relative degree assumption and we show the model structure is strictly globally identifable.Next,we study the inverse optimal control problem whose initial state distribution and the observation noise distribution are unknown,yet the exact observations on the initial states are available.We formulate the problem as a risk minimization problem and approximate the problem using empirical average.It is further shown that the solution to the approximated problem is statistically consistent under the assumption of relative degrees.We then study the case where the exact observations on the initial states are not available,yet the observation noises are known to be white Gaussian distributed and the distribution of the initial state is also Gaussian(with unknown mean and covariance).EM-algorithm is used to estimate the parameters in the objective function.The efectiveness of our results are demonstrated by numerical examples.
基金This work was supported by Natural Science Foundation of Shandong Province (No. Y2004A05, Y2004A07)Science Technology Planning Project of Shandong Provincial Education Department(No. J05P51) and Science Research Foundation of Shandong Economic University
文摘This paper deals with the problem of singular linear quadratic performance with the worst-disturbance rejection for descriptor systems. Under the conditions we give, the worst-disturbance and the optimal control-state pair are unique respectively, the optimal control can be synthesized as state feedback and the closed-loop system is regular, stable and impulse-free.
文摘The quarter model of an active suspension is established in the form of controllable autoregressive moving average (CARMA) model. An accelerometer can be mounted on the wheel hub for measuring road disturbance; this signal is used to identify the CARMA model parameters by recursive forgetting factors least square method. The linear quadratic integral (LQI) control method for the active suspension is presented. The LQI control algorithm is fit for vehicle suspension control, for the control performance index can comprise multi controlled variables. The simulation results show that the vertical acceleration and suspension travel both are decreased with the LQI control in the low frequency band, and the suspension travel is increased with the LQI control in the middle or high frequency band. The suspension travel is very small in the middle or high frequency band, the suspension bottoming stop will not happen, so the vehicle ride quality can be improved apparently by the LQI control.
基金Project supported by the National Natural Science Foundation of China (No.10202004)
文摘Optimal control system of state space is a conservative system, whose approximate method should be symplectic conservation. Based on the precise integration method, an algorithm of symplectic conservative perturbation is presented. It gives a uniform way to solve the linear quadratic control (LQ control) problems for linear timevarying systems accurately and efficiently, whose key points are solutions of differential Riccati equation (DRE) with variable coefficients and the state feedback equation. The method is symplectic conservative and has a good numerical stability and high precision. Numerical examples demonstrate the effectiveness of the proposed method.
文摘Most of the processes in the industry have nonlinear behavior. Control of such processes with conventional control methods could lead to unstable, suboptimal, etc., results. On the other hand, the adaptive control is a technique widely used for controlling of nonlinear systems. The approach here is based on the recursive identification of the external linear model as a linear representation of the originally nonlinear system. The controller then reacts to the change of the working point or disturbances which could occur by the change of the parameters, structure, etc. The polynomial synthesis together with the linear quadratic(LQ) approach is employed here for the controller synthesis. These techniques satisfy basic control requirements such as the stability, the reference signal tracking and the disturbance attenuation. Resulted controller could be tuned with the choice of weighting factors in LQ approach. This work investigates the effect of these factors on control results. Proposed methods are tested on the mathematical model of the isothermal continuous stirred-tank reactor and simulated results are also verified on the real model of the continuous stirred tank reactor.
基金This work was supported by the National Natural Science Foundation of China(No.60474029)China Postdoctoral Science Foundation (No.2005038558)
文摘In this paper, the Nash equilibria for differential games with multiple players is studied. A method for solving the Riccati-type matrix differential equations for open-loop Nash strategy in linear quadratic game with multiple players is presented and analytical solution is given for a type of differential games in which the system matrixcan be diagonalizable. As the special cases, the Nash equilibria for some type of differential games with particular structure is studied also, and some results in previous literatures are extended. Finally, a numerical example is given to illustrate the effectiveness of the solution procedure.
文摘This paper deals with Furuta Pendulum(FP)or Rotary Inverted Pendulum(RIP),which is an under-actuated non-minimum unstable non-linear process.The process considered along with uncertainties which are unmodelled and analyses the performance of Linear Quadratic Regulator(LQR)with Kalman filter and H∞filter as two filter configurations.The LQR is a technique for developing practical feedback,in addition the desired x shows the vector of desirable states and is used as the external input to the closed-loop system.The effectiveness of the two filters in FP or RIP are measured and contrasted with rise time,peak time,settling time and maximum peak overshoot for time domain performance.The filters are also tested with gain margin,phase margin,disk stability margins for frequency domain performance and worst case stability margins for performance due to uncertainties.The H-infinity filter reduces the estimate error to a minimum,making it resilient in the worst case than the standard Kalman filter.Further,when theβrestriction value lowers,the H∞filter becomes more robust.The worst case gain performance is also focused for the two filter configurations and tested where H∞filter is found to outperform towards robust stability and performance.Also the switchover between the two filters is dependent upon a user-specified co-efficient that gives the flexibility in the design of non-linear systems.The non-linear process is tested for set point tracking,disturbance rejection,un-modelled noise dynamics and uncertainties,which records robust performance towards stability.
基金financial support partly by the National Nature Science Foundation of China(Grant No.12171053,11701040,11871010&61871058)the Fundamental Research Funds for the Central Universities+2 种基金the Research Funds of Renmin University of China(No.23XNKJ05)the financial support partly by the National Nature Science Foundation of China(Grant No.11871010,11971040)the Fundamental Research Funds for the Central Universities(No.2019XD-A11).
文摘In this paper,we focus on a control-constrained stochastic LQ optimal control problem via backward stochastic differential equation(BSDE in short)with deterministic coefficients.One of the significant features in this framework,in contrast to the classical LQ issue,embodies that the admissible control set needs to satisfy more than the square integrability.By introducing two kinds of new generalized Riccati equations,we are able to announce the explicit optimal control and the solution to the corresponding H-J-B equation.A linear quadratic recursive utility portfolio optimization problem in the financial engineering is discussed as an explicitly illustrated example of the main result with short-selling prohibited.Feasibility of the mean-variance portfolio selection problem via BSDE for a financial market is characterized,and associated efficient portfolios are given in a closed form.
基金supported in part by the Natural Sciences and Engineering Research Council of Canada(NSERC)Discovery Grant under Grant No.RGPIN-2021-0351.
文摘In this paper,the authors revisit decentralized control of linear quadratic(LQ)systems.Instead of imposing an assumption that the process and observation noises are Gaussian,the authors assume that the controllers are restricted to be linear.The authors show that the multiple decentralized control models,the form of the best linear controllers is identical to the optimal controllers obtained under the Gaussian noise assumption.The main contribution of the paper is the solution technique.Traditionally,optimal controllers for decentralized LQ systems are identified using dynamic programming,maximum principle,or spectral decomposition.The authors present an alternative approach which is based by combining elementary building blocks from linear systems,namely,completion of squares,state splitting,static reduction,orthogonal projection,(conditional)independence of state processes,and decentralized estimation.
基金supported by the National Natural Science Foundation of China(Nos.11971334,12025105)。
文摘The author studies a stochastic linear quadratic(SLQ for short)optimal control problem for systems governed by stochastic evolution equations,where the control operator in the drift term may be unbounded.Under the condition that the cost functional is uniformly convex,the well-posedness of the operator-valued Riccati equation is proved.Based on that,the optimal feedback control of the control problem is given.
基金supported by the National Natural Science Foundation of China under Grant Nos.62103442,12326343,62373229the Research Grants Council of the Hong Kong Special Administrative Region,China under Grant Nos.CityU 11213023,11205724+3 种基金the Natural Science Foundation of Shandong Province under Grant No.ZR2021QF080the Taishan Scholar Project of Shandong Province under Grant No.tsqn202408110the Fundamental Research Foundation of the Central Universities under Grant No.23CX06024Athe Outstanding Youth Innovation Team in Shandong Higher Education Institutions under Grant No.2023KJ061.
文摘This paper reviews the mean field social(MFS)optimal control problem for multi-agent dynamic systems and the mean-field-type(MFT)optimal control problem for single-agent dynamic systems within the linear quadratic(LQ)framework.For the MFS control problem,this review discusses the existing conclusions on optimization in dynamic systems affected by both additive and multiplicative noises.In exploring MFT optimization,the authors first revisit researches associated with single-player systems constrained by these dynamics.The authors then extend the proposed review to scenarios that include multiple players engaged in Nash games,Stackelberg games,and cooperative Pareto games.Finally,the paper concludes by emphasizing future research on intelligent algorithms for mean field optimization,particularly using reinforcement learning method to design strategies for models with unknown parameters.
基金supported by the National Natural Science Foundation of China under Grant Nos.61821004 and 62250056the Natural Science Foundation of Shandong Province under Grant Nos.ZR2021ZD14 and ZR2021JQ24+1 种基金Science and Technology Project of Qingdao West Coast New Area under Grant Nos.2019-32,2020-20,2020-1-4,High-level Talent Team Project of Qingdao West Coast New Area under Grant No.RCTDJC-2019-05Key Research and Development Program of Shandong Province under Grant No.2020CXGC01208.
文摘This paper focuses on linear-quadratic(LQ)optimal control for a class of systems governed by first-order hyperbolic partial differential equations(PDEs).Different from most of the previous works,an approach of discretization-then-continuousization is proposed in this paper to cope with the infinite-dimensional nature of PDE systems.The contributions of this paper consist of the following aspects:(1)The differential Riccati equations and the solvability condition of the LQ optimal control problems are obtained via the discretization-then-continuousization method.(2)A numerical calculation way of the differential Riccati equations and a practical design way of the optimal controller are proposed.Meanwhile,the relationship between the optimal costate and the optimal state is established by solving a set of forward and backward partial difference equations(FBPDEs).(3)The correctness of the method used in this paper is verified by a complementary continuous method and the comparative analysis with the existing operator results is presented.It is shown that the proposed results not only contain the classic results of the standard LQ control problem of systems governed by ordinary differential equations as a special case,but also support the existing operator results and give a more convenient form of computation.
基金supported by the Natural Science Foundation of Shandong Province(Nos.ZR2021ZD14,ZR2021JQ24,and ZR2024QF198).
文摘In this paper,we study the linear quadratic(LQ)optimal control of time-varying difference system with terminal state constraints.The main contribution is to provide the Q-learning algorithm for the optimal controller under the case that the time-varying system matrices and input matrices are both unknown,which consists of learning the solution of the Riccati equation and calculating the specific Lagrange multiplier from the data-driven matrix equation.Different from the existing Q-learning algorithms that mainly focus on unconstrained optimal control problems,the novelty of the proposed algorithm can be applied to handle situations with terminal state constraints.The effectiveness of the proposed Q-learning algorithm is demonstrated through a numerical example.
基金This work was supported by the National Basic Research Program of China (973 Program) under Grant No. 2007CB814904the Natural Science Foundation of China under Grant No. 10671112+1 种基金Shandong Province under Grant No. Z2006A01Research Fund for the Doctoral Program of Higher Education of China under Grant No. 20060422018
文摘In this paper, tile authors first study two kinds of stochastic differential equations (SDEs) with Levy processes as noise source. Based on the existence and uniqueness of the solutions of these SDEs and multi-dimensional backward stochastic differential equations (BSDEs) driven by Levy pro- cesses, the authors proceed to study a stochastic linear quadratic (LQ) optimal control problem with a Levy process, where the cost weighting matrices of the state and control are allowed to be indefinite. One kind of new stochastic Riccati equation that involves equality and inequality constraints is derived from the idea of square completion and its solvability is proved to be sufficient for the well-posedness and the existence of optimal control which can be of either state feedback or open-loop form of the LQ problems. Moreover, the authors obtain the existence and uniqueness of the solution to the Riccati equation for some special cases. Finally, two examples are presented to illustrate these theoretical results.
基金Natural Sciences and Engineering Research Council(NSERC)of Canada。
文摘This paper studies an asymptotic solvability problem for linear quadratic(LQ)mean field games with controlled diffusions and indefinite weights for the state and control in the costs.The authors employ a rescaling approach to derive a low dimensional Riccati ordinary differential equation(ODE)system,which characterizes a necessary and sufficient condition for asymptotic solvability.The rescaling technique is further used for performance estimates,establishing an O(1/N)-Nash equilibrium for the obtained decentralized strategies.
基金Project (Nos. 90920304 and 91120015) supported by the National Natural Science Foundation of China
文摘We present an iterative linear quadratic regulator(ILQR) method for trajectory tracking control of a wheeled mobile robot system.The proposed scheme involves a kinematic model linearization technique,a global trajectory generation algorithm,and trajectory tracking controller design.A lattice planner,which searches over a 3D(x,y,θ) configuration space,is adopted to generate the global trajectory.The ILQR method is used to design a local trajectory tracking controller.The effectiveness of the proposed method is demonstrated in simulation and experiment with a significantly asymmetric differential drive robot.The performance of the local controller is analyzed and compared with that of the existing linear quadratic regulator(LQR) method.According to the experiments,the new controller improves the control sequences(v,ω) iteratively and produces slightly better results.Specifically,two trajectories,'S' and '8' courses,are followed with sufficient accuracy using the proposed controller.