In this paper,an adaptive neural-network(NN)output feedback optimal control problem is studied for a class of strict-feedback nonlinear systems with unknown internal dynamics,input saturation and state constraints.Neu...In this paper,an adaptive neural-network(NN)output feedback optimal control problem is studied for a class of strict-feedback nonlinear systems with unknown internal dynamics,input saturation and state constraints.Neural networks are used to approximate unknown internal dynamics and an adaptive NN state observer is developed to estimate immeasurable states.Under the framework of the backstepping design,by employing the actor-critic architecture and constructing the tan-type Barrier Lyapunov function(BLF),the virtual and actual optimal controllers are developed.In order to accomplish optimal control effectively,a simplified reinforcement learning(RL)algorithm is designed by deriving the updating laws from the negative gradient of a simple positive function,instead of employing existing optimal control methods.In addition,to ensure that all the signals in the closed-loop system are bounded and the output can follow the reference signal within a bounded error,all state variables are confined within their compact sets all times.Finally,a simulation example is given to illustrate the effectiveness of the proposed control strategy.展开更多
The stabilization problem of linear time-varying systems with both state and input constraints is considered. Sufficient conditions for the existence of the solution to this problem are derived and a gain-switched(ga...The stabilization problem of linear time-varying systems with both state and input constraints is considered. Sufficient conditions for the existence of the solution to this problem are derived and a gain-switched(gain-scheduled) state feedback control scheme is built to stabilize the constrained timevarying system. The design problem is transformed to a series of convex feasibility problems which can be solved efficiently. A design example is given to illustrate the effect of the proposed algorithm.展开更多
This paper deals with maximum principle for some optimal control problem governed by some elliptic variational inequalities. Some state constraints are discussed. The basic techniques used here are based on those in [...This paper deals with maximum principle for some optimal control problem governed by some elliptic variational inequalities. Some state constraints are discussed. The basic techniques used here are based on those in [1] and a new penalty functional defined in this paper.展开更多
The optimal control problems of hyperbolic H-hemivariational inequalities with the state constraints and nonnomotone multivalued mapping term are considered.The optimal solutions are obtained.In addition,their approxi...The optimal control problems of hyperbolic H-hemivariational inequalities with the state constraints and nonnomotone multivalued mapping term are considered.The optimal solutions are obtained.In addition,their approximating problems are also studied.展开更多
The optimal control problem of parabolic variational inequalities with the state constraint and nonlinear, discontinuous nonmonotone multivalued mapping term and its approximating problem are studied, which generalize...The optimal control problem of parabolic variational inequalities with the state constraint and nonlinear, discontinuous nonmonotone multivalued mapping term and its approximating problem are studied, which generalizes some obtained results.展开更多
A kind of direct methods is presented for the solution of optimal control problems with state constraints. These methods are sequential quadratic programming methods. At every iteration a quadratic programming which i...A kind of direct methods is presented for the solution of optimal control problems with state constraints. These methods are sequential quadratic programming methods. At every iteration a quadratic programming which is obtained by quadratic approximation to Lagrangian function and linear approximations to constraints is solved to get a search direction for a merit function. The merit function is formulated by augmenting the Lagrangian function with a penalty term. A line search is carried out along the search direction to determine a step length such that the merit function is decreased. The methods presented in this paper include continuous sequential quadratic programming methods and discreate sequential quadratic programming methods.展开更多
Rice yield is still low in Nigeria despite the ecological advantages. Several challenges has been traced it production. The study intend to investigate if other rice producing nations are faced with similar challenges...Rice yield is still low in Nigeria despite the ecological advantages. Several challenges has been traced it production. The study intend to investigate if other rice producing nations are faced with similar challenges and at what magnitude and more importantly, what can be learn to improve the rice yield in Nigeria. Based on 2013/2014 survey, a total sample of 400 famers were randomly interviewed;164 from Niger State of Nigeria and 236 from Hainan province of China. The study collate the perception of farmers to rice production constraints categorized into biotic, abiotic and socioeconomics. Biplot analysis was employed to examine multivariate pattern of their perceptions towards production constraints. The multivariate technique simultaneously displaying different yield levels and factor constraints in data matrix providing the inter-unit distances, variance and correlations of variables. According to the study, Niger state farmers identified socioeconomic constraint as the major factors to production and attributed it to lack of or insufficient investment while the Hainan farmers majorly identified abiotic constraints. The study also indicated that great potential remain to further improve rice yield in both regions especially in Nigeria given the appropriate investment on essential inputs. This study is of great use to extension officers more so, given the investment in Africa, policy makers take advantage of the bilateral and multilateral relationship to invest ease transfer of agricultural information and technologies between or among partners.展开更多
Necessary conditions for optimality are proved for smooth infinite horizon optimal control problems with unilateral state constraints (pathwise constraints) and with terminal conditions on the states at the infinite h...Necessary conditions for optimality are proved for smooth infinite horizon optimal control problems with unilateral state constraints (pathwise constraints) and with terminal conditions on the states at the infinite horizon. The aim of the paper is to obtain strong necessary conditions including transversality conditions at infinity, which in many cases lead to a set of candidates for optimality containing only a few elements, similar to what is the case in finite horizon problems. However, strong growth conditions are needed for the results to hold.展开更多
This article develops a novel data-driven safe Q-learning method to design the safe optimal controller which can guarantee constrained states of nonlinear systems always stay in the safe region while providing an opti...This article develops a novel data-driven safe Q-learning method to design the safe optimal controller which can guarantee constrained states of nonlinear systems always stay in the safe region while providing an optimal performance.First,we design an augmented utility function consisting of an adjustable positive definite control obstacle function and a quadratic form of the next state to ensure the safety and optimality.Second,by exploiting a pre-designed admissible policy for initialization,an off-policy stabilizing value iteration Q-learning(SVIQL)algorithm is presented to seek the safe optimal policy by using offline data within the safe region rather than the mathematical model.Third,the monotonicity,safety,and optimality of the SVIQL algorithm are theoretically proven.To obtain the initial admissible policy for SVIQL,an offline VIQL algorithm with zero initialization is constructed and a new admissibility criterion is established for immature iterative policies.Moreover,the critic and action networks with precise approximation ability are established to promote the operation of VIQL and SVIQL algorithms.Finally,three simulation experiments are conducted to demonstrate the virtue and superiority of the developed safe Q-learning method.展开更多
This paper investigates State Space Model Predictive Control (SSMPC) of an aerothermic process. It is a pilot scale heating and ventilation system equipped with a heater grid and a centrifugal blower, fully connected ...This paper investigates State Space Model Predictive Control (SSMPC) of an aerothermic process. It is a pilot scale heating and ventilation system equipped with a heater grid and a centrifugal blower, fully connected through a data acquisition system for real time control. The interaction between the process variables is shown to be challenging for single variable controllers, therefore multi-variable control is worth considering. A multi-variable state space model is obtained from on-line experimental data. The controller design is translated into a Quadratic Programming (QP) problem, in which a cost function subject to actuators linear inequality constraints is minimized. The outcome of the experimental results is that the main control objectives, such as set-point tracking and perturbations rejection under actuators constraints, are well achieved for both controlled variables simultaneously.展开更多
This paper aims at solving the state filtering problem for linear systems with state constraints. Three classes of typical state constraints, i.e., linear equality, quadratic equality and inequality, are discussed. By...This paper aims at solving the state filtering problem for linear systems with state constraints. Three classes of typical state constraints, i.e., linear equality, quadratic equality and inequality, are discussed. By using the linear relationships among different state variables, a reduced-order Kalman filter is derived for the system with linear equality constraints. Afterwards, such a solution is applied to the cases of the quadratic equality constraint and inequality constraints and the two constrained state filtering problems are transformed into two relative constrained optimization problems. Then they are solved by the Lagrangian multiplier and linear matrix inequality techniques, respectively. Finally, two simple tracking examples are provided to illustrate the effectiveness of the reduced-order filters.展开更多
This paper proposes a multiple-constraints-guaranteed midcourse guidance law for the interception of the hypersonic targets. In traditional midcourse law design, the constraints of the aero-thermal heating are rarely ...This paper proposes a multiple-constraints-guaranteed midcourse guidance law for the interception of the hypersonic targets. In traditional midcourse law design, the constraints of the aero-thermal heating are rarely taken into consideration. The performance of the infrared detection system may be degraded and the instability of the flight control system may be induced.To address this problem, a state-constrained model predictive static programming method is introduced such that both terminal constraints(position and angle) and optimal energy consumption can be ensured. As a result, a sub-optimal midcourse guidance,guaranteeing the aforementioned multiple-constraints to be never violated, is synthesized. Simulation results demonstrate the effectiveness of the proposed method.展开更多
This paper presents a novel three-dimensional autonomous entry guidance for relatively high lift-to-drag ratio vehicles satisfying geographic constraints and other path constraints. The guidance is composed of onboard...This paper presents a novel three-dimensional autonomous entry guidance for relatively high lift-to-drag ratio vehicles satisfying geographic constraints and other path constraints. The guidance is composed of onboard trajectory planning and robust trajectory tracking. For trajectory planning, a longitudinal sub-planner is introduced to generate a feasible drag-versus-energy profile by using the interpolation between upper boundary and lower boundary of entry corridor to get the desired trajectory length. The associated magnitude of the bank angle can be specified by drag profile, while the sign of bank angle is determined by lateral sub-planner. Two-reverse mode is utilized to satisfy waypoint constraints and dynamic heading error corridor is utilized to satisfy no-fly zone constraints. The longitudinal and lateral sub-planners are iteratively employed until all of the path constraints are satisfied. For trajectory tracking, a novel tracking law based on the active disturbance rejection control is introduced. Finally, adaptability tests and Monte Carlo simulations of the entry guidance approach are performed. Results show that the proposed entry guidance approach can adapt to different entry missions and is able to make the vehicle reach the prescribed target point precisely in spite of geographic constraints.展开更多
This paper presents a novel movement planning algorithm for a guard robot in an indoor environment, imitating the job of human security. A movement planner is employed by the guard robot to continuously observe a cert...This paper presents a novel movement planning algorithm for a guard robot in an indoor environment, imitating the job of human security. A movement planner is employed by the guard robot to continuously observe a certain person. This problem can be distinguished from the person following problem which continuously follows the object. Instead, the movement planner aims to reduce the movement and the energy while keeping the target person under its visibility. The proposed algorithm exploits the topological features of the environment to obtain a set of viewpoint candidates, and it is then optimized by a cost-based set covering problem. Both the robot and the target person are modeled using geodesic motion model which considers the environment shape. Subsequently, a particle model-based planner is employed, considering the chance constraints over the robot visibility, to choose an optimal action for the robot. Simulation results using 3D simulator and experiments on a real environment are provided to show the feasibility and effectiveness of our algorithm.展开更多
基金This work was supported by National Natural Science Foundation of China(61822307,61773188).
文摘In this paper,an adaptive neural-network(NN)output feedback optimal control problem is studied for a class of strict-feedback nonlinear systems with unknown internal dynamics,input saturation and state constraints.Neural networks are used to approximate unknown internal dynamics and an adaptive NN state observer is developed to estimate immeasurable states.Under the framework of the backstepping design,by employing the actor-critic architecture and constructing the tan-type Barrier Lyapunov function(BLF),the virtual and actual optimal controllers are developed.In order to accomplish optimal control effectively,a simplified reinforcement learning(RL)algorithm is designed by deriving the updating laws from the negative gradient of a simple positive function,instead of employing existing optimal control methods.In addition,to ensure that all the signals in the closed-loop system are bounded and the output can follow the reference signal within a bounded error,all state variables are confined within their compact sets all times.Finally,a simulation example is given to illustrate the effectiveness of the proposed control strategy.
基金supported by the National Natural Science Foundation of China(6132106261503100)the China Postdoctoral Science Foundation(2014M550189)
文摘The stabilization problem of linear time-varying systems with both state and input constraints is considered. Sufficient conditions for the existence of the solution to this problem are derived and a gain-switched(gain-scheduled) state feedback control scheme is built to stabilize the constrained timevarying system. The design problem is transformed to a series of convex feasibility problems which can be solved efficiently. A design example is given to illustrate the effect of the proposed algorithm.
文摘This paper deals with maximum principle for some optimal control problem governed by some elliptic variational inequalities. Some state constraints are discussed. The basic techniques used here are based on those in [1] and a new penalty functional defined in this paper.
文摘The optimal control problems of hyperbolic H-hemivariational inequalities with the state constraints and nonnomotone multivalued mapping term are considered.The optimal solutions are obtained.In addition,their approximating problems are also studied.
文摘The optimal control problem of parabolic variational inequalities with the state constraint and nonlinear, discontinuous nonmonotone multivalued mapping term and its approximating problem are studied, which generalizes some obtained results.
文摘A kind of direct methods is presented for the solution of optimal control problems with state constraints. These methods are sequential quadratic programming methods. At every iteration a quadratic programming which is obtained by quadratic approximation to Lagrangian function and linear approximations to constraints is solved to get a search direction for a merit function. The merit function is formulated by augmenting the Lagrangian function with a penalty term. A line search is carried out along the search direction to determine a step length such that the merit function is decreased. The methods presented in this paper include continuous sequential quadratic programming methods and discreate sequential quadratic programming methods.
文摘Rice yield is still low in Nigeria despite the ecological advantages. Several challenges has been traced it production. The study intend to investigate if other rice producing nations are faced with similar challenges and at what magnitude and more importantly, what can be learn to improve the rice yield in Nigeria. Based on 2013/2014 survey, a total sample of 400 famers were randomly interviewed;164 from Niger State of Nigeria and 236 from Hainan province of China. The study collate the perception of farmers to rice production constraints categorized into biotic, abiotic and socioeconomics. Biplot analysis was employed to examine multivariate pattern of their perceptions towards production constraints. The multivariate technique simultaneously displaying different yield levels and factor constraints in data matrix providing the inter-unit distances, variance and correlations of variables. According to the study, Niger state farmers identified socioeconomic constraint as the major factors to production and attributed it to lack of or insufficient investment while the Hainan farmers majorly identified abiotic constraints. The study also indicated that great potential remain to further improve rice yield in both regions especially in Nigeria given the appropriate investment on essential inputs. This study is of great use to extension officers more so, given the investment in Africa, policy makers take advantage of the bilateral and multilateral relationship to invest ease transfer of agricultural information and technologies between or among partners.
文摘Necessary conditions for optimality are proved for smooth infinite horizon optimal control problems with unilateral state constraints (pathwise constraints) and with terminal conditions on the states at the infinite horizon. The aim of the paper is to obtain strong necessary conditions including transversality conditions at infinity, which in many cases lead to a set of candidates for optimality containing only a few elements, similar to what is the case in finite horizon problems. However, strong growth conditions are needed for the results to hold.
基金supported in part by the National Science and Technology Major Project(2021ZD0112302)the National Natural Science Foundation of China(62222301,61890930-5,62021003)。
文摘This article develops a novel data-driven safe Q-learning method to design the safe optimal controller which can guarantee constrained states of nonlinear systems always stay in the safe region while providing an optimal performance.First,we design an augmented utility function consisting of an adjustable positive definite control obstacle function and a quadratic form of the next state to ensure the safety and optimality.Second,by exploiting a pre-designed admissible policy for initialization,an off-policy stabilizing value iteration Q-learning(SVIQL)algorithm is presented to seek the safe optimal policy by using offline data within the safe region rather than the mathematical model.Third,the monotonicity,safety,and optimality of the SVIQL algorithm are theoretically proven.To obtain the initial admissible policy for SVIQL,an offline VIQL algorithm with zero initialization is constructed and a new admissibility criterion is established for immature iterative policies.Moreover,the critic and action networks with precise approximation ability are established to promote the operation of VIQL and SVIQL algorithms.Finally,three simulation experiments are conducted to demonstrate the virtue and superiority of the developed safe Q-learning method.
文摘This paper investigates State Space Model Predictive Control (SSMPC) of an aerothermic process. It is a pilot scale heating and ventilation system equipped with a heater grid and a centrifugal blower, fully connected through a data acquisition system for real time control. The interaction between the process variables is shown to be challenging for single variable controllers, therefore multi-variable control is worth considering. A multi-variable state space model is obtained from on-line experimental data. The controller design is translated into a Quadratic Programming (QP) problem, in which a cost function subject to actuators linear inequality constraints is minimized. The outcome of the experimental results is that the main control objectives, such as set-point tracking and perturbations rejection under actuators constraints, are well achieved for both controlled variables simultaneously.
基金supported by the National Key Basic Research Development Project (973 Program) (2012CB821205)the Natural Scientific Research Innovation Foundation in Harbin Institute of Technology(HIT.NSRIF.2009004)
文摘This paper aims at solving the state filtering problem for linear systems with state constraints. Three classes of typical state constraints, i.e., linear equality, quadratic equality and inequality, are discussed. By using the linear relationships among different state variables, a reduced-order Kalman filter is derived for the system with linear equality constraints. Afterwards, such a solution is applied to the cases of the quadratic equality constraint and inequality constraints and the two constrained state filtering problems are transformed into two relative constrained optimization problems. Then they are solved by the Lagrangian multiplier and linear matrix inequality techniques, respectively. Finally, two simple tracking examples are provided to illustrate the effectiveness of the reduced-order filters.
基金supported by the National Natural Science Foundation of China(61503302)the joint fund of the National Natural Science Foundation Committee and China Academy of Engineering Physics(U1630127)
文摘This paper proposes a multiple-constraints-guaranteed midcourse guidance law for the interception of the hypersonic targets. In traditional midcourse law design, the constraints of the aero-thermal heating are rarely taken into consideration. The performance of the infrared detection system may be degraded and the instability of the flight control system may be induced.To address this problem, a state-constrained model predictive static programming method is introduced such that both terminal constraints(position and angle) and optimal energy consumption can be ensured. As a result, a sub-optimal midcourse guidance,guaranteeing the aforementioned multiple-constraints to be never violated, is synthesized. Simulation results demonstrate the effectiveness of the proposed method.
基金supported by National Natural Science Foundation of China (No. 11202024)
文摘This paper presents a novel three-dimensional autonomous entry guidance for relatively high lift-to-drag ratio vehicles satisfying geographic constraints and other path constraints. The guidance is composed of onboard trajectory planning and robust trajectory tracking. For trajectory planning, a longitudinal sub-planner is introduced to generate a feasible drag-versus-energy profile by using the interpolation between upper boundary and lower boundary of entry corridor to get the desired trajectory length. The associated magnitude of the bank angle can be specified by drag profile, while the sign of bank angle is determined by lateral sub-planner. Two-reverse mode is utilized to satisfy waypoint constraints and dynamic heading error corridor is utilized to satisfy no-fly zone constraints. The longitudinal and lateral sub-planners are iteratively employed until all of the path constraints are satisfied. For trajectory tracking, a novel tracking law based on the active disturbance rejection control is introduced. Finally, adaptability tests and Monte Carlo simulations of the entry guidance approach are performed. Results show that the proposed entry guidance approach can adapt to different entry missions and is able to make the vehicle reach the prescribed target point precisely in spite of geographic constraints.
文摘This paper presents a novel movement planning algorithm for a guard robot in an indoor environment, imitating the job of human security. A movement planner is employed by the guard robot to continuously observe a certain person. This problem can be distinguished from the person following problem which continuously follows the object. Instead, the movement planner aims to reduce the movement and the energy while keeping the target person under its visibility. The proposed algorithm exploits the topological features of the environment to obtain a set of viewpoint candidates, and it is then optimized by a cost-based set covering problem. Both the robot and the target person are modeled using geodesic motion model which considers the environment shape. Subsequently, a particle model-based planner is employed, considering the chance constraints over the robot visibility, to choose an optimal action for the robot. Simulation results using 3D simulator and experiments on a real environment are provided to show the feasibility and effectiveness of our algorithm.