This article presents an adaptive optimal control method for a semi-active suspension system.The model of the suspension system is built,in which the components of uncertain parameters and exogenous disturbance are de...This article presents an adaptive optimal control method for a semi-active suspension system.The model of the suspension system is built,in which the components of uncertain parameters and exogenous disturbance are described.The adaptive optimal control law consists of the sum of the optimal control component and the adaptive control component.First,the optimal control law is designed for the model of the suspension system after ignoring the components of uncertain parameters and exogenous disturbance caused by the road surface.The optimal control law expresses the desired dynamic characteristics of the suspension system.Next,the adaptive component is designed with the purpose of compensating for the effects caused by uncertain parameters and exogenous disturbance caused by the road surface;the adaptive component has adaptive parameter rules to estimate uncertain parameters and exogenous disturbance.When exogenous disturbances are eliminated,the system responds with an optimal controller designed.By separating theoretically the dynamic of a semi-active suspension system,this solution allows the design of two separate controllers easily and has reduced the computational burden and the use of too many tools,thus allowing for more convenient hardware implementation.The simulation results also show the effectiveness of damping oscillations of the proposed solution in this article.展开更多
The controlled objects are uncertain stable,but the optimal control systems which are constituted by the controlled objects under certain conditions are certain stable. This paper analyses stability of optimal control...The controlled objects are uncertain stable,but the optimal control systems which are constituted by the controlled objects under certain conditions are certain stable. This paper analyses stability of optimal control systems which have quadric performance index via Liapunov method.展开更多
To guarantee bus priority with a minimum impact on car traffic at intersections, an optimal control system of the intermittent bus-only approach (IBA) was proposed. The problems of the existing system are first solv...To guarantee bus priority with a minimum impact on car traffic at intersections, an optimal control system of the intermittent bus-only approach (IBA) was proposed. The problems of the existing system are first solved through optimization: the judgment time of the IBA system was advanced to allow a bus to jump car queues if the bus was detected to arrive at the intersection, and the instant that the IBA lane became available to cars was controlled dynamically to increase the capacity of the IBA lane. The total car delay in one cycle was then analyzed quantitatively when implementing the optimal control system. The results show that in comparison with the existing system of the IBA, the car delay is greatly reduced and the probability of a car stopping twice is low after optimizing the IBA system.展开更多
Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are o...Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are often effective for stabilization but may not directly optimize long-term performance.To address this limitation,this study develops an integrated framework that combines optimal control principles with reinforcement learning for a single-link robotic manipulator.The proposed scheme adopts an actor–critic structure,where the critic network approximates the value function associated with the Hamilton–Jacobi–Bellman equation,and the actor network generates near-optimal control signals in real time.This dual adaptation enables the controller to refine its policy online without explicit system knowledge.Stability of the closed-loop system is analyzed through Lyapunov theory,ensuring boundedness of the tracking error.Numerical simulations on the single-link manipulator demonstrate that themethod achieves accurate trajectory followingwhile maintaining lowcontrol effort.The results further showthat the actor–critic learning mechanism accelerates convergence of the control policy compared with conventional optimization-based strategies.This work highlights the potential of reinforcement learning integrated with optimal control for robotic manipulators and provides a foundation for future extensions to more complex multi-degree-of-freedom systems.The proposed controller is further validated in a physics-based virtual Gazebo environment,demonstrating stable adaptation and real-time feasibility.展开更多
In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to ...In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to obtain the maximal positive definite solution of nonlinear matrix equation X+A^(*)X|^(-α)A=Q with the case 0<α≤1.Based on this method,a new iterative algorithm is developed,and its convergence proof is given.Finally,two numerical examples are provided to show the effectiveness of the proposed method.展开更多
This paper investigates the edge-based dynamic event-triggered inverse optimal formation control problem for multiple quadrotor unmanned aerial vehicles(QUAVs) with attitude constraints. To improve communication effic...This paper investigates the edge-based dynamic event-triggered inverse optimal formation control problem for multiple quadrotor unmanned aerial vehicles(QUAVs) with attitude constraints. To improve communication efficiency, an edge-based dynamic event-triggered mechanism is developed for the communication channels between neighboring QUAVs. However, this edge-based dynamic event-triggered communication(DETC) may cause discontinuities in the reference signals. To solve this problem, a distributed estimator is designed for each QUAV to obtain the leader's output signals. Considering the safety of QUAV formation flying, this paper designs a function transformation method that constrains the attitudes of the QUAVs to a strictly safe region. Furthermore, an inverse optimal control strategy is proposed based on the backstepping methodology. This scheme not only minimizes the cost function but also avoids the necessity of solving the Hamilton-Jacobi-Bellman equation. Finally, the stability of the QUAV systems is proven using Lyapunov theory, and the effectiveness of the proposed control method is verified through simulation.展开更多
Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this wor...Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this work introduces a machine-learning-based,data-driven scheme to overcome the challenges encountered,with a trained neural network(NN)assuming the role of a surrogate model that captures the system’s dynamics and subsequently enables QOC to be performed on the NN instead of on the real system.The trained NN surrogate proves effective for practical QOC tasks and is further demonstrated to be adaptable to different experimental conditions,remaining robust across varying system sizes and pulse durations.展开更多
The electromagnetic levitation system(EMLS)serves as the most important part of any magnetic levitation system.However,its characteristics are defined by its highly nonlinear dynamics and instability.Furthermore,the u...The electromagnetic levitation system(EMLS)serves as the most important part of any magnetic levitation system.However,its characteristics are defined by its highly nonlinear dynamics and instability.Furthermore,the uncertainties in the dynamics of an electromagnetic levitation system make the controller design more difficult.Therefore,it is necessary to design a robust control law that will ensure the system’s stability in the presence of these uncertainties.In this framework,the dynamics of an electromagnetic levitation system are addressed in terms of matched and unmatched uncertainties.The robust control problem is translated into the optimal control problem,where the uncertainties of the electromagnetic levitation system are directly reflected in the cost function.The optimal control method is used to solve the robust control problem.The solution to the optimal control problem for the electromagnetic levitation system is indeed a solution to the robust control problem of the electromagnetic levitation system under matched and unmatched uncertainties.The simulation and experimental results demonstrate the performance of the designed control scheme.The performance indices such as integral absolute error(IAE),integral square error(ISE),integral time absolute error(ITAE),and integral time square error(ITSE)are compared for both uncertainties to showcase the robustness of the designed control scheme.展开更多
This paper investigates the problem of fuzzy adaptive finite-time inverse optimal control for active suspension systems(ASSs).The fuzzy logic systems(FLSs)are utilized to learn the unknown non-linear dynamics and an a...This paper investigates the problem of fuzzy adaptive finite-time inverse optimal control for active suspension systems(ASSs).The fuzzy logic systems(FLSs)are utilized to learn the unknown non-linear dynamics and an auxiliary system is established.Based on the finite-time stability theory and inverse optimal theory,a fuzzy adaptive inverse finite-time inverse optimal control method is proposed.It is proven that the formulated control approach guarantees the stability of the controlled systems,while ensuring that errors converge to a small neighborhood of zero within finite time.Moreover,the optimized control performance can be achieved.Eventually,the simulation results demonstrate the effectiveness of the proposed fuzzy adaptive finite-time inverse optimal control scheme.展开更多
Inverse reinforcement learning optimal control is under the framework of learner-expert.The learner system can imitate the expert system's demonstrated behaviors and does not require the predefined cost function,s...Inverse reinforcement learning optimal control is under the framework of learner-expert.The learner system can imitate the expert system's demonstrated behaviors and does not require the predefined cost function,so it can handle optimal control problems effectively.This paper proposes an inverse reinforcement learning optimal control method for Takagi-Sugeno(T-S)fuzzy systems.Based on learner systems,an expert system is constructed,where the learner system only knows the expert system's optimal control policy.To reconstruct the unknown cost function,we firstly develop a model-based inverse reinforcement learning algorithm for the case that systems dynamics are known.The developed model-based learning algorithm is consists of two learning stages:an inner reinforcement learning loop and an outer inverse optimal control loop.The inner loop desires to obtain optimal control policy via learner's cost function and the outer loop aims to update learner's state-penalty matrices via only using expert's optimal control policy.Then,to eliminate the requirement that the system dynamics must be known,a data-driven integral learning algorithm is presented.It is proved that the presented two algorithms are convergent and the developed inverse reinforcement learning optimal control scheme can ensure the controlled fuzzy learner systems to be asymptotically stable.Finally,we apply the proposed fuzzy optimal control to the truck-trailer system,and the computer simulation results verify the effectiveness of the presented approach.展开更多
Optimal impulse control and impulse games provide the cutting-edge frameworks for modeling systems where control actions occur at discrete time points,and optimizing objectives under discontinuous interventions.This r...Optimal impulse control and impulse games provide the cutting-edge frameworks for modeling systems where control actions occur at discrete time points,and optimizing objectives under discontinuous interventions.This review synthesizes the theoretical advancements,computational approaches,emerging challenges,and possible research directions in the field.Firstly,we briefly review the fundamental theory of continuous-time optimal control,including Pontryagin's maximum principle(PMP)and dynamic programming principle(DPP).Secondly,we present the foundational results in optimal impulse control,including necessary conditions and sufficient conditions.Thirdly,we systematize impulse game methodologies,from Nash equilibrium existence theory to the connection between Nash equilibrium and systems stability.Fourthly,we summarize the numerical algorithms including the intelligent computation approaches.Finally,we examine the new trends and challenges in theory and applications as well as computational considerations.展开更多
This paper highlights the utilization of parallel control and adaptive dynamic programming(ADP) for event-triggered robust parallel optimal consensus control(ETRPOC) of uncertain nonlinear continuous-time multiagent s...This paper highlights the utilization of parallel control and adaptive dynamic programming(ADP) for event-triggered robust parallel optimal consensus control(ETRPOC) of uncertain nonlinear continuous-time multiagent systems(MASs).First, the parallel control system, which consists of a virtual control variable and a specific auxiliary variable obtained from the coupled Hamiltonian, allows general systems to be transformed into affine systems. Of interest is the fact that the parallel control technique's introduction provides an unprecedented perspective on eliminating the negative effects of disturbance. Then, an eventtriggered mechanism is adopted to save communication resources while ensuring the system's stability. The coupled HamiltonJacobi(HJ) equation's solution is approximated using a critic neural network(NN), whose weights are updated in response to events. Furthermore, theoretical analysis reveals that the weight estimation error is uniformly ultimately bounded(UUB). Finally,numerical simulations demonstrate the effectiveness of the developed ETRPOC method.展开更多
Dear Editor,The attacker is always going to intrude covertly networked control systems(NCSs)by dynamically changing false data injection attacks(FDIAs)strategy,while the defender try their best to resist attacks by de...Dear Editor,The attacker is always going to intrude covertly networked control systems(NCSs)by dynamically changing false data injection attacks(FDIAs)strategy,while the defender try their best to resist attacks by designing defense strategy on the basis of identifying attack strategy,maintaining stable operation of NCSs.To solve this attack-defense game problem,this letter investigates optimal secure control of NCSs under FDIAs.First,for the alterations of energy caused by false data,a novel attack-defense game model is constructed,which considers the changes of energy caused by the actions of the defender and attacker in the forward and feedback channels.展开更多
In order to solve the multiple power extreme value point problem caused by system frequency splitting during wireless energy transmission at short distances a transmission model of the system is established.With the c...In order to solve the multiple power extreme value point problem caused by system frequency splitting during wireless energy transmission at short distances a transmission model of the system is established.With the comprehensive consideration of the resonance frequency load parameters and the coupling between coils the internal factors of frequency splitting and boundary conditions are discussed.The results show that under the condition of the fixed load the higher the natural resonance frequency the easier the frequency splitting. As the frequency splitting occurs the frequency of the maximum power transfer is no longer with the natural resonance frequency which can make the system unstable and the transfer power more difficult to control. Therefore a decreasing-frequency method is proposed to avoid the system frequency splitting. And decreasing the system resonance frequency can make the system successfully withdraw the frequency splitting area at a short-distance range.Under the fixed load condition the transmission power of the system can be increased by 400% and the transmission efficiency is reduced by only 14% which greatly improves the transmission performance of the system.展开更多
Linear quadratic regulator(LQR) and proportional-integral-derivative(PID) control methods, which are generally used for control of linear dynamical systems, are used in this paper to control the nonlinear dynamical sy...Linear quadratic regulator(LQR) and proportional-integral-derivative(PID) control methods, which are generally used for control of linear dynamical systems, are used in this paper to control the nonlinear dynamical system. LQR is one of the optimal control techniques, which takes into account the states of the dynamical system and control input to make the optimal control decisions.The nonlinear system states are fed to LQR which is designed using a linear state-space model. This is simple as well as robust. The inverted pendulum, a highly nonlinear unstable system, is used as a benchmark for implementing the control methods. Here the control objective is to control the system such that the cart reaches a desired position and the inverted pendulum stabilizes in the upright position. In this paper, the modeling and simulation for optimal control design of nonlinear inverted pendulum-cart dynamic system using PID controller and LQR have been presented for both cases of without and with disturbance input. The Matlab-Simulink models have been developed for simulation and performance analysis of the control schemes. The simulation results justify the comparative advantage of LQR control method.展开更多
This article is concerned with a class of control systems with Markovian switching, in which an It5 formula for Markov-modulated processes is derived. Moreover, an optimal control law satisfying the generalized Hamilt...This article is concerned with a class of control systems with Markovian switching, in which an It5 formula for Markov-modulated processes is derived. Moreover, an optimal control law satisfying the generalized Hamilton-Jacobi-Bellman (HJB) equation with Markovian switching is characterized. Then, through the generalized HJB equation, we study an optimal consumption and portfolio problem with the financial markets of Markovian switching and inflation. Thus, we deduce the optimal policies and show that a modified Mutual Fund Theorem consisting of three funds holds. Finally, for the CRRA utility function, we explicitly give the optimal consumption and portfolio policies. Numerical examples are included to illustrate the obtained results.展开更多
In order to solve the aero-propulsion system acceleration optimal problem,the necessity of inlet control is discussed,and a fully new aero-propulsion system acceleration process control design including the inlet,engi...In order to solve the aero-propulsion system acceleration optimal problem,the necessity of inlet control is discussed,and a fully new aero-propulsion system acceleration process control design including the inlet,engine,and nozzle is proposed in this paper.In the proposed propulsion system control scheme,the inlet,engine,and nozzle are simultaneously adjusted through the FSQP method.In order to implement the control scheme design,an aero-propulsion system componentlevel model is built to simulate the inlet working performance and the matching problems between the inlet and engine.Meanwhile,a stabilizing inlet control scheme is designed to solve the inlet control problems.In optimal control of the aero-propulsion system acceleration process,the inlet is an emphasized control unit in the optimal acceleration control system.Two inlet control patterns are discussed in the simulation.The simulation results prove that by taking the inlet ramp angle as an active control variable instead of being modulated passively,acceleration performance could be obviously enhanced.Acceleration objectives could be obtained with a faster acceleration time by5%.展开更多
The linear systems affected by additive external sinusoidal disturbances is studied. The problem is to damp this forced oscillation in an optimal fashion. The main result of this paper is a new design approach is prop...The linear systems affected by additive external sinusoidal disturbances is studied. The problem is to damp this forced oscillation in an optimal fashion. The main result of this paper is a new design approach is proposed of realizable feedforward and feedback optimal control law for a linear time invariant system with sinusoidal disturbances. The algorithm of solving the optimal control law is given. It is shown that the control law is easily realized and is robust with respect to errors produced by the external sinusoidal disturbances through simulation results.展开更多
A control algorithm for improving vehicle handling was proposed by applying right angle to the steering wheel,based on the nonlinear adaptive optimal control(NAOC).A nonlinear 4-DOF model was initially developed,then ...A control algorithm for improving vehicle handling was proposed by applying right angle to the steering wheel,based on the nonlinear adaptive optimal control(NAOC).A nonlinear 4-DOF model was initially developed,then it was simplified to a 2-DOF model with reasonable assumptions to design observer and optimal controllers.Then a simplified model was developed for steering system.The numerical simulations were carried out using vehicle parameters for standard maneuvers in dry and wet road conditions.Moreover,the hardware in the loop method was implemented to prove the controller ability in realistic conditions.Simulation results obviously show the effectiveness of NAOC on vehicle handling and reveal that the proposed controller can significantly improve vehicle handling during severe maneuvers.展开更多
基金supported in part by the Thai Nguyen University of Technology,Vietnam.
文摘This article presents an adaptive optimal control method for a semi-active suspension system.The model of the suspension system is built,in which the components of uncertain parameters and exogenous disturbance are described.The adaptive optimal control law consists of the sum of the optimal control component and the adaptive control component.First,the optimal control law is designed for the model of the suspension system after ignoring the components of uncertain parameters and exogenous disturbance caused by the road surface.The optimal control law expresses the desired dynamic characteristics of the suspension system.Next,the adaptive component is designed with the purpose of compensating for the effects caused by uncertain parameters and exogenous disturbance caused by the road surface;the adaptive component has adaptive parameter rules to estimate uncertain parameters and exogenous disturbance.When exogenous disturbances are eliminated,the system responds with an optimal controller designed.By separating theoretically the dynamic of a semi-active suspension system,this solution allows the design of two separate controllers easily and has reduced the computational burden and the use of too many tools,thus allowing for more convenient hardware implementation.The simulation results also show the effectiveness of damping oscillations of the proposed solution in this article.
文摘The controlled objects are uncertain stable,but the optimal control systems which are constituted by the controlled objects under certain conditions are certain stable. This paper analyses stability of optimal control systems which have quadric performance index via Liapunov method.
文摘To guarantee bus priority with a minimum impact on car traffic at intersections, an optimal control system of the intermittent bus-only approach (IBA) was proposed. The problems of the existing system are first solved through optimization: the judgment time of the IBA system was advanced to allow a bus to jump car queues if the bus was detected to arrive at the intersection, and the instant that the IBA lane became available to cars was controlled dynamically to increase the capacity of the IBA lane. The total car delay in one cycle was then analyzed quantitatively when implementing the optimal control system. The results show that in comparison with the existing system of the IBA, the car delay is greatly reduced and the probability of a car stopping twice is low after optimizing the IBA system.
基金supported in part by the National Science and Technology Council under Grant NSTC 114-2221-E-027-104.
文摘Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are often effective for stabilization but may not directly optimize long-term performance.To address this limitation,this study develops an integrated framework that combines optimal control principles with reinforcement learning for a single-link robotic manipulator.The proposed scheme adopts an actor–critic structure,where the critic network approximates the value function associated with the Hamilton–Jacobi–Bellman equation,and the actor network generates near-optimal control signals in real time.This dual adaptation enables the controller to refine its policy online without explicit system knowledge.Stability of the closed-loop system is analyzed through Lyapunov theory,ensuring boundedness of the tracking error.Numerical simulations on the single-link manipulator demonstrate that themethod achieves accurate trajectory followingwhile maintaining lowcontrol effort.The results further showthat the actor–critic learning mechanism accelerates convergence of the control policy compared with conventional optimization-based strategies.This work highlights the potential of reinforcement learning integrated with optimal control for robotic manipulators and provides a foundation for future extensions to more complex multi-degree-of-freedom systems.The proposed controller is further validated in a physics-based virtual Gazebo environment,demonstrating stable adaptation and real-time feasibility.
基金Supported in part by Natural Science Foundation of Guangxi(2023GXNSFAA026246)in part by the Central Government's Guide to Local Science and Technology Development Fund(GuikeZY23055044)in part by the National Natural Science Foundation of China(62363003)。
文摘In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to obtain the maximal positive definite solution of nonlinear matrix equation X+A^(*)X|^(-α)A=Q with the case 0<α≤1.Based on this method,a new iterative algorithm is developed,and its convergence proof is given.Finally,two numerical examples are provided to show the effectiveness of the proposed method.
基金supported by the National Natural Science Foundation of China (Grant Nos.62573134,62473100,62433018)the Guangdong Basic and Applied Basic Research Foundation(Grant Nos.2025A1515060017,2025A1515011436,2025B1515020065,2025A1515011789)the Guangzhou Basic and Applied Basic Research Project (Grant No.2025A04J3534)。
文摘This paper investigates the edge-based dynamic event-triggered inverse optimal formation control problem for multiple quadrotor unmanned aerial vehicles(QUAVs) with attitude constraints. To improve communication efficiency, an edge-based dynamic event-triggered mechanism is developed for the communication channels between neighboring QUAVs. However, this edge-based dynamic event-triggered communication(DETC) may cause discontinuities in the reference signals. To solve this problem, a distributed estimator is designed for each QUAV to obtain the leader's output signals. Considering the safety of QUAV formation flying, this paper designs a function transformation method that constrains the attitudes of the QUAVs to a strictly safe region. Furthermore, an inverse optimal control strategy is proposed based on the backstepping methodology. This scheme not only minimizes the cost function but also avoids the necessity of solving the Hamilton-Jacobi-Bellman equation. Finally, the stability of the QUAV systems is proven using Lyapunov theory, and the effectiveness of the proposed control method is verified through simulation.
基金supported by the Innovation Program for Quantum Science and Technology(Grant No.2021ZD0302100)the National Natural Science Foundation of China(Grant Nos.12361131576,92265205,and 92476205).
文摘Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this work introduces a machine-learning-based,data-driven scheme to overcome the challenges encountered,with a trained neural network(NN)assuming the role of a surrogate model that captures the system’s dynamics and subsequently enables QOC to be performed on the NN instead of on the real system.The trained NN surrogate proves effective for practical QOC tasks and is further demonstrated to be adaptable to different experimental conditions,remaining robust across varying system sizes and pulse durations.
文摘The electromagnetic levitation system(EMLS)serves as the most important part of any magnetic levitation system.However,its characteristics are defined by its highly nonlinear dynamics and instability.Furthermore,the uncertainties in the dynamics of an electromagnetic levitation system make the controller design more difficult.Therefore,it is necessary to design a robust control law that will ensure the system’s stability in the presence of these uncertainties.In this framework,the dynamics of an electromagnetic levitation system are addressed in terms of matched and unmatched uncertainties.The robust control problem is translated into the optimal control problem,where the uncertainties of the electromagnetic levitation system are directly reflected in the cost function.The optimal control method is used to solve the robust control problem.The solution to the optimal control problem for the electromagnetic levitation system is indeed a solution to the robust control problem of the electromagnetic levitation system under matched and unmatched uncertainties.The simulation and experimental results demonstrate the performance of the designed control scheme.The performance indices such as integral absolute error(IAE),integral square error(ISE),integral time absolute error(ITAE),and integral time square error(ITSE)are compared for both uncertainties to showcase the robustness of the designed control scheme.
基金supported by the National Natural Science Foundation of China under 62173172。
文摘This paper investigates the problem of fuzzy adaptive finite-time inverse optimal control for active suspension systems(ASSs).The fuzzy logic systems(FLSs)are utilized to learn the unknown non-linear dynamics and an auxiliary system is established.Based on the finite-time stability theory and inverse optimal theory,a fuzzy adaptive inverse finite-time inverse optimal control method is proposed.It is proven that the formulated control approach guarantees the stability of the controlled systems,while ensuring that errors converge to a small neighborhood of zero within finite time.Moreover,the optimized control performance can be achieved.Eventually,the simulation results demonstrate the effectiveness of the proposed fuzzy adaptive finite-time inverse optimal control scheme.
基金The National Natural Science Foundation of China(62173172).
文摘Inverse reinforcement learning optimal control is under the framework of learner-expert.The learner system can imitate the expert system's demonstrated behaviors and does not require the predefined cost function,so it can handle optimal control problems effectively.This paper proposes an inverse reinforcement learning optimal control method for Takagi-Sugeno(T-S)fuzzy systems.Based on learner systems,an expert system is constructed,where the learner system only knows the expert system's optimal control policy.To reconstruct the unknown cost function,we firstly develop a model-based inverse reinforcement learning algorithm for the case that systems dynamics are known.The developed model-based learning algorithm is consists of two learning stages:an inner reinforcement learning loop and an outer inverse optimal control loop.The inner loop desires to obtain optimal control policy via learner's cost function and the outer loop aims to update learner's state-penalty matrices via only using expert's optimal control policy.Then,to eliminate the requirement that the system dynamics must be known,a data-driven integral learning algorithm is presented.It is proved that the presented two algorithms are convergent and the developed inverse reinforcement learning optimal control scheme can ensure the controlled fuzzy learner systems to be asymptotically stable.Finally,we apply the proposed fuzzy optimal control to the truck-trailer system,and the computer simulation results verify the effectiveness of the presented approach.
文摘Optimal impulse control and impulse games provide the cutting-edge frameworks for modeling systems where control actions occur at discrete time points,and optimizing objectives under discontinuous interventions.This review synthesizes the theoretical advancements,computational approaches,emerging challenges,and possible research directions in the field.Firstly,we briefly review the fundamental theory of continuous-time optimal control,including Pontryagin's maximum principle(PMP)and dynamic programming principle(DPP).Secondly,we present the foundational results in optimal impulse control,including necessary conditions and sufficient conditions.Thirdly,we systematize impulse game methodologies,from Nash equilibrium existence theory to the connection between Nash equilibrium and systems stability.Fourthly,we summarize the numerical algorithms including the intelligent computation approaches.Finally,we examine the new trends and challenges in theory and applications as well as computational considerations.
基金supported in part by the National Key Research and Development Program of China(2021YFE0206100)the National Natural Science Foundation of China(62425310,62073321)+2 种基金the National Defense Basic Scientific Research Program(JCKY2019203C029,JCKY2020130C025)the Science and Technology Development FundMacao SAR(FDCT-22-009-MISE,0060/2021/A2,0015/2020/AMJ)
文摘This paper highlights the utilization of parallel control and adaptive dynamic programming(ADP) for event-triggered robust parallel optimal consensus control(ETRPOC) of uncertain nonlinear continuous-time multiagent systems(MASs).First, the parallel control system, which consists of a virtual control variable and a specific auxiliary variable obtained from the coupled Hamiltonian, allows general systems to be transformed into affine systems. Of interest is the fact that the parallel control technique's introduction provides an unprecedented perspective on eliminating the negative effects of disturbance. Then, an eventtriggered mechanism is adopted to save communication resources while ensuring the system's stability. The coupled HamiltonJacobi(HJ) equation's solution is approximated using a critic neural network(NN), whose weights are updated in response to events. Furthermore, theoretical analysis reveals that the weight estimation error is uniformly ultimately bounded(UUB). Finally,numerical simulations demonstrate the effectiveness of the developed ETRPOC method.
基金supported in part by the National Science Foundation of China(62373240,62273224,U24A20259).
文摘Dear Editor,The attacker is always going to intrude covertly networked control systems(NCSs)by dynamically changing false data injection attacks(FDIAs)strategy,while the defender try their best to resist attacks by designing defense strategy on the basis of identifying attack strategy,maintaining stable operation of NCSs.To solve this attack-defense game problem,this letter investigates optimal secure control of NCSs under FDIAs.First,for the alterations of energy caused by false data,a novel attack-defense game model is constructed,which considers the changes of energy caused by the actions of the defender and attacker in the forward and feedback channels.
基金Scholarship Award for Excellent Doctoral Student granted by Ministry of Education of Chinathe Scientific Innovation Research of College Graduates in Jiangsu Province(No.CXZZ11-0150)+1 种基金the National Natural Science Foundation of China(No.51177011)the National High Technology Research and Development Program of China(863 Program)(No.2012AA050210)
文摘In order to solve the multiple power extreme value point problem caused by system frequency splitting during wireless energy transmission at short distances a transmission model of the system is established.With the comprehensive consideration of the resonance frequency load parameters and the coupling between coils the internal factors of frequency splitting and boundary conditions are discussed.The results show that under the condition of the fixed load the higher the natural resonance frequency the easier the frequency splitting. As the frequency splitting occurs the frequency of the maximum power transfer is no longer with the natural resonance frequency which can make the system unstable and the transfer power more difficult to control. Therefore a decreasing-frequency method is proposed to avoid the system frequency splitting. And decreasing the system resonance frequency can make the system successfully withdraw the frequency splitting area at a short-distance range.Under the fixed load condition the transmission power of the system can be increased by 400% and the transmission efficiency is reduced by only 14% which greatly improves the transmission performance of the system.
基金supported by the Natural Sciences and Engineering Research Council of Canada(N00892)in part by National Natural Science Foundation of China(51405436,51375452,61573174)
文摘Linear quadratic regulator(LQR) and proportional-integral-derivative(PID) control methods, which are generally used for control of linear dynamical systems, are used in this paper to control the nonlinear dynamical system. LQR is one of the optimal control techniques, which takes into account the states of the dynamical system and control input to make the optimal control decisions.The nonlinear system states are fed to LQR which is designed using a linear state-space model. This is simple as well as robust. The inverted pendulum, a highly nonlinear unstable system, is used as a benchmark for implementing the control methods. Here the control objective is to control the system such that the cart reaches a desired position and the inverted pendulum stabilizes in the upright position. In this paper, the modeling and simulation for optimal control design of nonlinear inverted pendulum-cart dynamic system using PID controller and LQR have been presented for both cases of without and with disturbance input. The Matlab-Simulink models have been developed for simulation and performance analysis of the control schemes. The simulation results justify the comparative advantage of LQR control method.
基金supported by National Natural Science Foundation of China(71171003)Anhui Natural Science Foundation(10040606003)Anhui Natural Science Foundation of Universities(KJ2012B019,KJ2013B023)
文摘This article is concerned with a class of control systems with Markovian switching, in which an It5 formula for Markov-modulated processes is derived. Moreover, an optimal control law satisfying the generalized Hamilton-Jacobi-Bellman (HJB) equation with Markovian switching is characterized. Then, through the generalized HJB equation, we study an optimal consumption and portfolio problem with the financial markets of Markovian switching and inflation. Thus, we deduce the optimal policies and show that a modified Mutual Fund Theorem consisting of three funds holds. Finally, for the CRRA utility function, we explicitly give the optimal consumption and portfolio policies. Numerical examples are included to illustrate the obtained results.
基金co-supported by the Fundamental Research Funds for the Central Universities(No:NZ2016103)the National Natural Science Foundation of China(No: 51576096)
文摘In order to solve the aero-propulsion system acceleration optimal problem,the necessity of inlet control is discussed,and a fully new aero-propulsion system acceleration process control design including the inlet,engine,and nozzle is proposed in this paper.In the proposed propulsion system control scheme,the inlet,engine,and nozzle are simultaneously adjusted through the FSQP method.In order to implement the control scheme design,an aero-propulsion system componentlevel model is built to simulate the inlet working performance and the matching problems between the inlet and engine.Meanwhile,a stabilizing inlet control scheme is designed to solve the inlet control problems.In optimal control of the aero-propulsion system acceleration process,the inlet is an emphasized control unit in the optimal acceleration control system.Two inlet control patterns are discussed in the simulation.The simulation results prove that by taking the inlet ramp angle as an active control variable instead of being modulated passively,acceleration performance could be obviously enhanced.Acceleration objectives could be obtained with a faster acceleration time by5%.
文摘The linear systems affected by additive external sinusoidal disturbances is studied. The problem is to damp this forced oscillation in an optimal fashion. The main result of this paper is a new design approach is proposed of realizable feedforward and feedback optimal control law for a linear time invariant system with sinusoidal disturbances. The algorithm of solving the optimal control law is given. It is shown that the control law is easily realized and is robust with respect to errors produced by the external sinusoidal disturbances through simulation results.
文摘A control algorithm for improving vehicle handling was proposed by applying right angle to the steering wheel,based on the nonlinear adaptive optimal control(NAOC).A nonlinear 4-DOF model was initially developed,then it was simplified to a 2-DOF model with reasonable assumptions to design observer and optimal controllers.Then a simplified model was developed for steering system.The numerical simulations were carried out using vehicle parameters for standard maneuvers in dry and wet road conditions.Moreover,the hardware in the loop method was implemented to prove the controller ability in realistic conditions.Simulation results obviously show the effectiveness of NAOC on vehicle handling and reveal that the proposed controller can significantly improve vehicle handling during severe maneuvers.