This article presents an adaptive optimal control method for a semi-active suspension system.The model of the suspension system is built,in which the components of uncertain parameters and exogenous disturbance are de...This article presents an adaptive optimal control method for a semi-active suspension system.The model of the suspension system is built,in which the components of uncertain parameters and exogenous disturbance are described.The adaptive optimal control law consists of the sum of the optimal control component and the adaptive control component.First,the optimal control law is designed for the model of the suspension system after ignoring the components of uncertain parameters and exogenous disturbance caused by the road surface.The optimal control law expresses the desired dynamic characteristics of the suspension system.Next,the adaptive component is designed with the purpose of compensating for the effects caused by uncertain parameters and exogenous disturbance caused by the road surface;the adaptive component has adaptive parameter rules to estimate uncertain parameters and exogenous disturbance.When exogenous disturbances are eliminated,the system responds with an optimal controller designed.By separating theoretically the dynamic of a semi-active suspension system,this solution allows the design of two separate controllers easily and has reduced the computational burden and the use of too many tools,thus allowing for more convenient hardware implementation.The simulation results also show the effectiveness of damping oscillations of the proposed solution in this article.展开更多
The controlled objects are uncertain stable,but the optimal control systems which are constituted by the controlled objects under certain conditions are certain stable. This paper analyses stability of optimal control...The controlled objects are uncertain stable,but the optimal control systems which are constituted by the controlled objects under certain conditions are certain stable. This paper analyses stability of optimal control systems which have quadric performance index via Liapunov method.展开更多
To guarantee bus priority with a minimum impact on car traffic at intersections, an optimal control system of the intermittent bus-only approach (IBA) was proposed. The problems of the existing system are first solv...To guarantee bus priority with a minimum impact on car traffic at intersections, an optimal control system of the intermittent bus-only approach (IBA) was proposed. The problems of the existing system are first solved through optimization: the judgment time of the IBA system was advanced to allow a bus to jump car queues if the bus was detected to arrive at the intersection, and the instant that the IBA lane became available to cars was controlled dynamically to increase the capacity of the IBA lane. The total car delay in one cycle was then analyzed quantitatively when implementing the optimal control system. The results show that in comparison with the existing system of the IBA, the car delay is greatly reduced and the probability of a car stopping twice is low after optimizing the IBA system.展开更多
Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are o...Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are often effective for stabilization but may not directly optimize long-term performance.To address this limitation,this study develops an integrated framework that combines optimal control principles with reinforcement learning for a single-link robotic manipulator.The proposed scheme adopts an actor–critic structure,where the critic network approximates the value function associated with the Hamilton–Jacobi–Bellman equation,and the actor network generates near-optimal control signals in real time.This dual adaptation enables the controller to refine its policy online without explicit system knowledge.Stability of the closed-loop system is analyzed through Lyapunov theory,ensuring boundedness of the tracking error.Numerical simulations on the single-link manipulator demonstrate that themethod achieves accurate trajectory followingwhile maintaining lowcontrol effort.The results further showthat the actor–critic learning mechanism accelerates convergence of the control policy compared with conventional optimization-based strategies.This work highlights the potential of reinforcement learning integrated with optimal control for robotic manipulators and provides a foundation for future extensions to more complex multi-degree-of-freedom systems.The proposed controller is further validated in a physics-based virtual Gazebo environment,demonstrating stable adaptation and real-time feasibility.展开更多
Dear Editor,In this letter,we focus on the algebraic relationship between the coefficient matrices and the solution of the stochastic algebraic Riccati equation.It is revealed that,if the coefficient matrices are in a...Dear Editor,In this letter,we focus on the algebraic relationship between the coefficient matrices and the solution of the stochastic algebraic Riccati equation.It is revealed that,if the coefficient matrices are in an algebra,then the solution(and also the control gain in many cases)is also in the same algebra.The main result is verified by a numerical simulation.展开更多
In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to ...In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to obtain the maximal positive definite solution of nonlinear matrix equation X+A^(*)X|^(-α)A=Q with the case 0<α≤1.Based on this method,a new iterative algorithm is developed,and its convergence proof is given.Finally,two numerical examples are provided to show the effectiveness of the proposed method.展开更多
The augmented evolution equation is established under the framework of the Variation Evolving Method(VEM)that seeks optimal solutions by solving the transformed Initial-Value Problems(IVPs).To improve the numerical pe...The augmented evolution equation is established under the framework of the Variation Evolving Method(VEM)that seeks optimal solutions by solving the transformed Initial-Value Problems(IVPs).To improve the numerical performance,its compact form is developed herein.Through replacing the states and costates variation evolution with that of the controls,the dimension-reduced Evolution Partial Differential Equation(EPDE)only solves the control variables along the variation time to get the optimal solution,and the initial conditions for the definite solution may be arbitrary.With this equation,the scale of the resulting IVPs,obtained via the semi-discrete method,is significantly reduced and they may be solved with common Ordinary Differential Equation(ODE)integration methods conveniently.Meanwhile,the state and the costate dynamics share consistent stability in the numerical computation and this avoids the intrinsic numerical difficulty as in the indirect methods.Numerical examples are solved and it is shown that the compact form evolution equation outperforms the primary form in the precision,and the efficiency may be higher for the dense discretization.Actually,it is uncovered that the compact form of the augmented evolution equation is a continuous realization of the Newton type iteration mechanism.展开更多
Realizing optimal control performance for continuum robots(CRs) poses huge challenges on traditional modelbased optimal control approaches due to their high degrees of freedom,complex nonlinear dynamics and soft conti...Realizing optimal control performance for continuum robots(CRs) poses huge challenges on traditional modelbased optimal control approaches due to their high degrees of freedom,complex nonlinear dynamics and soft continuum morphologies which are difficult to explicitly model.This paper proposes a model-free adaptive optimal control algorithm(ADAPT)for CRs.In our strategy,we consider CRs as a class of nonlinear continuous-time dynamical systems in the state space,wherein the position of the end-effector is considered as the state and the input torque is mapped as the control input.Then,the optimized Hamilton-Jacobi-Bellman(HJB) equation is derived by optimal control principles,and subsequently solved by the proposed ADAPT algorithm without requiring knowledge of the original system dynamics.Under some mild assumptions,the global stability and convergence of the closed-loop control approach are guaranteed.Several simulation experiments are conducted on a magnetic CR(MCR) to demonstrate the practicality and effectiveness of the ADAPT algorithm.展开更多
Dear Editor,This letter proposes a reinforcement learning-based predictive learning algorithm for unknown continuous-time nonlinear systems with observation loss.Firstly,we construct a temporal nonzero-sum game over p...Dear Editor,This letter proposes a reinforcement learning-based predictive learning algorithm for unknown continuous-time nonlinear systems with observation loss.Firstly,we construct a temporal nonzero-sum game over predictive control input sequences,deriving multiple optimal predictive control input sequences from its solution.展开更多
Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this wor...Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this work introduces a machine-learning-based,data-driven scheme to overcome the challenges encountered,with a trained neural network(NN)assuming the role of a surrogate model that captures the system’s dynamics and subsequently enables QOC to be performed on the NN instead of on the real system.The trained NN surrogate proves effective for practical QOC tasks and is further demonstrated to be adaptable to different experimental conditions,remaining robust across varying system sizes and pulse durations.展开更多
The electromagnetic levitation system(EMLS)serves as the most important part of any magnetic levitation system.However,its characteristics are defined by its highly nonlinear dynamics and instability.Furthermore,the u...The electromagnetic levitation system(EMLS)serves as the most important part of any magnetic levitation system.However,its characteristics are defined by its highly nonlinear dynamics and instability.Furthermore,the uncertainties in the dynamics of an electromagnetic levitation system make the controller design more difficult.Therefore,it is necessary to design a robust control law that will ensure the system’s stability in the presence of these uncertainties.In this framework,the dynamics of an electromagnetic levitation system are addressed in terms of matched and unmatched uncertainties.The robust control problem is translated into the optimal control problem,where the uncertainties of the electromagnetic levitation system are directly reflected in the cost function.The optimal control method is used to solve the robust control problem.The solution to the optimal control problem for the electromagnetic levitation system is indeed a solution to the robust control problem of the electromagnetic levitation system under matched and unmatched uncertainties.The simulation and experimental results demonstrate the performance of the designed control scheme.The performance indices such as integral absolute error(IAE),integral square error(ISE),integral time absolute error(ITAE),and integral time square error(ITSE)are compared for both uncertainties to showcase the robustness of the designed control scheme.展开更多
Inverse reinforcement learning optimal control is under the framework of learner-expert.The learner system can imitate the expert system's demonstrated behaviors and does not require the predefined cost function,s...Inverse reinforcement learning optimal control is under the framework of learner-expert.The learner system can imitate the expert system's demonstrated behaviors and does not require the predefined cost function,so it can handle optimal control problems effectively.This paper proposes an inverse reinforcement learning optimal control method for Takagi-Sugeno(T-S)fuzzy systems.Based on learner systems,an expert system is constructed,where the learner system only knows the expert system's optimal control policy.To reconstruct the unknown cost function,we firstly develop a model-based inverse reinforcement learning algorithm for the case that systems dynamics are known.The developed model-based learning algorithm is consists of two learning stages:an inner reinforcement learning loop and an outer inverse optimal control loop.The inner loop desires to obtain optimal control policy via learner's cost function and the outer loop aims to update learner's state-penalty matrices via only using expert's optimal control policy.Then,to eliminate the requirement that the system dynamics must be known,a data-driven integral learning algorithm is presented.It is proved that the presented two algorithms are convergent and the developed inverse reinforcement learning optimal control scheme can ensure the controlled fuzzy learner systems to be asymptotically stable.Finally,we apply the proposed fuzzy optimal control to the truck-trailer system,and the computer simulation results verify the effectiveness of the presented approach.展开更多
This paper investigates the problem of fuzzy adaptive finite-time inverse optimal control for active suspension systems(ASSs).The fuzzy logic systems(FLSs)are utilized to learn the unknown non-linear dynamics and an a...This paper investigates the problem of fuzzy adaptive finite-time inverse optimal control for active suspension systems(ASSs).The fuzzy logic systems(FLSs)are utilized to learn the unknown non-linear dynamics and an auxiliary system is established.Based on the finite-time stability theory and inverse optimal theory,a fuzzy adaptive inverse finite-time inverse optimal control method is proposed.It is proven that the formulated control approach guarantees the stability of the controlled systems,while ensuring that errors converge to a small neighborhood of zero within finite time.Moreover,the optimized control performance can be achieved.Eventually,the simulation results demonstrate the effectiveness of the proposed fuzzy adaptive finite-time inverse optimal control scheme.展开更多
Optimal impulse control and impulse games provide the cutting-edge frameworks for modeling systems where control actions occur at discrete time points,and optimizing objectives under discontinuous interventions.This r...Optimal impulse control and impulse games provide the cutting-edge frameworks for modeling systems where control actions occur at discrete time points,and optimizing objectives under discontinuous interventions.This review synthesizes the theoretical advancements,computational approaches,emerging challenges,and possible research directions in the field.Firstly,we briefly review the fundamental theory of continuous-time optimal control,including Pontryagin's maximum principle(PMP)and dynamic programming principle(DPP).Secondly,we present the foundational results in optimal impulse control,including necessary conditions and sufficient conditions.Thirdly,we systematize impulse game methodologies,from Nash equilibrium existence theory to the connection between Nash equilibrium and systems stability.Fourthly,we summarize the numerical algorithms including the intelligent computation approaches.Finally,we examine the new trends and challenges in theory and applications as well as computational considerations.展开更多
This paper highlights the utilization of parallel control and adaptive dynamic programming(ADP) for event-triggered robust parallel optimal consensus control(ETRPOC) of uncertain nonlinear continuous-time multiagent s...This paper highlights the utilization of parallel control and adaptive dynamic programming(ADP) for event-triggered robust parallel optimal consensus control(ETRPOC) of uncertain nonlinear continuous-time multiagent systems(MASs).First, the parallel control system, which consists of a virtual control variable and a specific auxiliary variable obtained from the coupled Hamiltonian, allows general systems to be transformed into affine systems. Of interest is the fact that the parallel control technique's introduction provides an unprecedented perspective on eliminating the negative effects of disturbance. Then, an eventtriggered mechanism is adopted to save communication resources while ensuring the system's stability. The coupled HamiltonJacobi(HJ) equation's solution is approximated using a critic neural network(NN), whose weights are updated in response to events. Furthermore, theoretical analysis reveals that the weight estimation error is uniformly ultimately bounded(UUB). Finally,numerical simulations demonstrate the effectiveness of the developed ETRPOC method.展开更多
Dear Editor,The attacker is always going to intrude covertly networked control systems(NCSs)by dynamically changing false data injection attacks(FDIAs)strategy,while the defender try their best to resist attacks by de...Dear Editor,The attacker is always going to intrude covertly networked control systems(NCSs)by dynamically changing false data injection attacks(FDIAs)strategy,while the defender try their best to resist attacks by designing defense strategy on the basis of identifying attack strategy,maintaining stable operation of NCSs.To solve this attack-defense game problem,this letter investigates optimal secure control of NCSs under FDIAs.First,for the alterations of energy caused by false data,a novel attack-defense game model is constructed,which considers the changes of energy caused by the actions of the defender and attacker in the forward and feedback channels.展开更多
In order to solve the multiple power extreme value point problem caused by system frequency splitting during wireless energy transmission at short distances a transmission model of the system is established.With the c...In order to solve the multiple power extreme value point problem caused by system frequency splitting during wireless energy transmission at short distances a transmission model of the system is established.With the comprehensive consideration of the resonance frequency load parameters and the coupling between coils the internal factors of frequency splitting and boundary conditions are discussed.The results show that under the condition of the fixed load the higher the natural resonance frequency the easier the frequency splitting. As the frequency splitting occurs the frequency of the maximum power transfer is no longer with the natural resonance frequency which can make the system unstable and the transfer power more difficult to control. Therefore a decreasing-frequency method is proposed to avoid the system frequency splitting. And decreasing the system resonance frequency can make the system successfully withdraw the frequency splitting area at a short-distance range.Under the fixed load condition the transmission power of the system can be increased by 400% and the transmission efficiency is reduced by only 14% which greatly improves the transmission performance of the system.展开更多
Linear quadratic regulator(LQR) and proportional-integral-derivative(PID) control methods, which are generally used for control of linear dynamical systems, are used in this paper to control the nonlinear dynamical sy...Linear quadratic regulator(LQR) and proportional-integral-derivative(PID) control methods, which are generally used for control of linear dynamical systems, are used in this paper to control the nonlinear dynamical system. LQR is one of the optimal control techniques, which takes into account the states of the dynamical system and control input to make the optimal control decisions.The nonlinear system states are fed to LQR which is designed using a linear state-space model. This is simple as well as robust. The inverted pendulum, a highly nonlinear unstable system, is used as a benchmark for implementing the control methods. Here the control objective is to control the system such that the cart reaches a desired position and the inverted pendulum stabilizes in the upright position. In this paper, the modeling and simulation for optimal control design of nonlinear inverted pendulum-cart dynamic system using PID controller and LQR have been presented for both cases of without and with disturbance input. The Matlab-Simulink models have been developed for simulation and performance analysis of the control schemes. The simulation results justify the comparative advantage of LQR control method.展开更多
This article is concerned with a class of control systems with Markovian switching, in which an It5 formula for Markov-modulated processes is derived. Moreover, an optimal control law satisfying the generalized Hamilt...This article is concerned with a class of control systems with Markovian switching, in which an It5 formula for Markov-modulated processes is derived. Moreover, an optimal control law satisfying the generalized Hamilton-Jacobi-Bellman (HJB) equation with Markovian switching is characterized. Then, through the generalized HJB equation, we study an optimal consumption and portfolio problem with the financial markets of Markovian switching and inflation. Thus, we deduce the optimal policies and show that a modified Mutual Fund Theorem consisting of three funds holds. Finally, for the CRRA utility function, we explicitly give the optimal consumption and portfolio policies. Numerical examples are included to illustrate the obtained results.展开更多
基金supported in part by the Thai Nguyen University of Technology,Vietnam.
文摘This article presents an adaptive optimal control method for a semi-active suspension system.The model of the suspension system is built,in which the components of uncertain parameters and exogenous disturbance are described.The adaptive optimal control law consists of the sum of the optimal control component and the adaptive control component.First,the optimal control law is designed for the model of the suspension system after ignoring the components of uncertain parameters and exogenous disturbance caused by the road surface.The optimal control law expresses the desired dynamic characteristics of the suspension system.Next,the adaptive component is designed with the purpose of compensating for the effects caused by uncertain parameters and exogenous disturbance caused by the road surface;the adaptive component has adaptive parameter rules to estimate uncertain parameters and exogenous disturbance.When exogenous disturbances are eliminated,the system responds with an optimal controller designed.By separating theoretically the dynamic of a semi-active suspension system,this solution allows the design of two separate controllers easily and has reduced the computational burden and the use of too many tools,thus allowing for more convenient hardware implementation.The simulation results also show the effectiveness of damping oscillations of the proposed solution in this article.
文摘The controlled objects are uncertain stable,but the optimal control systems which are constituted by the controlled objects under certain conditions are certain stable. This paper analyses stability of optimal control systems which have quadric performance index via Liapunov method.
文摘To guarantee bus priority with a minimum impact on car traffic at intersections, an optimal control system of the intermittent bus-only approach (IBA) was proposed. The problems of the existing system are first solved through optimization: the judgment time of the IBA system was advanced to allow a bus to jump car queues if the bus was detected to arrive at the intersection, and the instant that the IBA lane became available to cars was controlled dynamically to increase the capacity of the IBA lane. The total car delay in one cycle was then analyzed quantitatively when implementing the optimal control system. The results show that in comparison with the existing system of the IBA, the car delay is greatly reduced and the probability of a car stopping twice is low after optimizing the IBA system.
基金supported in part by the National Science and Technology Council under Grant NSTC 114-2221-E-027-104.
文摘Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are often effective for stabilization but may not directly optimize long-term performance.To address this limitation,this study develops an integrated framework that combines optimal control principles with reinforcement learning for a single-link robotic manipulator.The proposed scheme adopts an actor–critic structure,where the critic network approximates the value function associated with the Hamilton–Jacobi–Bellman equation,and the actor network generates near-optimal control signals in real time.This dual adaptation enables the controller to refine its policy online without explicit system knowledge.Stability of the closed-loop system is analyzed through Lyapunov theory,ensuring boundedness of the tracking error.Numerical simulations on the single-link manipulator demonstrate that themethod achieves accurate trajectory followingwhile maintaining lowcontrol effort.The results further showthat the actor–critic learning mechanism accelerates convergence of the control policy compared with conventional optimization-based strategies.This work highlights the potential of reinforcement learning integrated with optimal control for robotic manipulators and provides a foundation for future extensions to more complex multi-degree-of-freedom systems.The proposed controller is further validated in a physics-based virtual Gazebo environment,demonstrating stable adaptation and real-time feasibility.
文摘Dear Editor,In this letter,we focus on the algebraic relationship between the coefficient matrices and the solution of the stochastic algebraic Riccati equation.It is revealed that,if the coefficient matrices are in an algebra,then the solution(and also the control gain in many cases)is also in the same algebra.The main result is verified by a numerical simulation.
基金Supported in part by Natural Science Foundation of Guangxi(2023GXNSFAA026246)in part by the Central Government's Guide to Local Science and Technology Development Fund(GuikeZY23055044)in part by the National Natural Science Foundation of China(62363003)。
文摘In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to obtain the maximal positive definite solution of nonlinear matrix equation X+A^(*)X|^(-α)A=Q with the case 0<α≤1.Based on this method,a new iterative algorithm is developed,and its convergence proof is given.Finally,two numerical examples are provided to show the effectiveness of the proposed method.
基金supported by the National Nature Science Foundation of China under Grant No.11902332。
文摘The augmented evolution equation is established under the framework of the Variation Evolving Method(VEM)that seeks optimal solutions by solving the transformed Initial-Value Problems(IVPs).To improve the numerical performance,its compact form is developed herein.Through replacing the states and costates variation evolution with that of the controls,the dimension-reduced Evolution Partial Differential Equation(EPDE)only solves the control variables along the variation time to get the optimal solution,and the initial conditions for the definite solution may be arbitrary.With this equation,the scale of the resulting IVPs,obtained via the semi-discrete method,is significantly reduced and they may be solved with common Ordinary Differential Equation(ODE)integration methods conveniently.Meanwhile,the state and the costate dynamics share consistent stability in the numerical computation and this avoids the intrinsic numerical difficulty as in the indirect methods.Numerical examples are solved and it is shown that the compact form evolution equation outperforms the primary form in the precision,and the efficiency may be higher for the dense discretization.Actually,it is uncovered that the compact form of the augmented evolution equation is a continuous realization of the Newton type iteration mechanism.
基金supported in part by the Innovation and Technology Commission of Hong Kong,China(ITS/136/20,ITS/234/21,MHP/096/22,ITS/235/22)Multi-Scale Medical Robotics Center,InnoHK,China(8312051)+1 种基金Research Grants Council(RGC) of Hong Kong,China(CUHK 14217822,CUHK14207823,AoE/E-407/24-N)The Chinese University of Hong Kong(CUHK) Direct Grant。
文摘Realizing optimal control performance for continuum robots(CRs) poses huge challenges on traditional modelbased optimal control approaches due to their high degrees of freedom,complex nonlinear dynamics and soft continuum morphologies which are difficult to explicitly model.This paper proposes a model-free adaptive optimal control algorithm(ADAPT)for CRs.In our strategy,we consider CRs as a class of nonlinear continuous-time dynamical systems in the state space,wherein the position of the end-effector is considered as the state and the input torque is mapped as the control input.Then,the optimized Hamilton-Jacobi-Bellman(HJB) equation is derived by optimal control principles,and subsequently solved by the proposed ADAPT algorithm without requiring knowledge of the original system dynamics.Under some mild assumptions,the global stability and convergence of the closed-loop control approach are guaranteed.Several simulation experiments are conducted on a magnetic CR(MCR) to demonstrate the practicality and effectiveness of the ADAPT algorithm.
基金supported by the National Natural Science Foundation of China(62433014,62373287,62573324,62333005,62273255)in part by the International Exchange Program for Graduate Students of Tongji University(4360143306)+3 种基金in part by the Fundamental Research Funds for Central Universities(22120230311)supported by DeutscheForschungsgemeinschaft(DFG,German Research Foundation)under Germany’s Excellence Strategy(EXC 2075390740016,468094890)support by the Stuttgart Center for Simulation Science(SimTech)the International Max Planck Research School for Intelligent Systems(IMPRS-IS)for supporting Y.Xie。
文摘Dear Editor,This letter proposes a reinforcement learning-based predictive learning algorithm for unknown continuous-time nonlinear systems with observation loss.Firstly,we construct a temporal nonzero-sum game over predictive control input sequences,deriving multiple optimal predictive control input sequences from its solution.
基金supported by the Innovation Program for Quantum Science and Technology(Grant No.2021ZD0302100)the National Natural Science Foundation of China(Grant Nos.12361131576,92265205,and 92476205).
文摘Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this work introduces a machine-learning-based,data-driven scheme to overcome the challenges encountered,with a trained neural network(NN)assuming the role of a surrogate model that captures the system’s dynamics and subsequently enables QOC to be performed on the NN instead of on the real system.The trained NN surrogate proves effective for practical QOC tasks and is further demonstrated to be adaptable to different experimental conditions,remaining robust across varying system sizes and pulse durations.
文摘The electromagnetic levitation system(EMLS)serves as the most important part of any magnetic levitation system.However,its characteristics are defined by its highly nonlinear dynamics and instability.Furthermore,the uncertainties in the dynamics of an electromagnetic levitation system make the controller design more difficult.Therefore,it is necessary to design a robust control law that will ensure the system’s stability in the presence of these uncertainties.In this framework,the dynamics of an electromagnetic levitation system are addressed in terms of matched and unmatched uncertainties.The robust control problem is translated into the optimal control problem,where the uncertainties of the electromagnetic levitation system are directly reflected in the cost function.The optimal control method is used to solve the robust control problem.The solution to the optimal control problem for the electromagnetic levitation system is indeed a solution to the robust control problem of the electromagnetic levitation system under matched and unmatched uncertainties.The simulation and experimental results demonstrate the performance of the designed control scheme.The performance indices such as integral absolute error(IAE),integral square error(ISE),integral time absolute error(ITAE),and integral time square error(ITSE)are compared for both uncertainties to showcase the robustness of the designed control scheme.
基金The National Natural Science Foundation of China(62173172).
文摘Inverse reinforcement learning optimal control is under the framework of learner-expert.The learner system can imitate the expert system's demonstrated behaviors and does not require the predefined cost function,so it can handle optimal control problems effectively.This paper proposes an inverse reinforcement learning optimal control method for Takagi-Sugeno(T-S)fuzzy systems.Based on learner systems,an expert system is constructed,where the learner system only knows the expert system's optimal control policy.To reconstruct the unknown cost function,we firstly develop a model-based inverse reinforcement learning algorithm for the case that systems dynamics are known.The developed model-based learning algorithm is consists of two learning stages:an inner reinforcement learning loop and an outer inverse optimal control loop.The inner loop desires to obtain optimal control policy via learner's cost function and the outer loop aims to update learner's state-penalty matrices via only using expert's optimal control policy.Then,to eliminate the requirement that the system dynamics must be known,a data-driven integral learning algorithm is presented.It is proved that the presented two algorithms are convergent and the developed inverse reinforcement learning optimal control scheme can ensure the controlled fuzzy learner systems to be asymptotically stable.Finally,we apply the proposed fuzzy optimal control to the truck-trailer system,and the computer simulation results verify the effectiveness of the presented approach.
基金supported by the National Natural Science Foundation of China under 62173172。
文摘This paper investigates the problem of fuzzy adaptive finite-time inverse optimal control for active suspension systems(ASSs).The fuzzy logic systems(FLSs)are utilized to learn the unknown non-linear dynamics and an auxiliary system is established.Based on the finite-time stability theory and inverse optimal theory,a fuzzy adaptive inverse finite-time inverse optimal control method is proposed.It is proven that the formulated control approach guarantees the stability of the controlled systems,while ensuring that errors converge to a small neighborhood of zero within finite time.Moreover,the optimized control performance can be achieved.Eventually,the simulation results demonstrate the effectiveness of the proposed fuzzy adaptive finite-time inverse optimal control scheme.
文摘Optimal impulse control and impulse games provide the cutting-edge frameworks for modeling systems where control actions occur at discrete time points,and optimizing objectives under discontinuous interventions.This review synthesizes the theoretical advancements,computational approaches,emerging challenges,and possible research directions in the field.Firstly,we briefly review the fundamental theory of continuous-time optimal control,including Pontryagin's maximum principle(PMP)and dynamic programming principle(DPP).Secondly,we present the foundational results in optimal impulse control,including necessary conditions and sufficient conditions.Thirdly,we systematize impulse game methodologies,from Nash equilibrium existence theory to the connection between Nash equilibrium and systems stability.Fourthly,we summarize the numerical algorithms including the intelligent computation approaches.Finally,we examine the new trends and challenges in theory and applications as well as computational considerations.
基金supported in part by the National Key Research and Development Program of China(2021YFE0206100)the National Natural Science Foundation of China(62425310,62073321)+2 种基金the National Defense Basic Scientific Research Program(JCKY2019203C029,JCKY2020130C025)the Science and Technology Development FundMacao SAR(FDCT-22-009-MISE,0060/2021/A2,0015/2020/AMJ)
文摘This paper highlights the utilization of parallel control and adaptive dynamic programming(ADP) for event-triggered robust parallel optimal consensus control(ETRPOC) of uncertain nonlinear continuous-time multiagent systems(MASs).First, the parallel control system, which consists of a virtual control variable and a specific auxiliary variable obtained from the coupled Hamiltonian, allows general systems to be transformed into affine systems. Of interest is the fact that the parallel control technique's introduction provides an unprecedented perspective on eliminating the negative effects of disturbance. Then, an eventtriggered mechanism is adopted to save communication resources while ensuring the system's stability. The coupled HamiltonJacobi(HJ) equation's solution is approximated using a critic neural network(NN), whose weights are updated in response to events. Furthermore, theoretical analysis reveals that the weight estimation error is uniformly ultimately bounded(UUB). Finally,numerical simulations demonstrate the effectiveness of the developed ETRPOC method.
基金supported in part by the National Science Foundation of China(62373240,62273224,U24A20259).
文摘Dear Editor,The attacker is always going to intrude covertly networked control systems(NCSs)by dynamically changing false data injection attacks(FDIAs)strategy,while the defender try their best to resist attacks by designing defense strategy on the basis of identifying attack strategy,maintaining stable operation of NCSs.To solve this attack-defense game problem,this letter investigates optimal secure control of NCSs under FDIAs.First,for the alterations of energy caused by false data,a novel attack-defense game model is constructed,which considers the changes of energy caused by the actions of the defender and attacker in the forward and feedback channels.
基金Scholarship Award for Excellent Doctoral Student granted by Ministry of Education of Chinathe Scientific Innovation Research of College Graduates in Jiangsu Province(No.CXZZ11-0150)+1 种基金the National Natural Science Foundation of China(No.51177011)the National High Technology Research and Development Program of China(863 Program)(No.2012AA050210)
文摘In order to solve the multiple power extreme value point problem caused by system frequency splitting during wireless energy transmission at short distances a transmission model of the system is established.With the comprehensive consideration of the resonance frequency load parameters and the coupling between coils the internal factors of frequency splitting and boundary conditions are discussed.The results show that under the condition of the fixed load the higher the natural resonance frequency the easier the frequency splitting. As the frequency splitting occurs the frequency of the maximum power transfer is no longer with the natural resonance frequency which can make the system unstable and the transfer power more difficult to control. Therefore a decreasing-frequency method is proposed to avoid the system frequency splitting. And decreasing the system resonance frequency can make the system successfully withdraw the frequency splitting area at a short-distance range.Under the fixed load condition the transmission power of the system can be increased by 400% and the transmission efficiency is reduced by only 14% which greatly improves the transmission performance of the system.
基金supported by the Natural Sciences and Engineering Research Council of Canada(N00892)in part by National Natural Science Foundation of China(51405436,51375452,61573174)
文摘Linear quadratic regulator(LQR) and proportional-integral-derivative(PID) control methods, which are generally used for control of linear dynamical systems, are used in this paper to control the nonlinear dynamical system. LQR is one of the optimal control techniques, which takes into account the states of the dynamical system and control input to make the optimal control decisions.The nonlinear system states are fed to LQR which is designed using a linear state-space model. This is simple as well as robust. The inverted pendulum, a highly nonlinear unstable system, is used as a benchmark for implementing the control methods. Here the control objective is to control the system such that the cart reaches a desired position and the inverted pendulum stabilizes in the upright position. In this paper, the modeling and simulation for optimal control design of nonlinear inverted pendulum-cart dynamic system using PID controller and LQR have been presented for both cases of without and with disturbance input. The Matlab-Simulink models have been developed for simulation and performance analysis of the control schemes. The simulation results justify the comparative advantage of LQR control method.
基金supported by National Natural Science Foundation of China(71171003)Anhui Natural Science Foundation(10040606003)Anhui Natural Science Foundation of Universities(KJ2012B019,KJ2013B023)
文摘This article is concerned with a class of control systems with Markovian switching, in which an It5 formula for Markov-modulated processes is derived. Moreover, an optimal control law satisfying the generalized Hamilton-Jacobi-Bellman (HJB) equation with Markovian switching is characterized. Then, through the generalized HJB equation, we study an optimal consumption and portfolio problem with the financial markets of Markovian switching and inflation. Thus, we deduce the optimal policies and show that a modified Mutual Fund Theorem consisting of three funds holds. Finally, for the CRRA utility function, we explicitly give the optimal consumption and portfolio policies. Numerical examples are included to illustrate the obtained results.