In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to ...In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to obtain the maximal positive definite solution of nonlinear matrix equation X+A^(*)X|^(-α)A=Q with the case 0<α≤1.Based on this method,a new iterative algorithm is developed,and its convergence proof is given.Finally,two numerical examples are provided to show the effectiveness of the proposed method.展开更多
Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are o...Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are often effective for stabilization but may not directly optimize long-term performance.To address this limitation,this study develops an integrated framework that combines optimal control principles with reinforcement learning for a single-link robotic manipulator.The proposed scheme adopts an actor–critic structure,where the critic network approximates the value function associated with the Hamilton–Jacobi–Bellman equation,and the actor network generates near-optimal control signals in real time.This dual adaptation enables the controller to refine its policy online without explicit system knowledge.Stability of the closed-loop system is analyzed through Lyapunov theory,ensuring boundedness of the tracking error.Numerical simulations on the single-link manipulator demonstrate that themethod achieves accurate trajectory followingwhile maintaining lowcontrol effort.The results further showthat the actor–critic learning mechanism accelerates convergence of the control policy compared with conventional optimization-based strategies.This work highlights the potential of reinforcement learning integrated with optimal control for robotic manipulators and provides a foundation for future extensions to more complex multi-degree-of-freedom systems.The proposed controller is further validated in a physics-based virtual Gazebo environment,demonstrating stable adaptation and real-time feasibility.展开更多
The augmented evolution equation is established under the framework of the Variation Evolving Method(VEM)that seeks optimal solutions by solving the transformed Initial-Value Problems(IVPs).To improve the numerical pe...The augmented evolution equation is established under the framework of the Variation Evolving Method(VEM)that seeks optimal solutions by solving the transformed Initial-Value Problems(IVPs).To improve the numerical performance,its compact form is developed herein.Through replacing the states and costates variation evolution with that of the controls,the dimension-reduced Evolution Partial Differential Equation(EPDE)only solves the control variables along the variation time to get the optimal solution,and the initial conditions for the definite solution may be arbitrary.With this equation,the scale of the resulting IVPs,obtained via the semi-discrete method,is significantly reduced and they may be solved with common Ordinary Differential Equation(ODE)integration methods conveniently.Meanwhile,the state and the costate dynamics share consistent stability in the numerical computation and this avoids the intrinsic numerical difficulty as in the indirect methods.Numerical examples are solved and it is shown that the compact form evolution equation outperforms the primary form in the precision,and the efficiency may be higher for the dense discretization.Actually,it is uncovered that the compact form of the augmented evolution equation is a continuous realization of the Newton type iteration mechanism.展开更多
To better complete various missions, it is necessary to plan an optimal trajectory or provide the optimal control law for the multirole missile according to the actual situation, including launch conditions and target...To better complete various missions, it is necessary to plan an optimal trajectory or provide the optimal control law for the multirole missile according to the actual situation, including launch conditions and target location. Since trajectory optimization struggles to meet real-time requirements, the emergence of data-based generation methods has become a significant focus in contemporary research. However, due to the large differences in the characteristics of the optimal control laws caused by the diversity of tasks, it is difficult to achieve good prediction results by modeling all data with one single model.Therefore, the modeling idea of the mixture of experts(MoE) is adopted. Firstly, the K-means clustering algorithm is used to partition the sample data set, and the corresponding neural network classification model is established as the gate switch of MoE. Then, the expert models, i.e., the mappings from the generation conditions to the optimal control law represented by the results of principal component analysis(PCA), are represented by Kriging models. Finally, multiple rounds of accuracy evaluation, sample supplementation, and model updating are conducted to improve the generation accuracy. The Monte Carlo simulation shows that the accuracy of the proposed model reaches 96% and the generation efficiency meets the real-time requirement.展开更多
Assessing the impact of anthropogenic volatile organic compounds(VOCs)on ozone(O_(3))formation is vital for themanagement of emission reduction and pollution control.Continuousmeasurement of O_(3)and the major precurs...Assessing the impact of anthropogenic volatile organic compounds(VOCs)on ozone(O_(3))formation is vital for themanagement of emission reduction and pollution control.Continuousmeasurement of O_(3)and the major precursorswas conducted in a typical light industrial city in the YRD region from 1 May to 25 July in 2021.Alkanes were the most abundant VOC group,contributing to 55.0%of TVOCs concentration(56.43±21.10 ppb).OVOCs,aromatics,halides,alkenes,and alkynes contributed 18.7%,9.6%,9.3%,5.2%and 1.9%,respectively.The observational site shifted from a typical VOC control regime to a mixed regime from May to July,which can be explained by the significant increase of RO_(x)production,resulting in the transition of environment from NOx saturation to radical saturation with respect to O_(3)production.The optimal O_(3)control strategy should be dynamically changed depending on the transition of control regime.Under NOx saturation condition,minimizing the proportion of NOx in reduction could lead to better achievement of O_(3)alleviation.Under mixed control regime,the cut percentage gets the top priority for the effectiveness of O_(3)control.Five VOCs sources were identified:temperature dependent source(28.1%),vehicular exhausts(19.9%),petrochemical industries(7.2%),solvent&gasoline usage(32.3%)and manufacturing industries(12.6%).The increase of temperature and radiation would enhance the evaporation related VOC emissions,resulting in the increase of VOC concentration and the change of RO_(x)circulation.Our results highlight determination of the optimal control strategies for O_(3)pollution in a typical YRD industrial city.展开更多
We present a robust quantum optimal control framework for implementing fast entangling gates on ion-trap quantum processors.The framework leverages tailored laser pulses to drive the multiple vibrational sidebands of ...We present a robust quantum optimal control framework for implementing fast entangling gates on ion-trap quantum processors.The framework leverages tailored laser pulses to drive the multiple vibrational sidebands of the ions to create phonon-mediated entangling gates and,unlike the state of the art,requires neither weakcoupling Lamb-Dicke approximation nor perturbation treatment.With the application of gradient-based optimal control,it enables finding amplitude-and phase-modulated laser control protocols that work without the Lamb-Dicke approximation,promising gate speeds on the order of microseconds comparable to the characteristic trap frequencies.Also,robustness requirements on the temperature of the ions and initial optical phase can be conveniently included to pursue high-quality fast gates against experimental imperfections.Our approach represents a step in speeding up quantum gates to achieve larger quantum circuits for quantum computation and simulation,and thus can find applications in near-future experiments.展开更多
Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this wor...Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this work introduces a machine-learning-based,data-driven scheme to overcome the challenges encountered,with a trained neural network(NN)assuming the role of a surrogate model that captures the system’s dynamics and subsequently enables QOC to be performed on the NN instead of on the real system.The trained NN surrogate proves effective for practical QOC tasks and is further demonstrated to be adaptable to different experimental conditions,remaining robust across varying system sizes and pulse durations.展开更多
This paper addresses the Singular Optimal Control Problem(SOCP)for a surface-to-air missile with limited control,fully considering aerodynamic effects with a parabolic drag polar.This problem is an extension of the ty...This paper addresses the Singular Optimal Control Problem(SOCP)for a surface-to-air missile with limited control,fully considering aerodynamic effects with a parabolic drag polar.This problem is an extension of the typical Goddard problem.First,the classical Legendre-Clebsch condition is applied to derive optimal conditions for the singular angle of attack,revealing that the missile turns by gravity along the singular arc.Second,the higher-order differentiation of the switching function provides the necessary conditions to determine the optimal thrust,expressed as linear functions of the costate variables.The vanishing coefficient determinant is then employed to decouple the control and costate variables,yielding the singular thrust solely dependent on state variables and identifying the singular surface.Moreover,the analytical singular control can be regarded as path constraints subject to the typical Optimal Control Problem(OCP),enabling the GPOPS-Ⅱ,a direct method framework that does not involve the singular condition,to solve the SOCP.Finally,three cases with different structures are presented to evaluate the performance of the proposed method.The results show that it takes a few steps to obtain the numerical optimal solution,which is consistent with the analytical solution derived from the calculus of variations,highlighting its great computational accuracy and effectiveness.展开更多
The co-infection of corona and influenza viruses has emerged as a significant threat to global public health due to their shared modes of transmission and overlapping clinical symptoms.This article presents a novel ma...The co-infection of corona and influenza viruses has emerged as a significant threat to global public health due to their shared modes of transmission and overlapping clinical symptoms.This article presents a novel mathematical model that addresses the dynamics of this co-infection by extending the SEIR(Susceptible-Exposed-Infectious-Recovered)framework to incorporate treatment and hospitalization compartments.The population is divided into eight compartments,with infectious individuals further categorized into influenza infectious,corona infectious,and co-infection cases.The proposed mathematical model is constrained to adhere to fundamental epidemiological properties,such as non-negativity and boundedness within a feasible region.Additionally,the model is demonstrated to be well-posed with a unique solution.Equilibrium points,including the disease-free and endemic equilibria,are identified,and various properties related to these equilibrium points,such as the basic reproduction number,are determined.Local and global sensitivity analyses are performed to identify the parameters that highly influence disease dynamics and the reproduction number.Knowing the most influential parameters is crucial for understanding their impact on the co-infection’s spread and severity.Furthermore,an optimal control problem is defined to minimize disease transmission and to control strategy costs.The purpose of our study is to identify the most effective(optimal)control strategies for mitigating the spread of the co-infection with minimum cost of the controls.The results illustrate the effectiveness of the implemented control strategies in managing the co-infection’s impact on the population’s health.This mathematical modeling and control strategy framework provides valuable tools for understanding and combating the dual threat of corona and influenza co-infection,helping public health authorities and policymakers make informed decisions in the face of these intertwined epidemics.展开更多
The electromagnetic levitation system(EMLS)serves as the most important part of any magnetic levitation system.However,its characteristics are defined by its highly nonlinear dynamics and instability.Furthermore,the u...The electromagnetic levitation system(EMLS)serves as the most important part of any magnetic levitation system.However,its characteristics are defined by its highly nonlinear dynamics and instability.Furthermore,the uncertainties in the dynamics of an electromagnetic levitation system make the controller design more difficult.Therefore,it is necessary to design a robust control law that will ensure the system’s stability in the presence of these uncertainties.In this framework,the dynamics of an electromagnetic levitation system are addressed in terms of matched and unmatched uncertainties.The robust control problem is translated into the optimal control problem,where the uncertainties of the electromagnetic levitation system are directly reflected in the cost function.The optimal control method is used to solve the robust control problem.The solution to the optimal control problem for the electromagnetic levitation system is indeed a solution to the robust control problem of the electromagnetic levitation system under matched and unmatched uncertainties.The simulation and experimental results demonstrate the performance of the designed control scheme.The performance indices such as integral absolute error(IAE),integral square error(ISE),integral time absolute error(ITAE),and integral time square error(ITSE)are compared for both uncertainties to showcase the robustness of the designed control scheme.展开更多
This article presents an adaptive optimal control method for a semi-active suspension system.The model of the suspension system is built,in which the components of uncertain parameters and exogenous disturbance are de...This article presents an adaptive optimal control method for a semi-active suspension system.The model of the suspension system is built,in which the components of uncertain parameters and exogenous disturbance are described.The adaptive optimal control law consists of the sum of the optimal control component and the adaptive control component.First,the optimal control law is designed for the model of the suspension system after ignoring the components of uncertain parameters and exogenous disturbance caused by the road surface.The optimal control law expresses the desired dynamic characteristics of the suspension system.Next,the adaptive component is designed with the purpose of compensating for the effects caused by uncertain parameters and exogenous disturbance caused by the road surface;the adaptive component has adaptive parameter rules to estimate uncertain parameters and exogenous disturbance.When exogenous disturbances are eliminated,the system responds with an optimal controller designed.By separating theoretically the dynamic of a semi-active suspension system,this solution allows the design of two separate controllers easily and has reduced the computational burden and the use of too many tools,thus allowing for more convenient hardware implementation.The simulation results also show the effectiveness of damping oscillations of the proposed solution in this article.展开更多
In the industrial roller kiln,the time-delay characteristic in heat transfer causes the temperature field to be affected by both the current and historical temperature states.It presents a poor control performance and...In the industrial roller kiln,the time-delay characteristic in heat transfer causes the temperature field to be affected by both the current and historical temperature states.It presents a poor control performance and brings a significant challenge to the process precise control.Considering high complexity of precise modeling,a data-driven time-delay optimal control method for temperature field of roller kiln is proposed based on a large amount of process data.First,the control challenges and problem description brought by time-delay are demonstrated,where the cost function for the time-delay partial differential equation system is constructed.To obtain the optimal control law,the policy iteration in adaptive dynamic programming is adopted to design the time-delay temperature field controller,and neural network is used for the critic network in policy iteration to approximate the optimal time-delay cost function.The closed-loop system stability is proved by designing the Lyapunov function which contains the time-delay information.Finally,through establishing the time-delay temperature field model for roller kiln,the effectiveness and convergence of the proposed method is verified and proved.展开更多
In this paper,based on the SVIQR model we develop a stochastic epidemic model with multiple vaccinations and time delay.Firstly,we prove the existence and uniqueness of the global positive solution of the model,and co...In this paper,based on the SVIQR model we develop a stochastic epidemic model with multiple vaccinations and time delay.Firstly,we prove the existence and uniqueness of the global positive solution of the model,and construct suitable functions to obtain sufficient conditions for disease extinction.Secondly,in order to effectively control the spread of the disease,appropriate control strategies are formulated by using optimal control theory.Finally,the results are verified by numerical simulation.展开更多
An optimal feedback guidance law with disturbance rejection objective is proposed for endoatmospheric powered descent.This guidance law with an affine form is derived by solving a novel problem called Endoatmospheric ...An optimal feedback guidance law with disturbance rejection objective is proposed for endoatmospheric powered descent.This guidance law with an affine form is derived by solving a novel problem called Endoatmospheric Powered Descent Guidance with Disturbance Rejection(Endo-PDG-DR).The key idea of formulating the Endo-PDG-DR problem is dividing disturbances into two parts,modeled and unmodeled disturbances:the modeled disturbance is proactively exploited by augmenting it as a new state of a dynamics model;the unmodeled disturbance is reactively attenuated in terms of its effect on the guidance performance by adjoining a parameterized time-varying quadratic performance index in the proposed optimal guidance problem.A Pseudospectral Differential Dynamic Programming(PDDP)method is developed to solve the Endo-PDG-DR problem,and correspondingly a robust neighboring optimal state feedback law is obtained,which has two synergistic functionalities.One is adaptive optimal steering to accommodate the modeled disturbance,and the other is disturbance attenuation to compensate for the state perturbation effect induced by the unmodeled disturbance.Using the derived feedback guidance law,a disturbance rejection level is quantified,and is correspondingly optimized by designing a quadratic weighting parameter tuning law.The numerical computations of interest are performed within a pseudospectral setting,ensuring polynomial analytical solution,high computational efficiency,and reliable convergence.展开更多
This paper investigates a new SEIQR(susceptible–exposed–infected–quarantined–recovered) epidemic model with quarantine mechanism on heterogeneous complex networks. Firstly, the nonlinear SEIQR epidemic spreading d...This paper investigates a new SEIQR(susceptible–exposed–infected–quarantined–recovered) epidemic model with quarantine mechanism on heterogeneous complex networks. Firstly, the nonlinear SEIQR epidemic spreading dynamic differential coupling model is proposed. Then, by using mean-field theory and the next-generation matrix method, the equilibriums and basic reproduction number are derived. Theoretical results indicate that the basic reproduction number significantly relies on model parameters and topology of the underlying networks. In addition, the globally asymptotic stability of equilibrium and the permanence of the disease are proved in detail by the Routh–Hurwitz criterion, Lyapunov method and La Salle's invariance principle. Furthermore, we find that the quarantine mechanism, that is the quarantine rate(γ1, γ2), has a significant effect on epidemic spreading through sensitivity analysis of basic reproduction number and model parameters. Meanwhile, the optimal control model of quarantined rate and analysis method are proposed, which can optimize the government control strategies and reduce the number of infected individual. Finally, numerical simulations are given to verify the correctness of theoretical results and a practice application is proposed to predict and control the spreading of COVID-19.展开更多
This paper proposes an optimal midcourse guidance method for dual pulse air-to-air missiles,which is based on the framework of the linear Gauss pseudospectral model predictive control method.Firstly,a multistage optim...This paper proposes an optimal midcourse guidance method for dual pulse air-to-air missiles,which is based on the framework of the linear Gauss pseudospectral model predictive control method.Firstly,a multistage optimal control problem with unspecified terminal time is formulated.Secondly,the control and terminal time update formulas are derived analytically.In contrast to previous work,the derivation process fully considers the Hamiltonian function corresponding to the unspecified terminal time,which is coupled with control,state,and costate.On the assumption of small perturbation,a special algebraic equation is provided to represent the equivalent optimal condition for the terminal time.Also,using Gauss pseudospectral collocation,error propagation dynamical equations involving the first-order correction term of the terminal time are transformed into a set of algebraic equations.Furthermore,analytical modification formulas can be derived by associating those equations and optimal conditions to eliminate terminal error and approach nonlinear optimal control.Even with their mathematical complexity,these formulas produce more accurate control and terminal time corrections and remove reliance on task-related parameters.Finally,several numerical simulations,comparisons with typical methods,and Monte Carlo simulations have been done to verify its optimality,high convergence rate,great stability and robustness.展开更多
This paper investigates the problem of fuzzy adaptive finite-time inverse optimal control for active suspension systems(ASSs).The fuzzy logic systems(FLSs)are utilized to learn the unknown non-linear dynamics and an a...This paper investigates the problem of fuzzy adaptive finite-time inverse optimal control for active suspension systems(ASSs).The fuzzy logic systems(FLSs)are utilized to learn the unknown non-linear dynamics and an auxiliary system is established.Based on the finite-time stability theory and inverse optimal theory,a fuzzy adaptive inverse finite-time inverse optimal control method is proposed.It is proven that the formulated control approach guarantees the stability of the controlled systems,while ensuring that errors converge to a small neighborhood of zero within finite time.Moreover,the optimized control performance can be achieved.Eventually,the simulation results demonstrate the effectiveness of the proposed fuzzy adaptive finite-time inverse optimal control scheme.展开更多
To establish the optimal reference trajectory for a near-space vehicle under free terminal time,a time-optimal model predictive static programming method is proposed with adaptive fish swarm optimization.First,the mod...To establish the optimal reference trajectory for a near-space vehicle under free terminal time,a time-optimal model predictive static programming method is proposed with adaptive fish swarm optimization.First,the model predictive static programming method is developed by incorporating neighboring terms and trust region,enabling rapid generation of precise optimal solutions.Next,an adaptive fish swarm optimization technique is employed to identify a sub-optimal solution,while a momentum gradient descent method with learning rate decay ensures the convergence to the global optimal solution.To validate the feasibility and accuracy of the proposed method,a near-space vehicle example is analyzed and simulated during its glide phase.The simulation results demonstrate that the proposed method aligns with theoretical derivations and outperforms existing methods in terms of convergence speed and accuracy.Therefore,the proposed method offers significant practical value for solving the fast trajectory optimization problem in near-space vehicle applications.展开更多
Intercepting high-maneuverability hypersonic targets in near-space environments poses significant challenges due to their extreme speeds and evasive capabilities.To address these challenges,this study presents an inte...Intercepting high-maneuverability hypersonic targets in near-space environments poses significant challenges due to their extreme speeds and evasive capabilities.To address these challenges,this study presents an integrated approach that combines a Three-Dimensional Finite-Time Optimal Cooperative Guidance Law(FTOC)with an Information Fusion Anti-saturation Predefined-time Observer(IFAPO).The proposed FTOC guidance law employs a nonlinear,non-quadratic finite-time optimal control strategy designed for rapid convergence within the limited timeframes of near-space interceptions,avoiding the need for remaining flight time estimation or linear decoupling inherent in traditional methods.To complement the guidance strategy,the IFAPO leverages multi-source information fusion theory and incorporates anti-saturation mechanisms to enhance target maneuver estimation.This method ensures accurate and real-time prediction of target acceleration while maintaining predefined convergence performance,even under complex interception conditions.By integrating the FTOC guidance law and IFAPO,the approach optimizes cooperative missile positioning,improves interception success rates,and minimizes fuel consumption,addressing practical constraints in military applications.Simulation results and comparative analyses confirm the effectiveness of the integrated approach,demonstrating its capability to achieve cooperative interception of highly maneuvering targets with enhanced efficiency and reduced economic costs,aligning with realistic combat scenarios.展开更多
Inverse reinforcement learning optimal control is under the framework of learner-expert.The learner system can imitate the expert system's demonstrated behaviors and does not require the predefined cost function,s...Inverse reinforcement learning optimal control is under the framework of learner-expert.The learner system can imitate the expert system's demonstrated behaviors and does not require the predefined cost function,so it can handle optimal control problems effectively.This paper proposes an inverse reinforcement learning optimal control method for Takagi-Sugeno(T-S)fuzzy systems.Based on learner systems,an expert system is constructed,where the learner system only knows the expert system's optimal control policy.To reconstruct the unknown cost function,we firstly develop a model-based inverse reinforcement learning algorithm for the case that systems dynamics are known.The developed model-based learning algorithm is consists of two learning stages:an inner reinforcement learning loop and an outer inverse optimal control loop.The inner loop desires to obtain optimal control policy via learner's cost function and the outer loop aims to update learner's state-penalty matrices via only using expert's optimal control policy.Then,to eliminate the requirement that the system dynamics must be known,a data-driven integral learning algorithm is presented.It is proved that the presented two algorithms are convergent and the developed inverse reinforcement learning optimal control scheme can ensure the controlled fuzzy learner systems to be asymptotically stable.Finally,we apply the proposed fuzzy optimal control to the truck-trailer system,and the computer simulation results verify the effectiveness of the presented approach.展开更多
基金Supported in part by Natural Science Foundation of Guangxi(2023GXNSFAA026246)in part by the Central Government's Guide to Local Science and Technology Development Fund(GuikeZY23055044)in part by the National Natural Science Foundation of China(62363003)。
文摘In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to obtain the maximal positive definite solution of nonlinear matrix equation X+A^(*)X|^(-α)A=Q with the case 0<α≤1.Based on this method,a new iterative algorithm is developed,and its convergence proof is given.Finally,two numerical examples are provided to show the effectiveness of the proposed method.
基金supported in part by the National Science and Technology Council under Grant NSTC 114-2221-E-027-104.
文摘Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are often effective for stabilization but may not directly optimize long-term performance.To address this limitation,this study develops an integrated framework that combines optimal control principles with reinforcement learning for a single-link robotic manipulator.The proposed scheme adopts an actor–critic structure,where the critic network approximates the value function associated with the Hamilton–Jacobi–Bellman equation,and the actor network generates near-optimal control signals in real time.This dual adaptation enables the controller to refine its policy online without explicit system knowledge.Stability of the closed-loop system is analyzed through Lyapunov theory,ensuring boundedness of the tracking error.Numerical simulations on the single-link manipulator demonstrate that themethod achieves accurate trajectory followingwhile maintaining lowcontrol effort.The results further showthat the actor–critic learning mechanism accelerates convergence of the control policy compared with conventional optimization-based strategies.This work highlights the potential of reinforcement learning integrated with optimal control for robotic manipulators and provides a foundation for future extensions to more complex multi-degree-of-freedom systems.The proposed controller is further validated in a physics-based virtual Gazebo environment,demonstrating stable adaptation and real-time feasibility.
基金supported by the National Nature Science Foundation of China under Grant No.11902332。
文摘The augmented evolution equation is established under the framework of the Variation Evolving Method(VEM)that seeks optimal solutions by solving the transformed Initial-Value Problems(IVPs).To improve the numerical performance,its compact form is developed herein.Through replacing the states and costates variation evolution with that of the controls,the dimension-reduced Evolution Partial Differential Equation(EPDE)only solves the control variables along the variation time to get the optimal solution,and the initial conditions for the definite solution may be arbitrary.With this equation,the scale of the resulting IVPs,obtained via the semi-discrete method,is significantly reduced and they may be solved with common Ordinary Differential Equation(ODE)integration methods conveniently.Meanwhile,the state and the costate dynamics share consistent stability in the numerical computation and this avoids the intrinsic numerical difficulty as in the indirect methods.Numerical examples are solved and it is shown that the compact form evolution equation outperforms the primary form in the precision,and the efficiency may be higher for the dense discretization.Actually,it is uncovered that the compact form of the augmented evolution equation is a continuous realization of the Newton type iteration mechanism.
基金Defense Industrial Technology Development Program (JCKY2020204B016)National Natural Science Foundation of China (92471206)。
文摘To better complete various missions, it is necessary to plan an optimal trajectory or provide the optimal control law for the multirole missile according to the actual situation, including launch conditions and target location. Since trajectory optimization struggles to meet real-time requirements, the emergence of data-based generation methods has become a significant focus in contemporary research. However, due to the large differences in the characteristics of the optimal control laws caused by the diversity of tasks, it is difficult to achieve good prediction results by modeling all data with one single model.Therefore, the modeling idea of the mixture of experts(MoE) is adopted. Firstly, the K-means clustering algorithm is used to partition the sample data set, and the corresponding neural network classification model is established as the gate switch of MoE. Then, the expert models, i.e., the mappings from the generation conditions to the optimal control law represented by the results of principal component analysis(PCA), are represented by Kriging models. Finally, multiple rounds of accuracy evaluation, sample supplementation, and model updating are conducted to improve the generation accuracy. The Monte Carlo simulation shows that the accuracy of the proposed model reaches 96% and the generation efficiency meets the real-time requirement.
基金supported by the National Natural Science Foundation of China(Nos.42005086,91844301,and 41805100)the National Key Research and Development Programof China(No.2022YFC3703500)+2 种基金China Postdoctoral Science Foundation(No.2023M733028)the Key Research and Development Program of Zhejiang Province(Nos.2021C03165 and 2022C03084)the Ecological and Environmental Scientific Research and Achievement Promotion Project of Zhejiang Province(No.2020HT0048).
文摘Assessing the impact of anthropogenic volatile organic compounds(VOCs)on ozone(O_(3))formation is vital for themanagement of emission reduction and pollution control.Continuousmeasurement of O_(3)and the major precursorswas conducted in a typical light industrial city in the YRD region from 1 May to 25 July in 2021.Alkanes were the most abundant VOC group,contributing to 55.0%of TVOCs concentration(56.43±21.10 ppb).OVOCs,aromatics,halides,alkenes,and alkynes contributed 18.7%,9.6%,9.3%,5.2%and 1.9%,respectively.The observational site shifted from a typical VOC control regime to a mixed regime from May to July,which can be explained by the significant increase of RO_(x)production,resulting in the transition of environment from NOx saturation to radical saturation with respect to O_(3)production.The optimal O_(3)control strategy should be dynamically changed depending on the transition of control regime.Under NOx saturation condition,minimizing the proportion of NOx in reduction could lead to better achievement of O_(3)alleviation.Under mixed control regime,the cut percentage gets the top priority for the effectiveness of O_(3)control.Five VOCs sources were identified:temperature dependent source(28.1%),vehicular exhausts(19.9%),petrochemical industries(7.2%),solvent&gasoline usage(32.3%)and manufacturing industries(12.6%).The increase of temperature and radiation would enhance the evaporation related VOC emissions,resulting in the increase of VOC concentration and the change of RO_(x)circulation.Our results highlight determination of the optimal control strategies for O_(3)pollution in a typical YRD industrial city.
基金supported by the National Natural Science Foundation of China(Grant Nos.12441502,12122506,12204230,and 12404554)the National Science and Technology Major Project of the Ministry of Science and Technology of China(2024ZD0300404)+6 种基金Guangdong Basic and Applied Basic Research Foundation(Grant No.2021B1515020070)Shenzhen Science and Technology Program(Grant No.RCYX20200714114522109)China Postdoctoral Science Foundation(CPSF)(2024M762114)Postdoctoral Fellowship Program of CPSF(GZC20231727)supported by the National Natural Science Foundation of China(Grant Nos.92165206 and 11974330)Innovation Program for Quantum Science and Technology(Grant No.2021ZD0301603)the Fundamental Research Funds for the Central Universities。
文摘We present a robust quantum optimal control framework for implementing fast entangling gates on ion-trap quantum processors.The framework leverages tailored laser pulses to drive the multiple vibrational sidebands of the ions to create phonon-mediated entangling gates and,unlike the state of the art,requires neither weakcoupling Lamb-Dicke approximation nor perturbation treatment.With the application of gradient-based optimal control,it enables finding amplitude-and phase-modulated laser control protocols that work without the Lamb-Dicke approximation,promising gate speeds on the order of microseconds comparable to the characteristic trap frequencies.Also,robustness requirements on the temperature of the ions and initial optical phase can be conveniently included to pursue high-quality fast gates against experimental imperfections.Our approach represents a step in speeding up quantum gates to achieve larger quantum circuits for quantum computation and simulation,and thus can find applications in near-future experiments.
基金supported by the Innovation Program for Quantum Science and Technology(Grant No.2021ZD0302100)the National Natural Science Foundation of China(Grant Nos.12361131576,92265205,and 92476205).
文摘Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this work introduces a machine-learning-based,data-driven scheme to overcome the challenges encountered,with a trained neural network(NN)assuming the role of a surrogate model that captures the system’s dynamics and subsequently enables QOC to be performed on the NN instead of on the real system.The trained NN surrogate proves effective for practical QOC tasks and is further demonstrated to be adaptable to different experimental conditions,remaining robust across varying system sizes and pulse durations.
基金co-supported by the National Natural Science Foundation of China(No.62003019)the Young Talents Support Program of Beihang University,China(No.YWF21-BJ-J-1180)。
文摘This paper addresses the Singular Optimal Control Problem(SOCP)for a surface-to-air missile with limited control,fully considering aerodynamic effects with a parabolic drag polar.This problem is an extension of the typical Goddard problem.First,the classical Legendre-Clebsch condition is applied to derive optimal conditions for the singular angle of attack,revealing that the missile turns by gravity along the singular arc.Second,the higher-order differentiation of the switching function provides the necessary conditions to determine the optimal thrust,expressed as linear functions of the costate variables.The vanishing coefficient determinant is then employed to decouple the control and costate variables,yielding the singular thrust solely dependent on state variables and identifying the singular surface.Moreover,the analytical singular control can be regarded as path constraints subject to the typical Optimal Control Problem(OCP),enabling the GPOPS-Ⅱ,a direct method framework that does not involve the singular condition,to solve the SOCP.Finally,three cases with different structures are presented to evaluate the performance of the proposed method.The results show that it takes a few steps to obtain the numerical optimal solution,which is consistent with the analytical solution derived from the calculus of variations,highlighting its great computational accuracy and effectiveness.
基金supported by NASA Oklahoma Established Program to Stimulate Competitive Research(EPSCoR)Infrastructure Development,“Machine Learning Ocean World Biosignature Detection from Mass Spec”(PI:BrettMcKinney),Grant No.80NSSC24M0109Tandy School of Computer Science,University of Tulsa.
文摘The co-infection of corona and influenza viruses has emerged as a significant threat to global public health due to their shared modes of transmission and overlapping clinical symptoms.This article presents a novel mathematical model that addresses the dynamics of this co-infection by extending the SEIR(Susceptible-Exposed-Infectious-Recovered)framework to incorporate treatment and hospitalization compartments.The population is divided into eight compartments,with infectious individuals further categorized into influenza infectious,corona infectious,and co-infection cases.The proposed mathematical model is constrained to adhere to fundamental epidemiological properties,such as non-negativity and boundedness within a feasible region.Additionally,the model is demonstrated to be well-posed with a unique solution.Equilibrium points,including the disease-free and endemic equilibria,are identified,and various properties related to these equilibrium points,such as the basic reproduction number,are determined.Local and global sensitivity analyses are performed to identify the parameters that highly influence disease dynamics and the reproduction number.Knowing the most influential parameters is crucial for understanding their impact on the co-infection’s spread and severity.Furthermore,an optimal control problem is defined to minimize disease transmission and to control strategy costs.The purpose of our study is to identify the most effective(optimal)control strategies for mitigating the spread of the co-infection with minimum cost of the controls.The results illustrate the effectiveness of the implemented control strategies in managing the co-infection’s impact on the population’s health.This mathematical modeling and control strategy framework provides valuable tools for understanding and combating the dual threat of corona and influenza co-infection,helping public health authorities and policymakers make informed decisions in the face of these intertwined epidemics.
文摘The electromagnetic levitation system(EMLS)serves as the most important part of any magnetic levitation system.However,its characteristics are defined by its highly nonlinear dynamics and instability.Furthermore,the uncertainties in the dynamics of an electromagnetic levitation system make the controller design more difficult.Therefore,it is necessary to design a robust control law that will ensure the system’s stability in the presence of these uncertainties.In this framework,the dynamics of an electromagnetic levitation system are addressed in terms of matched and unmatched uncertainties.The robust control problem is translated into the optimal control problem,where the uncertainties of the electromagnetic levitation system are directly reflected in the cost function.The optimal control method is used to solve the robust control problem.The solution to the optimal control problem for the electromagnetic levitation system is indeed a solution to the robust control problem of the electromagnetic levitation system under matched and unmatched uncertainties.The simulation and experimental results demonstrate the performance of the designed control scheme.The performance indices such as integral absolute error(IAE),integral square error(ISE),integral time absolute error(ITAE),and integral time square error(ITSE)are compared for both uncertainties to showcase the robustness of the designed control scheme.
基金supported in part by the Thai Nguyen University of Technology,Vietnam.
文摘This article presents an adaptive optimal control method for a semi-active suspension system.The model of the suspension system is built,in which the components of uncertain parameters and exogenous disturbance are described.The adaptive optimal control law consists of the sum of the optimal control component and the adaptive control component.First,the optimal control law is designed for the model of the suspension system after ignoring the components of uncertain parameters and exogenous disturbance caused by the road surface.The optimal control law expresses the desired dynamic characteristics of the suspension system.Next,the adaptive component is designed with the purpose of compensating for the effects caused by uncertain parameters and exogenous disturbance caused by the road surface;the adaptive component has adaptive parameter rules to estimate uncertain parameters and exogenous disturbance.When exogenous disturbances are eliminated,the system responds with an optimal controller designed.By separating theoretically the dynamic of a semi-active suspension system,this solution allows the design of two separate controllers easily and has reduced the computational burden and the use of too many tools,thus allowing for more convenient hardware implementation.The simulation results also show the effectiveness of damping oscillations of the proposed solution in this article.
基金supported in part by the Key Program of National Natural Science Foundation of China(62033014)the Application Projects of Integrated Standardization and New Paradigm for Intelligent Manufacturing from the Ministry of Industry and Information Technology of China in 2016the Fundamental Research Funds for the Central Universities of Central South University(2021zzts0700).
文摘In the industrial roller kiln,the time-delay characteristic in heat transfer causes the temperature field to be affected by both the current and historical temperature states.It presents a poor control performance and brings a significant challenge to the process precise control.Considering high complexity of precise modeling,a data-driven time-delay optimal control method for temperature field of roller kiln is proposed based on a large amount of process data.First,the control challenges and problem description brought by time-delay are demonstrated,where the cost function for the time-delay partial differential equation system is constructed.To obtain the optimal control law,the policy iteration in adaptive dynamic programming is adopted to design the time-delay temperature field controller,and neural network is used for the critic network in policy iteration to approximate the optimal time-delay cost function.The closed-loop system stability is proved by designing the Lyapunov function which contains the time-delay information.Finally,through establishing the time-delay temperature field model for roller kiln,the effectiveness and convergence of the proposed method is verified and proved.
基金supported by the Fundamental Research Funds for the Central Universities(No.3122025090)。
文摘In this paper,based on the SVIQR model we develop a stochastic epidemic model with multiple vaccinations and time delay.Firstly,we prove the existence and uniqueness of the global positive solution of the model,and construct suitable functions to obtain sufficient conditions for disease extinction.Secondly,in order to effectively control the spread of the disease,appropriate control strategies are formulated by using optimal control theory.Finally,the results are verified by numerical simulation.
基金co-supported by the National Natural Science Foundation of China(No.62103014)。
文摘An optimal feedback guidance law with disturbance rejection objective is proposed for endoatmospheric powered descent.This guidance law with an affine form is derived by solving a novel problem called Endoatmospheric Powered Descent Guidance with Disturbance Rejection(Endo-PDG-DR).The key idea of formulating the Endo-PDG-DR problem is dividing disturbances into two parts,modeled and unmodeled disturbances:the modeled disturbance is proactively exploited by augmenting it as a new state of a dynamics model;the unmodeled disturbance is reactively attenuated in terms of its effect on the guidance performance by adjoining a parameterized time-varying quadratic performance index in the proposed optimal guidance problem.A Pseudospectral Differential Dynamic Programming(PDDP)method is developed to solve the Endo-PDG-DR problem,and correspondingly a robust neighboring optimal state feedback law is obtained,which has two synergistic functionalities.One is adaptive optimal steering to accommodate the modeled disturbance,and the other is disturbance attenuation to compensate for the state perturbation effect induced by the unmodeled disturbance.Using the derived feedback guidance law,a disturbance rejection level is quantified,and is correspondingly optimized by designing a quadratic weighting parameter tuning law.The numerical computations of interest are performed within a pseudospectral setting,ensuring polynomial analytical solution,high computational efficiency,and reliable convergence.
基金Project supported the Natural Science Foundation of Zhejiang Province, China (Grant No. LQN25F030011)the Fundamental Research Project of Hangzhou Dianzi University (Grant No. KYS065624391)+1 种基金the National Natural Science Foundation of China (Grant No. 61573148)the Science and Technology Planning Project of Guangdong Province, China (Grant No. 2019A050520001)。
文摘This paper investigates a new SEIQR(susceptible–exposed–infected–quarantined–recovered) epidemic model with quarantine mechanism on heterogeneous complex networks. Firstly, the nonlinear SEIQR epidemic spreading dynamic differential coupling model is proposed. Then, by using mean-field theory and the next-generation matrix method, the equilibriums and basic reproduction number are derived. Theoretical results indicate that the basic reproduction number significantly relies on model parameters and topology of the underlying networks. In addition, the globally asymptotic stability of equilibrium and the permanence of the disease are proved in detail by the Routh–Hurwitz criterion, Lyapunov method and La Salle's invariance principle. Furthermore, we find that the quarantine mechanism, that is the quarantine rate(γ1, γ2), has a significant effect on epidemic spreading through sensitivity analysis of basic reproduction number and model parameters. Meanwhile, the optimal control model of quarantined rate and analysis method are proposed, which can optimize the government control strategies and reduce the number of infected individual. Finally, numerical simulations are given to verify the correctness of theoretical results and a practice application is proposed to predict and control the spreading of COVID-19.
基金supported by the National Natural Science Foundation of China(No.62003019)the Young Talents Support Program of Beihang University,China(No.YWF-21-BJ-J-1180).
文摘This paper proposes an optimal midcourse guidance method for dual pulse air-to-air missiles,which is based on the framework of the linear Gauss pseudospectral model predictive control method.Firstly,a multistage optimal control problem with unspecified terminal time is formulated.Secondly,the control and terminal time update formulas are derived analytically.In contrast to previous work,the derivation process fully considers the Hamiltonian function corresponding to the unspecified terminal time,which is coupled with control,state,and costate.On the assumption of small perturbation,a special algebraic equation is provided to represent the equivalent optimal condition for the terminal time.Also,using Gauss pseudospectral collocation,error propagation dynamical equations involving the first-order correction term of the terminal time are transformed into a set of algebraic equations.Furthermore,analytical modification formulas can be derived by associating those equations and optimal conditions to eliminate terminal error and approach nonlinear optimal control.Even with their mathematical complexity,these formulas produce more accurate control and terminal time corrections and remove reliance on task-related parameters.Finally,several numerical simulations,comparisons with typical methods,and Monte Carlo simulations have been done to verify its optimality,high convergence rate,great stability and robustness.
基金supported by the National Natural Science Foundation of China under 62173172。
文摘This paper investigates the problem of fuzzy adaptive finite-time inverse optimal control for active suspension systems(ASSs).The fuzzy logic systems(FLSs)are utilized to learn the unknown non-linear dynamics and an auxiliary system is established.Based on the finite-time stability theory and inverse optimal theory,a fuzzy adaptive inverse finite-time inverse optimal control method is proposed.It is proven that the formulated control approach guarantees the stability of the controlled systems,while ensuring that errors converge to a small neighborhood of zero within finite time.Moreover,the optimized control performance can be achieved.Eventually,the simulation results demonstrate the effectiveness of the proposed fuzzy adaptive finite-time inverse optimal control scheme.
基金supported by the National Science Foundation for Distinguished Young Scholars of China(No.52425212)National Key Research and Development Program of China(No.2021YFA0717100)National Natural Science Foundation of China(Nos.12072270,U2013206,and 52442214).
文摘To establish the optimal reference trajectory for a near-space vehicle under free terminal time,a time-optimal model predictive static programming method is proposed with adaptive fish swarm optimization.First,the model predictive static programming method is developed by incorporating neighboring terms and trust region,enabling rapid generation of precise optimal solutions.Next,an adaptive fish swarm optimization technique is employed to identify a sub-optimal solution,while a momentum gradient descent method with learning rate decay ensures the convergence to the global optimal solution.To validate the feasibility and accuracy of the proposed method,a near-space vehicle example is analyzed and simulated during its glide phase.The simulation results demonstrate that the proposed method aligns with theoretical derivations and outperforms existing methods in terms of convergence speed and accuracy.Therefore,the proposed method offers significant practical value for solving the fast trajectory optimization problem in near-space vehicle applications.
基金supported by the National Natural Science Foundation of China(Grant No.61773142).
文摘Intercepting high-maneuverability hypersonic targets in near-space environments poses significant challenges due to their extreme speeds and evasive capabilities.To address these challenges,this study presents an integrated approach that combines a Three-Dimensional Finite-Time Optimal Cooperative Guidance Law(FTOC)with an Information Fusion Anti-saturation Predefined-time Observer(IFAPO).The proposed FTOC guidance law employs a nonlinear,non-quadratic finite-time optimal control strategy designed for rapid convergence within the limited timeframes of near-space interceptions,avoiding the need for remaining flight time estimation or linear decoupling inherent in traditional methods.To complement the guidance strategy,the IFAPO leverages multi-source information fusion theory and incorporates anti-saturation mechanisms to enhance target maneuver estimation.This method ensures accurate and real-time prediction of target acceleration while maintaining predefined convergence performance,even under complex interception conditions.By integrating the FTOC guidance law and IFAPO,the approach optimizes cooperative missile positioning,improves interception success rates,and minimizes fuel consumption,addressing practical constraints in military applications.Simulation results and comparative analyses confirm the effectiveness of the integrated approach,demonstrating its capability to achieve cooperative interception of highly maneuvering targets with enhanced efficiency and reduced economic costs,aligning with realistic combat scenarios.
基金The National Natural Science Foundation of China(62173172).
文摘Inverse reinforcement learning optimal control is under the framework of learner-expert.The learner system can imitate the expert system's demonstrated behaviors and does not require the predefined cost function,so it can handle optimal control problems effectively.This paper proposes an inverse reinforcement learning optimal control method for Takagi-Sugeno(T-S)fuzzy systems.Based on learner systems,an expert system is constructed,where the learner system only knows the expert system's optimal control policy.To reconstruct the unknown cost function,we firstly develop a model-based inverse reinforcement learning algorithm for the case that systems dynamics are known.The developed model-based learning algorithm is consists of two learning stages:an inner reinforcement learning loop and an outer inverse optimal control loop.The inner loop desires to obtain optimal control policy via learner's cost function and the outer loop aims to update learner's state-penalty matrices via only using expert's optimal control policy.Then,to eliminate the requirement that the system dynamics must be known,a data-driven integral learning algorithm is presented.It is proved that the presented two algorithms are convergent and the developed inverse reinforcement learning optimal control scheme can ensure the controlled fuzzy learner systems to be asymptotically stable.Finally,we apply the proposed fuzzy optimal control to the truck-trailer system,and the computer simulation results verify the effectiveness of the presented approach.