An iterative optimization strategy is proposed and applied to the steady state optimizing control of the bio-dissimilation process of glycerol to 1,3-propanediol in the presence of model-plant mismatch and input const...An iterative optimization strategy is proposed and applied to the steady state optimizing control of the bio-dissimilation process of glycerol to 1,3-propanediol in the presence of model-plant mismatch and input constraints. The scheme is based on the Augmented Integrated System Optimization and Parameter Estimation (AI- SOPE) technique, but a linearization of some performance function in the modified model-based optimization problem of AISOPE is introduced to overcome the difficulty of determining an appropriate penalty parameter. When carrying out the iterative optimization, the penalty coefficient is set to a larger value at the current iteration than at the previous iteration, which can promote the evolution rate of the iterative optimization. Simulation studies illustrate the potential ofthe approach presented for the optimizing control of the bioTdissimilation process of glycerol to 1,3-propanediol. The effects of measurement noise, measured and unmeasured disturbances on the proposed algorithm are also investigated.展开更多
The aim of this work is to analyze and design a control system for vibration reduction in a rotor system using a shear mode magnetorheological fluid(MRF)damper.A dynamic model of the MRF damper-rotor system was built ...The aim of this work is to analyze and design a control system for vibration reduction in a rotor system using a shear mode magnetorheological fluid(MRF)damper.A dynamic model of the MRF damper-rotor system was built and simulated in Matlab/Simulink to analyze the rotor vibration characteristics and the vibration reduction effect of the MRF damper.Based on the numerical simulation analysis,an optimizing control strategy using pattern search method was proposed and designed.The control system was constructed on a test rotor bench and experiment validations on the effectiveness of the proposed control strategy were conducted.Experimental results show that rotor vibration caused by unbalance can be well controlled whether in resonance region(70%)or in non-resonance region(30%).An irregular vibration amplitude jump can be suppressed with the optimization strategy.Furthermore,it is found that the rapidity of transient response and efficiency of optimizing technique depend on the pattern search step.The presented strategies and control system can be extended to multi-span(more than two or three spans)rotor system.It provides a powerful technical support for the extension and application in target and control for shafting vibration.展开更多
Coal flotation is widely used to separate commercially valuable coal from the fine ore slurry, and is an industrial process with nonlinear, multivariable, time-varying and long time-delay characteristics. The online d...Coal flotation is widely used to separate commercially valuable coal from the fine ore slurry, and is an industrial process with nonlinear, multivariable, time-varying and long time-delay characteristics. The online detection of ash content of products as the operation performance evaluation in the flotation system is extraordinarily difficult because of the low solid content and numerous micro-bubbles in the slurry. Moreover, it is time-consuming by manual analysis. Consequently, the optimal separation is not usually maintained. A novel technique, called the neuro-immune algorithm (NIA) inspired by the biological nervous and immune systems, is presented in this paper for predicting the ash content of clean coal and performing the optimizing control to the coal flotation system. The proposed algorithm integrates the deeply-studied artificial neural network (ANN) and the developing artificial immune system (AIS). A two-layer back-propagation network was constructed offline based on the historical process data under the best system situation, using five parameters: the flow and the density of raw slurry, the input flows of water, the kerosene and the GF oil, as the inputs and the ash content of clean coal as the output. The immune cell of AIS is made up of six parameters above as the antigen. The cytokine based clone selection algorithm is used to produce the relative antibody. The detailed computation procedures about the hybrid neuro-immune algorithm are minutely discussed. The ash content of clean coal was predicted by NIA using the practical process data s: (308.6 174.7 146.1 43.6 4.0 9.4), and the absolute difference between the actual and computed ash content values was 0.0967%. The optimizing control on NIA was simulated considering two different situations where the ash content of clean coal was controlled downward from 10.00% or upward from 9.20% predicted by ANN to the target value 9.50%. The results indicate that the target ash content and the value of controlling parameters are obtained after several control cycles.展开更多
Green sand casting is still a main method in the world at present and it isvery significant to develop the technology of controlling green sand quality. A new concept, fromcontents test to contents control, is advance...Green sand casting is still a main method in the world at present and it isvery significant to develop the technology of controlling green sand quality. A new concept, fromcontents test to contents control, is advanced. In order to realize the new idea, a new method toon-line test active clay and moisture of green sand - double powers energizing alternately (DPEA)method is put forwards. The principle of the new method is to energize standard sand sample with ACand DC powers and to test the electric parameters, and then, to calculate active clay and moistureof green sand by using artificial neural network (ANN). Based on this new method, a directoptimizing system for controlling green sand quality is developed. Techniques about testing andcontrolling methods, hardware and software are discussed.展开更多
In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to ...In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to obtain the maximal positive definite solution of nonlinear matrix equation X+A^(*)X|^(-α)A=Q with the case 0<α≤1.Based on this method,a new iterative algorithm is developed,and its convergence proof is given.Finally,two numerical examples are provided to show the effectiveness of the proposed method.展开更多
Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are o...Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are often effective for stabilization but may not directly optimize long-term performance.To address this limitation,this study develops an integrated framework that combines optimal control principles with reinforcement learning for a single-link robotic manipulator.The proposed scheme adopts an actor–critic structure,where the critic network approximates the value function associated with the Hamilton–Jacobi–Bellman equation,and the actor network generates near-optimal control signals in real time.This dual adaptation enables the controller to refine its policy online without explicit system knowledge.Stability of the closed-loop system is analyzed through Lyapunov theory,ensuring boundedness of the tracking error.Numerical simulations on the single-link manipulator demonstrate that themethod achieves accurate trajectory followingwhile maintaining lowcontrol effort.The results further showthat the actor–critic learning mechanism accelerates convergence of the control policy compared with conventional optimization-based strategies.This work highlights the potential of reinforcement learning integrated with optimal control for robotic manipulators and provides a foundation for future extensions to more complex multi-degree-of-freedom systems.The proposed controller is further validated in a physics-based virtual Gazebo environment,demonstrating stable adaptation and real-time feasibility.展开更多
Dear Editor,In this letter,we focus on the algebraic relationship between the coefficient matrices and the solution of the stochastic algebraic Riccati equation.It is revealed that,if the coefficient matrices are in a...Dear Editor,In this letter,we focus on the algebraic relationship between the coefficient matrices and the solution of the stochastic algebraic Riccati equation.It is revealed that,if the coefficient matrices are in an algebra,then the solution(and also the control gain in many cases)is also in the same algebra.The main result is verified by a numerical simulation.展开更多
The augmented evolution equation is established under the framework of the Variation Evolving Method(VEM)that seeks optimal solutions by solving the transformed Initial-Value Problems(IVPs).To improve the numerical pe...The augmented evolution equation is established under the framework of the Variation Evolving Method(VEM)that seeks optimal solutions by solving the transformed Initial-Value Problems(IVPs).To improve the numerical performance,its compact form is developed herein.Through replacing the states and costates variation evolution with that of the controls,the dimension-reduced Evolution Partial Differential Equation(EPDE)only solves the control variables along the variation time to get the optimal solution,and the initial conditions for the definite solution may be arbitrary.With this equation,the scale of the resulting IVPs,obtained via the semi-discrete method,is significantly reduced and they may be solved with common Ordinary Differential Equation(ODE)integration methods conveniently.Meanwhile,the state and the costate dynamics share consistent stability in the numerical computation and this avoids the intrinsic numerical difficulty as in the indirect methods.Numerical examples are solved and it is shown that the compact form evolution equation outperforms the primary form in the precision,and the efficiency may be higher for the dense discretization.Actually,it is uncovered that the compact form of the augmented evolution equation is a continuous realization of the Newton type iteration mechanism.展开更多
Realizing optimal control performance for continuum robots(CRs) poses huge challenges on traditional modelbased optimal control approaches due to their high degrees of freedom,complex nonlinear dynamics and soft conti...Realizing optimal control performance for continuum robots(CRs) poses huge challenges on traditional modelbased optimal control approaches due to their high degrees of freedom,complex nonlinear dynamics and soft continuum morphologies which are difficult to explicitly model.This paper proposes a model-free adaptive optimal control algorithm(ADAPT)for CRs.In our strategy,we consider CRs as a class of nonlinear continuous-time dynamical systems in the state space,wherein the position of the end-effector is considered as the state and the input torque is mapped as the control input.Then,the optimized Hamilton-Jacobi-Bellman(HJB) equation is derived by optimal control principles,and subsequently solved by the proposed ADAPT algorithm without requiring knowledge of the original system dynamics.Under some mild assumptions,the global stability and convergence of the closed-loop control approach are guaranteed.Several simulation experiments are conducted on a magnetic CR(MCR) to demonstrate the practicality and effectiveness of the ADAPT algorithm.展开更多
The dense integration of residential distributed photovoltaic(PV)systems into three-phase,four-wire low-voltage(LV)distribution networks results in reverse power flow and three-phase imbalance,leading to voltage viola...The dense integration of residential distributed photovoltaic(PV)systems into three-phase,four-wire low-voltage(LV)distribution networks results in reverse power flow and three-phase imbalance,leading to voltage violations that hinder the growth of rural distributed PV systems.Traditional voltage droop-based control methods regulate PV power output solely based on local voltage measurements at the point of PV connection.Due to a lack of global coordination and optimization,their efficiency is often subpar.This paper presents a centralized coordinated active/reactive power control strategy for PV inverters in rural LV distribution feeders with high PV penetration.The strategy optimizes residential PV inverter reactive and active power control to enhance voltage quality.It uses sensitivity coefficients derived from the inverse Jacobian matrix to assign adjustment weights to individual PV units and iteratively optimize their power outputs.The control sequence prioritizes reactive power increases;if the coefficients are below average or the inverters reach capacity,active power is curtailed until voltage issues are resolved.A simulation based on a real 37-node rural distribution network shows that the proposed method significantly reduces PV curtailment.Typical daily results indicate a curtailment rate of 1.47%,which is significantly lower than the 15.4%observed with the voltage droop-based control method.The total daily PV power output(measured every 15 min)increases from 5.55 to 6.41 MW,improving PV hosting capacity.展开更多
Dear Editor,This letter proposes a reinforcement learning-based predictive learning algorithm for unknown continuous-time nonlinear systems with observation loss.Firstly,we construct a temporal nonzero-sum game over p...Dear Editor,This letter proposes a reinforcement learning-based predictive learning algorithm for unknown continuous-time nonlinear systems with observation loss.Firstly,we construct a temporal nonzero-sum game over predictive control input sequences,deriving multiple optimal predictive control input sequences from its solution.展开更多
To better complete various missions, it is necessary to plan an optimal trajectory or provide the optimal control law for the multirole missile according to the actual situation, including launch conditions and target...To better complete various missions, it is necessary to plan an optimal trajectory or provide the optimal control law for the multirole missile according to the actual situation, including launch conditions and target location. Since trajectory optimization struggles to meet real-time requirements, the emergence of data-based generation methods has become a significant focus in contemporary research. However, due to the large differences in the characteristics of the optimal control laws caused by the diversity of tasks, it is difficult to achieve good prediction results by modeling all data with one single model.Therefore, the modeling idea of the mixture of experts(MoE) is adopted. Firstly, the K-means clustering algorithm is used to partition the sample data set, and the corresponding neural network classification model is established as the gate switch of MoE. Then, the expert models, i.e., the mappings from the generation conditions to the optimal control law represented by the results of principal component analysis(PCA), are represented by Kriging models. Finally, multiple rounds of accuracy evaluation, sample supplementation, and model updating are conducted to improve the generation accuracy. The Monte Carlo simulation shows that the accuracy of the proposed model reaches 96% and the generation efficiency meets the real-time requirement.展开更多
Dear Editor,In this letter,a constrained networked predictive control strategy is proposed for the optimal control problem of complex nonlinear highorder fully actuated(HOFA)systems with noises.The method can effectiv...Dear Editor,In this letter,a constrained networked predictive control strategy is proposed for the optimal control problem of complex nonlinear highorder fully actuated(HOFA)systems with noises.The method can effectively deal with nonlinearities,constraints,and noises in the system,optimize the performance metric,and present an upper bound on the stable output of the system.展开更多
Assessing the impact of anthropogenic volatile organic compounds(VOCs)on ozone(O_(3))formation is vital for themanagement of emission reduction and pollution control.Continuousmeasurement of O_(3)and the major precurs...Assessing the impact of anthropogenic volatile organic compounds(VOCs)on ozone(O_(3))formation is vital for themanagement of emission reduction and pollution control.Continuousmeasurement of O_(3)and the major precursorswas conducted in a typical light industrial city in the YRD region from 1 May to 25 July in 2021.Alkanes were the most abundant VOC group,contributing to 55.0%of TVOCs concentration(56.43±21.10 ppb).OVOCs,aromatics,halides,alkenes,and alkynes contributed 18.7%,9.6%,9.3%,5.2%and 1.9%,respectively.The observational site shifted from a typical VOC control regime to a mixed regime from May to July,which can be explained by the significant increase of RO_(x)production,resulting in the transition of environment from NOx saturation to radical saturation with respect to O_(3)production.The optimal O_(3)control strategy should be dynamically changed depending on the transition of control regime.Under NOx saturation condition,minimizing the proportion of NOx in reduction could lead to better achievement of O_(3)alleviation.Under mixed control regime,the cut percentage gets the top priority for the effectiveness of O_(3)control.Five VOCs sources were identified:temperature dependent source(28.1%),vehicular exhausts(19.9%),petrochemical industries(7.2%),solvent&gasoline usage(32.3%)and manufacturing industries(12.6%).The increase of temperature and radiation would enhance the evaporation related VOC emissions,resulting in the increase of VOC concentration and the change of RO_(x)circulation.Our results highlight determination of the optimal control strategies for O_(3)pollution in a typical YRD industrial city.展开更多
We present a robust quantum optimal control framework for implementing fast entangling gates on ion-trap quantum processors.The framework leverages tailored laser pulses to drive the multiple vibrational sidebands of ...We present a robust quantum optimal control framework for implementing fast entangling gates on ion-trap quantum processors.The framework leverages tailored laser pulses to drive the multiple vibrational sidebands of the ions to create phonon-mediated entangling gates and,unlike the state of the art,requires neither weakcoupling Lamb-Dicke approximation nor perturbation treatment.With the application of gradient-based optimal control,it enables finding amplitude-and phase-modulated laser control protocols that work without the Lamb-Dicke approximation,promising gate speeds on the order of microseconds comparable to the characteristic trap frequencies.Also,robustness requirements on the temperature of the ions and initial optical phase can be conveniently included to pursue high-quality fast gates against experimental imperfections.Our approach represents a step in speeding up quantum gates to achieve larger quantum circuits for quantum computation and simulation,and thus can find applications in near-future experiments.展开更多
Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this wor...Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this work introduces a machine-learning-based,data-driven scheme to overcome the challenges encountered,with a trained neural network(NN)assuming the role of a surrogate model that captures the system’s dynamics and subsequently enables QOC to be performed on the NN instead of on the real system.The trained NN surrogate proves effective for practical QOC tasks and is further demonstrated to be adaptable to different experimental conditions,remaining robust across varying system sizes and pulse durations.展开更多
This paper addresses the Singular Optimal Control Problem(SOCP)for a surface-to-air missile with limited control,fully considering aerodynamic effects with a parabolic drag polar.This problem is an extension of the ty...This paper addresses the Singular Optimal Control Problem(SOCP)for a surface-to-air missile with limited control,fully considering aerodynamic effects with a parabolic drag polar.This problem is an extension of the typical Goddard problem.First,the classical Legendre-Clebsch condition is applied to derive optimal conditions for the singular angle of attack,revealing that the missile turns by gravity along the singular arc.Second,the higher-order differentiation of the switching function provides the necessary conditions to determine the optimal thrust,expressed as linear functions of the costate variables.The vanishing coefficient determinant is then employed to decouple the control and costate variables,yielding the singular thrust solely dependent on state variables and identifying the singular surface.Moreover,the analytical singular control can be regarded as path constraints subject to the typical Optimal Control Problem(OCP),enabling the GPOPS-Ⅱ,a direct method framework that does not involve the singular condition,to solve the SOCP.Finally,three cases with different structures are presented to evaluate the performance of the proposed method.The results show that it takes a few steps to obtain the numerical optimal solution,which is consistent with the analytical solution derived from the calculus of variations,highlighting its great computational accuracy and effectiveness.展开更多
The co-infection of corona and influenza viruses has emerged as a significant threat to global public health due to their shared modes of transmission and overlapping clinical symptoms.This article presents a novel ma...The co-infection of corona and influenza viruses has emerged as a significant threat to global public health due to their shared modes of transmission and overlapping clinical symptoms.This article presents a novel mathematical model that addresses the dynamics of this co-infection by extending the SEIR(Susceptible-Exposed-Infectious-Recovered)framework to incorporate treatment and hospitalization compartments.The population is divided into eight compartments,with infectious individuals further categorized into influenza infectious,corona infectious,and co-infection cases.The proposed mathematical model is constrained to adhere to fundamental epidemiological properties,such as non-negativity and boundedness within a feasible region.Additionally,the model is demonstrated to be well-posed with a unique solution.Equilibrium points,including the disease-free and endemic equilibria,are identified,and various properties related to these equilibrium points,such as the basic reproduction number,are determined.Local and global sensitivity analyses are performed to identify the parameters that highly influence disease dynamics and the reproduction number.Knowing the most influential parameters is crucial for understanding their impact on the co-infection’s spread and severity.Furthermore,an optimal control problem is defined to minimize disease transmission and to control strategy costs.The purpose of our study is to identify the most effective(optimal)control strategies for mitigating the spread of the co-infection with minimum cost of the controls.The results illustrate the effectiveness of the implemented control strategies in managing the co-infection’s impact on the population’s health.This mathematical modeling and control strategy framework provides valuable tools for understanding and combating the dual threat of corona and influenza co-infection,helping public health authorities and policymakers make informed decisions in the face of these intertwined epidemics.展开更多
The electromagnetic levitation system(EMLS)serves as the most important part of any magnetic levitation system.However,its characteristics are defined by its highly nonlinear dynamics and instability.Furthermore,the u...The electromagnetic levitation system(EMLS)serves as the most important part of any magnetic levitation system.However,its characteristics are defined by its highly nonlinear dynamics and instability.Furthermore,the uncertainties in the dynamics of an electromagnetic levitation system make the controller design more difficult.Therefore,it is necessary to design a robust control law that will ensure the system’s stability in the presence of these uncertainties.In this framework,the dynamics of an electromagnetic levitation system are addressed in terms of matched and unmatched uncertainties.The robust control problem is translated into the optimal control problem,where the uncertainties of the electromagnetic levitation system are directly reflected in the cost function.The optimal control method is used to solve the robust control problem.The solution to the optimal control problem for the electromagnetic levitation system is indeed a solution to the robust control problem of the electromagnetic levitation system under matched and unmatched uncertainties.The simulation and experimental results demonstrate the performance of the designed control scheme.The performance indices such as integral absolute error(IAE),integral square error(ISE),integral time absolute error(ITAE),and integral time square error(ITSE)are compared for both uncertainties to showcase the robustness of the designed control scheme.展开更多
基金the State Science and Technology Project of China (No.2001BA204B01).
文摘An iterative optimization strategy is proposed and applied to the steady state optimizing control of the bio-dissimilation process of glycerol to 1,3-propanediol in the presence of model-plant mismatch and input constraints. The scheme is based on the Augmented Integrated System Optimization and Parameter Estimation (AI- SOPE) technique, but a linearization of some performance function in the modified model-based optimization problem of AISOPE is introduced to overcome the difficulty of determining an appropriate penalty parameter. When carrying out the iterative optimization, the penalty coefficient is set to a larger value at the current iteration than at the previous iteration, which can promote the evolution rate of the iterative optimization. Simulation studies illustrate the potential ofthe approach presented for the optimizing control of the bioTdissimilation process of glycerol to 1,3-propanediol. The effects of measurement noise, measured and unmeasured disturbances on the proposed algorithm are also investigated.
基金Supported by the National Program on Key Basic Research Program(″973″Program)(2012CB026000)the Ph.D.Programs Foundation of Ministry of Education of China(20110010110009)
文摘The aim of this work is to analyze and design a control system for vibration reduction in a rotor system using a shear mode magnetorheological fluid(MRF)damper.A dynamic model of the MRF damper-rotor system was built and simulated in Matlab/Simulink to analyze the rotor vibration characteristics and the vibration reduction effect of the MRF damper.Based on the numerical simulation analysis,an optimizing control strategy using pattern search method was proposed and designed.The control system was constructed on a test rotor bench and experiment validations on the effectiveness of the proposed control strategy were conducted.Experimental results show that rotor vibration caused by unbalance can be well controlled whether in resonance region(70%)or in non-resonance region(30%).An irregular vibration amplitude jump can be suppressed with the optimization strategy.Furthermore,it is found that the rapidity of transient response and efficiency of optimizing technique depend on the pattern search step.The presented strategies and control system can be extended to multi-span(more than two or three spans)rotor system.It provides a powerful technical support for the extension and application in target and control for shafting vibration.
基金the financial support from the Fundamental Research Funds for the Central universities of China (No. 2009KH07)
文摘Coal flotation is widely used to separate commercially valuable coal from the fine ore slurry, and is an industrial process with nonlinear, multivariable, time-varying and long time-delay characteristics. The online detection of ash content of products as the operation performance evaluation in the flotation system is extraordinarily difficult because of the low solid content and numerous micro-bubbles in the slurry. Moreover, it is time-consuming by manual analysis. Consequently, the optimal separation is not usually maintained. A novel technique, called the neuro-immune algorithm (NIA) inspired by the biological nervous and immune systems, is presented in this paper for predicting the ash content of clean coal and performing the optimizing control to the coal flotation system. The proposed algorithm integrates the deeply-studied artificial neural network (ANN) and the developing artificial immune system (AIS). A two-layer back-propagation network was constructed offline based on the historical process data under the best system situation, using five parameters: the flow and the density of raw slurry, the input flows of water, the kerosene and the GF oil, as the inputs and the ash content of clean coal as the output. The immune cell of AIS is made up of six parameters above as the antigen. The cytokine based clone selection algorithm is used to produce the relative antibody. The detailed computation procedures about the hybrid neuro-immune algorithm are minutely discussed. The ash content of clean coal was predicted by NIA using the practical process data s: (308.6 174.7 146.1 43.6 4.0 9.4), and the absolute difference between the actual and computed ash content values was 0.0967%. The optimizing control on NIA was simulated considering two different situations where the ash content of clean coal was controlled downward from 10.00% or upward from 9.20% predicted by ANN to the target value 9.50%. The results indicate that the target ash content and the value of controlling parameters are obtained after several control cycles.
基金Provincial Outstanding Youth Foundation of Heilongjiang, China.
文摘Green sand casting is still a main method in the world at present and it isvery significant to develop the technology of controlling green sand quality. A new concept, fromcontents test to contents control, is advanced. In order to realize the new idea, a new method toon-line test active clay and moisture of green sand - double powers energizing alternately (DPEA)method is put forwards. The principle of the new method is to energize standard sand sample with ACand DC powers and to test the electric parameters, and then, to calculate active clay and moistureof green sand by using artificial neural network (ANN). Based on this new method, a directoptimizing system for controlling green sand quality is developed. Techniques about testing andcontrolling methods, hardware and software are discussed.
基金Supported in part by Natural Science Foundation of Guangxi(2023GXNSFAA026246)in part by the Central Government's Guide to Local Science and Technology Development Fund(GuikeZY23055044)in part by the National Natural Science Foundation of China(62363003)。
文摘In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to obtain the maximal positive definite solution of nonlinear matrix equation X+A^(*)X|^(-α)A=Q with the case 0<α≤1.Based on this method,a new iterative algorithm is developed,and its convergence proof is given.Finally,two numerical examples are provided to show the effectiveness of the proposed method.
基金supported in part by the National Science and Technology Council under Grant NSTC 114-2221-E-027-104.
文摘Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are often effective for stabilization but may not directly optimize long-term performance.To address this limitation,this study develops an integrated framework that combines optimal control principles with reinforcement learning for a single-link robotic manipulator.The proposed scheme adopts an actor–critic structure,where the critic network approximates the value function associated with the Hamilton–Jacobi–Bellman equation,and the actor network generates near-optimal control signals in real time.This dual adaptation enables the controller to refine its policy online without explicit system knowledge.Stability of the closed-loop system is analyzed through Lyapunov theory,ensuring boundedness of the tracking error.Numerical simulations on the single-link manipulator demonstrate that themethod achieves accurate trajectory followingwhile maintaining lowcontrol effort.The results further showthat the actor–critic learning mechanism accelerates convergence of the control policy compared with conventional optimization-based strategies.This work highlights the potential of reinforcement learning integrated with optimal control for robotic manipulators and provides a foundation for future extensions to more complex multi-degree-of-freedom systems.The proposed controller is further validated in a physics-based virtual Gazebo environment,demonstrating stable adaptation and real-time feasibility.
文摘Dear Editor,In this letter,we focus on the algebraic relationship between the coefficient matrices and the solution of the stochastic algebraic Riccati equation.It is revealed that,if the coefficient matrices are in an algebra,then the solution(and also the control gain in many cases)is also in the same algebra.The main result is verified by a numerical simulation.
基金supported by the National Nature Science Foundation of China under Grant No.11902332。
文摘The augmented evolution equation is established under the framework of the Variation Evolving Method(VEM)that seeks optimal solutions by solving the transformed Initial-Value Problems(IVPs).To improve the numerical performance,its compact form is developed herein.Through replacing the states and costates variation evolution with that of the controls,the dimension-reduced Evolution Partial Differential Equation(EPDE)only solves the control variables along the variation time to get the optimal solution,and the initial conditions for the definite solution may be arbitrary.With this equation,the scale of the resulting IVPs,obtained via the semi-discrete method,is significantly reduced and they may be solved with common Ordinary Differential Equation(ODE)integration methods conveniently.Meanwhile,the state and the costate dynamics share consistent stability in the numerical computation and this avoids the intrinsic numerical difficulty as in the indirect methods.Numerical examples are solved and it is shown that the compact form evolution equation outperforms the primary form in the precision,and the efficiency may be higher for the dense discretization.Actually,it is uncovered that the compact form of the augmented evolution equation is a continuous realization of the Newton type iteration mechanism.
基金supported in part by the Innovation and Technology Commission of Hong Kong,China(ITS/136/20,ITS/234/21,MHP/096/22,ITS/235/22)Multi-Scale Medical Robotics Center,InnoHK,China(8312051)+1 种基金Research Grants Council(RGC) of Hong Kong,China(CUHK 14217822,CUHK14207823,AoE/E-407/24-N)The Chinese University of Hong Kong(CUHK) Direct Grant。
文摘Realizing optimal control performance for continuum robots(CRs) poses huge challenges on traditional modelbased optimal control approaches due to their high degrees of freedom,complex nonlinear dynamics and soft continuum morphologies which are difficult to explicitly model.This paper proposes a model-free adaptive optimal control algorithm(ADAPT)for CRs.In our strategy,we consider CRs as a class of nonlinear continuous-time dynamical systems in the state space,wherein the position of the end-effector is considered as the state and the input torque is mapped as the control input.Then,the optimized Hamilton-Jacobi-Bellman(HJB) equation is derived by optimal control principles,and subsequently solved by the proposed ADAPT algorithm without requiring knowledge of the original system dynamics.Under some mild assumptions,the global stability and convergence of the closed-loop control approach are guaranteed.Several simulation experiments are conducted on a magnetic CR(MCR) to demonstrate the practicality and effectiveness of the ADAPT algorithm.
基金supported by the Provincial Industrial Science and Technology Project of State Grid Jiangsu Electric Power Co.,Ltd.of China,grant number JC2024118.
文摘The dense integration of residential distributed photovoltaic(PV)systems into three-phase,four-wire low-voltage(LV)distribution networks results in reverse power flow and three-phase imbalance,leading to voltage violations that hinder the growth of rural distributed PV systems.Traditional voltage droop-based control methods regulate PV power output solely based on local voltage measurements at the point of PV connection.Due to a lack of global coordination and optimization,their efficiency is often subpar.This paper presents a centralized coordinated active/reactive power control strategy for PV inverters in rural LV distribution feeders with high PV penetration.The strategy optimizes residential PV inverter reactive and active power control to enhance voltage quality.It uses sensitivity coefficients derived from the inverse Jacobian matrix to assign adjustment weights to individual PV units and iteratively optimize their power outputs.The control sequence prioritizes reactive power increases;if the coefficients are below average or the inverters reach capacity,active power is curtailed until voltage issues are resolved.A simulation based on a real 37-node rural distribution network shows that the proposed method significantly reduces PV curtailment.Typical daily results indicate a curtailment rate of 1.47%,which is significantly lower than the 15.4%observed with the voltage droop-based control method.The total daily PV power output(measured every 15 min)increases from 5.55 to 6.41 MW,improving PV hosting capacity.
基金supported by the National Natural Science Foundation of China(62433014,62373287,62573324,62333005,62273255)in part by the International Exchange Program for Graduate Students of Tongji University(4360143306)+3 种基金in part by the Fundamental Research Funds for Central Universities(22120230311)supported by DeutscheForschungsgemeinschaft(DFG,German Research Foundation)under Germany’s Excellence Strategy(EXC 2075390740016,468094890)support by the Stuttgart Center for Simulation Science(SimTech)the International Max Planck Research School for Intelligent Systems(IMPRS-IS)for supporting Y.Xie。
文摘Dear Editor,This letter proposes a reinforcement learning-based predictive learning algorithm for unknown continuous-time nonlinear systems with observation loss.Firstly,we construct a temporal nonzero-sum game over predictive control input sequences,deriving multiple optimal predictive control input sequences from its solution.
基金Defense Industrial Technology Development Program (JCKY2020204B016)National Natural Science Foundation of China (92471206)。
文摘To better complete various missions, it is necessary to plan an optimal trajectory or provide the optimal control law for the multirole missile according to the actual situation, including launch conditions and target location. Since trajectory optimization struggles to meet real-time requirements, the emergence of data-based generation methods has become a significant focus in contemporary research. However, due to the large differences in the characteristics of the optimal control laws caused by the diversity of tasks, it is difficult to achieve good prediction results by modeling all data with one single model.Therefore, the modeling idea of the mixture of experts(MoE) is adopted. Firstly, the K-means clustering algorithm is used to partition the sample data set, and the corresponding neural network classification model is established as the gate switch of MoE. Then, the expert models, i.e., the mappings from the generation conditions to the optimal control law represented by the results of principal component analysis(PCA), are represented by Kriging models. Finally, multiple rounds of accuracy evaluation, sample supplementation, and model updating are conducted to improve the generation accuracy. The Monte Carlo simulation shows that the accuracy of the proposed model reaches 96% and the generation efficiency meets the real-time requirement.
基金supported in part by the National Natural Science Foundation of China(62173255,62188101)Shenzhen Key Laboratory of Control Theory and Intelligent Systems(ZDSYS20220330161800001)
文摘Dear Editor,In this letter,a constrained networked predictive control strategy is proposed for the optimal control problem of complex nonlinear highorder fully actuated(HOFA)systems with noises.The method can effectively deal with nonlinearities,constraints,and noises in the system,optimize the performance metric,and present an upper bound on the stable output of the system.
基金supported by the National Natural Science Foundation of China(Nos.42005086,91844301,and 41805100)the National Key Research and Development Programof China(No.2022YFC3703500)+2 种基金China Postdoctoral Science Foundation(No.2023M733028)the Key Research and Development Program of Zhejiang Province(Nos.2021C03165 and 2022C03084)the Ecological and Environmental Scientific Research and Achievement Promotion Project of Zhejiang Province(No.2020HT0048).
文摘Assessing the impact of anthropogenic volatile organic compounds(VOCs)on ozone(O_(3))formation is vital for themanagement of emission reduction and pollution control.Continuousmeasurement of O_(3)and the major precursorswas conducted in a typical light industrial city in the YRD region from 1 May to 25 July in 2021.Alkanes were the most abundant VOC group,contributing to 55.0%of TVOCs concentration(56.43±21.10 ppb).OVOCs,aromatics,halides,alkenes,and alkynes contributed 18.7%,9.6%,9.3%,5.2%and 1.9%,respectively.The observational site shifted from a typical VOC control regime to a mixed regime from May to July,which can be explained by the significant increase of RO_(x)production,resulting in the transition of environment from NOx saturation to radical saturation with respect to O_(3)production.The optimal O_(3)control strategy should be dynamically changed depending on the transition of control regime.Under NOx saturation condition,minimizing the proportion of NOx in reduction could lead to better achievement of O_(3)alleviation.Under mixed control regime,the cut percentage gets the top priority for the effectiveness of O_(3)control.Five VOCs sources were identified:temperature dependent source(28.1%),vehicular exhausts(19.9%),petrochemical industries(7.2%),solvent&gasoline usage(32.3%)and manufacturing industries(12.6%).The increase of temperature and radiation would enhance the evaporation related VOC emissions,resulting in the increase of VOC concentration and the change of RO_(x)circulation.Our results highlight determination of the optimal control strategies for O_(3)pollution in a typical YRD industrial city.
基金supported by the National Natural Science Foundation of China(Grant Nos.12441502,12122506,12204230,and 12404554)the National Science and Technology Major Project of the Ministry of Science and Technology of China(2024ZD0300404)+6 种基金Guangdong Basic and Applied Basic Research Foundation(Grant No.2021B1515020070)Shenzhen Science and Technology Program(Grant No.RCYX20200714114522109)China Postdoctoral Science Foundation(CPSF)(2024M762114)Postdoctoral Fellowship Program of CPSF(GZC20231727)supported by the National Natural Science Foundation of China(Grant Nos.92165206 and 11974330)Innovation Program for Quantum Science and Technology(Grant No.2021ZD0301603)the Fundamental Research Funds for the Central Universities。
文摘We present a robust quantum optimal control framework for implementing fast entangling gates on ion-trap quantum processors.The framework leverages tailored laser pulses to drive the multiple vibrational sidebands of the ions to create phonon-mediated entangling gates and,unlike the state of the art,requires neither weakcoupling Lamb-Dicke approximation nor perturbation treatment.With the application of gradient-based optimal control,it enables finding amplitude-and phase-modulated laser control protocols that work without the Lamb-Dicke approximation,promising gate speeds on the order of microseconds comparable to the characteristic trap frequencies.Also,robustness requirements on the temperature of the ions and initial optical phase can be conveniently included to pursue high-quality fast gates against experimental imperfections.Our approach represents a step in speeding up quantum gates to achieve larger quantum circuits for quantum computation and simulation,and thus can find applications in near-future experiments.
基金supported by the Innovation Program for Quantum Science and Technology(Grant No.2021ZD0302100)the National Natural Science Foundation of China(Grant Nos.12361131576,92265205,and 92476205).
文摘Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this work introduces a machine-learning-based,data-driven scheme to overcome the challenges encountered,with a trained neural network(NN)assuming the role of a surrogate model that captures the system’s dynamics and subsequently enables QOC to be performed on the NN instead of on the real system.The trained NN surrogate proves effective for practical QOC tasks and is further demonstrated to be adaptable to different experimental conditions,remaining robust across varying system sizes and pulse durations.
基金co-supported by the National Natural Science Foundation of China(No.62003019)the Young Talents Support Program of Beihang University,China(No.YWF21-BJ-J-1180)。
文摘This paper addresses the Singular Optimal Control Problem(SOCP)for a surface-to-air missile with limited control,fully considering aerodynamic effects with a parabolic drag polar.This problem is an extension of the typical Goddard problem.First,the classical Legendre-Clebsch condition is applied to derive optimal conditions for the singular angle of attack,revealing that the missile turns by gravity along the singular arc.Second,the higher-order differentiation of the switching function provides the necessary conditions to determine the optimal thrust,expressed as linear functions of the costate variables.The vanishing coefficient determinant is then employed to decouple the control and costate variables,yielding the singular thrust solely dependent on state variables and identifying the singular surface.Moreover,the analytical singular control can be regarded as path constraints subject to the typical Optimal Control Problem(OCP),enabling the GPOPS-Ⅱ,a direct method framework that does not involve the singular condition,to solve the SOCP.Finally,three cases with different structures are presented to evaluate the performance of the proposed method.The results show that it takes a few steps to obtain the numerical optimal solution,which is consistent with the analytical solution derived from the calculus of variations,highlighting its great computational accuracy and effectiveness.
基金supported by NASA Oklahoma Established Program to Stimulate Competitive Research(EPSCoR)Infrastructure Development,“Machine Learning Ocean World Biosignature Detection from Mass Spec”(PI:BrettMcKinney),Grant No.80NSSC24M0109Tandy School of Computer Science,University of Tulsa.
文摘The co-infection of corona and influenza viruses has emerged as a significant threat to global public health due to their shared modes of transmission and overlapping clinical symptoms.This article presents a novel mathematical model that addresses the dynamics of this co-infection by extending the SEIR(Susceptible-Exposed-Infectious-Recovered)framework to incorporate treatment and hospitalization compartments.The population is divided into eight compartments,with infectious individuals further categorized into influenza infectious,corona infectious,and co-infection cases.The proposed mathematical model is constrained to adhere to fundamental epidemiological properties,such as non-negativity and boundedness within a feasible region.Additionally,the model is demonstrated to be well-posed with a unique solution.Equilibrium points,including the disease-free and endemic equilibria,are identified,and various properties related to these equilibrium points,such as the basic reproduction number,are determined.Local and global sensitivity analyses are performed to identify the parameters that highly influence disease dynamics and the reproduction number.Knowing the most influential parameters is crucial for understanding their impact on the co-infection’s spread and severity.Furthermore,an optimal control problem is defined to minimize disease transmission and to control strategy costs.The purpose of our study is to identify the most effective(optimal)control strategies for mitigating the spread of the co-infection with minimum cost of the controls.The results illustrate the effectiveness of the implemented control strategies in managing the co-infection’s impact on the population’s health.This mathematical modeling and control strategy framework provides valuable tools for understanding and combating the dual threat of corona and influenza co-infection,helping public health authorities and policymakers make informed decisions in the face of these intertwined epidemics.
文摘The electromagnetic levitation system(EMLS)serves as the most important part of any magnetic levitation system.However,its characteristics are defined by its highly nonlinear dynamics and instability.Furthermore,the uncertainties in the dynamics of an electromagnetic levitation system make the controller design more difficult.Therefore,it is necessary to design a robust control law that will ensure the system’s stability in the presence of these uncertainties.In this framework,the dynamics of an electromagnetic levitation system are addressed in terms of matched and unmatched uncertainties.The robust control problem is translated into the optimal control problem,where the uncertainties of the electromagnetic levitation system are directly reflected in the cost function.The optimal control method is used to solve the robust control problem.The solution to the optimal control problem for the electromagnetic levitation system is indeed a solution to the robust control problem of the electromagnetic levitation system under matched and unmatched uncertainties.The simulation and experimental results demonstrate the performance of the designed control scheme.The performance indices such as integral absolute error(IAE),integral square error(ISE),integral time absolute error(ITAE),and integral time square error(ITSE)are compared for both uncertainties to showcase the robustness of the designed control scheme.