In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to ...In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to obtain the maximal positive definite solution of nonlinear matrix equation X+A^(*)X|^(-α)A=Q with the case 0<α≤1.Based on this method,a new iterative algorithm is developed,and its convergence proof is given.Finally,two numerical examples are provided to show the effectiveness of the proposed method.展开更多
In recent years,three-dimensional reconstruction technologies that employ multiple cameras have continued to evolve significantly,enabling remote collaboration among users in extended Reality(XR)environments.In additi...In recent years,three-dimensional reconstruction technologies that employ multiple cameras have continued to evolve significantly,enabling remote collaboration among users in extended Reality(XR)environments.In addition,methods for deploying multiple cameras for motion capture of users(e.g.,performers)are widely used in computer graphics.As the need to minimize and optimize the number of cameras grows to reduce costs,various technologies and research approaches focused on Optimal Camera Placement(OCP)are continually being proposed.However,as most existing studies assume homogeneous camera setups,there is a growing demand for studies on heterogeneous camera setups.For instance,technical demands keep emerging in scenarios with minimal camera configurations,especially regarding cost factors,the physical placement of cameras given the spatial structure,and image capture strategies for heterogeneous cameras,such as high-resolution RGB cameras and depth cameras.In this study,we propose a pre-visualization and simulation method for the optimal placement of heterogeneous cameras in XR environments,accounting for both the specifications of heterogeneous cameras(e.g.,field of view)and the physical configuration(e.g.,wall configuration)in real-world spaces.The proposed method performs a visibility analysis of cameras by considering each camera’s field-of-view volume,resolution,and unique characteristics,along with physicalspace constraints.This approach enables the optimal position and rotation of each camera to be recommended,along with the minimum number of cameras required.In the results of our study conducted in heterogeneous camera combinations,the proposed method achieved 81.7%~82.7%coverage of the target visual information using only 2~3 cameras.In contrast,single(or homogeneous)-typed cameras were required to use 11 cameras for 81.6%coverage.Accordingly,we found that camera deployment resources can be reduced with the proposed approaches.展开更多
Virtual power plant(VPP)integrates a variety of distributed renewable energy and energy storage to participate in electricity market transactions,promote the consumption of renewable energy,and improve economic effici...Virtual power plant(VPP)integrates a variety of distributed renewable energy and energy storage to participate in electricity market transactions,promote the consumption of renewable energy,and improve economic efficiency.In this paper,aiming at the uncertainty of distributed wind power and photovoltaic output,considering the coupling relationship between power,carbon trading,and green cardmarket,the optimal operationmodel and bidding scheme of VPP in spot market,carbon trading market,and green card market are established.On this basis,through the Shapley value and independent risk contribution theory in cooperative game theory,the quantitative analysis of the total income and risk contribution of various distributed resources in the virtual power plant is realized.Moreover,the scheduling strategies of virtual power plants under different risk preferences are systematically compared,and the feasibility and accuracy of the combination of Shapley value and independent risk contribution theory in ensuring fair income distribution and reasonable risk assessment are emphasized.A comprehensive solution for virtual power plants in the multi-market environment is constructed,which integrates operation strategy,income distribution mechanism,and risk control system into a unified analysis framework.Through the simulation of multi-scenario examples,the CPLEXsolver inMATLAB software is used to optimize themodel.The proposed joint optimization scheme can increase the profit of VPP participating in carbon trading and green certificate market by 29%.The total revenue of distributed resources managed by VPP is 9%higher than that of individual participation.展开更多
Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are o...Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are often effective for stabilization but may not directly optimize long-term performance.To address this limitation,this study develops an integrated framework that combines optimal control principles with reinforcement learning for a single-link robotic manipulator.The proposed scheme adopts an actor–critic structure,where the critic network approximates the value function associated with the Hamilton–Jacobi–Bellman equation,and the actor network generates near-optimal control signals in real time.This dual adaptation enables the controller to refine its policy online without explicit system knowledge.Stability of the closed-loop system is analyzed through Lyapunov theory,ensuring boundedness of the tracking error.Numerical simulations on the single-link manipulator demonstrate that themethod achieves accurate trajectory followingwhile maintaining lowcontrol effort.The results further showthat the actor–critic learning mechanism accelerates convergence of the control policy compared with conventional optimization-based strategies.This work highlights the potential of reinforcement learning integrated with optimal control for robotic manipulators and provides a foundation for future extensions to more complex multi-degree-of-freedom systems.The proposed controller is further validated in a physics-based virtual Gazebo environment,demonstrating stable adaptation and real-time feasibility.展开更多
In this paper,we propose and analyze two second-order accurate finite difference schemes for the one-dimensional heat equation with concentrated capacity on a computa-tional domain=[a,b].We first transform the target ...In this paper,we propose and analyze two second-order accurate finite difference schemes for the one-dimensional heat equation with concentrated capacity on a computa-tional domain=[a,b].We first transform the target equation into the standard heat equation on the domain excluding the singular point equipped with an inner interface matching(IIM)condition on the singular point x=ξ∈(a,b),then adopt Taylor’s ex-pansion to approximate the IIM condition at the singular point and apply second-order finite difference method to approximate the standard heat equation at the nonsingular points.This discrete procedure allows us to choose different grid sizes to partition the two sub-domains[a,ξ]and[ξ,b],which ensures that x=ξ is a grid point,and hence the pro-posed schemes can be generalized to the heat equation with more than one concentrated capacities.We prove that the two proposed schemes are uniquely solvable.And through in-depth analysis of the local truncation errors,we rigorously prove that the two schemes are second-order accurate both in temporal and spatial directions in the maximum norm without any constraint on the grid ratio.Numerical experiments are carried out to verify our theoretical conclusions.展开更多
This paper investigates the edge-based dynamic event-triggered inverse optimal formation control problem for multiple quadrotor unmanned aerial vehicles(QUAVs) with attitude constraints. To improve communication effic...This paper investigates the edge-based dynamic event-triggered inverse optimal formation control problem for multiple quadrotor unmanned aerial vehicles(QUAVs) with attitude constraints. To improve communication efficiency, an edge-based dynamic event-triggered mechanism is developed for the communication channels between neighboring QUAVs. However, this edge-based dynamic event-triggered communication(DETC) may cause discontinuities in the reference signals. To solve this problem, a distributed estimator is designed for each QUAV to obtain the leader's output signals. Considering the safety of QUAV formation flying, this paper designs a function transformation method that constrains the attitudes of the QUAVs to a strictly safe region. Furthermore, an inverse optimal control strategy is proposed based on the backstepping methodology. This scheme not only minimizes the cost function but also avoids the necessity of solving the Hamilton-Jacobi-Bellman equation. Finally, the stability of the QUAV systems is proven using Lyapunov theory, and the effectiveness of the proposed control method is verified through simulation.展开更多
We present a robust quantum optimal control framework for implementing fast entangling gates on ion-trap quantum processors.The framework leverages tailored laser pulses to drive the multiple vibrational sidebands of ...We present a robust quantum optimal control framework for implementing fast entangling gates on ion-trap quantum processors.The framework leverages tailored laser pulses to drive the multiple vibrational sidebands of the ions to create phonon-mediated entangling gates and,unlike the state of the art,requires neither weakcoupling Lamb-Dicke approximation nor perturbation treatment.With the application of gradient-based optimal control,it enables finding amplitude-and phase-modulated laser control protocols that work without the Lamb-Dicke approximation,promising gate speeds on the order of microseconds comparable to the characteristic trap frequencies.Also,robustness requirements on the temperature of the ions and initial optical phase can be conveniently included to pursue high-quality fast gates against experimental imperfections.Our approach represents a step in speeding up quantum gates to achieve larger quantum circuits for quantum computation and simulation,and thus can find applications in near-future experiments.展开更多
Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this wor...Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this work introduces a machine-learning-based,data-driven scheme to overcome the challenges encountered,with a trained neural network(NN)assuming the role of a surrogate model that captures the system’s dynamics and subsequently enables QOC to be performed on the NN instead of on the real system.The trained NN surrogate proves effective for practical QOC tasks and is further demonstrated to be adaptable to different experimental conditions,remaining robust across varying system sizes and pulse durations.展开更多
In the wireless energy transmission service composition optimization problem,a key challenge is accurately capturing users’preferences for service criteria under complex influencing factors,and optimally selecting a ...In the wireless energy transmission service composition optimization problem,a key challenge is accurately capturing users’preferences for service criteria under complex influencing factors,and optimally selecting a composition solution under their budget constraints.Existing studies typically evaluate satisfaction solely based on energy transmission capacity,while overlooking critical factors such as price and trustworthiness of the provider,leading to a mismatch between optimization outcomes and user needs.To address this gap,we construct a user satisfaction evaluation model for multi-user and multi-provider scenarios,systematically incorporating service price,transmission capacity,and trustworthiness into the satisfaction assessment framework.Furthermore,we propose a Budget-Aware Preference Adjustment Model that predicts users’baseline preference weights from historical data and dynamically adjusts them according to budget levels,thereby reflecting user preferences more realistically under varying budget constraints.In addition,to tackle the composition optimization problem,we develop a ReflectiveEvolutionary Large Language Model—Guided Ant Colony Optimization algorithm,which leverages the reflective evolution capability of large language models to iteratively generate and refine heuristic information that guides the search process.Experimental results demonstrate that the proposed framework effectively integrates personalized preferences with budget sensitivity,accurately predicts users’preferences,and significantly enhances their satisfaction under complex constraints.展开更多
Ensuring reliable power supply in urban distribution networks is a complex and critical task.To address the increased demand during extreme scenarios,this paper proposes an optimal dispatch strategy that considers the...Ensuring reliable power supply in urban distribution networks is a complex and critical task.To address the increased demand during extreme scenarios,this paper proposes an optimal dispatch strategy that considers the coordination with virtual power plants(VPPs).The proposed strategy improves systemflexibility and responsiveness by optimizing the power adjustment of flexible resources.In the proposed strategy,theGaussian Process Regression(GPR)is firstly employed to determine the adjustable range of aggregated power within the VPP,facilitating an assessment of its potential contribution to power supply support.Then,an optimal dispatch model based on a leader-follower game is developed to maximize the benefits of the VPP and flexible resources while guaranteeing the power balance at the same time.To solve the proposed optimal dispatch model efficiently,the constraints of the problem are reformulated and resolved using the Karush-Kuhn-Tucker(KKT)optimality conditions and linear programming duality theorem.The effectiveness of the strategy is illustrated through a detailed case study.展开更多
This paper addresses the Singular Optimal Control Problem(SOCP)for a surface-to-air missile with limited control,fully considering aerodynamic effects with a parabolic drag polar.This problem is an extension of the ty...This paper addresses the Singular Optimal Control Problem(SOCP)for a surface-to-air missile with limited control,fully considering aerodynamic effects with a parabolic drag polar.This problem is an extension of the typical Goddard problem.First,the classical Legendre-Clebsch condition is applied to derive optimal conditions for the singular angle of attack,revealing that the missile turns by gravity along the singular arc.Second,the higher-order differentiation of the switching function provides the necessary conditions to determine the optimal thrust,expressed as linear functions of the costate variables.The vanishing coefficient determinant is then employed to decouple the control and costate variables,yielding the singular thrust solely dependent on state variables and identifying the singular surface.Moreover,the analytical singular control can be regarded as path constraints subject to the typical Optimal Control Problem(OCP),enabling the GPOPS-Ⅱ,a direct method framework that does not involve the singular condition,to solve the SOCP.Finally,three cases with different structures are presented to evaluate the performance of the proposed method.The results show that it takes a few steps to obtain the numerical optimal solution,which is consistent with the analytical solution derived from the calculus of variations,highlighting its great computational accuracy and effectiveness.展开更多
Dear Editor,This letter considers the problem of achieving optimal formation control in multiple vertical take-off and landing(VTOL)unmanned aerial vehicles(UAVs).Specifically,the objective is to derive the vehicles t...Dear Editor,This letter considers the problem of achieving optimal formation control in multiple vertical take-off and landing(VTOL)unmanned aerial vehicles(UAVs).Specifically,the objective is to derive the vehicles to the desired formation shape while minimizing the total cost function.Leveraging the backstepping design,a distributed control strategy is proposed that incorporates a dynamic system for generating a reference trajectory and a trajectory tracking controller for each vehicle.展开更多
This paper proposes an optimal midcourse guidance method for dual pulse air-to-air missiles,which is based on the framework of the linear Gauss pseudospectral model predictive control method.Firstly,a multistage optim...This paper proposes an optimal midcourse guidance method for dual pulse air-to-air missiles,which is based on the framework of the linear Gauss pseudospectral model predictive control method.Firstly,a multistage optimal control problem with unspecified terminal time is formulated.Secondly,the control and terminal time update formulas are derived analytically.In contrast to previous work,the derivation process fully considers the Hamiltonian function corresponding to the unspecified terminal time,which is coupled with control,state,and costate.On the assumption of small perturbation,a special algebraic equation is provided to represent the equivalent optimal condition for the terminal time.Also,using Gauss pseudospectral collocation,error propagation dynamical equations involving the first-order correction term of the terminal time are transformed into a set of algebraic equations.Furthermore,analytical modification formulas can be derived by associating those equations and optimal conditions to eliminate terminal error and approach nonlinear optimal control.Even with their mathematical complexity,these formulas produce more accurate control and terminal time corrections and remove reliance on task-related parameters.Finally,several numerical simulations,comparisons with typical methods,and Monte Carlo simulations have been done to verify its optimality,high convergence rate,great stability and robustness.展开更多
Dear Editor,This letter investigates the optimal transmission scheduling problem in remote state estimation systems over an unknown wireless channel.We propose a partially observable Markov decision Process(POMDP)fram...Dear Editor,This letter investigates the optimal transmission scheduling problem in remote state estimation systems over an unknown wireless channel.We propose a partially observable Markov decision Process(POMDP)framework to model the sensor scheduling problem.By truncating and simplifying the POMDP problem,we have established the properties of the optimal solution under the POMDP model,through a fixed-point contraction method,and have shown that the threshold structure of the POMDP solution is not easily attainable.Subsequently,we obtained a suboptimal solution via Qlearning.Numerical simulations are used to demonstrate the efficacy of the proposed Q-learning approach.展开更多
This paper aims to study the optimal control and algorithm implementation of a generalized epidemic model governed by reaction-diffusion equations.Considering individual mobility,this paper first proposes a reaction-d...This paper aims to study the optimal control and algorithm implementation of a generalized epidemic model governed by reaction-diffusion equations.Considering individual mobility,this paper first proposes a reaction-diffusion epidemic model with two strains.Furthermore,applying vaccines as a control strategy in the model,an optimal control problem is proposed to increase the number of healthy individuals while reducing control costs.By applying the truncation function technique and the operator semigroup methods,we prove the existence and uniqueness of a globally positive strong solution for the control model.The existence of the optimal control strategy is proven by using functional analysis theory and minimum sequence methods.The first-order necessary condition satisfied by the optimal control is established by employing the dual techniques.Finally,a specific example and its algorithm are provided.展开更多
In this paper,a bilevel optimization model of an integrated energy operator(IEO)–load aggregator(LA)is constructed to address the coordinate optimization challenge of multiple stakeholder island integrated energy sys...In this paper,a bilevel optimization model of an integrated energy operator(IEO)–load aggregator(LA)is constructed to address the coordinate optimization challenge of multiple stakeholder island integrated energy system(IIES).The upper level represents the integrated energy operator,and the lower level is the electricity-heatgas load aggregator.Owing to the benefit conflict between the upper and lower levels of the IIES,a dynamic pricing mechanism for coordinating the interests of the upper and lower levels is proposed,combined with factors such as the carbon emissions of the IIES,as well as the lower load interruption power.The price of selling energy can be dynamically adjusted to the lower LA in the mechanism,according to the information on carbon emissions and load interruption power.Mutual benefits and win-win situations are achieved between the upper and lower multistakeholders.Finally,CPLEX is used to iteratively solve the bilevel optimization model.The optimal solution is selected according to the joint optimal discrimination mechanism.Thesimulation results indicate that the sourceload coordinate operation can reduce the upper and lower operation costs.Using the proposed pricingmechanism,the carbon emissions and load interruption power of IEO-LA are reduced by 9.78%and 70.19%,respectively,and the capture power of the carbon capture equipment is improved by 36.24%.The validity of the proposed model and method is verified.展开更多
Assessing the impact of anthropogenic volatile organic compounds(VOCs)on ozone(O_(3))formation is vital for themanagement of emission reduction and pollution control.Continuousmeasurement of O_(3)and the major precurs...Assessing the impact of anthropogenic volatile organic compounds(VOCs)on ozone(O_(3))formation is vital for themanagement of emission reduction and pollution control.Continuousmeasurement of O_(3)and the major precursorswas conducted in a typical light industrial city in the YRD region from 1 May to 25 July in 2021.Alkanes were the most abundant VOC group,contributing to 55.0%of TVOCs concentration(56.43±21.10 ppb).OVOCs,aromatics,halides,alkenes,and alkynes contributed 18.7%,9.6%,9.3%,5.2%and 1.9%,respectively.The observational site shifted from a typical VOC control regime to a mixed regime from May to July,which can be explained by the significant increase of RO_(x)production,resulting in the transition of environment from NOx saturation to radical saturation with respect to O_(3)production.The optimal O_(3)control strategy should be dynamically changed depending on the transition of control regime.Under NOx saturation condition,minimizing the proportion of NOx in reduction could lead to better achievement of O_(3)alleviation.Under mixed control regime,the cut percentage gets the top priority for the effectiveness of O_(3)control.Five VOCs sources were identified:temperature dependent source(28.1%),vehicular exhausts(19.9%),petrochemical industries(7.2%),solvent&gasoline usage(32.3%)and manufacturing industries(12.6%).The increase of temperature and radiation would enhance the evaporation related VOC emissions,resulting in the increase of VOC concentration and the change of RO_(x)circulation.Our results highlight determination of the optimal control strategies for O_(3)pollution in a typical YRD industrial city.展开更多
This article briefly reviews the topic of complex network synchronization,with its graph-theoretic criterion,showing that the homogeneous and symmetrical network structures are essential for optimal synchronization.Fu...This article briefly reviews the topic of complex network synchronization,with its graph-theoretic criterion,showing that the homogeneous and symmetrical network structures are essential for optimal synchronization.Furthermore,it briefly reviews the notion of higher-order network topologies and shows their promising potential in application to evaluating the optimality of network synchronizability.展开更多
To better complete various missions, it is necessary to plan an optimal trajectory or provide the optimal control law for the multirole missile according to the actual situation, including launch conditions and target...To better complete various missions, it is necessary to plan an optimal trajectory or provide the optimal control law for the multirole missile according to the actual situation, including launch conditions and target location. Since trajectory optimization struggles to meet real-time requirements, the emergence of data-based generation methods has become a significant focus in contemporary research. However, due to the large differences in the characteristics of the optimal control laws caused by the diversity of tasks, it is difficult to achieve good prediction results by modeling all data with one single model.Therefore, the modeling idea of the mixture of experts(MoE) is adopted. Firstly, the K-means clustering algorithm is used to partition the sample data set, and the corresponding neural network classification model is established as the gate switch of MoE. Then, the expert models, i.e., the mappings from the generation conditions to the optimal control law represented by the results of principal component analysis(PCA), are represented by Kriging models. Finally, multiple rounds of accuracy evaluation, sample supplementation, and model updating are conducted to improve the generation accuracy. The Monte Carlo simulation shows that the accuracy of the proposed model reaches 96% and the generation efficiency meets the real-time requirement.展开更多
In this paper,we study the optimal investment problem of an insurer whose surplus process follows the diffusion approximation of the classical Cramer-Lundberg model.Investment in the foreign markets is allowed,and the...In this paper,we study the optimal investment problem of an insurer whose surplus process follows the diffusion approximation of the classical Cramer-Lundberg model.Investment in the foreign markets is allowed,and therefore,the foreign exchange rate model is incorporated.Under the allowing of selling and borrowing,the problem of maximizing the expected exponential utility of terminal wealth is studied.By solving the corresponding Hamilton-Jacobi-Bellman equations,the optimal investment strategies and value functions are obtained.Finally,numerical analysis is presented.展开更多
基金Supported in part by Natural Science Foundation of Guangxi(2023GXNSFAA026246)in part by the Central Government's Guide to Local Science and Technology Development Fund(GuikeZY23055044)in part by the National Natural Science Foundation of China(62363003)。
文摘In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to obtain the maximal positive definite solution of nonlinear matrix equation X+A^(*)X|^(-α)A=Q with the case 0<α≤1.Based on this method,a new iterative algorithm is developed,and its convergence proof is given.Finally,two numerical examples are provided to show the effectiveness of the proposed method.
基金supported by the 2024 Research Fund of University of Ulsan.
文摘In recent years,three-dimensional reconstruction technologies that employ multiple cameras have continued to evolve significantly,enabling remote collaboration among users in extended Reality(XR)environments.In addition,methods for deploying multiple cameras for motion capture of users(e.g.,performers)are widely used in computer graphics.As the need to minimize and optimize the number of cameras grows to reduce costs,various technologies and research approaches focused on Optimal Camera Placement(OCP)are continually being proposed.However,as most existing studies assume homogeneous camera setups,there is a growing demand for studies on heterogeneous camera setups.For instance,technical demands keep emerging in scenarios with minimal camera configurations,especially regarding cost factors,the physical placement of cameras given the spatial structure,and image capture strategies for heterogeneous cameras,such as high-resolution RGB cameras and depth cameras.In this study,we propose a pre-visualization and simulation method for the optimal placement of heterogeneous cameras in XR environments,accounting for both the specifications of heterogeneous cameras(e.g.,field of view)and the physical configuration(e.g.,wall configuration)in real-world spaces.The proposed method performs a visibility analysis of cameras by considering each camera’s field-of-view volume,resolution,and unique characteristics,along with physicalspace constraints.This approach enables the optimal position and rotation of each camera to be recommended,along with the minimum number of cameras required.In the results of our study conducted in heterogeneous camera combinations,the proposed method achieved 81.7%~82.7%coverage of the target visual information using only 2~3 cameras.In contrast,single(or homogeneous)-typed cameras were required to use 11 cameras for 81.6%coverage.Accordingly,we found that camera deployment resources can be reduced with the proposed approaches.
基金funded by the Department of Education of Liaoning Province and was supported by the Basic Scientific Research Project of the Department of Education of Liaoning Province(Grant No.LJ222411632051)and(Grant No.LJKQZ2021085)Natural Science Foundation Project of Liaoning Province(Grant No.2022-BS-222).
文摘Virtual power plant(VPP)integrates a variety of distributed renewable energy and energy storage to participate in electricity market transactions,promote the consumption of renewable energy,and improve economic efficiency.In this paper,aiming at the uncertainty of distributed wind power and photovoltaic output,considering the coupling relationship between power,carbon trading,and green cardmarket,the optimal operationmodel and bidding scheme of VPP in spot market,carbon trading market,and green card market are established.On this basis,through the Shapley value and independent risk contribution theory in cooperative game theory,the quantitative analysis of the total income and risk contribution of various distributed resources in the virtual power plant is realized.Moreover,the scheduling strategies of virtual power plants under different risk preferences are systematically compared,and the feasibility and accuracy of the combination of Shapley value and independent risk contribution theory in ensuring fair income distribution and reasonable risk assessment are emphasized.A comprehensive solution for virtual power plants in the multi-market environment is constructed,which integrates operation strategy,income distribution mechanism,and risk control system into a unified analysis framework.Through the simulation of multi-scenario examples,the CPLEXsolver inMATLAB software is used to optimize themodel.The proposed joint optimization scheme can increase the profit of VPP participating in carbon trading and green certificate market by 29%.The total revenue of distributed resources managed by VPP is 9%higher than that of individual participation.
基金supported in part by the National Science and Technology Council under Grant NSTC 114-2221-E-027-104.
文摘Trajectory tracking for nonlinear robotic systems remains a fundamental yet challenging problem in control engineering,particularly when both precision and efficiency must be ensured.Conventional control methods are often effective for stabilization but may not directly optimize long-term performance.To address this limitation,this study develops an integrated framework that combines optimal control principles with reinforcement learning for a single-link robotic manipulator.The proposed scheme adopts an actor–critic structure,where the critic network approximates the value function associated with the Hamilton–Jacobi–Bellman equation,and the actor network generates near-optimal control signals in real time.This dual adaptation enables the controller to refine its policy online without explicit system knowledge.Stability of the closed-loop system is analyzed through Lyapunov theory,ensuring boundedness of the tracking error.Numerical simulations on the single-link manipulator demonstrate that themethod achieves accurate trajectory followingwhile maintaining lowcontrol effort.The results further showthat the actor–critic learning mechanism accelerates convergence of the control policy compared with conventional optimization-based strategies.This work highlights the potential of reinforcement learning integrated with optimal control for robotic manipulators and provides a foundation for future extensions to more complex multi-degree-of-freedom systems.The proposed controller is further validated in a physics-based virtual Gazebo environment,demonstrating stable adaptation and real-time feasibility.
基金supported by the National Natural Science Foundation of China(Grant No.11571181)by the Natural Science Foundation of Jiangsu Province(Grant No.BK20171454).
文摘In this paper,we propose and analyze two second-order accurate finite difference schemes for the one-dimensional heat equation with concentrated capacity on a computa-tional domain=[a,b].We first transform the target equation into the standard heat equation on the domain excluding the singular point equipped with an inner interface matching(IIM)condition on the singular point x=ξ∈(a,b),then adopt Taylor’s ex-pansion to approximate the IIM condition at the singular point and apply second-order finite difference method to approximate the standard heat equation at the nonsingular points.This discrete procedure allows us to choose different grid sizes to partition the two sub-domains[a,ξ]and[ξ,b],which ensures that x=ξ is a grid point,and hence the pro-posed schemes can be generalized to the heat equation with more than one concentrated capacities.We prove that the two proposed schemes are uniquely solvable.And through in-depth analysis of the local truncation errors,we rigorously prove that the two schemes are second-order accurate both in temporal and spatial directions in the maximum norm without any constraint on the grid ratio.Numerical experiments are carried out to verify our theoretical conclusions.
基金supported by the National Natural Science Foundation of China (Grant Nos.62573134,62473100,62433018)the Guangdong Basic and Applied Basic Research Foundation(Grant Nos.2025A1515060017,2025A1515011436,2025B1515020065,2025A1515011789)the Guangzhou Basic and Applied Basic Research Project (Grant No.2025A04J3534)。
文摘This paper investigates the edge-based dynamic event-triggered inverse optimal formation control problem for multiple quadrotor unmanned aerial vehicles(QUAVs) with attitude constraints. To improve communication efficiency, an edge-based dynamic event-triggered mechanism is developed for the communication channels between neighboring QUAVs. However, this edge-based dynamic event-triggered communication(DETC) may cause discontinuities in the reference signals. To solve this problem, a distributed estimator is designed for each QUAV to obtain the leader's output signals. Considering the safety of QUAV formation flying, this paper designs a function transformation method that constrains the attitudes of the QUAVs to a strictly safe region. Furthermore, an inverse optimal control strategy is proposed based on the backstepping methodology. This scheme not only minimizes the cost function but also avoids the necessity of solving the Hamilton-Jacobi-Bellman equation. Finally, the stability of the QUAV systems is proven using Lyapunov theory, and the effectiveness of the proposed control method is verified through simulation.
基金supported by the National Natural Science Foundation of China(Grant Nos.12441502,12122506,12204230,and 12404554)the National Science and Technology Major Project of the Ministry of Science and Technology of China(2024ZD0300404)+6 种基金Guangdong Basic and Applied Basic Research Foundation(Grant No.2021B1515020070)Shenzhen Science and Technology Program(Grant No.RCYX20200714114522109)China Postdoctoral Science Foundation(CPSF)(2024M762114)Postdoctoral Fellowship Program of CPSF(GZC20231727)supported by the National Natural Science Foundation of China(Grant Nos.92165206 and 11974330)Innovation Program for Quantum Science and Technology(Grant No.2021ZD0301603)the Fundamental Research Funds for the Central Universities。
文摘We present a robust quantum optimal control framework for implementing fast entangling gates on ion-trap quantum processors.The framework leverages tailored laser pulses to drive the multiple vibrational sidebands of the ions to create phonon-mediated entangling gates and,unlike the state of the art,requires neither weakcoupling Lamb-Dicke approximation nor perturbation treatment.With the application of gradient-based optimal control,it enables finding amplitude-and phase-modulated laser control protocols that work without the Lamb-Dicke approximation,promising gate speeds on the order of microseconds comparable to the characteristic trap frequencies.Also,robustness requirements on the temperature of the ions and initial optical phase can be conveniently included to pursue high-quality fast gates against experimental imperfections.Our approach represents a step in speeding up quantum gates to achieve larger quantum circuits for quantum computation and simulation,and thus can find applications in near-future experiments.
基金supported by the Innovation Program for Quantum Science and Technology(Grant No.2021ZD0302100)the National Natural Science Foundation of China(Grant Nos.12361131576,92265205,and 92476205).
文摘Quantum optimal control(QOC)relies on accurately modeling system dynamics and is often challenged by unknown or inaccessible interactions in real systems.Taking an unknown collective spin system as an example,this work introduces a machine-learning-based,data-driven scheme to overcome the challenges encountered,with a trained neural network(NN)assuming the role of a surrogate model that captures the system’s dynamics and subsequently enables QOC to be performed on the NN instead of on the real system.The trained NN surrogate proves effective for practical QOC tasks and is further demonstrated to be adaptable to different experimental conditions,remaining robust across varying system sizes and pulse durations.
基金supported by the National Natural Science Foundation of China under Grant 62472264the Natural Science Distinguished Youth Foundation of Shandong Province under Grant ZR2025QA13。
文摘In the wireless energy transmission service composition optimization problem,a key challenge is accurately capturing users’preferences for service criteria under complex influencing factors,and optimally selecting a composition solution under their budget constraints.Existing studies typically evaluate satisfaction solely based on energy transmission capacity,while overlooking critical factors such as price and trustworthiness of the provider,leading to a mismatch between optimization outcomes and user needs.To address this gap,we construct a user satisfaction evaluation model for multi-user and multi-provider scenarios,systematically incorporating service price,transmission capacity,and trustworthiness into the satisfaction assessment framework.Furthermore,we propose a Budget-Aware Preference Adjustment Model that predicts users’baseline preference weights from historical data and dynamically adjusts them according to budget levels,thereby reflecting user preferences more realistically under varying budget constraints.In addition,to tackle the composition optimization problem,we develop a ReflectiveEvolutionary Large Language Model—Guided Ant Colony Optimization algorithm,which leverages the reflective evolution capability of large language models to iteratively generate and refine heuristic information that guides the search process.Experimental results demonstrate that the proposed framework effectively integrates personalized preferences with budget sensitivity,accurately predicts users’preferences,and significantly enhances their satisfaction under complex constraints.
基金supported by the Science and Technology Project of Sichuan Electric Power Company“Power Supply Guarantee Strategy for Urban Distribution Networks Considering Coordination with Virtual Power Plant during Extreme Weather Event”(No.521920230003).
文摘Ensuring reliable power supply in urban distribution networks is a complex and critical task.To address the increased demand during extreme scenarios,this paper proposes an optimal dispatch strategy that considers the coordination with virtual power plants(VPPs).The proposed strategy improves systemflexibility and responsiveness by optimizing the power adjustment of flexible resources.In the proposed strategy,theGaussian Process Regression(GPR)is firstly employed to determine the adjustable range of aggregated power within the VPP,facilitating an assessment of its potential contribution to power supply support.Then,an optimal dispatch model based on a leader-follower game is developed to maximize the benefits of the VPP and flexible resources while guaranteeing the power balance at the same time.To solve the proposed optimal dispatch model efficiently,the constraints of the problem are reformulated and resolved using the Karush-Kuhn-Tucker(KKT)optimality conditions and linear programming duality theorem.The effectiveness of the strategy is illustrated through a detailed case study.
基金co-supported by the National Natural Science Foundation of China(No.62003019)the Young Talents Support Program of Beihang University,China(No.YWF21-BJ-J-1180)。
文摘This paper addresses the Singular Optimal Control Problem(SOCP)for a surface-to-air missile with limited control,fully considering aerodynamic effects with a parabolic drag polar.This problem is an extension of the typical Goddard problem.First,the classical Legendre-Clebsch condition is applied to derive optimal conditions for the singular angle of attack,revealing that the missile turns by gravity along the singular arc.Second,the higher-order differentiation of the switching function provides the necessary conditions to determine the optimal thrust,expressed as linear functions of the costate variables.The vanishing coefficient determinant is then employed to decouple the control and costate variables,yielding the singular thrust solely dependent on state variables and identifying the singular surface.Moreover,the analytical singular control can be regarded as path constraints subject to the typical Optimal Control Problem(OCP),enabling the GPOPS-Ⅱ,a direct method framework that does not involve the singular condition,to solve the SOCP.Finally,three cases with different structures are presented to evaluate the performance of the proposed method.The results show that it takes a few steps to obtain the numerical optimal solution,which is consistent with the analytical solution derived from the calculus of variations,highlighting its great computational accuracy and effectiveness.
基金supported by the National Natural Science Foundation of China(62003214)Guangdong Basic and Applied Basic Research Foundation(2024A1515012681)+1 种基金the Natural Science Foundation of Shanghai(22ZR1443600)Shanghai Pujiang Programme(23PJD064).
文摘Dear Editor,This letter considers the problem of achieving optimal formation control in multiple vertical take-off and landing(VTOL)unmanned aerial vehicles(UAVs).Specifically,the objective is to derive the vehicles to the desired formation shape while minimizing the total cost function.Leveraging the backstepping design,a distributed control strategy is proposed that incorporates a dynamic system for generating a reference trajectory and a trajectory tracking controller for each vehicle.
基金supported by the National Natural Science Foundation of China(No.62003019)the Young Talents Support Program of Beihang University,China(No.YWF-21-BJ-J-1180).
文摘This paper proposes an optimal midcourse guidance method for dual pulse air-to-air missiles,which is based on the framework of the linear Gauss pseudospectral model predictive control method.Firstly,a multistage optimal control problem with unspecified terminal time is formulated.Secondly,the control and terminal time update formulas are derived analytically.In contrast to previous work,the derivation process fully considers the Hamiltonian function corresponding to the unspecified terminal time,which is coupled with control,state,and costate.On the assumption of small perturbation,a special algebraic equation is provided to represent the equivalent optimal condition for the terminal time.Also,using Gauss pseudospectral collocation,error propagation dynamical equations involving the first-order correction term of the terminal time are transformed into a set of algebraic equations.Furthermore,analytical modification formulas can be derived by associating those equations and optimal conditions to eliminate terminal error and approach nonlinear optimal control.Even with their mathematical complexity,these formulas produce more accurate control and terminal time corrections and remove reliance on task-related parameters.Finally,several numerical simulations,comparisons with typical methods,and Monte Carlo simulations have been done to verify its optimality,high convergence rate,great stability and robustness.
基金supported in part by the Frontier Technology R&D Plan of Jiangsu Province(BF2024065)the Shenzhen Science and Technology Program(JCYJ20230807114609019)Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX22_0236).
文摘Dear Editor,This letter investigates the optimal transmission scheduling problem in remote state estimation systems over an unknown wireless channel.We propose a partially observable Markov decision Process(POMDP)framework to model the sensor scheduling problem.By truncating and simplifying the POMDP problem,we have established the properties of the optimal solution under the POMDP model,through a fixed-point contraction method,and have shown that the threshold structure of the POMDP solution is not easily attainable.Subsequently,we obtained a suboptimal solution via Qlearning.Numerical simulations are used to demonstrate the efficacy of the proposed Q-learning approach.
基金Supported by the National Natural Science Foundation of China(Grant Nos.125610811246108612271147)。
文摘This paper aims to study the optimal control and algorithm implementation of a generalized epidemic model governed by reaction-diffusion equations.Considering individual mobility,this paper first proposes a reaction-diffusion epidemic model with two strains.Furthermore,applying vaccines as a control strategy in the model,an optimal control problem is proposed to increase the number of healthy individuals while reducing control costs.By applying the truncation function technique and the operator semigroup methods,we prove the existence and uniqueness of a globally positive strong solution for the control model.The existence of the optimal control strategy is proven by using functional analysis theory and minimum sequence methods.The first-order necessary condition satisfied by the optimal control is established by employing the dual techniques.Finally,a specific example and its algorithm are provided.
基金supported by the Central Government Guides Local Science and Technology Development Fund Project(2023ZY0020)Key R&D and Achievement Transformation Project in InnerMongolia Autonomous Region(2022YFHH0019)+3 种基金the Fundamental Research Funds for Inner Mongolia University of Science&Technology(2022053)Natural Science Foundation of Inner Mongolia(2022LHQN05002)National Natural Science Foundation of China(52067018)Metallurgical Engineering First-Class Discipline Construction Project in Inner Mongolia University of Science and Technology,Control Science and Engineering Quality Improvement and Cultivation Discipline Project in Inner Mongolia University of Science and Technology。
文摘In this paper,a bilevel optimization model of an integrated energy operator(IEO)–load aggregator(LA)is constructed to address the coordinate optimization challenge of multiple stakeholder island integrated energy system(IIES).The upper level represents the integrated energy operator,and the lower level is the electricity-heatgas load aggregator.Owing to the benefit conflict between the upper and lower levels of the IIES,a dynamic pricing mechanism for coordinating the interests of the upper and lower levels is proposed,combined with factors such as the carbon emissions of the IIES,as well as the lower load interruption power.The price of selling energy can be dynamically adjusted to the lower LA in the mechanism,according to the information on carbon emissions and load interruption power.Mutual benefits and win-win situations are achieved between the upper and lower multistakeholders.Finally,CPLEX is used to iteratively solve the bilevel optimization model.The optimal solution is selected according to the joint optimal discrimination mechanism.Thesimulation results indicate that the sourceload coordinate operation can reduce the upper and lower operation costs.Using the proposed pricingmechanism,the carbon emissions and load interruption power of IEO-LA are reduced by 9.78%and 70.19%,respectively,and the capture power of the carbon capture equipment is improved by 36.24%.The validity of the proposed model and method is verified.
基金supported by the National Natural Science Foundation of China(Nos.42005086,91844301,and 41805100)the National Key Research and Development Programof China(No.2022YFC3703500)+2 种基金China Postdoctoral Science Foundation(No.2023M733028)the Key Research and Development Program of Zhejiang Province(Nos.2021C03165 and 2022C03084)the Ecological and Environmental Scientific Research and Achievement Promotion Project of Zhejiang Province(No.2020HT0048).
文摘Assessing the impact of anthropogenic volatile organic compounds(VOCs)on ozone(O_(3))formation is vital for themanagement of emission reduction and pollution control.Continuousmeasurement of O_(3)and the major precursorswas conducted in a typical light industrial city in the YRD region from 1 May to 25 July in 2021.Alkanes were the most abundant VOC group,contributing to 55.0%of TVOCs concentration(56.43±21.10 ppb).OVOCs,aromatics,halides,alkenes,and alkynes contributed 18.7%,9.6%,9.3%,5.2%and 1.9%,respectively.The observational site shifted from a typical VOC control regime to a mixed regime from May to July,which can be explained by the significant increase of RO_(x)production,resulting in the transition of environment from NOx saturation to radical saturation with respect to O_(3)production.The optimal O_(3)control strategy should be dynamically changed depending on the transition of control regime.Under NOx saturation condition,minimizing the proportion of NOx in reduction could lead to better achievement of O_(3)alleviation.Under mixed control regime,the cut percentage gets the top priority for the effectiveness of O_(3)control.Five VOCs sources were identified:temperature dependent source(28.1%),vehicular exhausts(19.9%),petrochemical industries(7.2%),solvent&gasoline usage(32.3%)and manufacturing industries(12.6%).The increase of temperature and radiation would enhance the evaporation related VOC emissions,resulting in the increase of VOC concentration and the change of RO_(x)circulation.Our results highlight determination of the optimal control strategies for O_(3)pollution in a typical YRD industrial city.
基金Hong Kong Research Grants Council under the GRF(9043664).
文摘This article briefly reviews the topic of complex network synchronization,with its graph-theoretic criterion,showing that the homogeneous and symmetrical network structures are essential for optimal synchronization.Furthermore,it briefly reviews the notion of higher-order network topologies and shows their promising potential in application to evaluating the optimality of network synchronizability.
基金Defense Industrial Technology Development Program (JCKY2020204B016)National Natural Science Foundation of China (92471206)。
文摘To better complete various missions, it is necessary to plan an optimal trajectory or provide the optimal control law for the multirole missile according to the actual situation, including launch conditions and target location. Since trajectory optimization struggles to meet real-time requirements, the emergence of data-based generation methods has become a significant focus in contemporary research. However, due to the large differences in the characteristics of the optimal control laws caused by the diversity of tasks, it is difficult to achieve good prediction results by modeling all data with one single model.Therefore, the modeling idea of the mixture of experts(MoE) is adopted. Firstly, the K-means clustering algorithm is used to partition the sample data set, and the corresponding neural network classification model is established as the gate switch of MoE. Then, the expert models, i.e., the mappings from the generation conditions to the optimal control law represented by the results of principal component analysis(PCA), are represented by Kriging models. Finally, multiple rounds of accuracy evaluation, sample supplementation, and model updating are conducted to improve the generation accuracy. The Monte Carlo simulation shows that the accuracy of the proposed model reaches 96% and the generation efficiency meets the real-time requirement.
基金supported by the National Natural Science Foundation of China(Grant No.12301603).
文摘In this paper,we study the optimal investment problem of an insurer whose surplus process follows the diffusion approximation of the classical Cramer-Lundberg model.Investment in the foreign markets is allowed,and therefore,the foreign exchange rate model is incorporated.Under the allowing of selling and borrowing,the problem of maximizing the expected exponential utility of terminal wealth is studied.By solving the corresponding Hamilton-Jacobi-Bellman equations,the optimal investment strategies and value functions are obtained.Finally,numerical analysis is presented.