Optimal feedback control of nonlinear system with free terminal time present many challenges including nonsmooth in the value function and control laws,and existence of multiple local or even global optimal trajectori...Optimal feedback control of nonlinear system with free terminal time present many challenges including nonsmooth in the value function and control laws,and existence of multiple local or even global optimal trajectories.To mitigate these issues,the authors introduce an actor-critic method along with some enhancements.The authors demonstrate the algorithm's effectiveness on a prototypical example featuring each of the main pathological issues present in problems of this type as well as a higher dimensional example to show that the solution method presented can scale.展开更多
基金supported in part by the Air Force Office of Scientific Research(AFOSR),USA under Grant No.FA9550-21-1-0113the National Science Foundation(NSF),USA under Grant Nos.2134235 and 2202668。
文摘Optimal feedback control of nonlinear system with free terminal time present many challenges including nonsmooth in the value function and control laws,and existence of multiple local or even global optimal trajectories.To mitigate these issues,the authors introduce an actor-critic method along with some enhancements.The authors demonstrate the algorithm's effectiveness on a prototypical example featuring each of the main pathological issues present in problems of this type as well as a higher dimensional example to show that the solution method presented can scale.