Learning-based methods have become mainstream for solving residential energy scheduling problems. In order to improve the learning efficiency of existing methods and increase the utilization of renewable energy, we pr...Learning-based methods have become mainstream for solving residential energy scheduling problems. In order to improve the learning efficiency of existing methods and increase the utilization of renewable energy, we propose the Dyna actiondependent heuristic dynamic programming(Dyna-ADHDP)method, which incorporates the ideas of learning and planning from the Dyna framework in action-dependent heuristic dynamic programming. This method defines a continuous action space for precise control of an energy storage system and allows online optimization of algorithm performance during the real-time operation of the residential energy model. Meanwhile, the target network is introduced during the training process to make the training smoother and more efficient. We conducted experimental comparisons with the benchmark method using simulated and real data to verify its applicability and performance. The results confirm the method's excellent performance and generalization capabilities, as well as its excellence in increasing renewable energy utilization and extending equipment life.展开更多
In this paper,a distributed adaptive dynamic programming(ADP)framework based on value iteration is proposed for multi-player differential games.In the game setting,players have no access to the information of others...In this paper,a distributed adaptive dynamic programming(ADP)framework based on value iteration is proposed for multi-player differential games.In the game setting,players have no access to the information of others'system parameters or control laws.Each player adopts an on-policy value iteration algorithm as the basic learning framework.To deal with the incomplete information structure,players collect a period of system trajectory data to compensate for the lack of information.The policy updating step is implemented by a nonlinear optimization problem aiming to search for the proximal admissible policy.Theoretical analysis shows that by adopting proximal policy searching rules,the approximated policies can converge to a neighborhood of equilibrium policies.The efficacy of our method is illustrated by three examples,which also demonstrate that the proposed method can accelerate the learning process compared with the centralized learning framework.展开更多
Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ...Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.展开更多
In this paper,a novel adaptive Fault-Tolerant Control(FTC)strategy is proposed for non-minimum phase Hypersonic Vehicles(HSVs)that are affected by actuator faults and parameter uncertainties.The strategy is based on t...In this paper,a novel adaptive Fault-Tolerant Control(FTC)strategy is proposed for non-minimum phase Hypersonic Vehicles(HSVs)that are affected by actuator faults and parameter uncertainties.The strategy is based on the output redefinition method and Adaptive Dynamic Programming(ADP).The intelligent FTC scheme consists of two main parts:a basic fault-tolerant and stable controller and an ADP-based supplementary controller.In the basic FTC part,an output redefinition approach is designed to make zero-dynamics stable with respect to the new output.Then,Ideal Internal Dynamic(IID)is obtained using an optimal bounded inversion approach,and a tracking controller is designed for the new output to realize output tracking of the nonminimum phase HSV system.For the ADP-based compensation control part,an ActionDependent Heuristic Dynamic Programming(ADHDP)adopting an actor-critic learning structure is utilized to further optimize the tracking performance of the HSV control system.Finally,simulation results are provided to verify the effectiveness and efficiency of the proposed FTC algorithm.展开更多
In order to address the output feedback issue for linear discrete-time systems, this work suggests a brand-new adaptive dynamic programming(ADP) technique based on the internal model principle(IMP). The proposed metho...In order to address the output feedback issue for linear discrete-time systems, this work suggests a brand-new adaptive dynamic programming(ADP) technique based on the internal model principle(IMP). The proposed method, termed as IMP-ADP, does not require complete state feedback-merely the measurement of input and output data. More specifically, based on the IMP, the output control problem can first be converted into a stabilization problem. We then design an observer to reproduce the full state of the system by measuring the inputs and outputs. Moreover, this technique includes both a policy iteration algorithm and a value iteration algorithm to determine the optimal feedback gain without using a dynamic system model. It is important that with this concept one does not need to solve the regulator equation. Finally, this control method was tested on an inverter system of grid-connected LCLs to demonstrate that the proposed method provides the desired performance in terms of both tracking and disturbance rejection.展开更多
The use of dynamic programming(DP)algorithms to learn Bayesian network structures is limited by their high space complexity and difficulty in learning the structure of large-scale networks.Therefore,this study propose...The use of dynamic programming(DP)algorithms to learn Bayesian network structures is limited by their high space complexity and difficulty in learning the structure of large-scale networks.Therefore,this study proposes a DP algorithm based on node block sequence constraints.The proposed algorithm constrains the traversal process of the parent graph by using the M-sequence matrix to considerably reduce the time consumption and space complexity by pruning the traversal process of the order graph using the node block sequence.Experimental results show that compared with existing DP algorithms,the proposed algorithm can obtain learning results more efficiently with less than 1%loss of accuracy,and can be used for learning larger-scale networks.展开更多
Stochastic unit commitment is one of the most powerful methods to address uncertainty. However, the existingscenario clustering technique for stochastic unit commitment cannot accurately select representative scenario...Stochastic unit commitment is one of the most powerful methods to address uncertainty. However, the existingscenario clustering technique for stochastic unit commitment cannot accurately select representative scenarios,which threatens the robustness of stochastic unit commitment and hinders its application. This paper providesa stochastic unit commitment with dynamic scenario clustering based on multi-parametric programming andBenders decomposition. The stochastic unit commitment is solved via the Benders decomposition, which decouplesthe primal problem into the master problem and two types of subproblems. In the master problem, the committedgenerator is determined, while the feasibility and optimality of generator output are checked in these twosubproblems. Scenarios are dynamically clustered during the subproblem solution process through the multiparametric programming with respect to the solution of the master problem. In other words, multiple scenariosare clustered into several representative scenarios after the subproblem is solved, and the Benders cut obtainedby the representative scenario is generated for the master problem. Different from the conventional stochasticunit commitment, the proposed approach integrates scenario clustering into the Benders decomposition solutionprocess. Such a clustering approach could accurately cluster representative scenarios that have impacts on theunit commitment. The proposed method is tested on a 6-bus system and the modified IEEE 118-bus system.Numerical results illustrate the effectiveness of the proposed method in clustering scenarios. Compared withthe conventional clustering method, the proposed method can accurately select representative scenarios whilemitigating computational burden, thus guaranteeing the robustness of unit commitment.展开更多
The helicopter Trailing-Edge Flaps(TEFs)technology is one of the recent hot topics in morphing wing research.By employing controlled deflection,TEFs can effectively reduce the vibration level of helicopters.Thus,desig...The helicopter Trailing-Edge Flaps(TEFs)technology is one of the recent hot topics in morphing wing research.By employing controlled deflection,TEFs can effectively reduce the vibration level of helicopters.Thus,designing specific vibration reduction control methods for the helicopters equipped with trailing-edge flaps is of significant practical value.This paper studies the optimal control problem for helicopter-vibration systems with TEFs under the framework of adaptive dynamic programming combined with Reinforcement Learning(RL).Time-delay and disturbances,caused by complexity of helicopter dynamics,inevitably deteriorate the control performance of vibration reduction.To solve this problem,a zero-sum game formulation with a linear quadratic form for reducing vibration of helicopter systems is presented with a virtual predictor.In this context,an off-policy reinforcement learning algorithm is developed to determine the optimal control policy.The algorithm utilizes only vertical vibration load data to achieve a policy that reduces vibration,attains Nash equilibrium,and addresses disturbances while compensating for time-delay without knowledge of the dynamics of the helicopter system.The effectiveness of the proposed method is demonstrated in a virtual platform.展开更多
It is of great scientific significance to construct a 3D dynamic structural color with a special color effect based on the microlens array.However,the problems of imperfect mechanisms and poor color quality need to be...It is of great scientific significance to construct a 3D dynamic structural color with a special color effect based on the microlens array.However,the problems of imperfect mechanisms and poor color quality need to be solved.A method of 3D structural color turning on periodic metasurfaces fabricated by the microlens array and self-assembly technology was proposed in this study.In the experiment,Polydimethylsiloxane(PDMS)flexible film was used as a substrate,and SiO2 microspheres were scraped into grooves of the PDMS film to form 3D photonic crystal structures.By adjusting the number of blade-coated times and microsphere concentrations,high-saturation structural color micropatterns were obtained.These films were then matched with microlens arrays to produce dynamic graphics with iridescent effects.The results showed that by blade-coated two times and SiO2 microsphere concentrations of 50%are the best conditions.This method demonstrates the potential for being widely applied in the anticounterfeiting printing and ultra-high-resolution display.展开更多
The hot deformation behavior of Pt−10Ir alloy was studied under a wide range of deformation parameters.At a low deformation temperature(950−1150℃),the softening mechanism is primarily dynamic recovery.In addition,dyn...The hot deformation behavior of Pt−10Ir alloy was studied under a wide range of deformation parameters.At a low deformation temperature(950−1150℃),the softening mechanism is primarily dynamic recovery.In addition,dynamic recrystallization by progressive lattice rotation near grain boundaries(DRX by LRGBs)and microshear bands assisted dynamic recrystallization(MSBs assisted DRX)coordinate the deformation.However,it is difficult for the dynamic softening to offset the stain hardening due to a limited amount of DRXed grains.At a high deformation temperature(1250−1350℃),three main DRX mechanisms associated with strain rates occur:DRX by LRGBs,DRX by a homogeneous increase in misorientation(HIM)and geometric DRX(GDRX).With increasing strain,DRX by LRGBs is enhanced gradually under high strain rates;the“pinch-off”effect is enhanced at low strain rates,which was conducive to the formation of a uniform and fine microstructure.展开更多
Over the last two decades,the dogma that cell fate is immutable has been increasingly challenged,with important implications for regenerative medicine.The brea kth rough discovery that induced pluripotent stem cells c...Over the last two decades,the dogma that cell fate is immutable has been increasingly challenged,with important implications for regenerative medicine.The brea kth rough discovery that induced pluripotent stem cells could be generated from adult mouse fibroblasts is powerful proof that cell fate can be changed.An exciting extension of the discovery of cell fate impermanence is the direct cellular reprogram ming hypothesis-that terminally differentiated cells can be reprogrammed into other adult cell fates without first passing through a stem cell state.展开更多
The brain's extracellular matrix(ECM),which is comprised of protein and glycosaminoglycan(GAG)scaffolds,constitutes 20%-40% of the human brain and is considered one of the largest influencers on brain cell functio...The brain's extracellular matrix(ECM),which is comprised of protein and glycosaminoglycan(GAG)scaffolds,constitutes 20%-40% of the human brain and is considered one of the largest influencers on brain cell functioning(Soles et al.,2023).Synthesized by neural and glial cells,the brain's ECM regulates a myriad of homeostatic cellular processes,including neuronal plasticity and firing(Miyata et al.,2012),cation buffering(Moraws ki et al.,2015),and glia-neuron interactions(Anderson et al.,2016).Considering the diversity of functions,dynamic remodeling of the brain's ECM indicates that this understudied medium is an active participant in both normal physiology and neurological diseases.展开更多
Metal Additive Manufacturing(MAM) technology has become an important means of rapid prototyping precision manufacturing of special high dynamic heterogeneous complex parts. In response to the micromechanical defects s...Metal Additive Manufacturing(MAM) technology has become an important means of rapid prototyping precision manufacturing of special high dynamic heterogeneous complex parts. In response to the micromechanical defects such as porosity issues, significant deformation, surface cracks, and challenging control of surface morphology encountered during the selective laser melting(SLM) additive manufacturing(AM) process of specialized Micro Electromechanical System(MEMS) components, multiparameter optimization and micro powder melt pool/macro-scale mechanical properties control simulation of specialized components are conducted. The optimal parameters obtained through highprecision preparation and machining of components and static/high dynamic verification are: laser power of 110 W, laser speed of 600 mm/s, laser diameter of 75 μm, and scanning spacing of 50 μm. The density of the subordinate components under this reference can reach 99.15%, the surface hardness can reach 51.9 HRA, the yield strength can reach 550 MPa, the maximum machining error of the components is 4.73%, and the average surface roughness is 0.45 μm. Through dynamic hammering and high dynamic firing verification, SLM components meet the requirements for overload resistance. The results have proven that MEM technology can provide a new means for the processing of MEMS components applied in high dynamic environments. The parameters obtained in the conclusion can provide a design basis for the additive preparation of MEMS components.展开更多
In dynamic scenarios,visual simultaneous localization and mapping(SLAM)algorithms often incorrectly incorporate dynamic points during camera pose computation,leading to reduced accuracy and robustness.This paper prese...In dynamic scenarios,visual simultaneous localization and mapping(SLAM)algorithms often incorrectly incorporate dynamic points during camera pose computation,leading to reduced accuracy and robustness.This paper presents a dynamic SLAM algorithm that leverages object detection and regional dynamic probability.Firstly,a parallel thread employs the YOLOX object detectionmodel to gather 2D semantic information and compensate for missed detections.Next,an improved K-means++clustering algorithm clusters bounding box regions,adaptively determining the threshold for extracting dynamic object contours as dynamic points change.This process divides the image into low dynamic,suspicious dynamic,and high dynamic regions.In the tracking thread,the dynamic point removal module assigns dynamic probability weights to the feature points in these regions.Combined with geometric methods,it detects and removes the dynamic points.The final evaluation on the public TUM RGB-D dataset shows that the proposed dynamic SLAM algorithm surpasses most existing SLAM algorithms,providing better pose estimation accuracy and robustness in dynamic environments.展开更多
In this work,fow behavior and dynamic recrystallization(DRX)mechanism of a low carbon martensitic stainless bearing steel,CSS-42L,were investigated using a thermomechanical simulator under the temperature and strain r...In this work,fow behavior and dynamic recrystallization(DRX)mechanism of a low carbon martensitic stainless bearing steel,CSS-42L,were investigated using a thermomechanical simulator under the temperature and strain rate ranges of 900 to 1100℃ and 0.1 to 20 s^(−1),respectively.The Arrhenius-type constitutive equation was established based on the fow stress curves.Moreover,the peak stress decreased with the increase in deformation temperature and the decrease in strain rate.There were two DRX mechanisms during hot deformation of the current studied steel,the main one being discontinuous dynamic recrystallization mechanism,acting through grain boundary bulging and migration,and the auxiliary one being continuous dynamic recrystallization mechanism,working through the rotation of sub-grains.On the basis of microstructural characterizations,power dissipation maps and fow instability maps,the optimized hot deformation parameters for CSS-42L bearing steel were determined as 1050℃/0.1 s^(−1) and 1100℃/1 s^(−1).展开更多
Traumatic brain injury(TBI)is a public health problem with an undue economic burden that impacts nearly every age,ethnic,and gender group across the globe(Capizzi et al.,2020).TBIs are often sustained during a dynamic...Traumatic brain injury(TBI)is a public health problem with an undue economic burden that impacts nearly every age,ethnic,and gender group across the globe(Capizzi et al.,2020).TBIs are often sustained during a dynamic range of exposures to energetic environmental forces and as such outcomes are typically heterogeneous regarding severity and pathology(Capizzi et al.,2020).展开更多
Revealing the structure evolution of interfacial active species during a dynamic catalytic process is a challenging but pivotal issue for the rational design of high-performance catalysts.Here,we successfully prepare ...Revealing the structure evolution of interfacial active species during a dynamic catalytic process is a challenging but pivotal issue for the rational design of high-performance catalysts.Here,we successfully prepare sub-nanometric Pt clusters(~0.8 nm)encapsulated within the defects of CeO_(2)nanorods via an in-situ defect engineering methodology.The as-prepared Pt@d-CeO_(2)catalyst significantly boosts the activity and stability in the water-gas shift(WGS)reaction compared to other analogs.Based on controlled experiments and complementary(in-situ)spectroscopic studies,a reversible encapsulation induced by active site transformation between the Pt^(2+)-terminal hydroxyl and Pt^(δ+)-O vacancy species at the interface is revealed,which enables to evoke the enhanced performance.Our findings not only offer practical guidance for the design of high-efficiency catalysts but also bring a new understanding of the exceptional performance of WGS in a holistic view,which shows a great application potential in materials and catalysis.展开更多
Rockbursts, which mainly affect mining roadways, are dynamic disasters arising from the surrounding rock under high stress. Understanding the interaction between supports and the surrounding rock is necessary for effe...Rockbursts, which mainly affect mining roadways, are dynamic disasters arising from the surrounding rock under high stress. Understanding the interaction between supports and the surrounding rock is necessary for effective rockburst control. In this study, the squeezing behavior of the surrounding rock is analyzed in rockburst roadways, and a mechanical model of rockbursts is established considering the dynamic support stress, thus deriving formulas and providing characteristic curves for describing the interaction between the support and surrounding rock. Design principles and parameters of supports for rockburst control are proposed. The results show that only when the geostress magnitude exceeds a critical value can it drive the formation of rockburst conditions. The main factors influencing the convergence response and rockburst occurrence around roadways are geostress, rock brittleness, uniaxial compressive strength, and roadway excavation size. Roadway support devices can play a role in controlling rockburst by suppressing the squeezing evolution of the surrounding rock towards instability points of rockburst. Further, the higher the strength and the longer the impact stroke of support devices with constant resistance, the more easily multiple balance points can be formed with the surrounding rock to control rockburst occurrence. Supports with long impact stroke allow adaptation to varying geostress levels around the roadway, aiding in rockburst control. The results offer a quantitative method for designing support systems for rockburst-prone roadways. The design criterion of supports is determined by the intersection between the convergence curve of the surrounding rock and the squeezing deformation curve of the support devices.展开更多
Generating dynamically feasible trajectory for fixed-wing Unmanned Aerial Vehicles(UAVs)in dense obstacle environments remains computationally intractable.This paper proposes a Safe Flight Corridor constrained Sequent...Generating dynamically feasible trajectory for fixed-wing Unmanned Aerial Vehicles(UAVs)in dense obstacle environments remains computationally intractable.This paper proposes a Safe Flight Corridor constrained Sequential Convex Programming(SFC-SCP)to improve the computation efficiency and reliability of trajectory generation.SFC-SCP combines the front-end convex polyhedron SFC construction and back-end SCP-based trajectory optimization.A Sparse A^(*)Search(SAS)driven SFC construction method is designed to efficiently generate polyhedron SFC according to the geometric relation among obstacles and collision-free waypoints.Via transforming the nonconvex obstacle-avoidance constraints to linear inequality constraints,SFC can mitigate infeasibility of trajectory planning and reduce computation complexity.Then,SCP casts the nonlinear trajectory optimization subject to SFC into convex programming subproblems to decrease the problem complexity.In addition,a convex optimizer based on interior point method is customized,where the search direction is calculated via successive elimination to further improve efficiency.Simulation experiments on dense obstacle scenarios show that SFC-SCP can generate dynamically feasible safe trajectory rapidly.Comparative studies with state-of-the-art SCP-based methods demonstrate the efficiency and reliability merits of SFC-SCP.Besides,the customized convex optimizer outperforms off-the-shelf optimizers in terms of computation time.展开更多
Theβsolidifiedγ-TiAl alloy holds important application value in the aerospace industry,while its com-plex phase compositions and geometric structures pose challenges to its microstructure control during the thermal-...Theβsolidifiedγ-TiAl alloy holds important application value in the aerospace industry,while its com-plex phase compositions and geometric structures pose challenges to its microstructure control during the thermal-mechanical process.The microstructure evolution of Ti-43Al-4Nb-1Mo-0.2B alloy at 1200℃/0.01 s−1 was investigated to clarify the coupling role of dynamic recrystallization(DRX)and phase transformation.The results revealed that the rate of DRX inα2+γlamellar colonies was comparatively slower than that inβo+γmixed structure,instead being accompanied by intense lamellar kinking and rotation.The initiation and development rates of DRX inα2,βo,andγphases decreased sequentially.The asynchronous DRX of the various geometric structures and phase compositions resulted in the un-even deformed microstructure,and the dynamic softening induced by lamellar kinking and rotation was replaced by strengthened DRX as strain increased.Additionally,the blockyα2 phase and the terminals ofα2 lamellae were the preferential DRX sites owing to the abundant activated slip systems.Theα2→βo transformation within lamellar colonies facilitated DRX and fragment ofα2 lamellae,while theα2→γtransformation promoted the decomposition ofα2 lamellae and DRX ofγlamellae.Moreover,the var-iedβo+γmixed structures underwent complicated evolution:(1)Theγ→βo transformation occurred at boundaries of lamellar colonies,followed by simultaneous DRX ofγlamellar terminals and neighboringβo phase;(2)DRX occurred earlier within the band-likeβo phase,with the delayed DRX in enclosedγphase;(3)DRX within theβo synapses and neighboringγphase was accelerated owing to generation of elastic stress field;(4)Dispersedβo particles triggered particle stimulated nucleation(PSN)ofγphase.Eventually,atomic diffusion along crystal defects inβo andγphases caused fracture of band-likeβo phase and formation of massiveβo particles,impeding grain boundary migration and hindering DRXed grain growth ofγphase.展开更多
基金supported in part by the National Key Research and Development Program of China(2024YFB4709100,2021YFE0206100)the National Natural Science Foundation of China(62073321)+1 种基金the National Defense Basic Scientific Research Program(JCKY2019203C029)the Science and Technology Development Fund,Macao SAR,China(0015/2020/AMJ)
文摘Learning-based methods have become mainstream for solving residential energy scheduling problems. In order to improve the learning efficiency of existing methods and increase the utilization of renewable energy, we propose the Dyna actiondependent heuristic dynamic programming(Dyna-ADHDP)method, which incorporates the ideas of learning and planning from the Dyna framework in action-dependent heuristic dynamic programming. This method defines a continuous action space for precise control of an energy storage system and allows online optimization of algorithm performance during the real-time operation of the residential energy model. Meanwhile, the target network is introduced during the training process to make the training smoother and more efficient. We conducted experimental comparisons with the benchmark method using simulated and real data to verify its applicability and performance. The results confirm the method's excellent performance and generalization capabilities, as well as its excellence in increasing renewable energy utilization and extending equipment life.
基金supported by the Aeronautical Science Foundation of China(20220001057001)an Open Project of the National Key Laboratory of Air-based Information Perception and Fusion(202437)
文摘In this paper,a distributed adaptive dynamic programming(ADP)framework based on value iteration is proposed for multi-player differential games.In the game setting,players have no access to the information of others'system parameters or control laws.Each player adopts an on-policy value iteration algorithm as the basic learning framework.To deal with the incomplete information structure,players collect a period of system trajectory data to compensate for the lack of information.The policy updating step is implemented by a nonlinear optimization problem aiming to search for the proximal admissible policy.Theoretical analysis shows that by adopting proximal policy searching rules,the approximated policies can converge to a neighborhood of equilibrium policies.The efficacy of our method is illustrated by three examples,which also demonstrate that the proposed method can accelerate the learning process compared with the centralized learning framework.
基金supported in part by the National Natural Science Foundation of China(62222301, 62073085, 62073158, 61890930-5, 62021003)the National Key Research and Development Program of China (2021ZD0112302, 2021ZD0112301, 2018YFC1900800-5)Beijing Natural Science Foundation (JQ19013)。
文摘Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.
基金supported in part by the Science Center Program of National Natural Science Foundation of China(62373189,62188101,62020106003)the Research Fund of State Key Laboratory of Mechanics and Control for Aerospace Structures,China。
文摘In this paper,a novel adaptive Fault-Tolerant Control(FTC)strategy is proposed for non-minimum phase Hypersonic Vehicles(HSVs)that are affected by actuator faults and parameter uncertainties.The strategy is based on the output redefinition method and Adaptive Dynamic Programming(ADP).The intelligent FTC scheme consists of two main parts:a basic fault-tolerant and stable controller and an ADP-based supplementary controller.In the basic FTC part,an output redefinition approach is designed to make zero-dynamics stable with respect to the new output.Then,Ideal Internal Dynamic(IID)is obtained using an optimal bounded inversion approach,and a tracking controller is designed for the new output to realize output tracking of the nonminimum phase HSV system.For the ADP-based compensation control part,an ActionDependent Heuristic Dynamic Programming(ADHDP)adopting an actor-critic learning structure is utilized to further optimize the tracking performance of the HSV control system.Finally,simulation results are provided to verify the effectiveness and efficiency of the proposed FTC algorithm.
基金supported by the National Science Fund for Distinguished Young Scholars (62225303)the Fundamental Research Funds for the Central Universities (buctrc202201)+1 种基金China Scholarship Council,and High Performance Computing PlatformCollege of Information Science and Technology,Beijing University of Chemical Technology。
文摘In order to address the output feedback issue for linear discrete-time systems, this work suggests a brand-new adaptive dynamic programming(ADP) technique based on the internal model principle(IMP). The proposed method, termed as IMP-ADP, does not require complete state feedback-merely the measurement of input and output data. More specifically, based on the IMP, the output control problem can first be converted into a stabilization problem. We then design an observer to reproduce the full state of the system by measuring the inputs and outputs. Moreover, this technique includes both a policy iteration algorithm and a value iteration algorithm to determine the optimal feedback gain without using a dynamic system model. It is important that with this concept one does not need to solve the regulator equation. Finally, this control method was tested on an inverter system of grid-connected LCLs to demonstrate that the proposed method provides the desired performance in terms of both tracking and disturbance rejection.
基金Shaanxi Science Fund for Distinguished Young Scholars,Grant/Award Number:2024JC-JCQN-57Xi’an Science and Technology Plan Project,Grant/Award Number:2023JH-QCYJQ-0086+2 种基金Scientific Research Program Funded by Education Department of Shaanxi Provincial Government,Grant/Award Number:P23JP071Engineering Technology Research Center of Shaanxi Province for Intelligent Testing and Reliability Evaluation of Electronic Equipments,Grant/Award Number:2023-ZC-GCZX-00472022 Shaanxi University Youth Innovation Team Project。
文摘The use of dynamic programming(DP)algorithms to learn Bayesian network structures is limited by their high space complexity and difficulty in learning the structure of large-scale networks.Therefore,this study proposes a DP algorithm based on node block sequence constraints.The proposed algorithm constrains the traversal process of the parent graph by using the M-sequence matrix to considerably reduce the time consumption and space complexity by pruning the traversal process of the order graph using the node block sequence.Experimental results show that compared with existing DP algorithms,the proposed algorithm can obtain learning results more efficiently with less than 1%loss of accuracy,and can be used for learning larger-scale networks.
基金the Science and Technology Project of State Grid Corporation of China,Grant Number 5108-202304065A-1-1-ZN.
文摘Stochastic unit commitment is one of the most powerful methods to address uncertainty. However, the existingscenario clustering technique for stochastic unit commitment cannot accurately select representative scenarios,which threatens the robustness of stochastic unit commitment and hinders its application. This paper providesa stochastic unit commitment with dynamic scenario clustering based on multi-parametric programming andBenders decomposition. The stochastic unit commitment is solved via the Benders decomposition, which decouplesthe primal problem into the master problem and two types of subproblems. In the master problem, the committedgenerator is determined, while the feasibility and optimality of generator output are checked in these twosubproblems. Scenarios are dynamically clustered during the subproblem solution process through the multiparametric programming with respect to the solution of the master problem. In other words, multiple scenariosare clustered into several representative scenarios after the subproblem is solved, and the Benders cut obtainedby the representative scenario is generated for the master problem. Different from the conventional stochasticunit commitment, the proposed approach integrates scenario clustering into the Benders decomposition solutionprocess. Such a clustering approach could accurately cluster representative scenarios that have impacts on theunit commitment. The proposed method is tested on a 6-bus system and the modified IEEE 118-bus system.Numerical results illustrate the effectiveness of the proposed method in clustering scenarios. Compared withthe conventional clustering method, the proposed method can accurately select representative scenarios whilemitigating computational burden, thus guaranteeing the robustness of unit commitment.
基金co-supported by the National Natural Science Foundation of China(Nos.62022060,62073234,62073158,62373268,62373273)the Basic Research Project of Education Department of Liaoning Province,China(No.LJKZ0401).
文摘The helicopter Trailing-Edge Flaps(TEFs)technology is one of the recent hot topics in morphing wing research.By employing controlled deflection,TEFs can effectively reduce the vibration level of helicopters.Thus,designing specific vibration reduction control methods for the helicopters equipped with trailing-edge flaps is of significant practical value.This paper studies the optimal control problem for helicopter-vibration systems with TEFs under the framework of adaptive dynamic programming combined with Reinforcement Learning(RL).Time-delay and disturbances,caused by complexity of helicopter dynamics,inevitably deteriorate the control performance of vibration reduction.To solve this problem,a zero-sum game formulation with a linear quadratic form for reducing vibration of helicopter systems is presented with a virtual predictor.In this context,an off-policy reinforcement learning algorithm is developed to determine the optimal control policy.The algorithm utilizes only vertical vibration load data to achieve a policy that reduces vibration,attains Nash equilibrium,and addresses disturbances while compensating for time-delay without knowledge of the dynamics of the helicopter system.The effectiveness of the proposed method is demonstrated in a virtual platform.
文摘It is of great scientific significance to construct a 3D dynamic structural color with a special color effect based on the microlens array.However,the problems of imperfect mechanisms and poor color quality need to be solved.A method of 3D structural color turning on periodic metasurfaces fabricated by the microlens array and self-assembly technology was proposed in this study.In the experiment,Polydimethylsiloxane(PDMS)flexible film was used as a substrate,and SiO2 microspheres were scraped into grooves of the PDMS film to form 3D photonic crystal structures.By adjusting the number of blade-coated times and microsphere concentrations,high-saturation structural color micropatterns were obtained.These films were then matched with microlens arrays to produce dynamic graphics with iridescent effects.The results showed that by blade-coated two times and SiO2 microsphere concentrations of 50%are the best conditions.This method demonstrates the potential for being widely applied in the anticounterfeiting printing and ultra-high-resolution display.
基金financial supports from the National Natural Science Foundation of China(Nos.52161023,51901204)Science and Technology Project of Yunnan Precious Metal Laboratory,China(No.YPML-2023050208)+1 种基金Yunnan Science and Technology Planning Project,China(Nos.202201AU070010,202301AT070276,202302AB080008,202303AA080001)Postgraduate Research and Innovation Foundation of Yunnan University,China(No.2021Y338).
文摘The hot deformation behavior of Pt−10Ir alloy was studied under a wide range of deformation parameters.At a low deformation temperature(950−1150℃),the softening mechanism is primarily dynamic recovery.In addition,dynamic recrystallization by progressive lattice rotation near grain boundaries(DRX by LRGBs)and microshear bands assisted dynamic recrystallization(MSBs assisted DRX)coordinate the deformation.However,it is difficult for the dynamic softening to offset the stain hardening due to a limited amount of DRXed grains.At a high deformation temperature(1250−1350℃),three main DRX mechanisms associated with strain rates occur:DRX by LRGBs,DRX by a homogeneous increase in misorientation(HIM)and geometric DRX(GDRX).With increasing strain,DRX by LRGBs is enhanced gradually under high strain rates;the“pinch-off”effect is enhanced at low strain rates,which was conducive to the formation of a uniform and fine microstructure.
基金supported by Canada First Research Excellence Fund,Medicine by Design(to CMM)。
文摘Over the last two decades,the dogma that cell fate is immutable has been increasingly challenged,with important implications for regenerative medicine.The brea kth rough discovery that induced pluripotent stem cells could be generated from adult mouse fibroblasts is powerful proof that cell fate can be changed.An exciting extension of the discovery of cell fate impermanence is the direct cellular reprogram ming hypothesis-that terminally differentiated cells can be reprogrammed into other adult cell fates without first passing through a stem cell state.
基金supported by National Institute on Aging(NIH-NIA)R21 AG074152(to KMA)National Institute of Allergy and Infectious Diseases(NIAID)grant DP2 AI171150(to KMA)Department of Defense(DoD)grant AZ210089(to KMA)。
文摘The brain's extracellular matrix(ECM),which is comprised of protein and glycosaminoglycan(GAG)scaffolds,constitutes 20%-40% of the human brain and is considered one of the largest influencers on brain cell functioning(Soles et al.,2023).Synthesized by neural and glial cells,the brain's ECM regulates a myriad of homeostatic cellular processes,including neuronal plasticity and firing(Miyata et al.,2012),cation buffering(Moraws ki et al.,2015),and glia-neuron interactions(Anderson et al.,2016).Considering the diversity of functions,dynamic remodeling of the brain's ECM indicates that this understudied medium is an active participant in both normal physiology and neurological diseases.
基金funded by the National Natural Science Foundation of China Youth Fund(Grant No.62304022)Science and Technology on Electromechanical Dynamic Control Laboratory(China,Grant No.6142601012304)the 2022e2024 China Association for Science and Technology Innovation Integration Association Youth Talent Support Project(Grant No.2022QNRC001).
文摘Metal Additive Manufacturing(MAM) technology has become an important means of rapid prototyping precision manufacturing of special high dynamic heterogeneous complex parts. In response to the micromechanical defects such as porosity issues, significant deformation, surface cracks, and challenging control of surface morphology encountered during the selective laser melting(SLM) additive manufacturing(AM) process of specialized Micro Electromechanical System(MEMS) components, multiparameter optimization and micro powder melt pool/macro-scale mechanical properties control simulation of specialized components are conducted. The optimal parameters obtained through highprecision preparation and machining of components and static/high dynamic verification are: laser power of 110 W, laser speed of 600 mm/s, laser diameter of 75 μm, and scanning spacing of 50 μm. The density of the subordinate components under this reference can reach 99.15%, the surface hardness can reach 51.9 HRA, the yield strength can reach 550 MPa, the maximum machining error of the components is 4.73%, and the average surface roughness is 0.45 μm. Through dynamic hammering and high dynamic firing verification, SLM components meet the requirements for overload resistance. The results have proven that MEM technology can provide a new means for the processing of MEMS components applied in high dynamic environments. The parameters obtained in the conclusion can provide a design basis for the additive preparation of MEMS components.
基金the National Natural Science Foundation of China(No.62063006)to the Guangxi Natural Science Foundation under Grant(Nos.2023GXNSFAA026025,AA24010001)+3 种基金to the Innovation Fund of Chinese Universities Industry-University-Research(ID:2023RY018)to the Special Guangxi Industry and Information Technology Department,Textile and Pharmaceutical Division(ID:2021 No.231)to the Special Research Project of Hechi University(ID:2021GCC028)to the Key Laboratory of AI and Information Processing,Education Department of Guangxi Zhuang Autonomous Region(Hechi University),No.2024GXZDSY009。
文摘In dynamic scenarios,visual simultaneous localization and mapping(SLAM)algorithms often incorrectly incorporate dynamic points during camera pose computation,leading to reduced accuracy and robustness.This paper presents a dynamic SLAM algorithm that leverages object detection and regional dynamic probability.Firstly,a parallel thread employs the YOLOX object detectionmodel to gather 2D semantic information and compensate for missed detections.Next,an improved K-means++clustering algorithm clusters bounding box regions,adaptively determining the threshold for extracting dynamic object contours as dynamic points change.This process divides the image into low dynamic,suspicious dynamic,and high dynamic regions.In the tracking thread,the dynamic point removal module assigns dynamic probability weights to the feature points in these regions.Combined with geometric methods,it detects and removes the dynamic points.The final evaluation on the public TUM RGB-D dataset shows that the proposed dynamic SLAM algorithm surpasses most existing SLAM algorithms,providing better pose estimation accuracy and robustness in dynamic environments.
基金fnancially supported by the Scientifc Research Project of the Department of Education in Hunan Prov ince,China(Grant No.23B0533).
文摘In this work,fow behavior and dynamic recrystallization(DRX)mechanism of a low carbon martensitic stainless bearing steel,CSS-42L,were investigated using a thermomechanical simulator under the temperature and strain rate ranges of 900 to 1100℃ and 0.1 to 20 s^(−1),respectively.The Arrhenius-type constitutive equation was established based on the fow stress curves.Moreover,the peak stress decreased with the increase in deformation temperature and the decrease in strain rate.There were two DRX mechanisms during hot deformation of the current studied steel,the main one being discontinuous dynamic recrystallization mechanism,acting through grain boundary bulging and migration,and the auxiliary one being continuous dynamic recrystallization mechanism,working through the rotation of sub-grains.On the basis of microstructural characterizations,power dissipation maps and fow instability maps,the optimized hot deformation parameters for CSS-42L bearing steel were determined as 1050℃/0.1 s^(−1) and 1100℃/1 s^(−1).
文摘Traumatic brain injury(TBI)is a public health problem with an undue economic burden that impacts nearly every age,ethnic,and gender group across the globe(Capizzi et al.,2020).TBIs are often sustained during a dynamic range of exposures to energetic environmental forces and as such outcomes are typically heterogeneous regarding severity and pathology(Capizzi et al.,2020).
文摘Revealing the structure evolution of interfacial active species during a dynamic catalytic process is a challenging but pivotal issue for the rational design of high-performance catalysts.Here,we successfully prepare sub-nanometric Pt clusters(~0.8 nm)encapsulated within the defects of CeO_(2)nanorods via an in-situ defect engineering methodology.The as-prepared Pt@d-CeO_(2)catalyst significantly boosts the activity and stability in the water-gas shift(WGS)reaction compared to other analogs.Based on controlled experiments and complementary(in-situ)spectroscopic studies,a reversible encapsulation induced by active site transformation between the Pt^(2+)-terminal hydroxyl and Pt^(δ+)-O vacancy species at the interface is revealed,which enables to evoke the enhanced performance.Our findings not only offer practical guidance for the design of high-efficiency catalysts but also bring a new understanding of the exceptional performance of WGS in a holistic view,which shows a great application potential in materials and catalysis.
基金funded by the National Natural Science Foundation of China (No. 52304133)the National Key R&D Program of China (No. 2022YFC3004605)the Department of Science and Technology of Liaoning Province (No. 2023-BS-083)。
文摘Rockbursts, which mainly affect mining roadways, are dynamic disasters arising from the surrounding rock under high stress. Understanding the interaction between supports and the surrounding rock is necessary for effective rockburst control. In this study, the squeezing behavior of the surrounding rock is analyzed in rockburst roadways, and a mechanical model of rockbursts is established considering the dynamic support stress, thus deriving formulas and providing characteristic curves for describing the interaction between the support and surrounding rock. Design principles and parameters of supports for rockburst control are proposed. The results show that only when the geostress magnitude exceeds a critical value can it drive the formation of rockburst conditions. The main factors influencing the convergence response and rockburst occurrence around roadways are geostress, rock brittleness, uniaxial compressive strength, and roadway excavation size. Roadway support devices can play a role in controlling rockburst by suppressing the squeezing evolution of the surrounding rock towards instability points of rockburst. Further, the higher the strength and the longer the impact stroke of support devices with constant resistance, the more easily multiple balance points can be formed with the surrounding rock to control rockburst occurrence. Supports with long impact stroke allow adaptation to varying geostress levels around the roadway, aiding in rockburst control. The results offer a quantitative method for designing support systems for rockburst-prone roadways. The design criterion of supports is determined by the intersection between the convergence curve of the surrounding rock and the squeezing deformation curve of the support devices.
基金supported by the National Natural Science Foundation of China(No.62203256)。
文摘Generating dynamically feasible trajectory for fixed-wing Unmanned Aerial Vehicles(UAVs)in dense obstacle environments remains computationally intractable.This paper proposes a Safe Flight Corridor constrained Sequential Convex Programming(SFC-SCP)to improve the computation efficiency and reliability of trajectory generation.SFC-SCP combines the front-end convex polyhedron SFC construction and back-end SCP-based trajectory optimization.A Sparse A^(*)Search(SAS)driven SFC construction method is designed to efficiently generate polyhedron SFC according to the geometric relation among obstacles and collision-free waypoints.Via transforming the nonconvex obstacle-avoidance constraints to linear inequality constraints,SFC can mitigate infeasibility of trajectory planning and reduce computation complexity.Then,SCP casts the nonlinear trajectory optimization subject to SFC into convex programming subproblems to decrease the problem complexity.In addition,a convex optimizer based on interior point method is customized,where the search direction is calculated via successive elimination to further improve efficiency.Simulation experiments on dense obstacle scenarios show that SFC-SCP can generate dynamically feasible safe trajectory rapidly.Comparative studies with state-of-the-art SCP-based methods demonstrate the efficiency and reliability merits of SFC-SCP.Besides,the customized convex optimizer outperforms off-the-shelf optimizers in terms of computation time.
基金financially supported by the National Key Re-search and Development Program of China(No.2021YFB3702604)the National Natural Science Foundation of China(No.52174377)+1 种基金the Chongqing Natural Science Foundation Project(No.CSTB2023NSCQ-MSX0824)This work was also supported by the Shaanxi Materials Analysis&Research Center and the Analytical&Testing Center of NPU.
文摘Theβsolidifiedγ-TiAl alloy holds important application value in the aerospace industry,while its com-plex phase compositions and geometric structures pose challenges to its microstructure control during the thermal-mechanical process.The microstructure evolution of Ti-43Al-4Nb-1Mo-0.2B alloy at 1200℃/0.01 s−1 was investigated to clarify the coupling role of dynamic recrystallization(DRX)and phase transformation.The results revealed that the rate of DRX inα2+γlamellar colonies was comparatively slower than that inβo+γmixed structure,instead being accompanied by intense lamellar kinking and rotation.The initiation and development rates of DRX inα2,βo,andγphases decreased sequentially.The asynchronous DRX of the various geometric structures and phase compositions resulted in the un-even deformed microstructure,and the dynamic softening induced by lamellar kinking and rotation was replaced by strengthened DRX as strain increased.Additionally,the blockyα2 phase and the terminals ofα2 lamellae were the preferential DRX sites owing to the abundant activated slip systems.Theα2→βo transformation within lamellar colonies facilitated DRX and fragment ofα2 lamellae,while theα2→γtransformation promoted the decomposition ofα2 lamellae and DRX ofγlamellae.Moreover,the var-iedβo+γmixed structures underwent complicated evolution:(1)Theγ→βo transformation occurred at boundaries of lamellar colonies,followed by simultaneous DRX ofγlamellar terminals and neighboringβo phase;(2)DRX occurred earlier within the band-likeβo phase,with the delayed DRX in enclosedγphase;(3)DRX within theβo synapses and neighboringγphase was accelerated owing to generation of elastic stress field;(4)Dispersedβo particles triggered particle stimulated nucleation(PSN)ofγphase.Eventually,atomic diffusion along crystal defects inβo andγphases caused fracture of band-likeβo phase and formation of massiveβo particles,impeding grain boundary migration and hindering DRXed grain growth ofγphase.