期刊文献+
共找到397,983篇文章
< 1 2 250 >
每页显示 20 50 100
Residential Energy Scheduling With Solar Energy Based on Dyna Adaptive Dynamic Programming
1
作者 Kang Xiong Qinglai Wei Hongyang Li 《IEEE/CAA Journal of Automatica Sinica》 2025年第2期403-413,共11页
Learning-based methods have become mainstream for solving residential energy scheduling problems. In order to improve the learning efficiency of existing methods and increase the utilization of renewable energy, we pr... Learning-based methods have become mainstream for solving residential energy scheduling problems. In order to improve the learning efficiency of existing methods and increase the utilization of renewable energy, we propose the Dyna actiondependent heuristic dynamic programming(Dyna-ADHDP)method, which incorporates the ideas of learning and planning from the Dyna framework in action-dependent heuristic dynamic programming. This method defines a continuous action space for precise control of an energy storage system and allows online optimization of algorithm performance during the real-time operation of the residential energy model. Meanwhile, the target network is introduced during the training process to make the training smoother and more efficient. We conducted experimental comparisons with the benchmark method using simulated and real data to verify its applicability and performance. The results confirm the method's excellent performance and generalization capabilities, as well as its excellence in increasing renewable energy utilization and extending equipment life. 展开更多
关键词 Adaptive dynamic programming(ADP) dynamic residential scenarios optimal residential energy management smart grid
在线阅读 下载PDF
Value Iteration-Based Distributed Adaptive Dynamic Programming for Multi-Player Differential Game With Incomplete Information
2
作者 Yun Zhang Yuqi Wang Yunze Cai 《IEEE/CAA Journal of Automatica Sinica》 2025年第2期436-447,共12页
In this paper,a distributed adaptive dynamic programming(ADP)framework based on value iteration is proposed for multi-player differential games.In the game setting,players have no access to the information of others&#... In this paper,a distributed adaptive dynamic programming(ADP)framework based on value iteration is proposed for multi-player differential games.In the game setting,players have no access to the information of others'system parameters or control laws.Each player adopts an on-policy value iteration algorithm as the basic learning framework.To deal with the incomplete information structure,players collect a period of system trajectory data to compensate for the lack of information.The policy updating step is implemented by a nonlinear optimization problem aiming to search for the proximal admissible policy.Theoretical analysis shows that by adopting proximal policy searching rules,the approximated policies can converge to a neighborhood of equilibrium policies.The efficacy of our method is illustrated by three examples,which also demonstrate that the proposed method can accelerate the learning process compared with the centralized learning framework. 展开更多
关键词 Distributed adaptive dynamic programming incomplete information multi-player differential game(MPDG) value iteration
在线阅读 下载PDF
Reprogramming induced neurons from olfactory ensheathing glial cells: A feasible approach for spinal cord injury repair
3
作者 Javier Sierra María Portela-Lomba +1 位作者 Diana Simón M.Teresa Moreno-Flores 《Neural Regeneration Research》 2026年第1期296-297,共2页
Every year, around the world, between 250,000 and 500,000 people suffer a spinal cord injury(SCI). SCI is a devastating medical condition that arises from trauma or disease-induced damage to the spinal cord, disruptin... Every year, around the world, between 250,000 and 500,000 people suffer a spinal cord injury(SCI). SCI is a devastating medical condition that arises from trauma or disease-induced damage to the spinal cord, disrupting the neural connections that allow communication between the brain and the rest of the body, which results in varying degrees of motor and sensory impairment. Disconnection in the spinal tracts is an irreversible condition owing to the poor capacity for spontaneous axonal regeneration in the affected neurons. 展开更多
关键词 INJURY feasible programming
暂未订购
Hepatocyte nuclear factors dynamically regulate triglyceride metabolic reprogramming in metabolic dysfunction-associated steatotic liver disease:Mechanisms and implications
4
作者 Su-Qun Li Jin-Hua Wu +8 位作者 Ying Zhou Chen-Xi Wang Li Xie Si-Ying Liu Yu-Zhi Su Wei He Huan Chen Wei-Wei Zhong Yi-Huai He 《World Journal of Hepatology》 2025年第10期60-84,共25页
Metabolic dysfunction-associated steatotic liver disease,characterized by pathological intracellular triglyceride(TG)accumulation,is mechanistically associated with the disrupted spatiotemporal regulation of hepatocyt... Metabolic dysfunction-associated steatotic liver disease,characterized by pathological intracellular triglyceride(TG)accumulation,is mechanistically associated with the disrupted spatiotemporal regulation of hepatocyte nuclear factor(HNF)-dependent transcriptional programs.HNFs,including key members such as HNF-1α,HNF-4α,and HNF-6,constitute a liver-enriched family of transcription factors that govern hepatic lipid metabolism through hierarchical transcriptional regulatory networks.These networks critically regulate the dynamic equilibrium of TG metabolism,encompassing TG synthesis,storage,lipolysis,and lipoprotein-mediated export.This review comprehensively deciphers the molecular cascades through which HNF dysfunction exacerbates TG metabolic disorder in metabolic dysfunction-associated steatotic liver disease.Additionally,we evaluate emerging translational strategies targeting key HNF regulatory nodes and discuss current clinical challenges as well as potential solutions. 展开更多
关键词 Hepatocyte nuclear factors Metabolic dysfunction-associated steatotic liver disease Triglyceride metabolic imbalance dynamic dysregulation of transcriptional networks Clinical translation
暂未订购
Quality-guaranteed Dubins Path Planning for USV Based on Mixed-integer Piecewise linear Programming for Addressing the Extended Minimum-time Intercept Problem
5
作者 Xing Zhou Kelin Zhu +3 位作者 Shuang Liu Zhaoqing Li Wenxin Zhang Kang Du 《哈尔滨工程大学学报(英文版)》 2026年第1期216-227,共12页
During the use of robotics in applications such as antiterrorism or combat,a motion-constrained pursuer vehicle,such as a Dubins unmanned surface vehicle(USV),must get close enough(within a prescribed zero or positive... During the use of robotics in applications such as antiterrorism or combat,a motion-constrained pursuer vehicle,such as a Dubins unmanned surface vehicle(USV),must get close enough(within a prescribed zero or positive distance)to a moving target as quickly as possible,resulting in the extended minimum-time intercept problem(EMTIP).Existing research has primarily focused on the zero-distance intercept problem,MTIP,establishing the necessary or sufficient conditions for MTIP optimality,and utilizing analytic algorithms,such as root-finding algorithms,to calculate the optimal solutions.However,these approaches depend heavily on the properties of the analytic algorithm,making them inapplicable when problem settings change,such as in the case of a positive effective range or complicated target motions outside uniform rectilinear motion.In this study,an approach employing a high-accuracy and quality-guaranteed mixed-integer piecewise-linear program(QG-PWL)is proposed for the EMTIP.This program can accommodate different effective interception ranges and complicated target motions(variable velocity or complicated trajectories).The high accuracy and quality guarantees of QG-PWL originate from elegant strategies such as piecewise linearization and other developed operation strategies.The approximate error in the intercept path length is proved to be bounded to h^(2)/(4√2),where h is the piecewise length. 展开更多
关键词 Minimum-time intercept problem Dubins vehicle Mixed-integer piecewise-linear program LINEARIZATION Approximate error trigonometric function USV
在线阅读 下载PDF
Dynamic psychological vulnerability and adaptation in rheumatoid arthritis:Trajectories,predictors,and interventions
6
作者 Xue-Meng Chen Xian Cheng Wei Wu 《World Journal of Psychiatry》 2026年第1期32-46,共15页
Rheumatoid arthritis(RA)patients face significant psychological challenges alongside physical symptoms,necessitating a comprehensive understanding of how psychological vulnerability and adaptation patterns evolve thro... Rheumatoid arthritis(RA)patients face significant psychological challenges alongside physical symptoms,necessitating a comprehensive understanding of how psychological vulnerability and adaptation patterns evolve throughout the disease course.This review examined 95 studies(2000-2025)from PubMed,Web of Science,and CNKI databases including longitudinal cohorts,randomized controlled trials,and mixed-methods research,to characterize the complex interplay between biological,psychological,and social factors affecting RA patients’mental health.Findings revealed three distinct vulnerability trajectories(45%persistently low,30%fluctuating improvement,25%persistently high)and four adaptation stages,with critical intervention periods occurring 3-6 months postdiagnosis and during disease flares.Multiple factors significantly influence psychological outcomes,including gender(females showing 1.8-fold increased risk),age(younger patients experiencing 42%higher vulnerability),pain intensity,inflammatory markers,and neuroendocrine dysregulation(48%showing cortisol rhythm disruption).Early psychological intervention(within 3 months of diagnosis)demonstrated robust benefits,reducing depression incidence by 42%with effects persisting 24-36 months,while different modalities showed complementary advantages:Cognitive behavioral therapy for depression(Cohen’s d=0.68),mindfulness for pain acceptance(38%improvement),and peer support for meaning reconstruction(25.6%increase).These findings underscore the importance of integrating routine psychological assessment into standard RA care,developing stage-appropriate interventions,and advancing research toward personalized biopsychosocial approaches that address the dynamic psychological dimensions of the disease. 展开更多
关键词 Rheumatoid arthritis Psychological vulnerability Disease adaptation ability dynamic changes Mental health
暂未订购
Lithium-Ion Dynamic Interface Engineering of Nano-Charged Composite Polymer Electrolytes for Solid-State Lithium-Metal Batteries
7
作者 Shanshan Lv Jingwen Wang +7 位作者 Yuanming Zhai Yu Chen Jiarui Yang Zhiwei Zhu Rui Peng Xuewei Fu Wei Yang Yu Wang 《Nano-Micro Letters》 2026年第2期288-305,共18页
Composite polymer electrolytes(CPEs)offer a promising solution for all-solid-state lithium-metal batteries(ASSLMBs).However,conventional nanofillers with Lewis-acid-base surfaces make limited contribution to improving... Composite polymer electrolytes(CPEs)offer a promising solution for all-solid-state lithium-metal batteries(ASSLMBs).However,conventional nanofillers with Lewis-acid-base surfaces make limited contribution to improving the overall performance of CPEs due to their difficulty in achieving robust electrochemical and mechanical interfaces simultaneously.Here,by regulating the surface charge characteristics of halloysite nanotube(HNT),we propose a concept of lithium-ion dynamic interface(Li^(+)-DI)engineering in nano-charged CPE(NCCPE).Results show that the surface charge characteristics of HNTs fundamentally change the Li^(+)-DI,and thereof the mechanical and ion-conduction behaviors of the NCCPEs.Particularly,the HNTs with positively charged surface(HNTs+)lead to a higher Li^(+)transference number(0.86)than that of HNTs-(0.73),but a lower toughness(102.13 MJ m^(-3)for HNTs+and 159.69 MJ m^(-3)for HNTs-).Meanwhile,a strong interface compatibilization effect by Li^(+)is observed for especially the HNTs+-involved Li^(+)-DI,which improves the toughness by 2000%compared with the control.Moreover,HNTs+are more effective to weaken the Li^(+)-solvation strength and facilitate the formation of Li F-rich solid-electrolyte interphase of Li metal compared to HNTs-.The resultant Li|NCCPE|LiFePO4cell delivers a capacity of 144.9 m Ah g^(-1)after 400 cycles at 0.5 C and a capacity retention of 78.6%.This study provides deep insights into understanding the roles of surface charges of nanofillers in regulating the mechanical and electrochemical interfaces in ASSLMBs. 展开更多
关键词 Charged nanofillers Nanocomposite polymer electrolyte dynamic lithium ion interface Solid ion-conductors Solidstate lithium-metal battery
在线阅读 下载PDF
Mitochondrial dynamics dysfunction and neurodevelopmental disorders:From pathological mechanisms to clinical translation
8
作者 Ziqi Yang Yiran Luo +5 位作者 Zaiqi Yang Zheng Liu Meihua Li Xiao Wu Like Chen Wenqiang Xin 《Neural Regeneration Research》 2026年第5期1926-1946,共21页
Mitochondrial dysfunction has emerged as a critical factor in the etiology of various neurodevelopmental disorders, including autism spectrum disorders, attention-deficit/hyperactivity disorder, and Rett syndrome. Alt... Mitochondrial dysfunction has emerged as a critical factor in the etiology of various neurodevelopmental disorders, including autism spectrum disorders, attention-deficit/hyperactivity disorder, and Rett syndrome. Although these conditions differ in clinical presentation, they share fundamental pathological features that may stem from abnormal mitochondrial dynamics and impaired autophagic clearance, which contribute to redox imbalance and oxidative stress in neurons. This review aimed to elucidate the relationship between mitochondrial dynamics dysfunction and neurodevelopmental disorders. Mitochondria are highly dynamic organelles that undergo continuous fusion and fission to meet the substantial energy demands of neural cells. Dysregulation of these processes, as observed in certain neurodevelopmental disorders, causes accumulation of damaged mitochondria, exacerbating oxidative damage and impairing neuronal function. The phosphatase and tensin homolog-induced putative kinase 1/E3 ubiquitin-protein ligase pathway is crucial for mitophagy, the process of selectively removing malfunctioning mitochondria. Mutations in genes encoding mitochondrial fusion proteins have been identified in autism spectrum disorders, linking disruptions in the fusion-fission equilibrium to neurodevelopmental impairments. Additionally, animal models of Rett syndrome have shown pronounced defects in mitophagy, reinforcing the notion that mitochondrial quality control is indispensable for neuronal health. Clinical studies have highlighted the importance of mitochondrial disturbances in neurodevelopmental disorders. In autism spectrum disorders, elevated oxidative stress markers and mitochondrial DNA deletions indicate compromised mitochondrial function. Attention-deficit/hyperactivity disorder has also been associated with cognitive deficits linked to mitochondrial dysfunction and oxidative stress. Moreover, induced pluripotent stem cell models derived from patients with Rett syndrome have shown impaired mitochondrial dynamics and heightened vulnerability to oxidative injury, suggesting the role of defective mitochondrial homeostasis in these disorders. From a translational standpoint, multiple therapeutic approaches targeting mitochondrial pathways show promise. Interventions aimed at preserving normal fusion-fission cycles or enhancing mitophagy can reduce oxidative damage by limiting the accumulation of defective mitochondria. Pharmacological modulation of mitochondrial permeability and upregulation of peroxisome proliferator-activated receptor gamma coactivator 1-alpha, an essential regulator of mitochondrial biogenesis, may also ameliorate cellular energy deficits. Identifying early biomarkers of mitochondrial impairment is crucial for precision medicine, since it can help clinicians tailor interventions to individual patient profiles and improve prognoses. Furthermore, integrating mitochondria-focused strategies with established therapies, such as antioxidants or behavioral interventions, may enhance treatment efficacy and yield better clinical outcomes. Leveraging these pathways could open avenues for regenerative strategies, given the influence of mitochondria on neuronal repair and plasticity. In conclusion, this review indicates mitochondrial homeostasis as a unifying therapeutic axis within neurodevelopmental pathophysiology. Disruptions in mitochondrial dynamics and autophagic clearance converge on oxidative stress, and researchers should prioritize validating these interventions in clinical settings to advance precision medicine and enhance outcomes for individuals affected by neurodevelopmental disorders. 展开更多
关键词 autophagic clearance autism spectrum disorders cellular homeostasis fusion and fission mitochondrial dynamics MITOPHAGY neural regeneration neuronal energy metabolism neurodevelopmental disorders oxidative stress
暂未订购
Automated Segmentation of Left Ventricle Using Local and Global Intensity Based Active Contour and Dynamic Programming 被引量:3
9
作者 G.Dharanibai Anupama Chandrasekharan Zachariah C.Alex 《International Journal of Automation and computing》 EI CSCD 2018年第6期673-688,共16页
The aim of this work is to develop an improved region based active contour and dynamic programming based method for accurate segmentation of left ventricle (LV) from multi-slice cine short axis cardiac magnetic reso... The aim of this work is to develop an improved region based active contour and dynamic programming based method for accurate segmentation of left ventricle (LV) from multi-slice cine short axis cardiac magnetic resonance (MR) images. Intensity inhomogeneity and weak object boundaries present in MR images hinder the segmentation accuracy. The proposed active contour model driven by a local Gaussian distribution fitting (LGDF) energy and an auxiliary global intensity fitting energy improves the accuracy of endocardial boundary detection. The weightage of the global energy fitting term is dynamically adjusted using a spatially varying weight function. Dynamic programming scheme proposed for the segmentation of epicardium considers the myocardium probability map and a distance weighted edge map in the cost matrix. Radial distance weighted technique and conical geometry are employed for segmenting the basal slices with left ventricle outflow tract (LVOT) and most apical slices. The proposed method is validated on a public dataset comprising 45 subjects from medical image computing and computer assisted interventions (MICCAI) 2009 segmentation challenge. The average percentage of good endocardial and epicardial contours detected is about 99%, average perpendicular distance of the detected good contours from the manual reference contours is 1.95 mm, and the dice similarity coefficient between the detected contours and the reference contours is 0.91. Correlation coefficient and the coefficient of determination between the ejection fraction measurements from manual segmentation and the automated method are respectively 0.9781 and 0.9567, for LV mass these values are 0.9249 and 0.8554. Statistical analysis of the results reveals a good agreement between the clinical parameters determined manually and those estimated using the automated method. 展开更多
关键词 Cardiovascular magnetic resonance left ventricle ENDOCARDIUM EPICARDIUM MYOCARDIUM segmentation active contour dynamic programming.
原文传递
Novel algorithm for distributed replicas management based on dynamic programming 被引量:1
10
作者 Wang Tao Lu Xianliang Hou Mengshu 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2006年第3期669-672,共4页
Replicas can improve the data reliability in distributed system. However, the traditional algorithms for replica management are based on the assumption that all replicas have the uniform reliability, which is inaccura... Replicas can improve the data reliability in distributed system. However, the traditional algorithms for replica management are based on the assumption that all replicas have the uniform reliability, which is inaccurate in some actual systems. To address such problem, a novel algorithm is proposed based on dynamic programming to manage the number and distribution of replicas in different nodes. By using Markov model, replicas management is organized as a multi-phase process, and the recursion equations are provided. In this algorithm, the heterogeneity of nodes, the expense for maintaining replicas and the engaged space have been considered. Under these restricted conditions, this algorithm realizes high data reliability in a distributed system. The results of case analysis prove the feasibility of the algorithm. 展开更多
关键词 DISTRIBUTED replicas MARKOV dynamic programming.
在线阅读 下载PDF
A Hybrid Dynamic Programming Method for Concave Resource Allocation Problems
11
作者 姜计荣 孙小玲 《Journal of Shanghai University(English Edition)》 CAS 2005年第2期95-98,共4页
Concave resource allocation problem is an integer programming problem of minimizing a nonincreasing concave function subject to a convex nondecreasing constraint and bounded integer variables. This class of problems a... Concave resource allocation problem is an integer programming problem of minimizing a nonincreasing concave function subject to a convex nondecreasing constraint and bounded integer variables. This class of problems are encountered in optimization models involving economies of scale. In this paper, a new hybrid dynamic programming method was proposed for solving concave resource allocation problems. A convex underestimating function was used to approximate the objective function and the resulting convex subproblem was solved with dynamic programming technique after transforming it into a 0-1 linear knapsack problem. To ensure the convergence, monotonicity and domain cut technique was employed to remove certain integer boxes and partition the revised domain into a union of integer boxes. Computational results were given to show the efficiency of the algorithm. 展开更多
关键词 nonlinear integer programming resource allocation linear underestimation 0-1linearization dynamic programming.
在线阅读 下载PDF
Shrek:a dynamic object-oriented programming language 被引量:1
12
作者 曹璟 徐宝文 周毓明 《Journal of Southeast University(English Edition)》 EI CAS 2009年第1期31-35,共5页
From a perspective of theoretical study, there are some faults in the models of the existing object-oriented programming languages. For example, C# does not support metaclasses, the primitive types of Java and C# are ... From a perspective of theoretical study, there are some faults in the models of the existing object-oriented programming languages. For example, C# does not support metaclasses, the primitive types of Java and C# are not objects, etc. So, this paper designs a programming language, Shrek, which integrates many language features and constructions in a compact and consistent model. The Shrek language is a class-based purely object-oriented language. It has a dynamical strong type system, and adopts a single-inheritance mechanism with Mixin as its complement. It has a consistent class instantiation and inheritance structure, and the ability of intercessive structural computational reflection, which enables it to support safe metaclass programming. It also supports multi-thread programming and automatic garbage collection, and enforces its expressive power by adopting a native method mechanism. The prototype system of the Shrek language is implemented and anticipated design goals are achieved. 展开更多
关键词 dynamic typing metaclass programming computational reflection native method object-oriented programming language
在线阅读 下载PDF
Parallel Control for Optimal Tracking via Adaptive Dynamic Programming 被引量:26
13
作者 Jingwei Lu Qinglai Wei Fei-Yue Wang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2020年第6期1662-1674,共13页
This paper studies the problem of optimal parallel tracking control for continuous-time general nonlinear systems.Unlike existing optimal state feedback control,the control input of the optimal parallel control is int... This paper studies the problem of optimal parallel tracking control for continuous-time general nonlinear systems.Unlike existing optimal state feedback control,the control input of the optimal parallel control is introduced into the feedback system.However,due to the introduction of control input into the feedback system,the optimal state feedback control methods can not be applied directly.To address this problem,an augmented system and an augmented performance index function are proposed firstly.Thus,the general nonlinear system is transformed into an affine nonlinear system.The difference between the optimal parallel control and the optimal state feedback control is analyzed theoretically.It is proven that the optimal parallel control with the augmented performance index function can be seen as the suboptimal state feedback control with the traditional performance index function.Moreover,an adaptive dynamic programming(ADP)technique is utilized to implement the optimal parallel tracking control using a critic neural network(NN)to approximate the value function online.The stability analysis of the closed-loop system is performed using the Lyapunov theory,and the tracking error and NN weights errors are uniformly ultimately bounded(UUB).Also,the optimal parallel controller guarantees the continuity of the control input under the circumstance that there are finite jump discontinuities in the reference signals.Finally,the effectiveness of the developed optimal parallel control method is verified in two cases. 展开更多
关键词 Adaptive dynamic programming(ADP) nonlinear optimal control parallel controller parallel control theory parallel system tracking control neural network(NN)
在线阅读 下载PDF
PDP:Parallel Dynamic Programming 被引量:15
14
作者 Fei-Yue Wang Jie Zhang +2 位作者 Qinglai Wei Xinhu Zheng Li Li 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2017年第1期1-5,共5页
Deep reinforcement learning is a focus research area in artificial intelligence.The principle of optimality in dynamic programming is a key to the success of reinforcement learning methods.The principle of adaptive dy... Deep reinforcement learning is a focus research area in artificial intelligence.The principle of optimality in dynamic programming is a key to the success of reinforcement learning methods.The principle of adaptive dynamic programming ADP is first presented instead of direct dynamic programming DP,and the inherent relationship between ADP and deep reinforcement learning is developed.Next,analytics intelligence,as the necessary requirement,for the real reinforcement learning,is discussed.Finally,the principle of the parallel dynamic programming,which integrates dynamic programming and analytics intelligence,is presented as the future computational intelligence.©2014 Chinese Association of Automation. 展开更多
关键词 Parallel dynamic programming dynamic programming Adaptive dynamic programming Reinforcement learning Deep learning Neural networks Artificial intelligence
在线阅读 下载PDF
Residential Energy Scheduling for Variable Weather Solar Energy Based on Adaptive Dynamic Programming 被引量:18
15
作者 Derong Liu Yancai Xu +1 位作者 Qinglai Wei Xinliang Liu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2018年第1期36-46,共11页
The residential energy scheduling of solar energy is an important research area of smart grid. On the demand side, factors such as household loads, storage batteries, the outside public utility grid and renewable ener... The residential energy scheduling of solar energy is an important research area of smart grid. On the demand side, factors such as household loads, storage batteries, the outside public utility grid and renewable energy resources, are combined together as a nonlinear, time-varying, indefinite and complex system, which is difficult to manage or optimize. Many nations have already applied the residential real-time pricing to balance the burden on their grid. In order to enhance electricity efficiency of the residential micro grid, this paper presents an action dependent heuristic dynamic programming(ADHDP) method to solve the residential energy scheduling problem. The highlights of this paper are listed below. First,the weather-type classification is adopted to establish three types of programming models based on the features of the solar energy. In addition, the priorities of different energy resources are set to reduce the loss of electrical energy transmissions.Second, three ADHDP-based neural networks, which can update themselves during applications, are designed to manage the flows of electricity. Third, simulation results show that the proposed scheduling method has effectively reduced the total electricity cost and improved load balancing process. The comparison with the particle swarm optimization algorithm further proves that the present method has a promising effect on energy management to save cost. 展开更多
关键词 Action dependent heuristic dynamic programming adaptive dynamic programming control strategy residential energy management smart grid
在线阅读 下载PDF
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 被引量:11
16
作者 Ding Wang Ning Gao +2 位作者 Derong Liu Jinna Li Frank L.Lewis 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第1期18-36,共19页
Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ... Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence. 展开更多
关键词 Adaptive dynamic programming(ADP) advanced control complex environment data-driven control event-triggered design intelligent control neural networks nonlinear systems optimal control reinforcement learning(RL)
在线阅读 下载PDF
Adaptive event-triggered distributed optimal guidance design via adaptive dynamic programming 被引量:7
17
作者 Teng LONG Yan CAO +1 位作者 Jingliang SUN Guangtong XU 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2022年第7期113-127,共15页
In this paper,the multi-missile cooperative guidance system is formulated as a general nonlinear multi-agent system.To save the limited communication resources,an adaptive eventtriggered optimal guidance law is propos... In this paper,the multi-missile cooperative guidance system is formulated as a general nonlinear multi-agent system.To save the limited communication resources,an adaptive eventtriggered optimal guidance law is proposed by designing a synchronization-error-driven triggering condition,which brings together the consensus control with Adaptive Dynamic Programming(ADP)technique.Then,the developed event-triggered distributed control law can be employed by finding an approximate solution of event-triggered coupled Hamilton-Jacobi-Bellman(HJB)equation.To address this issue,the critic network architecture is constructed,in which an adaptive weight updating law is designed for estimating the cooperative optimal cost function online.Therefore,the event-triggered closed-loop system is decomposed into two subsystems:the system with flow dynamics and the system with jump dynamics.By using Lyapunov method,the stability of this closed-loop system is guaranteed and all signals are ensured to be Uniformly Ultimately Bounded(UUB).Furthermore,the Zeno behavior is avoided.Simulation results are finally provided to demonstrate the effectiveness of the proposed method. 展开更多
关键词 Adaptive dynamic programming Distributed control Event-triggered Guidance and control Multi-agent system
原文传递
Approach of service recovery decision-making based on Bellman dynamic programming
18
作者 何蕾 任江春 王志英 《Journal of Southeast University(English Edition)》 EI CAS 2008年第3期377-380,共4页
Based on service-oriented architecture(SOA),a Bellman-dynamic-programming-based approach of service recovery decision-making is proposed to make valid recovery decisions.Both the attribute and the process of service... Based on service-oriented architecture(SOA),a Bellman-dynamic-programming-based approach of service recovery decision-making is proposed to make valid recovery decisions.Both the attribute and the process of services in the controllable distributed information system are analyzed as the preparatory work.Using the idea of service composition as a reference,the approach translates the recovery decision-making into a planning problem regarding artificial intelligence (AI) through two steps.The first is the self-organization based on a logical view of the network,and the second is the definition of evaluation standards.Applying Bellman dynamic programming to solve the planning problem,the approach offers timely emergency response and optimal recovery source selection,meeting multiple QoS (quality of service)requirements.Experimental results demonstrate the rationality and optimality of the approach,and the theoretical analysis of its computational complexity and the comparison with conventional methods exhibit its high efficiency. 展开更多
关键词 service recovery decision-making Bellman dynamic programming quality of service (QoS) service-oriented architecture(SOA)
在线阅读 下载PDF
UAV flight strategy algorithm based on dynamic programming 被引量:7
19
作者 ZHANG Zixuan WU Qinhao +2 位作者 ZHANG Bo YI Xiaodong TANG Yuhua 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2018年第6期1293-1299,共7页
Unmanned aerial vehicles(UAVs) may play an important role in data collection and offloading in vast areas deploying wireless sensor networks, and the UAV’s action strategy has a vital influence on achieving applicabi... Unmanned aerial vehicles(UAVs) may play an important role in data collection and offloading in vast areas deploying wireless sensor networks, and the UAV’s action strategy has a vital influence on achieving applicability and computational complexity. Dynamic programming(DP) has a good application in the path planning of UAV, but there are problems in the applicability of special terrain environment and the complexity of the algorithm.Based on the analysis of DP, this paper proposes a hierarchical directional DP(DDP) algorithm based on direction determination and hierarchical model. We compare our methods with Q-learning and DP algorithm by experiments, and the results show that our method can improve the terrain applicability, meanwhile greatly reduce the computational complexity. 展开更多
关键词 motion state space map stratification computational complexity dynamic programming(DP) envirommental adaptability
在线阅读 下载PDF
Using approximate dynamic programming for multi-ESM scheduling to track ground moving targets 被引量:6
20
作者 WAN Kaifang GAO Xiaoguang +1 位作者 LI Bo LI Fei 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2018年第1期74-85,共12页
This paper researches the adaptive scheduling problem of multiple electronic support measures(multi-ESM) in a ground moving radar targets tracking application. It is a sequential decision-making problem in uncertain e... This paper researches the adaptive scheduling problem of multiple electronic support measures(multi-ESM) in a ground moving radar targets tracking application. It is a sequential decision-making problem in uncertain environment. For adaptive selection of appropriate ESMs, we generalize an approximate dynamic programming(ADP) framework to the dynamic case. We define the environment model and agent model, respectively. To handle the partially observable challenge, we apply the unsented Kalman filter(UKF) algorithm for belief state estimation. To reduce the computational burden, a simulation-based approach rollout with a redesigned base policy is proposed to approximate the long-term cumulative reward. Meanwhile, Monte Carlo sampling is combined into the rollout to estimate the expectation of the rewards. The experiments indicate that our method outperforms other strategies due to its better performance in larger-scale problems. 展开更多
关键词 sensor scheduling target tracking approximate dynamic programming non-myopic rollout belief state
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部