期刊文献+
共找到5,980篇文章
< 1 2 250 >
每页显示 20 50 100
A Three-section Algorithm of Dynamic Programming Based on Three-stage Decomposition System Model for Grade Transition Trajectory Optimization Problems
1
作者 魏宇杰 江永亨 黄德先 《Chinese Journal of Chemical Engineering》 SCIE EI CAS CSCD 2014年第10期1122-1130,共9页
This paper introduces a practical solving scheme of gradetransition trajectory optimization(GTTO) problems under typical certificate-checking–updating framework. Due to complicated kinetics of polymerization,differen... This paper introduces a practical solving scheme of gradetransition trajectory optimization(GTTO) problems under typical certificate-checking–updating framework. Due to complicated kinetics of polymerization,differential/algebraic equations(DAEs) always cause great computational burden and system non-linearity usually makes GTTO non-convex bearing multiple optima. Therefore, coupled with the three-stage decomposition model, a three-section algorithm of dynamic programming(TSDP) is proposed based on the general iteration mechanism of iterative programming(IDP) and incorporated with adaptivegrid allocation scheme and heuristic modifications. The algorithm iteratively performs dynamic programming with heuristic modifications under constant calculation loads and adaptively allocates the valued computational resources to the regions that can further improve the optimality under the guidance of local error estimates. TSDP is finally compared with IDP and interior point method(IP) to verify its efficiency of computation. 展开更多
关键词 Gradetransition TRAJECTORY optimization Adaptivegrid ALLOCATION HEURISTIC modifications three-section dynamic programming Three-stage DECOMPOSITION model
在线阅读 下载PDF
Residential Energy Scheduling With Solar Energy Based on Dyna Adaptive Dynamic Programming
2
作者 Kang Xiong Qinglai Wei Hongyang Li 《IEEE/CAA Journal of Automatica Sinica》 2025年第2期403-413,共11页
Learning-based methods have become mainstream for solving residential energy scheduling problems. In order to improve the learning efficiency of existing methods and increase the utilization of renewable energy, we pr... Learning-based methods have become mainstream for solving residential energy scheduling problems. In order to improve the learning efficiency of existing methods and increase the utilization of renewable energy, we propose the Dyna actiondependent heuristic dynamic programming(Dyna-ADHDP)method, which incorporates the ideas of learning and planning from the Dyna framework in action-dependent heuristic dynamic programming. This method defines a continuous action space for precise control of an energy storage system and allows online optimization of algorithm performance during the real-time operation of the residential energy model. Meanwhile, the target network is introduced during the training process to make the training smoother and more efficient. We conducted experimental comparisons with the benchmark method using simulated and real data to verify its applicability and performance. The results confirm the method's excellent performance and generalization capabilities, as well as its excellence in increasing renewable energy utilization and extending equipment life. 展开更多
关键词 Adaptive dynamic programming(ADP) dynamic residential scenarios optimal residential energy management smart grid
在线阅读 下载PDF
Value Iteration-Based Distributed Adaptive Dynamic Programming for Multi-Player Differential Game With Incomplete Information
3
作者 Yun Zhang Yuqi Wang Yunze Cai 《IEEE/CAA Journal of Automatica Sinica》 2025年第2期436-447,共12页
In this paper,a distributed adaptive dynamic programming(ADP)framework based on value iteration is proposed for multi-player differential games.In the game setting,players have no access to the information of others&#... In this paper,a distributed adaptive dynamic programming(ADP)framework based on value iteration is proposed for multi-player differential games.In the game setting,players have no access to the information of others'system parameters or control laws.Each player adopts an on-policy value iteration algorithm as the basic learning framework.To deal with the incomplete information structure,players collect a period of system trajectory data to compensate for the lack of information.The policy updating step is implemented by a nonlinear optimization problem aiming to search for the proximal admissible policy.Theoretical analysis shows that by adopting proximal policy searching rules,the approximated policies can converge to a neighborhood of equilibrium policies.The efficacy of our method is illustrated by three examples,which also demonstrate that the proposed method can accelerate the learning process compared with the centralized learning framework. 展开更多
关键词 Distributed adaptive dynamic programming incomplete information multi-player differential game(MPDG) value iteration
在线阅读 下载PDF
Shrek:a dynamic object-oriented programming language 被引量:1
4
作者 曹璟 徐宝文 周毓明 《Journal of Southeast University(English Edition)》 EI CAS 2009年第1期31-35,共5页
From a perspective of theoretical study, there are some faults in the models of the existing object-oriented programming languages. For example, C# does not support metaclasses, the primitive types of Java and C# are ... From a perspective of theoretical study, there are some faults in the models of the existing object-oriented programming languages. For example, C# does not support metaclasses, the primitive types of Java and C# are not objects, etc. So, this paper designs a programming language, Shrek, which integrates many language features and constructions in a compact and consistent model. The Shrek language is a class-based purely object-oriented language. It has a dynamical strong type system, and adopts a single-inheritance mechanism with Mixin as its complement. It has a consistent class instantiation and inheritance structure, and the ability of intercessive structural computational reflection, which enables it to support safe metaclass programming. It also supports multi-thread programming and automatic garbage collection, and enforces its expressive power by adopting a native method mechanism. The prototype system of the Shrek language is implemented and anticipated design goals are achieved. 展开更多
关键词 dynamic typing metaclass programming computational reflection native method object-oriented programming language
在线阅读 下载PDF
Residential Energy Scheduling for Variable Weather Solar Energy Based on Adaptive Dynamic Programming 被引量:18
5
作者 Derong Liu Yancai Xu +1 位作者 Qinglai Wei Xinliang Liu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2018年第1期36-46,共11页
The residential energy scheduling of solar energy is an important research area of smart grid. On the demand side, factors such as household loads, storage batteries, the outside public utility grid and renewable ener... The residential energy scheduling of solar energy is an important research area of smart grid. On the demand side, factors such as household loads, storage batteries, the outside public utility grid and renewable energy resources, are combined together as a nonlinear, time-varying, indefinite and complex system, which is difficult to manage or optimize. Many nations have already applied the residential real-time pricing to balance the burden on their grid. In order to enhance electricity efficiency of the residential micro grid, this paper presents an action dependent heuristic dynamic programming(ADHDP) method to solve the residential energy scheduling problem. The highlights of this paper are listed below. First,the weather-type classification is adopted to establish three types of programming models based on the features of the solar energy. In addition, the priorities of different energy resources are set to reduce the loss of electrical energy transmissions.Second, three ADHDP-based neural networks, which can update themselves during applications, are designed to manage the flows of electricity. Third, simulation results show that the proposed scheduling method has effectively reduced the total electricity cost and improved load balancing process. The comparison with the particle swarm optimization algorithm further proves that the present method has a promising effect on energy management to save cost. 展开更多
关键词 Action dependent heuristic dynamic programming adaptive dynamic programming control strategy residential energy management smart grid
在线阅读 下载PDF
PDP:Parallel Dynamic Programming 被引量:15
6
作者 Fei-Yue Wang Jie Zhang +2 位作者 Qinglai Wei Xinhu Zheng Li Li 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2017年第1期1-5,共5页
Deep reinforcement learning is a focus research area in artificial intelligence.The principle of optimality in dynamic programming is a key to the success of reinforcement learning methods.The principle of adaptive dy... Deep reinforcement learning is a focus research area in artificial intelligence.The principle of optimality in dynamic programming is a key to the success of reinforcement learning methods.The principle of adaptive dynamic programming ADP is first presented instead of direct dynamic programming DP,and the inherent relationship between ADP and deep reinforcement learning is developed.Next,analytics intelligence,as the necessary requirement,for the real reinforcement learning,is discussed.Finally,the principle of the parallel dynamic programming,which integrates dynamic programming and analytics intelligence,is presented as the future computational intelligence.©2014 Chinese Association of Automation. 展开更多
关键词 Parallel dynamic programming dynamic programming Adaptive dynamic programming Reinforcement learning Deep learning Neural networks Artificial intelligence
在线阅读 下载PDF
Using approximate dynamic programming for multi-ESM scheduling to track ground moving targets 被引量:6
7
作者 WAN Kaifang GAO Xiaoguang +1 位作者 LI Bo LI Fei 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2018年第1期74-85,共12页
This paper researches the adaptive scheduling problem of multiple electronic support measures(multi-ESM) in a ground moving radar targets tracking application. It is a sequential decision-making problem in uncertain e... This paper researches the adaptive scheduling problem of multiple electronic support measures(multi-ESM) in a ground moving radar targets tracking application. It is a sequential decision-making problem in uncertain environment. For adaptive selection of appropriate ESMs, we generalize an approximate dynamic programming(ADP) framework to the dynamic case. We define the environment model and agent model, respectively. To handle the partially observable challenge, we apply the unsented Kalman filter(UKF) algorithm for belief state estimation. To reduce the computational burden, a simulation-based approach rollout with a redesigned base policy is proposed to approximate the long-term cumulative reward. Meanwhile, Monte Carlo sampling is combined into the rollout to estimate the expectation of the rewards. The experiments indicate that our method outperforms other strategies due to its better performance in larger-scale problems. 展开更多
关键词 sensor scheduling target tracking approximate dynamic programming non-myopic rollout belief state
在线阅读 下载PDF
UAV flight strategy algorithm based on dynamic programming 被引量:7
8
作者 ZHANG Zixuan WU Qinhao +2 位作者 ZHANG Bo YI Xiaodong TANG Yuhua 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2018年第6期1293-1299,共7页
Unmanned aerial vehicles(UAVs) may play an important role in data collection and offloading in vast areas deploying wireless sensor networks, and the UAV’s action strategy has a vital influence on achieving applicabi... Unmanned aerial vehicles(UAVs) may play an important role in data collection and offloading in vast areas deploying wireless sensor networks, and the UAV’s action strategy has a vital influence on achieving applicability and computational complexity. Dynamic programming(DP) has a good application in the path planning of UAV, but there are problems in the applicability of special terrain environment and the complexity of the algorithm.Based on the analysis of DP, this paper proposes a hierarchical directional DP(DDP) algorithm based on direction determination and hierarchical model. We compare our methods with Q-learning and DP algorithm by experiments, and the results show that our method can improve the terrain applicability, meanwhile greatly reduce the computational complexity. 展开更多
关键词 motion state space map stratification computational complexity dynamic programming(DP) envirommental adaptability
在线阅读 下载PDF
Genetic programming for predictions of effectiveness of rolling dynamic compaction with dynamic cone penetrometer test results 被引量:3
9
作者 R.A.T.M.Ranasinghe M.B.Jaksa +1 位作者 F.Pooya Nejad Y.L.Kuo 《Journal of Rock Mechanics and Geotechnical Engineering》 SCIE CSCD 2019年第4期815-823,共9页
Rolling dynamic compaction (RDC),which employs non-circular module towed behind a tractor,is an innovative soil compaction method that has proven to be successful in many ground improvement applications.RDC involves r... Rolling dynamic compaction (RDC),which employs non-circular module towed behind a tractor,is an innovative soil compaction method that has proven to be successful in many ground improvement applications.RDC involves repeatedly delivering high-energy impact blows onto the ground surface,which improves soil density and thus soil strength and stiffness.However,there exists a lack of methods to predict the effectiveness of RDC in different ground conditions,which has become a major obstacle to its adoption.For this,in this context,a prediction model is developed based on linear genetic programming (LGP),which is one of the common approaches in application of artificial intelligence for nonlinear forecasting.The model is based on in situ density-related data in terms of dynamic cone penetrometer (DCP) results obtained from several projects that have employed the 4-sided,8-t impact roller (BH-1300).It is shown that the model is accurate and reliable over a range of soil types.Furthermore,a series of parametric studies confirms its robustness in generalizing data.In addition,the results of the comparative study indicate that the optimal LGP model has a better predictive performance than the existing artificial neural network (ANN) model developed earlier by the authors. 展开更多
关键词 Ground improvement ROLLING dynamic compaction (RDC) Linear genetic programming (LGP) dynamic cone PENETROMETER (DCP) test
在线阅读 下载PDF
Implementable Strategy Research of Brake Energy Recovery Based on Dynamic Programming Algorithm for a Parallel Hydraulic Hybrid Bus 被引量:6
10
作者 Zhong-Liang Zhang Jie Chen 《International Journal of Automation and computing》 EI CSCD 2014年第3期249-255,共7页
The purpose of this paper is to develop an implementable strategy of brake energy recovery for a parallel hydraulic hybrid bus. Based on brake process analysis, a dynamic programming algorithm of brake energy recovery... The purpose of this paper is to develop an implementable strategy of brake energy recovery for a parallel hydraulic hybrid bus. Based on brake process analysis, a dynamic programming algorithm of brake energy recovery is established. And then an implementable strategy of brake energy recovery is proposed by the constraint variable trajectories analysis of the dynamic programming algorithm in the typical urban bus cycle. The simulation results indicate the brake energy recovery efficiency of the accumulator can reach 60% in the dynamic programming algorithm. And the hydraulic hybrid system can output braking torque as much as possible.Moreover, the accumulator has almost equal efficiency of brake energy recovery between the implementable strategy and the dynamic programming algorithm. Therefore, the implementable strategy is very effective in improving the efficiency of brake energy recovery.The road tests show the fuel economy of the hydraulic hybrid bus improves by 22.6% compared with the conventional bus. 展开更多
关键词 Implementable strategy brake energy recovery dynamic programming parallel hydraulic hybrid bus shifting schedule pump/motor displacement.
原文传递
Policy iteration optimal tracking control for chaotic systems by using an adaptive dynamic programming approach 被引量:2
11
作者 魏庆来 刘德荣 徐延才 《Chinese Physics B》 SCIE EI CAS CSCD 2015年第3期87-94,共8页
A policy iteration algorithm of adaptive dynamic programming(ADP) is developed to solve the optimal tracking control for a class of discrete-time chaotic systems. By system transformations, the optimal tracking prob... A policy iteration algorithm of adaptive dynamic programming(ADP) is developed to solve the optimal tracking control for a class of discrete-time chaotic systems. By system transformations, the optimal tracking problem is transformed into an optimal regulation one. The policy iteration algorithm for discrete-time chaotic systems is first described. Then,the convergence and admissibility properties of the developed policy iteration algorithm are presented, which show that the transformed chaotic system can be stabilized under an arbitrary iterative control law and the iterative performance index function simultaneously converges to the optimum. By implementing the policy iteration algorithm via neural networks,the developed optimal tracking control scheme for chaotic systems is verified by a simulation. 展开更多
关键词 adaptive critic designs adaptive dynamic programming approximate dynamic programming neuro-dynamic programming
原文传递
Optimal Control for a Class of Complex Singular System Based on Adaptive Dynamic Programming 被引量:6
12
作者 Zhan Shi Zhanshan Wang 《IEEE/CAA Journal of Automatica Sinica》 EI CSCD 2019年第1期188-197,共10页
This paper presents a new design approach to achieve decentralized optimal control of high-dimension complex singular systems with dynamic uncertainties. Based on robust adaptive dynamic programming(robust ADP) method... This paper presents a new design approach to achieve decentralized optimal control of high-dimension complex singular systems with dynamic uncertainties. Based on robust adaptive dynamic programming(robust ADP) method, controllers for solving the singular systems optimal control problem are designed. The proposed algorithm can work well when the system model is not exactly known but the input and output data can be measured. The policy iteration of each controller only uses their own states and input information for learning,and do not need to know the whole system dynamics. Simulation results on the New England 10-machine 39-bus test system show the effectiveness of the designed controller. 展开更多
关键词 Adaptive dynamic programming (ADP) DECENTRALIZED CONTROL frequency CONTROL power system SINGULAR systems
在线阅读 下载PDF
A COMBINED PARAMETRIC QUADRATIC PROGRAMMING AND PRECISE INTEGRATION METHOD BASED DYNAMIC ANALYSIS OF ELASTIC-PLASTIC HARDENING/SOFTENING PROBLEMS 被引量:3
13
作者 张洪武 张新伟 《Acta Mechanica Sinica》 SCIE EI CAS CSCD 2002年第6期638-648,共11页
The objective of the paper is to develop a new algorithm for numerical solution of dynamic elastic-plastic strain hardening/softening problems. The gradient dependent model is adopted in the numerical model to overcom... The objective of the paper is to develop a new algorithm for numerical solution of dynamic elastic-plastic strain hardening/softening problems. The gradient dependent model is adopted in the numerical model to overcome the result mesh-sensitivity problem in the dynamic strain softening or strain localization analysis. The equations for the dynamic elastic-plastic problems are derived in terms of the parametric variational principle, which is valid for associated, non-associated and strain softening plastic constitutive models in the finite element analysis. The precise integration method, which has been widely used for discretization in time domain of the linear problems, is introduced for the solution of dynamic nonlinear equations. The new algorithm proposed is based on the combination of the parametric quadratic programming method and the precise integration method and has all the advantages in both of the algorithms. Results of numerical examples demonstrate not only the validity, but also the advantages of the algorithm proposed for the numerical solution of nonlinear dynamic problems. 展开更多
关键词 precise integration method parametric quadratic programming method strain localization strain softening dynamic response
在线阅读 下载PDF
Adaptive event-triggered distributed optimal guidance design via adaptive dynamic programming 被引量:7
14
作者 Teng LONG Yan CAO +1 位作者 Jingliang SUN Guangtong XU 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2022年第7期113-127,共15页
In this paper,the multi-missile cooperative guidance system is formulated as a general nonlinear multi-agent system.To save the limited communication resources,an adaptive eventtriggered optimal guidance law is propos... In this paper,the multi-missile cooperative guidance system is formulated as a general nonlinear multi-agent system.To save the limited communication resources,an adaptive eventtriggered optimal guidance law is proposed by designing a synchronization-error-driven triggering condition,which brings together the consensus control with Adaptive Dynamic Programming(ADP)technique.Then,the developed event-triggered distributed control law can be employed by finding an approximate solution of event-triggered coupled Hamilton-Jacobi-Bellman(HJB)equation.To address this issue,the critic network architecture is constructed,in which an adaptive weight updating law is designed for estimating the cooperative optimal cost function online.Therefore,the event-triggered closed-loop system is decomposed into two subsystems:the system with flow dynamics and the system with jump dynamics.By using Lyapunov method,the stability of this closed-loop system is guaranteed and all signals are ensured to be Uniformly Ultimately Bounded(UUB).Furthermore,the Zeno behavior is avoided.Simulation results are finally provided to demonstrate the effectiveness of the proposed method. 展开更多
关键词 Adaptive dynamic programming Distributed control Event-triggered Guidance and control Multi-agent system
原文传递
Parallel Control for Optimal Tracking via Adaptive Dynamic Programming 被引量:26
15
作者 Jingwei Lu Qinglai Wei Fei-Yue Wang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2020年第6期1662-1674,共13页
This paper studies the problem of optimal parallel tracking control for continuous-time general nonlinear systems.Unlike existing optimal state feedback control,the control input of the optimal parallel control is int... This paper studies the problem of optimal parallel tracking control for continuous-time general nonlinear systems.Unlike existing optimal state feedback control,the control input of the optimal parallel control is introduced into the feedback system.However,due to the introduction of control input into the feedback system,the optimal state feedback control methods can not be applied directly.To address this problem,an augmented system and an augmented performance index function are proposed firstly.Thus,the general nonlinear system is transformed into an affine nonlinear system.The difference between the optimal parallel control and the optimal state feedback control is analyzed theoretically.It is proven that the optimal parallel control with the augmented performance index function can be seen as the suboptimal state feedback control with the traditional performance index function.Moreover,an adaptive dynamic programming(ADP)technique is utilized to implement the optimal parallel tracking control using a critic neural network(NN)to approximate the value function online.The stability analysis of the closed-loop system is performed using the Lyapunov theory,and the tracking error and NN weights errors are uniformly ultimately bounded(UUB).Also,the optimal parallel controller guarantees the continuity of the control input under the circumstance that there are finite jump discontinuities in the reference signals.Finally,the effectiveness of the developed optimal parallel control method is verified in two cases. 展开更多
关键词 Adaptive dynamic programming(ADP) nonlinear optimal control parallel controller parallel control theory parallel system tracking control neural network(NN)
在线阅读 下载PDF
Automated Segmentation of Left Ventricle Using Local and Global Intensity Based Active Contour and Dynamic Programming 被引量:3
16
作者 G.Dharanibai Anupama Chandrasekharan Zachariah C.Alex 《International Journal of Automation and computing》 EI CSCD 2018年第6期673-688,共16页
The aim of this work is to develop an improved region based active contour and dynamic programming based method for accurate segmentation of left ventricle (LV) from multi-slice cine short axis cardiac magnetic reso... The aim of this work is to develop an improved region based active contour and dynamic programming based method for accurate segmentation of left ventricle (LV) from multi-slice cine short axis cardiac magnetic resonance (MR) images. Intensity inhomogeneity and weak object boundaries present in MR images hinder the segmentation accuracy. The proposed active contour model driven by a local Gaussian distribution fitting (LGDF) energy and an auxiliary global intensity fitting energy improves the accuracy of endocardial boundary detection. The weightage of the global energy fitting term is dynamically adjusted using a spatially varying weight function. Dynamic programming scheme proposed for the segmentation of epicardium considers the myocardium probability map and a distance weighted edge map in the cost matrix. Radial distance weighted technique and conical geometry are employed for segmenting the basal slices with left ventricle outflow tract (LVOT) and most apical slices. The proposed method is validated on a public dataset comprising 45 subjects from medical image computing and computer assisted interventions (MICCAI) 2009 segmentation challenge. The average percentage of good endocardial and epicardial contours detected is about 99%, average perpendicular distance of the detected good contours from the manual reference contours is 1.95 mm, and the dice similarity coefficient between the detected contours and the reference contours is 0.91. Correlation coefficient and the coefficient of determination between the ejection fraction measurements from manual segmentation and the automated method are respectively 0.9781 and 0.9567, for LV mass these values are 0.9249 and 0.8554. Statistical analysis of the results reveals a good agreement between the clinical parameters determined manually and those estimated using the automated method. 展开更多
关键词 Cardiovascular magnetic resonance left ventricle ENDOCARDIUM EPICARDIUM MYOCARDIUM segmentation active contour dynamic programming.
原文传递
Dynamic programming methodology for multi-criteria group decision-making under ordinal preferences 被引量:3
17
作者 Wu Li Guanqi Guo +1 位作者 Chaoyuan Yue Yong Zhao 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2010年第6期975-980,共6页
A method of minimizing rankings inconsistency is proposed for a decision-making problem with rankings of alternatives given by multiple decision makers according to multiple criteria. For each criteria, at first, the ... A method of minimizing rankings inconsistency is proposed for a decision-making problem with rankings of alternatives given by multiple decision makers according to multiple criteria. For each criteria, at first, the total inconsistency between the rankings of all alternatives for the group and the ones for every decision maker is defined after the decision maker weights in respect to the criteria are considered. Similarly, the total inconsistency between their final rankings for the group and the ones under every criteria is determined after the criteria weights are taken into account. Then two nonlinear integer programming models minimizing respectively the two total inconsistencies above are developed and then transformed to two dynamic programming models to obtain separately the rankings of all alternatives for the group with respect to each criteria and their final rankings. A supplier selection case illustrated the proposed method, and some discussions on the results verified its effectiveness. This work develops a new measurement of ordinal preferences’ inconsistency in multi-criteria group decision-making (MCGDM) and extends the cook-seiford social selection function to MCGDM considering weights of criteria and decision makers and can obtain unique ranking result. 展开更多
关键词 multi-criteria group decision-making (MCGDM) ordinal preference minimum deviation method dynamic programming Cook-Seiford social selection function.
在线阅读 下载PDF
Approximate Dynamic Programming for Stochastic Resource Allocation Problems 被引量:4
18
作者 Ali Forootani Raffaele Iervolino +1 位作者 Massimo Tipaldi Joshua Neilson 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2020年第4期975-990,共16页
A stochastic resource allocation model, based on the principles of Markov decision processes(MDPs), is proposed in this paper. In particular, a general-purpose framework is developed, which takes into account resource... A stochastic resource allocation model, based on the principles of Markov decision processes(MDPs), is proposed in this paper. In particular, a general-purpose framework is developed, which takes into account resource requests for both instant and future needs. The considered framework can handle two types of reservations(i.e., specified and unspecified time interval reservation requests), and implement an overbooking business strategy to further increase business revenues. The resulting dynamic pricing problems can be regarded as sequential decision-making problems under uncertainty, which is solved by means of stochastic dynamic programming(DP) based algorithms. In this regard, Bellman’s backward principle of optimality is exploited in order to provide all the implementation mechanisms for the proposed reservation pricing algorithm. The curse of dimensionality, as the inevitable issue of the DP both for instant resource requests and future resource reservations,occurs. In particular, an approximate dynamic programming(ADP) technique based on linear function approximations is applied to solve such scalability issues. Several examples are provided to show the effectiveness of the proposed approach. 展开更多
关键词 Approximate dynamic programming(ADP) dynamic programming(DP) Markov decision processes(MDPs) resource allocation problem
在线阅读 下载PDF
Smooth free-cycle dynamic soaring in unspecified shear wind via quadratic programming 被引量:4
19
作者 Haichao HONG Benedikt GRüTER +1 位作者 Patrick PIPREK Florian HOLZAPFEL 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2022年第7期19-29,共11页
Harvesting wind energy is promising for extending long-endurance flights,which can be greatly facilitated by a flight technique called dynamic soaring.The presented study is concerned with generating model-based traje... Harvesting wind energy is promising for extending long-endurance flights,which can be greatly facilitated by a flight technique called dynamic soaring.The presented study is concerned with generating model-based trajectories with smooth control histories for dynamic soaring maneuvers exploiting wind gradients.The desired smoothness is achieved by introducing a trigonometric series parameterization for the controls,which are formulated with respect to the normalized time.Specifically,the periodicity of the trigonometric functions is leveraged to facilitate the connection of cycles and streamline the problem formulation.Without relying on a specified wind profile,a freefinal-time quadratic programming-based control strategy is developed for the online correction of the flight trajectory,which requires only the instant wind information.Offline and online numerical studies show the trade-off to achieve the smoothness and demonstrate the effectiveness of the proposed method in a varying wind field. 展开更多
关键词 dynamic soaring Optimal control Quadratic programming Trajectory generation Trigonometric series
原文传递
Approach of service recovery decision-making based on Bellman dynamic programming
20
作者 何蕾 任江春 王志英 《Journal of Southeast University(English Edition)》 EI CAS 2008年第3期377-380,共4页
Based on service-oriented architecture(SOA),a Bellman-dynamic-programming-based approach of service recovery decision-making is proposed to make valid recovery decisions.Both the attribute and the process of service... Based on service-oriented architecture(SOA),a Bellman-dynamic-programming-based approach of service recovery decision-making is proposed to make valid recovery decisions.Both the attribute and the process of services in the controllable distributed information system are analyzed as the preparatory work.Using the idea of service composition as a reference,the approach translates the recovery decision-making into a planning problem regarding artificial intelligence (AI) through two steps.The first is the self-organization based on a logical view of the network,and the second is the definition of evaluation standards.Applying Bellman dynamic programming to solve the planning problem,the approach offers timely emergency response and optimal recovery source selection,meeting multiple QoS (quality of service)requirements.Experimental results demonstrate the rationality and optimality of the approach,and the theoretical analysis of its computational complexity and the comparison with conventional methods exhibit its high efficiency. 展开更多
关键词 service recovery decision-making Bellman dynamic programming quality of service (QoS) service-oriented architecture(SOA)
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部