期刊文献+
共找到11篇文章
< 1 >
每页显示 20 50 100
HEURISTIC MODELING FOR A DYNAMIC AND GOAL PROGRAMMING IN PRODUCTION PLANNING OF CONTINUOUS MANUFACTURING SYSTEMS 被引量:2
1
作者 JAHAN A ABDOLSHAH M 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2007年第5期110-113,共4页
At the first sight it seems that advanced operation research is not used enough in continuous production systems as comparison with mass production, batch production and job shop systems, but really in a comprehensive... At the first sight it seems that advanced operation research is not used enough in continuous production systems as comparison with mass production, batch production and job shop systems, but really in a comprehensive evaluation the advanced operation research techniques can be used in continuous production systems in developing countries very widely, because of initial inadequate plant layout, stage by stage development of production lines, the purchase of second hand machineries from various countries, plurality of customers. A case of production system planning is proposed for a chemical company in which the above mentioned conditions are almost presented. The goals and constraints in this issue are as follows: (1) Minimizing deviation of customer's requirements. (2) Maximizing the profit. (3) Minimizing the frequencies of changes in formula production. (4) Minimizing the inventory of final products. (5) Balancing the production sections with regard to rate in production. (6) Limitation in inventory of raw material. The present situation is in such a way that various techniques such as goal programming, linear programming and dynamic programming can be used. But dynamic production programming issues are divided into two categories, at first one with limitation in production capacity and another with unlimited production capacity. For the first category, a systematic and acceptable solution has not been presented yet. Therefore an innovative method is used to convert the dynamic situation to a zero- one model. At last this issue is changed to a goal programming model with non-linear limitations with the use of GRG algorithm and that's how it is solved. 展开更多
关键词 heuristic model Dynamic programming Goal programming production planning
在线阅读 下载PDF
Heuristic dynamic programming-based learning control for discrete-time disturbed multi-agent systems
2
作者 Yao Zhang Chaoxu Mu +1 位作者 Yong Zhang Yanghe Feng 《Control Theory and Technology》 EI CSCD 2021年第3期339-353,共15页
Owing to extensive applications in many fields,the synchronization problem has been widely investigated in multi-agent systems.The synchronization for multi-agent systems is a pivotal issue,which means that under the ... Owing to extensive applications in many fields,the synchronization problem has been widely investigated in multi-agent systems.The synchronization for multi-agent systems is a pivotal issue,which means that under the designed control policy,the output of systems or the state of each agent can be consistent with the leader.The purpose of this paper is to investigate a heuristic dynamic programming(HDP)-based learning tracking control for discrete-time multi-agent systems to achieve synchronization while considering disturbances in systems.Besides,due to the difficulty of solving the coupled Hamilton–Jacobi–Bellman equation analytically,an improved HDP learning control algorithm is proposed to realize the synchronization between the leader and all following agents,which is executed by an action-critic neural network.The action and critic neural network are utilized to learn the optimal control policy and cost function,respectively,by means of introducing an auxiliary action network.Finally,two numerical examples and a practical application of mobile robots are presented to demonstrate the control performance of the HDP-based learning control algorithm. 展开更多
关键词 Multi-agent systems heuristic dynamic programming(HDP) Learning control Neural network SYNCHRONIZATION
原文传递
迁移增量启发式动态规划及污水处理应用
3
作者 王鼎 李鑫 《北京工业大学学报》 北大核心 2025年第3期277-283,共7页
针对污水处理系统中的溶解氧(dissolved oxygen,DO)质量浓度控制问题,提出一种迁移增量启发式动态规划(transferable incremental heuristic dynamic programming,TI-HDP)算法。针对污水处理过程的特性,该算法通过将控制变量的更新方式... 针对污水处理系统中的溶解氧(dissolved oxygen,DO)质量浓度控制问题,提出一种迁移增量启发式动态规划(transferable incremental heuristic dynamic programming,TI-HDP)算法。针对污水处理过程的特性,该算法通过将控制变量的更新方式改进为增量形式,提升了算法的抗干扰能力,并弱化了与增量式比例-积分-微分(proportional-integral-derivative,PID)算法之间的结构差异。基于数据驱动的思想,通过利用PID算法所产生的历史数据,成功地将传统控制领域中的专家经验迁移到TI-HDP算法框架中,保证了TI-HDP算法前期控制策略的稳定性。仿真结果表明:与PID算法和传统的启发式动态规划算法相比,所提算法对DO质量浓度具有更高的控制精度。 展开更多
关键词 启发式动态规划(heuristic dynamic programming HDP) 智能控制 知识迁移 非线性系统 神经网络 污水处理
在线阅读 下载PDF
Data-based neural controls for an unknown continuous-time multi-input system with integral reinforcement
4
作者 Yongfeng Lv Jun Zhao +1 位作者 Wan Zhang Huimin Chang 《Control Theory and Technology》 2025年第1期118-130,共13页
Integral reinforcement learning(IRL)is an effective tool for solving optimal control problems of nonlinear systems,and it has been widely utilized in optimal controller design for solving discrete-time nonlinearity.Ho... Integral reinforcement learning(IRL)is an effective tool for solving optimal control problems of nonlinear systems,and it has been widely utilized in optimal controller design for solving discrete-time nonlinearity.However,solving the Hamilton-Jacobi-Bellman(HJB)equations for nonlinear systems requires precise and complicated dynamics.Moreover,the research and application of IRL in continuous-time(CT)systems must be further improved.To develop the IRL of a CT nonlinear system,a data-based adaptive neural dynamic programming(ANDP)method is proposed to investigate the optimal control problem of uncertain CT multi-input systems such that the knowledge of the dynamics in the HJB equation is unnecessary.First,the multi-input model is approximated using a neural network(NN),which can be utilized to design an integral reinforcement signal.Subsequently,two criterion networks and one action network are constructed based on the integral reinforcement signal.A nonzero-sum Nash equilibrium can be reached by learning the optimal strategies of the multi-input model.In this scheme,the NN weights are constantly updated using an adaptive algorithm.The weight convergence and the system stability are analyzed in detail.The optimal control problem of a multi-input nonlinear CT system is effectively solved using the ANDP scheme,and the results are verified by a simulation study. 展开更多
关键词 Adaptive dynamic programming Integral reinforcement Neural networks heuristic dynamic programming Multi-input system
原文传递
Residential Energy Scheduling for Variable Weather Solar Energy Based on Adaptive Dynamic Programming 被引量:18
5
作者 Derong Liu Yancai Xu +1 位作者 Qinglai Wei Xinliang Liu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2018年第1期36-46,共11页
The residential energy scheduling of solar energy is an important research area of smart grid. On the demand side, factors such as household loads, storage batteries, the outside public utility grid and renewable ener... The residential energy scheduling of solar energy is an important research area of smart grid. On the demand side, factors such as household loads, storage batteries, the outside public utility grid and renewable energy resources, are combined together as a nonlinear, time-varying, indefinite and complex system, which is difficult to manage or optimize. Many nations have already applied the residential real-time pricing to balance the burden on their grid. In order to enhance electricity efficiency of the residential micro grid, this paper presents an action dependent heuristic dynamic programming(ADHDP) method to solve the residential energy scheduling problem. The highlights of this paper are listed below. First,the weather-type classification is adopted to establish three types of programming models based on the features of the solar energy. In addition, the priorities of different energy resources are set to reduce the loss of electrical energy transmissions.Second, three ADHDP-based neural networks, which can update themselves during applications, are designed to manage the flows of electricity. Third, simulation results show that the proposed scheduling method has effectively reduced the total electricity cost and improved load balancing process. The comparison with the particle swarm optimization algorithm further proves that the present method has a promising effect on energy management to save cost. 展开更多
关键词 Action dependent heuristic dynamic programming adaptive dynamic programming control strategy residential energy management smart grid
在线阅读 下载PDF
Learning control of fermentation process with an improved DHP algorithm
6
作者 Dazi Li Ningjia Meng Tianheng Song 《Chinese Journal of Chemical Engineering》 SCIE EI CAS CSCD 2016年第10期1399-1405,共7页
Control of the fed-batch ethanol fermentation processes to produce maximum product ethanol is one of the key issues in the bioreactor system.However,ethanol fermentation processes exhibit complex behavior and nonlinea... Control of the fed-batch ethanol fermentation processes to produce maximum product ethanol is one of the key issues in the bioreactor system.However,ethanol fermentation processes exhibit complex behavior and nonlinear dynamics with respect to the cell mass,substrate,feed-rate,etc.An improved dual heuristic programming algorithm based on the least squares temporal difference with gradient correction(LSTDC) algorithm(LSTDC-DHP) is proposed to solve the learning control problem of a fed-batch ethanol fermentation process.As a new algorithm of adaptive critic designs,LSTDC-DHP is used to realize online learning control of chemical dynamical plants,where LSTDC is commonly employed to approximate the value functions.Application of the LSTDC-DHP algorithm to ethanol fermentation process can realize efficient online learning control in continuous spaces.Simulation results demonstrate the effectiveness of LSTDC-DHP,and show that LSTDC-DHP can obtain the near-optimal feed rate trajectory faster than other-based algorithms. 展开更多
关键词 Dual heuristic programming Batch process Ethanol fermentation process Learning control
在线阅读 下载PDF
Adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal Learning Control 被引量:7
7
作者 Ding Wang Jiangyu Wang +2 位作者 Mingming Zhao Peng Xin Junfei Qiao 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第9期1797-1809,共13页
This paper is concerned with a novel integrated multi-step heuristic dynamic programming(MsHDP)algorithm for solving optimal control problems.It is shown that,initialized by the zero cost function,MsHDP can converge t... This paper is concerned with a novel integrated multi-step heuristic dynamic programming(MsHDP)algorithm for solving optimal control problems.It is shown that,initialized by the zero cost function,MsHDP can converge to the optimal solution of the Hamilton-Jacobi-Bellman(HJB)equation.Then,the stability of the system is analyzed using control policies generated by MsHDP.Also,a general stability criterion is designed to determine the admissibility of the current control policy.That is,the criterion is applicable not only to traditional value iteration and policy iteration but also to MsHDP.Further,based on the convergence and the stability criterion,the integrated MsHDP algorithm using immature control policies is developed to accelerate learning efficiency greatly.Besides,actor-critic is utilized to implement the integrated MsHDP scheme,where neural networks are used to evaluate and improve the iterative policy as the parameter architecture.Finally,two simulation examples are given to demonstrate that the learning effectiveness of the integrated MsHDP scheme surpasses those of other fixed or integrated methods. 展开更多
关键词 Adaptive critic artificial neural networks Hamilton-Jacobi-Bellman(HJB)equation multi-step heuristic dynamic programming multi-step reinforcement learning optimal control
在线阅读 下载PDF
Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning 被引量:2
8
作者 Ruofan Wu Zhikai Yao +1 位作者 Jennie Si He(Helen)Huang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2022年第1期19-30,共12页
We address a state-of-the-art reinforcement learning(RL)control approach to automatically configure robotic pros-thesis impedance parameters to enable end-to-end,continuous locomotion intended for transfemoral amputee... We address a state-of-the-art reinforcement learning(RL)control approach to automatically configure robotic pros-thesis impedance parameters to enable end-to-end,continuous locomotion intended for transfemoral amputee subjects.Specifically,our actor-critic based RL provides tracking control of a robotic knee prosthesis to mimic the intact knee profile.This is a significant advance from our previous RL based automatic tuning of prosthesis control parameters which have centered on regulation control with a designer prescribed robotic knee profile as the target.In addition to presenting the tracking control algorithm based on direct heuristic dynamic programming(dHDP),we provide a control performance guarantee including the case of constrained inputs.We show that our proposed tracking control possesses several important properties,such as weight convergence of the learning networks,Bellman(sub)optimality of the cost-to-go value function and control input,and practical stability of the human-robot system.We further provide a systematic simulation of the proposed tracking control using a realistic human-robot system simulator,the OpenSim,to emulate how the dHDP enables level ground walking,walking on different terrains and at different paces.These results show that our proposed dHDP based tracking control is not only theoretically suitable,but also practically useful. 展开更多
关键词 Automatic tracking of intact knee configuration of robotic knee prosthesis direct heuristic dynamic programming(dHDP) reinforcement learning control
在线阅读 下载PDF
Multi-period mine planning with multi-process routes 被引量:7
9
作者 Mustafa Kumral 《International Journal of Mining Science and Technology》 SCIE EI 2013年第3期317-321,共5页
This paper attempts to optimize optimal capacities, block routing and mine sequencing problems in a mining system. The solution approach is based on a heuristics and the mixed integer programming (MIP). Unlike previou... This paper attempts to optimize optimal capacities, block routing and mine sequencing problems in a mining system. The solution approach is based on a heuristics and the mixed integer programming (MIP). Unlike previous sequential solution approaches, the problems are herein solved at the same time. Furthermore, the proposed approach guarantees practical solutions because it considers ore material distribution within orebody. The paper has two main contributions: (a) the proposed approach generates production rates in a manner that the capacities are satisfied; (b) the proposed approach does not use pre-defined marginal cut-off grades. Thus, idle capacity problem is eliminated and different scheduling combinations are allowed. To see the performance of the approach proposed, a case study is carried out using a gold data. The schedule generated shows that the approach can determine optimal production rates, block destination and sequencing effectively. 展开更多
关键词 Mine planning Multi-route Sequencing Production rates heuristics Mixed integer programming
在线阅读 下载PDF
A novel adaptive heuristic dynamic programming-based algorithm for aircraft confrontation games 被引量:1
10
作者 Yi Mao Zhijie Chen +1 位作者 Yi Yang Yuxin Hu 《Fundamental Research》 CAS 2021年第6期792-799,共8页
Intelligent confrontation has become a vital technology for future air combats.Confrontation games between a penetrating aircraft and an intercepting aircraft are essential for modern air combats.In addition,the perfo... Intelligent confrontation has become a vital technology for future air combats.Confrontation games between a penetrating aircraft and an intercepting aircraft are essential for modern air combats.In addition,the perfor-mance indexes of both the interceptor and penetrator must be considered.Traditional methods only solve one side’s guidance problem without considering the intelligence of the opponent.In this paper,an adaptive heuristic dynamic programming-based algorithm is proposed for aircraft confrontation games.This algorithm constructs a heuristic dynamic programming model for both confrontation aircraft and then updates the critical and ac-tion network parameters using the dynamic confrontation state information.Numerical simulations indicate that the proposed algorithm can optimize the guidance law for both the interceptor and penetrator and is therefore superior to traditional proportional navigation methods. 展开更多
关键词 Aircraft confrontation Optimal control Neural network Adaptive heuristic dynamic programming Flight simulation
原文传递
On the stability of multicast flow aggregation in IP over optical network for IPTV delivery 被引量:3
11
作者 罗萱 金耀辉 +3 位作者 曾庆济 孙卫强 郭薇 胡卫生 《Chinese Optics Letters》 SCIE EI CAS CSCD 2008年第8期553-557,共5页
The stable multicast flow aggregation (MFA) problem in internet protocol (IP) over optical network under the dynamical scenario is studied. Given an optical network topology, there is a set of head ends and access... The stable multicast flow aggregation (MFA) problem in internet protocol (IP) over optical network under the dynamical scenario is studied. Given an optical network topology, there is a set of head ends and access touters attached to the optical network, in which each head end can provide a set of programs (IP multicasting flows) and each access router requests a set of programs, we find a set of stable light-trees to accommodate the optimally aggregated multicast IP flows if the requests of access touters changed dynamically. We introduce a program correlation matrix to describe the preference of end users' requests. As the original MFA problem is NP-complete, a heuristic approach, named most correlated program first (MCPF), is presented and compared with the extended least tree first (ELTF) algorithm which is topology- aware. Simulation results show that MCPF can achieve better performance than ELTF in terms of stability with negligible increment of network resource usage. 展开更多
关键词 AGGLOMERATION Electric network topology Fiber optic networks heuristic algorithms heuristic programming Internet Mobile telecommunication systems MULTICASTING Nuclear propulsion Optical data processing Optical materials ROUTERS Topology Trees (mathematics)
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部