基于强化学习的水电站多零件设备拆卸序列规划研究

Disassembly sequence planning of multipart equipment in hydropower stations based on reinforcement learning

下载PDF

导出

摘要针对在大型机电设备拆卸与维修问题中当前主流元启发式算法存在的解决拆卸序列规划(DSP)问题效率低、稳定性差的问题,引入了强化学习思想,并将其与分层策略结合,提出了适用于多零件DSP的一种强化学习算法(QL)。首先,构建了DSP数据模型与空间约束模型;基于分层策略,把设备零部件分解成包含少量零件的多个子集;然后,基于零部件两两直接装配约束,构建了每个子集的初始R表,通过拆解工作量指标构建了序列评价奖惩函数,对初始R表进行了更新并生成了最终R表,利用QL算法,根据最终R表对每个子集进行了循环迭代学习训练直至结果收敛,生成了用于最优路径决策的Q表;最后,选取了水电站球阀、轴套提取装置和主接力器作为虚拟拆解测试对象,对该方法的有效性进行了验证。研究结果表明:QL算法相较于遗传算法(GA)和引力搜索算法(GSA),在收敛速度、优化效率和稳定性方面具有优势,运行时间优化程度相较GA与GSA分别达到了97.3%、98.4%;87.1%、94.9%和93.4%、95.0%,得到了符合预期的高质量拆卸序列,验证了该方法的有效性。与传统算法对比,QL算法具有一定的优越性。 Aiming at the problems of low efficiency and poor stability of the current mainstream meta-heuristic algorithms in solving the disassembly sequence planning problem in the disassembly and maintenance of large-scale electromechanical equipment,the idea of reinforcement learning was introduced and combined with the hierarchical strategy.A quality learning(QL)algorithm suitable for multi-part disassembly sequence planning was proposed.Firstly,a data model and a spatial constraint model for disassembly sequence planning were constructed.Based on the hierarchical strategy,the equipment components were decomposed into multiple subsets with a small number of parts.Then,an initial R table for each subset was constructed based on the direct assembly constraints between every two parts.A sequence evaluation reward and punishment function were constructed using the disassembly workload index to update the initial R table and generate the final R table.After that,the QL algorithm was used to perform cyclic iterative learning and training on each subset according to the final R table until the results converged,generating a Q table for optimal path decision-making.Finally,the spherical valve of the hydropower station,the bushing extraction device,and the main servomotor were selected as virtual disassembly test objects to verify the proposed method.The research results show that the QL algorithm has advantages over the genetic algorithm(GA)and the gravitational search algorithm(GSA)in terms of convergence speed,optimization efficiency,and stability.The optimization degrees of the running time reach 97.3%,98.4%;87.1%,94.9%and 93.4%,95.0%respectively.High-quality disassembly sequences that meet the expectations are obtained,it verifies the effectiveness of the proposed method.Compared with the traditional algorithm,QL algorithm has certain advantages.

作者杨贵程刘海涛王克远吴月超苏佶智王卓瑜 YANG Guicheng;LIU Haitao;WANG Keyuan;WU Yuechao;SU Jizhi;WANG Zhuoyu(Power China Huadong Engineering Corporation Limited,Hangzhou 310014,China;State Grid Xinyuan Group Co.,Ltd.,Beijing 100052,China;State Grid Electric Power Engineering Research Institute,Beijing 100073,China;Economic and Technological Research Institute of State Grid Hebei Electric Power Co.,Ltd.,Shijiazhuang 050000,China)

机构地区中国电建集团华东勘测设计研究院有限公司国网新源集团有限公司国网电力工程研究院有限公司国网河北省电力有限公司经济技术研究院

出处《机电工程》北大核心 2025年第10期2001-2009,共9页 Journal of Mechanical & Electrical Engineering

基金国家电网有限公司总部科技项目(5200-202356477A-3-2-ZN)。

关键词设备维修拆卸序列规划强化学习算法遗传算法引力搜索算法分层策略 equipment maintenance disassembly sequence planning(DSP) quality learning(QL)algorithm genetic algorithm(GA) gravitational search algorithm(GSA) hierarchical strategy

分类号 TH16 [机械工程—机械制造及自动化] TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献6

1顾嘉豪.离散鲸鱼算法求解拆解序列规划问题[J].计算机系统应用,2022,31(12):335-341. 被引量：2
2郭洪飞,陆鑫宇,任亚平,张超勇,李建庆.基于强化学习的群体进化算法求解双边多目标同步并行拆解线平衡问题[J].机械工程学报,2023,59(7):355-366. 被引量：8
3王运涛,刘钢,薛俊芳.基于改进遗传算法的拆卸序列规划[J].现代制造工程,2022(1):137-142. 被引量：8
4邢宇飞,王成恩,柳强.基于Pareto解集蚁群算法的拆卸序列规划[J].机械工程学报,2012,48(9):186-192. 被引量：28
5郭钧,王振东,杜百岗,李益兵.考虑不定拆卸程度的选择性异步并行拆卸序列规划[J].中国机械工程,2021,32(9):1080-1090. 被引量：11
6郭洪飞,傅文杰,李雷孝,林浩.基于最优化的拆卸序列规划研究进展[J].计算机工程与应用,2025,61(11):51-66. 被引量：1

二级参考文献45

1王峻峰,李世其,刘继红.面向绿色制造的产品选择拆卸技术研究[J].计算机集成制造系统,2007,13(6):1097-1102. 被引量：20
2SRINIVASAN H,GADH R. A geometric algorithm for single selective disassembly using the wave propagation abstraction[J]. Computer Aided Design,1998,30(8):603-613.
3GARCìA MA,LARRè A,LòPEZ B,et al. Reducing the complexity of geometric selective disassembly[C]// Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems,2000,Takamatsu,Japan,2000:1474-1479.
4CHUNG Chulho,PENG Qingjin. An integrated approach to selective disassembly sequence planning[J]. Robotics and Computer Integrated Manufacturing,2005,21(4-5):475-485.
5SMITH S S,CHEN Weihsiang. Rule-based recursive selective disassembly sequence planning for green design[J]. Advanced Engineering Informatics,2011,25:77-87.
6KARA S,POMPRASITPOL P,KEBEMICK H. A selective disassembly methodology for end-of-life products[J]. Assembly Automation,2005,25(2):124-134.
7DORIGO M,GAMBARDELLA L M. Ant colony system:A cooperative learning approach to the traveling salesman problem[J]. IEEE Transactions on Evolutionary Computation,1997,1(1):53-66.
8FAILLI F,DINI G. Ant colony systems in assembly planning:A new approach to sequence detection and optimization[C]//Proceedings of the 2nd CIRP International Seminar on Intelligent Computation in Manufacturing Engineering,2000,Toronto,Ontario,Canada:Elsevier Press,2000:227-232.
9DEB K,PRATAP A,AGARWAL S. A fast and elitist multi-objective genetic algorithm:NSGA-II[J]. IEEE Transactions on Evolutionary Computation,2002,6(2): 182-197.
10KONGAR E,GUPTA S M. Multi-objective optimization of lot size balancing for multi-products selective disassembly[C]//15th International Conference on Multiple Criteria Decision Making (MCDM),2000,Ankara,Turkey. Jul. 10-14,2001:338-347.

共引文献50

1WANG Xinqing,ZHAO Yang,WANG Dong,ZHU Huijie,ZHANG Qing.Improved Multi-objective Ant Colony Optimization Algorithm and Its Application in Complex Reasoning[J].Chinese Journal of Mechanical Engineering,2013,26(5):1031-1040. 被引量：3
2陈进才,郑守淇,冀德学.多Agent系统的形式化开放混合体系结构模型研究[J].西安交通大学学报,2000,34(2):42-46. 被引量：4
3陈弋文,陈伟达.不确定环境下考虑多约束的拆卸收益概率模型研究[J].计算机应用研究,2013,30(11):3339-3342. 被引量：2
4赵柏萱,刘检华,宁汝新,刘佳顺.一种基于运动规划的选择拆卸序列规划技术[J].机械工程学报,2014,50(7):136-145. 被引量：7
5陈弋文,陈伟达.基于收益概率的不确定环境下的产品拆卸序列优化[J].计算机集成制造系统,2014,20(4):793-798. 被引量：7
6郭崇颖,刘检华,唐承统,王春生.基于装配精度预分析的红外线CCD实时装调技术研究[J].机械工程学报,2014,50(10):15-24. 被引量：3
7张闻雷,曲蓉霞,许美蓉,罗小川.复杂产品装配干涉矩阵自动生成方法[J].机械工程学报,2016,52(1):139-148. 被引量：6
8宋守许,张文胜,张雷.基于改进人工蜂群算法的产品拆卸序列规划[J].中国机械工程,2016,27(17):2384-2390. 被引量：14
9蔡凯骏,张伟明,张梅军,季立,赵鸿飞.面向多人同时作业的拆卸序列规划[J].计算机集成制造系统,2016,22(12):2767-2777. 被引量：6
10焦庆龙,徐达,李闯.基于花朵授粉算法的产品拆卸序列规划[J].计算机集成制造系统,2016,22(12):2791-2799. 被引量：22

1郭洪飞,傅文杰,李雷孝,林浩.基于最优化的拆卸序列规划研究进展[J].计算机工程与应用,2025,61(11):51-66. 被引量：1
2邱栋,彭奕童,陈兆芳.考虑不确定拆卸时间的异步并行拆卸序列规划[J].福建理工大学学报,2025,23(1):57-63.
3杜百岗,赵豪杰,郭钧.基于改进灰狼算法的回收与拆卸联合调度问题研究[J].合肥工业大学学报(自然科学版),2025,48(10):1325-1335.
4刘明红,韩立芝,刘灵爽,李琛,李诗文.融合改进IGSA和ELM算法的电力建设项目风险预警研究[J].自动化技术与应用,2025,44(10):172-176.
5丁连迪.改进深度学习下大型机电设备运行状态检测[J].中国新技术新产品,2025(16):19-21.
6裘雨音,钱建国,章晓锘,陈冰恽.基于引力搜索优化的多重分形算法在水电机组振动中的应用[J].水电能源科学,2025,43(9):179-182.
7郭志虎.基于节能型煤矿大型机电设备协同控制技术[J].矿业装备,2025(7):129-131.
8万叶,付娜静,于智慧,王馨,姜晖.临床药师绩效考核方法的研究[J].现代医药卫生,2025,41(8):1996-1999.
9张华,殷俊鸿,鄢威,马峰,江志刚,朱硕.基于价值评估的废旧产品拆卸序列与拆卸深度决策[J].组合机床与自动化加工技术,2025(1):143-149.
10黄瑶,印培源,印隆林,李晓艳,孙菊.磁共振弥散加权成像在肝门部胆管癌术前Bismuth-Corlette分型中的价值探讨[J].中国普外基础与临床杂志,2025,32(8):964-971.

机电工程

2025年第10期

浏览历史

内容加载中请稍等...

基于强化学习的水电站多零件设备拆卸序列规划研究

参考文献6

二级参考文献45

共引文献50

相关作者

相关机构

相关主题

浏览历史