分布式强化学习驱动的量子编译自动调优方法

A Distributed Reinforcement Learning Driven Approach to Automatic Tuning of Quantum Compilation

下载PDF

导出

摘要针对强化学习模型应用到量子编译自动调优领域开销大的问题,提出一种分布式强化学习(DRL)驱动的量子编译自动调优方法,通过将经验生成与智能体训练解耦,基于分布式集群实现了并行经验生成。该方法通过建立具有稠密奖励特性的量子编译马尔可夫决策过程(MDP)模型,设计经验生成与智能体训练的解耦机制,结合动态经验加载策略,在保证优化效果的同时提升训练效率。实验结果表明,分布式训练框架训练耗时减少54.6%;优化性能方面,智能体在测试集77.3%的量子线路上表现优于Qiskit-O3编译器,对未见过的Shor算法线路平均减少17.4%量子门数量。 Aiming at the problem of high overhead when applying reinforcement learning models to the field of automatic tuning of quantum compilation,a distributed reinforcement learning(DRL)driven au-tomatic tuning method for quantum compilation is proposed.By decoupling experience generation from agent training,parallel experience generation is achieved based on a distributed cluster.In this method,a markov decision process(MDP)model of quantum compilation is established with the char-acteristic of dense rewards,a decoupling mechanism for experience generation and agent trainingis is designed,and a dynamic experience loading strategy is combined to improve the training efficiency while ensuring the optimization effect.Experiments demonstrate a 54.6%reduction in training time compared to baseline methods.In terms of optimization performance,the agent performs better than the Qiskit-O3 compiler on 77.3%of the quantum circuits in the test set,and the number of quantum gates of the unseen Shor algorithm circuits is reduced by an average of 17.4%.

作者刘毅朱雨许瑾晨杜启明连航涂政 LIU Yi;ZHU Yu;XU Jinchen;DU Qiming;LIAN Hang;TU Zheng(Information Engineering University,Zhengzhou 450001,China)

机构地区信息工程大学

出处《信息工程大学学报》 2025年第4期462-469,共8页 Journal of Information Engineering University

关键词强化学习分布式系统深度Q网络量子编译优化量子编译自动调优 reinforcement learning distributed system deep Q-network quantum compilation opti-mization auto-tuning quantum compilation

分类号 TP311.5 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献2

1朱棣,申涓,李丹,兰巨龙.一种面向网络节能的VNF容量调整方法[J].信息工程大学学报,2023,24(2):183-189. 被引量：1
2陈博,孙鹏浩,兰巨龙,王雨薇,崔鹏帅,申涓.基于多智能体强化学习的域间多链路路由优化[J].信息工程大学学报,2022,23(6):641-647. 被引量：1

二级参考文献3

1张俊,沈苏彬.一种基于SDN的多管理域路由机制[J].计算机技术与发展,2018,28(8):86-90. 被引量：3
2何晓明,刘宁芳,陈文华.基于SDN的互联网域间路由研究[J].通信技术,2020,53(5):1146-1150. 被引量：4
3丛培壮,张宇超,田野,王文东,李丹.跨域场景下的联邦路由机制设计[J].电信科学,2020,36(10):29-36. 被引量：2

信息工程大学学报

2025年第4期

浏览历史

内容加载中请稍等...

分布式强化学习驱动的量子编译自动调优方法

参考文献2

二级参考文献3

相关作者

相关机构

相关主题

浏览历史