带消极动量的自适应步长随机方差缩减方法

An adaptive variance reduction method with negative momentum

下载PDF

导出

摘要近年来,随机方差缩减类方法在解决大规模机器学习问题中取得很大成功,自适应步长技术的引入减轻了该类方法的调参负担。针对自适应步长的方差缩减算法SVRG-BB,指出其算法设计带来了“进展-自适应步长有效性”的权衡问题。因此引入Katyusha动量以更好地处理该权衡问题,并且在强凸假设下证明由此得到的SVRG-BB-Katyusha算法的线性收敛性质。之后基于“贪婪”思想,提出稀疏地使用Katyusha动量的SVRG-BB-Katyusha-SPARSE算法。在公开数据集上的数值实验结果表明,提出的2个改进算法较SVRG-BB有较稳定的优势,即在达到一定外循环数时优化间隙有若干个数量级的减小。 Stochastic variance reduction methods have been successful in solving large scale machine learning problems,and researchers cooperate them with adaptive stepsize schemes to further alleviate the burden of parameter-tuning.In this article,we propose that there exists a trade-off between progress and effectiveness of adaptive stepsize arising in the SVRG-BB algorithm.To enhance the practical performance of SVRG-BB,we introduce the Katyusha momentum to handle the aforementioned trade-off.The linear convergence rate of the resulting SVRG-BB-Katyusha algorithm is proven under strong convexity condition.Moreover,we propose SVRG-BB-Katyusha-SPARSE algorithm which uses Katyusha momentum sparsely in the inner iterations.Numerical experiments are given to illustrate that the proposed algorithms have promising advantages over SVRG-BB,in the sense that the optimality gaps of the proposed algorithms are smaller than the optimality gap of SVRG-BB by orders of magnitude.

作者刘海郭田德韩丛英 LIU Hai;GUO Tiande;HAN Congying(School of Mathematical Sciences,University of Chinese Academy of Sciences,Beijing 100049,China)

机构地区中国科学院大学数学科学学院

出处《中国科学院大学学报（中英文）》 CAS CSCD 北大核心 2024年第5期577-588,共12页 Journal of University of Chinese Academy of Sciences

基金国家重点研发计划(2021YFA1000403) 国家自然科学基金(11991022,U23B2012) 中央高校基本科研业务费专项(E1E40104X2)资助。

关键词自适应步长机制随机方差缩减类方法 Barzilai-Borwein方法 Katyusha动量 adaptive stepsize scheme stochastic variance reduction methods Barzilai-Borwein method Katyusha momentum

分类号 O224 [理学—运筹学与控制论]

引文网络
相关文献

参考文献1

1Teng-Teng Yu,Xin-Wei Liu,Yu-Hong Dai,Jie Sun.A Mini-Batch Proximal Stochastic Recursive Gradient Algorithm with Diagonal Barzilai–Borwein Stepsize[J].Journal of the Operations Research Society of China,2023,11(2):277-307. 被引量：2

共引文献1

1Caixia Kou,Feifei Gao,Yu-Hong Dai.MINI-BATCH STOCHASTIC CONJUGATE GRADIENT ALGORITHMS WITH MINIMAL VARIANCE[J].Journal of Computational Mathematics,2025,43(5):1045-1062.

1包世鹏,宋旭明,唐冕.基于向量化的BESO方法灵敏度过滤快速算法[J].铁道科学与工程学报,2023,20(5):1810-1820. 被引量：4
2刘柯珺.基于列生成算法的机组与多航段旅客行程优化恢复[J].武汉理工大学学报,2023,45(8):68-75.
3王福胜,史鲁玉.基于Polyak步长的加速临近随机方差缩减算法[J].运筹学学报（中英文）,2024,28(2):131-142.
4李琰歆,陈学强,贺巧玲,周佳,李晶.基于方差分析研究不同微生物方法对奶粉中泛酸含量测定的影响[J].中国乳品工业,2024,52(7):67-72.
5陈修龙,居硕,贾永皓.考虑润滑间隙效应的空间并联机构动力学优化[J].农业机械学报,2024,55(3):441-451. 被引量：1

中国科学院大学学报（中英文）

2024年第5期

浏览历史

内容加载中请稍等...

带消极动量的自适应步长随机方差缩减方法

参考文献1

共引文献1

相关作者

相关机构

相关主题

浏览历史