深度泛化机制的再思考:过参数化与高维噪声扰动下的一致收敛界重构

Rethinking Deep Generalization Mechanisms:Establishment of Uniform Convergence Bounds Under Overparameterization and High-dimensional Noise Perturbations

下载PDF

导出

摘要深度神经网络在具备强大的表达能力的同时展现出优异的泛化性能,这与统计学习理论中“模型复杂度损害泛化”的经典论断存在本质冲突,导致传统框架下的深度泛化机制分析陷入困境。经典一致收敛界理论具有依赖参数空间维度、忽略算法隐式偏置等局限,难以直接适配深度网络核心特性。针对这一理论裂隙,构建了融合深度模型关键特征的新型统计学习理论框架,重构了一致收敛理论对深度模型泛化机制的解释范式。通过构建保留深度网络过参数化结构与高维噪声扰动特征的代理线性模型,首次推导出有效的一致收敛界,揭示了高维特征空间中噪声扰动对泛化性能的良性作用机制,突破了传统低维学习理论框架的局限性;基于深度泛化机制构造了数据规模敏感的规范化训练过程,揭示一致收敛界与泛化误差随样本复杂度增长呈现同步衰减的规律,证实了一致收敛理论对深度模型泛化机制的解释能力。基于理论与实验双重证据,突破了一致收敛泛化界的适配瓶颈,重新打开了一致收敛理论分析深度模型泛化性这扇即将被关闭的大门。 Deep neural networks demonstrate both powerful expressive capabilities and exceptional generalization performance,which fundamentally conflicts with the classical statistical learning tenet that“model complexity harms generalization”,rendering the analysis of deep generalization mechanisms under traditional frameworks intractable.Classic uniform convergence theory,constrained by its reliance on parameter space dimensionality and neglect of algorithmic implicit bias,fails to directly align with the core characteristics of deep networks.To address this theoretical gap,this paper constructs a novel statistical learning framework that integrates key features of deep models,thereby redefining the explanatory paradigm of uniform convergence theory for deep generalization mechanisms.It derives the first effective uniform convergence bound for deep networks by introducing a surrogate linear model that preserves overparameterization and high-dimensional noise-perturbation features,which reveals a benign role of high-dimensional noise in improving generalization beyond classical low-dimensional theory.Building on this deep generalization mechanism,it further proposes a scale-sensitive regularized training scheme and shows that the bound and the generalization error decay with increasing sample complexity.Supported by both theoretical and empirical evidence,this work breaks through the adaptability bottleneck of uniform convergence bounds and reopens the door for uniform convergence theory to analyze the generalization of deep models.

作者李鹏奇丁立中张春晖傅稼润 LI Pengqi;DING Lizhong;ZHANG Chunhui;FU Jiarun(School of Computer Science,Beijing Institute of Technology,Beijing 100081,China)

机构地区北京理工大学计算机学院

出处《计算机科学》北大核心 2026年第4期33-39,共7页 Computer Science

基金国家重点研发计划(2022YFB2703100) 国家自然科学基金(62376028,U22A2099) 国家自然科学基金优秀青年科学基金(海外)。

关键词泛化误差一致收敛界修剪假设空间高维概率泛化机制 Generalization error Uniform convergence bound Pruned hypothesis space High-dimensional probability Generalization mechanism

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1王煜.知识分子的师道传承[J].教师博览(上旬刊),2026(3):61-64.
2杨伟,叶真,陈东旭,欧阳志青.面向油气田碳排放优化的边缘智能体决策机制[J].无线互联科技,2025,22(21):10-13.
3邢志勇,黄丽.一类非线性算子矩阵变换一致收敛性的判据[J].山西师范大学学报(自然科学版),2026,40(1):1-4.
4孙文强,曾竞,彭赞,张征,苏盛.基于用电周期分析的通信基站窃电检测方法[J].电力科学与工程,2026,42(1):23-30.
5石涛,魏伟,廉依淋.岭回归的随机采样Kaczmarz变体方法[J].高等学校计算数学学报,2025,47(2):149-162.
6寿莹鑫,许斌.基于有限时间复合估计的多无人机分层分布式一致性控制方法[J].航空科学技术,2025,36(12):10-17.
7张晓飞,姜宏伟.基于马氏样本的Huber正则化回归算法的泛化性能[J].新乡学院学报,2025,42(12):20-26.
8邰伟鹏,高旋,王修君,王档良,马立强,李剑锋.基于双层模态分解和Autoformer模型的矿井微震时序预测方法[J].煤炭学报,2025,50(S2):891-906. 被引量：1
9原凤妍,常琳,李华,吴晔.“单模态”向“多模态”的范式跃迁:算法—情绪—意义的三维整合——2025年计算传播学研究综述[J].教育传媒研究,2026(2):36-45.
10吴猛.论福柯与马克思对话的思想基础与理论路径——兼评雅克·比岱《福柯与马克思》[J].福建论坛(人文社会科学版),2025(11):70-82.

计算机科学

2026年第4期

浏览历史

内容加载中请稍等...

深度泛化机制的再思考:过参数化与高维噪声扰动下的一致收敛界重构

相关作者

相关机构

相关主题

浏览历史