期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
LOSS SPIKE IN TRAINING NEURAL NETWORKS
1
作者 Xiaolong Li Zhi-Qin John Xu Zhongwang Zhang 《Journal of Computational Mathematics》 2026年第2期369-393,共25页
In this work,we investigate the mechanism underlying loss spikes observed during neural network training.When the training enters a region with a lower-loss-as-sharper structure,the training becomes unstable,and the l... In this work,we investigate the mechanism underlying loss spikes observed during neural network training.When the training enters a region with a lower-loss-as-sharper structure,the training becomes unstable,and the loss exponentially increases once the loss landscape is too sharp,resulting in the rapid ascent of the loss spike.The training stabilizes when it finds a flat region.From a frequency perspective,we explain the rapid descent in loss as being primarily influenced by low-frequency components.We observe a deviation in the first eigendirection,which can be reasonably explained by the frequency principle,as low-frequency information is captured rapidly,leading to the rapid descent.Inspired by our analysis of loss spikes,we revisit the link between the maximum eigenvalue of the loss Hessian(λ_(max)),flatness and generalization.We suggest that λ_(max)is a good measure of sharpness but not a good measure for generalization.Furthermore,we experimentally observe that loss spikes can facilitate condensation,causing input weights to evolve towards the same direction.And our experiments show that there is a correlation(similar trend)between λ_(max)and condensation.This observation may provide valuable insights for further theoretical research on the relationship between loss spikes,λ_(max),and generalization. 展开更多
关键词 Neural Network Loss Spike Frequency Principle maximum eigenvalue Flat-ness GENERALIZATION CONDENSATION
原文传递
Fault Location Detection of Transmission Lines in Noise Environments Based on Random Matrix Theory 被引量:1
2
作者 Jun An Zihan Deng +1 位作者 Haipeng Chen Gang Mu 《CSEE Journal of Power and Energy Systems》 SCIE EI CSCD 2022年第4期1233-1241,共9页
Fault detection and location are critically significant applications of a supervisory control system in a smart grid.The methods,based on random matrix theory(RMT),have been practiced using measurements to detect shor... Fault detection and location are critically significant applications of a supervisory control system in a smart grid.The methods,based on random matrix theory(RMT),have been practiced using measurements to detect short circuit faults occurring on transmission lines.However,the diagnostic accuracy is infuenced by the noise signal in the measurements.The relationship between mean eigenvalue of a random matrix and noise is detected in this paper,and the defects of the Mean Spectral Radius(MSR),as an indicator to detect faults,are theoretically determined,along with a novel indicator of the shifting degree of maximum eigenvalue and its threshold.By comparing the indicator and the threshold,the occurrence of a fault can be assessed.Finally,an augmented matrix is constructed to locate the fault area.The proposed method can effectively achieve fault detection via the RMT without any influence of noise,and also does not depend on system models.The experiment results are based on the IEEE 39-bus system.Also,actual provincial grid data is applied to validate the effectiveness of the proposed method. 展开更多
关键词 Fault detection maximum eigenvalue noise random matrix theory smart grid
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部