摘要
针对传统核模糊聚类(KFCM)算法无法克服边界噪声数据影响且对初始聚类中心敏感的不足,提出一种基于样本密度和最大类间方差法相结合的KFCM算法。该算法在传统的KFCM算法中引入样本分布密度作为权重,克服噪声及边界数据对分类中心的影响,使样本的聚类效果更好,同时还可以分析各样本对聚类的贡献程度。此外利用最大类间方差法对样本密度进行分割,得到各类中心点并以此作为KFCM算法的初始聚类中心,克服了传统算法对初始值敏感的不足。对各种实际数据集的测试结果均显示出新算法的优良性能。最后利用新算法对轴承故障进行诊断,试验结果表明新算法的诊断率优于传统的聚类算法。
To solve the problem of sensibility of traditional kernel fuzzy C-means algorithm (KFCM) to outliers and noises in training sets, a novel kernel fuzzy C-means algorithm based on distribution density around samples combined with maximum variance between clusters was proposed. In the proposed method, the value of distribution density around samples was used as weighted values according to the feature of sample distribution to overcome the shortcomings of sensibility to and outliers noises. The maximum variance between clusters was applied to segment the samples’ distribution density vector, and the segmentation results were used as the initial centers of the proposed KFCM algorithm, which overcomes the problem of sensibility to initial values. The proposed method can be applied to analyze the samples’ contribution to clustering performance. The experimental results with various real data sets illustrate the effectiveness of the algorithm. The proposed method can be also utilized in fault diagnosis field which outperforms traditional cluster methods.
出处
《振动与冲击》
EI
CSCD
北大核心
2009年第8期61-64,83,共5页
Journal of Vibration and Shock
基金
黑龙江省博士后科学基金资助项目(LBH-Z08227)
黑龙江省教育厅基金项目(11544049)
关键词
核模糊聚类
样本密度
最大类间方差法
故障诊断
kernel fuzzy c-means algorithm
distribution density
maximum variance between clusters
fault diagnosis