期刊文献+

图正则化的模糊局部坐标编码概念分解模型

Fuzzy local coordinate concept factorization with graph regularization
在线阅读 下载PDF
导出
摘要 现有的基于矩阵分解聚类模型训练过程大多需要两个独立的步骤,一是通过自身的模型对数据集进行训练获得系数矩阵,二是对得到的系数矩阵进一步使用K-means方法来获得最终的聚类结果.这种两阶段模式一方面增加了计算消耗,也会因为K-means对初始聚类中心的敏感,会对聚类效果产生一定的影响.针对此问题,本文提出了一种图正则化的模糊局部坐标编码概念分解模型.该模型通过对系数矩阵添加约束使得系数矩阵行和为1,从而避免了再次使用K-means方法进行二次训练,而直接由系数矩阵获得聚类结果.另外,由于此系数矩阵的约束.该模型实现了模糊聚类,增强了聚类结果的可解释性.本文通过对人工合成数据的测试,验证了该模型的模糊性与可解释性;同时在常用的标准数据集上,通过与现有的聚类方法相比较,同样获得了较好的聚类效果. Matrix Factorization is an effective and efficient method to solve clustering problems in machine learning.However,for most traditional which factorization based models in clustering,there are two necessary steps to get the final assignments.First,original data can be decomposed to a basis matrix and a coefficient matrix through a certain model.Second,the learned coefficient matrix is fed into K-means to make discretization.This two-step paradigm causes extra computational burden and may have some side effect on the final results due to the sensitivity to initialization of K-means.To this end,a novel model termed fuzzy local coordinate concept factorization with graph regularizer(FLCCF-G)is proposed.Which avoids using K-means by enforcing the sum of each row of the non-negative coefficient matrix to equal to one.Then the final clustering results can obtained directly by checking the maximum value of each row of the coefficient matrix.In addition,through this constraint,our proposed model changes is a fuzzy clustering model rather than hard clustering,indicating that the model has better interpretability to data points in boundaries of different clusters.Extensive experimental results on synthetic and Benchmark data sets indicate the better performance of FLCCF-G on data clustering.
作者 张怿恺 彭勇 孔万增 文益民 ZHANG Yikai;PENG Yong;KONG Wanzeng;WEN Yimin(School of Computer Science and Technology,Hangzhou Dianzi University,Hangzhou 310018,China;School of Computer Science and Information Security,Guilin University of Electronic Technology,Guilin 541000,China)
出处 《中国科学技术大学学报》 CAS CSCD 北大核心 2020年第7期993-1002,共10页 JUSTC
基金 国家自然科学基金(61971173,61602140) 浙江省科技计划(2017C33049) 中国博士后科学基金(2017M620470) 浙江省新苗人才计划(2019R407030)资助
关键词 概念分解 局部坐标编码 模糊聚类 图正则化 concept factorization local coordinate coding fuzzy clustering graph regularizer
  • 相关文献

参考文献4

二级参考文献36

  • 1张敏,于剑.基于划分的模糊聚类算法[J].软件学报,2004,15(6):858-868. 被引量:179
  • 2邢宗义 ,贾利民 ,张永 ,胡维礼 ,秦勇 .一类基于数据的解释性模糊建模方法的研究[J].自动化学报,2005,31(6):815-824. 被引量:12
  • 3冯征.一种基于粗糙集的K-Means聚类算法[J].计算机工程与应用,2006,42(20):141-142. 被引量:16
  • 4宋晓峰,亢金龙,王宏.进化算法的发展与应用[J].现代电子技术,2006,29(20):66-68. 被引量:4
  • 5Setnes M. Supervised Fuzzy Clustering for Rule Extraction. IEEE Trans on Fuzzy Systems, 2000, 8(4) : 416 -424.
  • 6Nascimento S, Mirkin B, Moura-Pires F. Modeling Proportional Membership in Fuzzy Clustering. IEEE Trans on Fuzzy Systems, 2003, 11(2): 173-186.
  • 7Pedrycz W. Collaborative Fuzzy Clustering. Pattern Recognition Letters, 2002, 23(14) : 1675 - 1686.
  • 8Gorinevsky D. An Approach to Parametric Nonlinear Least Square Optimization and Application to Task-Level Learning Control. IEEE Trans on Automatic Control, 1997, 42(7) : 912 -927.
  • 9Wasito l, Mirkin B. Nearest Neighbours in Least-Squares Data Imputation Algorithms with Different Missing Patterns. Computational Statistics & Data Analysis, 2006, 50(4) : 926 -949.
  • 10Nascimento S. Fuzzy Clustering via Proportional Membership Model. Amsterdam, Netherlands: I05 Press, 2005.

共引文献56

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部