期刊文献+

数据挖掘中关联弱化问题的解决方法分析 被引量:3

Research on Solution to Association Weakening Problem in Data Mining
在线阅读 下载PDF
导出
摘要 当前的支持向量机和均值聚类等数据挖掘算法中,几乎都是依靠数据之间的关联性来完成数据匹配。一旦数据库中含有大量的冗余数据,将造成数据之间的相关性降低,关联性被破坏,导致传统的数据挖掘算法效率降低。为了避免上述缺陷,提出了一种弱化关联规则修补挖掘算法。利用弱聚类方法,在数据选择过程中,不将所有的元素都进行初始分类处理,只计算某一元素属于某一个类别的概率,确定多个弱聚类中心,计算不同数据之间的弱聚类关联性,从而实现关联规则较弱的冗余环境下准确的数据挖掘。实验结果表明,这种算法能够有效提高海量冗余环境下的数据挖掘效率,取得了令人满意的效果。 The support vector machine (SVM) and mean cluster data mining algorithm, almost all rely on the correla- tion between data, complete data matching. Once the database contains a large amount of redundancy data, the correla- tion between data will be reduced, and relevance is destroyed, resulting in traditional data mining algorithm efficiency lower. In order to avoid the above defects, this paper proposed a weakening association rules repair mining algorithm. In the data selection process, the method will not make initial classification processing for all elements only calculates proba- bility that one element belongs to a category, and determines multiple weak clustering center, calculates weak clustering relevance between different data, so as to realize the association rules weaker redundancy environment accurate data mining. The experimental results show that this algorithm can effectively improve the massive redundant environment data mining efficiency, has made the satisfactory effect.
出处 《计算机科学》 CSCD 北大核心 2013年第8期220-222,共3页 Computer Science
基金 国家自然科学基金(11171112)资助
关键词 海量冗余 数据挖掘 关联规则 Mass redundancy Data mining Association rules
  • 相关文献

参考文献8

  • 1崔建,李强,杨龙坡.基于垂直数据分布的大型稠密数据库快速关联规则挖掘算法[J].计算机科学,2011,38(4):216-220. 被引量:24
  • 2Tojanovic Z, Dahanayake A. Service-Oriented Software System Engineering Challenges and Practices [J]. Idea Group Publi- shing,2011 : 1-47.
  • 3Tasi T,Zhang D,Chen Y,et al. A software reliability model for Web services [C]// 8th IASTED International Conference on Software Engineering and Applications. Cambridge, MA, USA, 2011:144-149.
  • 4穆肇南,张健.数据挖掘技术在经济预测中的应用[J].计算机仿真,2012,29(6):347-350. 被引量:10
  • 5王晟,赵壁芳.基于模糊数据挖掘和遗传算法的网络入侵检测技术[J].计算机测量与控制,2012,20(3):660-663. 被引量:28
  • 6Xu Yue, Li Yue-feng. Mining non-redundant association rules based on concise bases[J]. International Journal of Pattern Re- cognition and Artificial Intelligence, 2007,21(4) : 659-675.
  • 7Loglisci C, Malerba D. Mining multiple level non-redundant as- sociation rules through two-fold pruning of redundancies[C]// Proceedings of MLDM. 2009 : 251-265.
  • 8Cheng J, ke Y P, Ng W. Effective elimination of redundant asso- ciation rules[J]. Data mining and knowledge discovery, 2008,16 (2) : 221-249.

二级参考文献32

  • 1陈飞,高铁梅.结构时间序列模型在经济预测方面的应用研究[J].数量经济技术经济研究,2005,22(2):95-103. 被引量:28
  • 2陈德军,张玉民,陈绵云.系统云灰色宏观调控预测模型及其应用研究[J].控制与决策,2005,20(5):553-556. 被引量:4
  • 3丁艳辉,王洪国,高明,谷建军.一种基于矩阵的关联规则挖掘新算法[J].计算机科学,2006,33(4):188-189. 被引量:13
  • 4肖健华,林健,刘晋.区域经济中长期预测的支持向量回归方法[J].系统工程理论与实践,2006,26(4):97-103. 被引量:19
  • 5Agrawa R, Imielinski T, Swami A. Mining association rules between sets of items in large databases[C].//Proc, of ACM SIGMOD International Conference on Management of Date. Washington DC,1993 : 207-216.
  • 6Park J S, Ming-Syan C, Philip S Y. An Effective Hash Based Algorithm for Mining Association Rules[C].// Proc of ACMSIGMOD. 1995 : 175-185.
  • 7Brin S, Motwai R, Ullman J D, et al. Dynamic Itemset Counting and Implication Rules for Market BasketData [C].//Proc. of ACM SIGMOD Conference on Management of Data. 1997:265-276.
  • 8Agrawal R, Srikant R. Fast Algorithms for Mining Association Rules in Large Databaes[C].//Proc. of 1994 International Conference on Very Large Databases. 1994:487-499.
  • 9Savasere S, Omiecinski E, Navathe S. An Efficient Algorithm for Mining Association Rules in Large Databases[C].//Proc. of 21^St VLDB. 1995 : 432-444.
  • 10Dunkel B, Soparkar N. Data Organization and Access for Efficient Data Mining[C].//Proc. of 15th IEEE Intl. Conf. on Data Engineering. 1999 : 522-529.

共引文献57

同被引文献15

引证文献3

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部