期刊文献+

超越支持度-置信度框架的负相关对规则挖掘 被引量:6

Mining Negative Correlation Rules Beyond Support-Confidence Framework
在线阅读 下载PDF
导出
摘要 相关规则比传统的关联规则更具有实际意义。但现存的相关规则挖掘算法均需利用apriori类似算法挖掘具有高支持度的项集,再对获得的项集进行相关性测试而获取相关规则,这导致低支持度-高相关度的规则不易被发现。直接挖掘相关规则的困难在于候选相关项不能利用apriori类似性质进行剪枝,导致搜索空间爆炸性增长。本文提出的算法MNI利用Phi相关系数的下界来产生候选负相关项,从而缩小负相关项搜索空间,并证明了该算法的完全性和正确性。在负相关项对基础上利用规则可靠度产生负相关规则时,提出将负相关对计数统一转化为正相关对计数的方法。在真实数据集上的实验结果表明,该算法MNI能有效提高负相关项对的挖掘速度。 High correlation rules are more practical than traditional association rules,but existed correlation rule mining algorithms are almost apriori-based. This results in the difficulty of finding correlation rules with low support but high correlation. In this paper a new algorithm called MNI is introduced to use the lower bound of Phi correlation coefficient to generate all candidate negative correlation items and reduce explosive search space. Both the completeness and correctness of MNI are proved. Negative correlation rules are mined using reliability measure without directly counting the number of negative correlation pairs. Experiments on real datasets show that the algorithm is quite efficient in negative correlation items mining.
出处 《计算机科学》 CSCD 北大核心 2005年第10期124-127,163,共5页 Computer Science
基金 高等学校博士学科点专项科研基金 基于浓缩数据立方的联机分析处理与梯度挖掘(项目编号20030487032)
关键词 关联规则 相关规则 Phi相关系数 规则可靠度 规则挖掘 负相关 APRIORI 置信度 挖掘算法 相关规则 框架 搜索空间 实际意义 Association rules, Correlation rules, Phi correlation coefficient, Rule reliability measure
  • 相关文献

参考文献10

  • 1Agrawal R,Srikant R. Fast algorithm for mining association rules in large databases. In: Proc. of 20th Int' l Conference on Very Large Databases(VLDB1994),1994. 487~499
  • 2Han J,Pei J, Yin Y. Mining Frequent patterns without candidate generation. ACM SIGMOD, Dallas, Texas, 2000
  • 3Brin S,Motwani R,Silverstein C. Beyond Market Basket: Generalizing Association Rules to Correlation. In: Proc. 1997 ACMSIGMOD Int'l Conf. Management of Data, 1998. 265~276
  • 4Savasere A,Omiecinski E, Navathe S. Mining for Strong Negative Associations in a Large Database of Customer Transactions. In:Proc. of the Fourteenth Intl. Conf. on Data Engineering (ICDE1998), 1998. 494 ~ 502
  • 5Ahmed K M, EI-Makky N M, Taha Y. A note on "Beyond Market Baskets: Generalizing Association Rules to Correlations".ACM SIGKDD Explorations, 2000 ,1(2): 46~48
  • 6Wu X, Zhang C, Zhang S. Mining both positive and negative association rules. In: Proc. of 19th Intl. Conf. on Machine Learning (ICML2002), 2002. 658~665
  • 7Antonie M-L, Zaiane O R. An Associative Classifier based on Positive and Negative Rules. In: Proc. of the 9th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2004
  • 8Xiong H,Shekhar S, Tan P-N, et al. Exploiting A Support-based Upper Bound of Phi′s Correlation Coefficient for Identifying Strongly Correlated Pairs. In: Proc. of the tenth ACM SIGKDD conference, 2004
  • 9http:∥www. almaden. ibm. com/software/quest/Resources/ index. shtml
  • 10http:∥www. ics. uci. edu/~mlearn/MLRepository. html

同被引文献50

引证文献6

二级引证文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部