期刊文献+

基于频繁模式矩阵的最大频繁项目集挖掘算法 被引量:7

AN ALGORITHM FOR MINING MAXIMUM FREQUENT ITEMSETS BASED ON FP-ARRAY
在线阅读 下载PDF
导出
摘要 提出了一种基于频繁模式矩阵FP-array的挖掘最大频繁项目集的算法。算法基本思想:①只扫描事务数据库一遍,把该数据库转换成一个矩阵FP-array,并且保留了所有事务数据库中项目间的关联信息,然后对该矩阵进行挖掘。②在FP-array中只存放逻辑型数据,节省了存储空间。③直接在FP-array上挖掘而不需要递归创建大量条件模式矩阵,挖掘过程采用逻辑运算,在效率上有独特的优势。通过实验验证了算法的有效性。 A new algorithm for mining maximum frequent itemsets based on FP-array is presented. The main idea of this algorithm is that ① it can convert a transaction database into a FP-array through scanning the database only once. Then it does the mining work of the FP-array that includes all information of items in database. ② FP-array is better in memory because it stores logic data only. ③ There is no need to build conditional arrays in the FP-array mining process. Logic operaion is adopted so that the algorithm has predominance in efficiency. An experiment is carried out to verify the mining effectiveness.
出处 《计算机应用与软件》 CSCD 北大核心 2007年第7期45-46,53,共3页 Computer Applications and Software
基金 天津市科技发展计划资助项目(04310941R) 天津市应用基础研究计划资助项目(05YFJMJC11700) 河北省科技研究与发展指导计划项目(0621355)。
关键词 数据挖掘 频繁模式矩阵 最大频繁项目集 算法 Data mining FP-array Maximum frequent itemsets Algorithm
  • 相关文献

参考文献3

二级参考文献16

  • 1冯志新,钟诚.基于FP-tree的最大频繁模式挖掘算法[J].计算机工程,2004,30(11):123-124. 被引量:18
  • 2Lin Dao I,Proc the 6th European Conference on Extending Database Technology,1998年,105页
  • 3Agrawal R,Proc the 11th Inter Conference on Data Engineering,1995年,3页
  • 4AGRAWA LR, IMIELINSKI T, SWAMI A.Mining association rules between sets of items in large databases[A]. BUNEMAN P, JAJODIA S, ed. Proc. Of the ACM SIGMOD Conf. On Management of Data (SIGMOD'93)[C]. New York: ACM Press, 1993. 207-216.
  • 5AGRAWA LR, SRIKANT R. Fast algorithms for mining association rules in large databases[A]. BOCCA JB, JARKE M, ZANIOLO C, ed. Proc.of the 20th Int'l Conf. On Very Large Data Bases[C]. Santiago: Morgan Kaufmann, 1994. 478-499.
  • 6ALY HH, TAHA Y, AMR AA. Fast mining of association rules in large-scale problems[A]. ABDEL-WAHAB H, JEFFAY K, ed. Proc. of the 6th IEEE Symp. on Computers and Communications (ISCC 2001)[C]. New York: IEEE Computer Society Press, 2001. 107-113.
  • 7TSAI CF, LIN YC, CHEN CP. A new fast algorithms for mining association rules in large databases[A].KAMEL AE, MELLOULI K, BORNE P, ed. Proc. Of the 2002 IEEE Int'l Conf. On Systems, Man and Cybernetics (SMC 2002)[C]. IEEE Computer Society Press, 2002.251-256.
  • 8HAN J, PEI J, YIN Y. Mining frequent patterns without candidate generation[A]. CHEN WD, NAUGHTON J, BERNSTEIN PA, ed. Proc. Of the 2000 ACM SIGMOD Int'l Conf. On Management of Data (SIGMOD 2000)[C]. New York: ACM Press, 2000. 1~12.
  • 9BAYARDO R.Efficiently mining long patterns from databases[A]. HAAS LM, ed. Proceedings of the ACM SIGMOD International Conference on Management of Data[C]. New York: ACM Press, 1998. 85-93.
  • 10LIN DI, KEDEM ZM.Pincer-Search: A new algorithm for discovering the maximum frequent set[A]. SCHEK HJ, ed. Proceedings of the 6th European Conference on Extending Database Technology[C]. Heidelberg: Springer-Verlag, 1998. 105-119.

共引文献124

同被引文献45

引证文献7

二级引证文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部