基于向量矩阵的频繁项集挖掘算法被引量：2

A Frequent Itemsets Mining Algorithm Based on Vector Matrix

下载PDF

导出

摘要为减少冗余候选项集的产生,提出了一种基于向量矩阵的频繁项集挖掘算法FIS-Miner.在该算法中,将所有频繁1-项集按支持度升序进行排序并存储其对应的二进制位向量,将这些二进制位向量映射到向量矩阵进行分析找出所有的频繁项集,既实现了数据库的一次扫描又避免了大量候选项集的产生.探讨了该算法的实现步骤,并给出实例验证了该算法的有效性. A new algorithm FIS-Miner （Frequent Item Sets Miner） is presented for discovering frequent item sets to decrease candidate generation based on vector matrix. All frequent 1-itemsets are put into an array in support degree ascending order, storing corresponding binary vector. All frequent item sets are found by analyzing these binary vectors and mapping vector matrix, therefore database needs to scan only one time while avoiding many candidates. This algorithm steps are explored and its validity is examined.

作者田宏董爱杰

机构地区大连交通大学软件学院

出处《大连交通大学学报》 CAS 2008年第3期74-77,共4页 Journal of Dalian Jiaotong University

关键词二进制位向量向量矩阵频繁项集最大频繁项集 binary vector vector matrix frequent item sets maximum frequent item sets

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献5

1贾彩燕倪现君.关联规则挖掘研究述评[J].计算机科学,2003,30(4):145-148.
2陈耿,朱玉全,杨鹤标,陆介平,宋余庆,孙志挥.关联规则挖掘中若干关键技术的研究[J].计算机研究与发展,2005,42(10):1785-1789. 被引量：62
3HAN J, PEIJ, YIN Y. Mining Frequent Patterns without Candidate Generation [C]//Proc. 2000 ACM-SIG-MOD Int. Conf. Management of Data, Dallas, TX, 2000: 1-12.
4陈明,史忠植,王文杰.一种有效的基于图的关联规则挖掘算法[J].计算机应用,2006,26(11):2654-2656. 被引量：10
5郑玲霞,李大学,马万里.基于有向图的关联规则算法[J].重庆邮电学院学报（自然科学版）,2005,17(4):495-498. 被引量：5

二级参考文献31

1贾彩燕倪现君.关联规则挖掘研究述评[J].计算机科学,2003,30(4):145-148.
2R. Agrawal, T. Imielinski, A. Swami. Mining association rules between sets of items in large databases. ACM SIGMOD Int'l Conf. Management of Data, Washington, D. C., 1993.
3Han J, Kamber. MData Mining: Concepts and Techniques.Beijing: High Education Press, 2001.
4B. Goethals. Survey of frequent pattern mining. Helsinki Institute for Information Technology, Technical Report, 2003.
5R. Agrawal, R. Srikant. Fast algorithm for mining association rules. The 20th Int'l Conf. VLDB, Santiago, Chile, 1994.
6M. Houtsma, A. Swami. Set-oriented mining for association rules in relational databases. In: Yu P., Chen A, eds. Proc. Int'l Conf. Data Engineering. Los Alamitos, CA: IEEE Computer Society Press, 1995. 25～33.
7A. Savasere, E. Omiecinski, S. Navathe. An efficient algorithm for mining association rules. The 21st Int' l Conf. VLDB, Zurich,Switzerland, 1995.
8J. Han, Y. Fu. Discovery of multiple-level association rules from large databases. The 21st Int'l Conf. VLDB, Zurich,Switzerland, 1995.
9R. Bayardo. Efficiently mining long patterns from databases. In:L. M. Haas, A. Tiwary, eds. Proc. ACM SIGMOD Int'l Conf.Management of Data. New York: ACM Press, 1998. 85～93.
10Lin, Dao-I, Z. M. Kedem. Pincer-Search: A new algorithm for discovering the maximum frequent set. In: H. J. Schek, F.Saltor, I. Ramos et al. eds. Proc. 6th European Conf.Extending Database Technology. Berlin: Springer-Veriag, 1998.105～119.

共引文献83

1李华.二进制挖掘算法在Web使用挖掘中的应用[J].计算机时代,2009(4):56-57.
2郭云峰,张集祥.对关联规则挖掘中Apriori算法的一种改进[J].杭州电子科技大学学报（自然科学版）,2009,29(2):60-63. 被引量：4
3王盛,董黎刚,李群.一种基于逆序编码的关联规则挖掘研究[J].杭州电子科技大学学报（自然科学版）,2010,30(5):169-172. 被引量：1
4董彩云,杜韬,郭春燕,曲守宁.聚类后的关联规则快速更新算法研究[J].计算机应用研究,2004,21(11):30-32. 被引量：3
5胡蓉.一种基于串与运算的关联规则挖掘算法[J].湘潭师范学院学报（自然科学版）,2005,27(1):23-24.
6曲守宁,董彩云,徐德军,吴桐.关联规则算法研究及其在教学系统中的应用[J].计算机系统应用,2005,14(4):20-23. 被引量：5
7胡蓉,陈文.一种基于串与运算的关联规则挖掘算法[J].东北电力学院学报,2005,25(2):12-15.
8郑伯川,张征.特征间互补性结合使用的3D模型检索技术[J].重庆邮电学院学报（自然科学版）,2006,18(1):119-122. 被引量：4
9贾彩燕,陆汝钤.关联规则挖掘的取样误差量化模型和快速估计算法[J].计算机学报,2006,29(4):625-634. 被引量：7
10陈明,史忠植,王文杰.一种有效的基于图的关联规则挖掘算法[J].计算机应用,2006,26(11):2654-2656. 被引量：10

同被引文献18

1陈耿,朱玉全,杨鹤标,陆介平,宋余庆,孙志挥.关联规则挖掘中若干关键技术的研究[J].计算机研究与发展,2005,42(10):1785-1789. 被引量：62
2陈晓云,陈袆,王雷,李荣陆,胡运发.基于分类规则树的频繁模式文本分类[J].软件学报,2006,17(5):1017-1025. 被引量：19
3HAN Jia-wei,PEI Jian,YIN Yi-wen.Mining frequent patterns without candidate generation:a frequent pattern tree approach[J].Data Mining and Knowledge Discovery,2004,8(1):53-87.
4FU A W C,KWONG R W W,TANG J.Mining N-most interesting itemsets[C] //Proceedings of 2000 ISMIS.Berlin:Springer,2000:59-67.
5BODON F.A survey on frequent itemset mining[C] //Proceedings of the ACM SIGKDD Workshop on OSDM'04.Chicago,USA:[s.n.] ,2004:523-531.
6HAJJ M E,ZAIANE O R.Inverted matrix:Efficient discovery of frequent items in large datasets in the context of interactive mining[C] //2003 Int'l Conf on Data Mining and Knowledge Discovery(ACM SIGKDD).Califomia,USA:[s.n.] ,2003:109-118.
7HAJJ M E,ZAIANE O R.Non recursive generation of frequent k-itemsets from frequent pattern tree representations[C] //Proceedings of 5th International Conference on Data Warehousing and Knowledge Discovery.Melbourne:Australia,2003:371-380.
8RACZ B.NonordFP:an FP-growth variation without rebuilding the FP-tree[C] //Proceedings of the IEEE ICDM Workshop on FIMI'04.Brighton,UK:[s.n.] ,2004:1089-1097.
9LIU Gui-mei,LU Hong-jun.AFOPT:an efficient implemefitation of pattern growth approach[C] //Proceedings of the IEEE ICDM Workshop on FIMI'04.Brighton,UK:[s.n.] ,2004:2056-2067.
10WU Fan,CHIANG S W,LINJ R.A new approach to mine frequent patterns using item-transformation methods[J].Information Systems,2007,32(7):1056-1072.

引证文献2

1朱颢东,李红婵.关于Top-N最频繁项集挖掘的研究[J].电子科技大学学报,2010,39(5):757-761.
2孙序.基于倒排表和集合的TOP-N最频繁项集挖掘算法[J].科技信息,2011(27).

1王丽珍,周丽华.生成频繁项目集的一种贪心算法[J].计算机工程与应用,2001,37(13):86-88. 被引量：6
2丁邦旭,黄永青.矩阵与前缀树方法挖掘频繁项集[J].计算机工程与应用,2015,51(22):154-157. 被引量：2

大连交通大学学报

2008年第3期

浏览历史

内容加载中请稍等...

基于向量矩阵的频繁项集挖掘算法被引量：2

参考文献5

二级参考文献31

共引文献83

同被引文献18

引证文献2

相关作者

相关机构

相关主题

浏览历史

基于向量矩阵的频繁项集挖掘算法 被引量：2

参考文献5

二级参考文献31

共引文献83

同被引文献18

引证文献2

相关作者

相关机构

相关主题

浏览历史

基于向量矩阵的频繁项集挖掘算法被引量：2