窗口模式下在线数据流中频繁项集的挖掘被引量：1

Online data stream mining of recent frequent itemsets based on sliding window model

下载PDF

导出

摘要拟采用一种基于滑动窗模式的单遍挖掘算法,专注于处理近期数据;为了减少处理时间和占用的内存,设计了一种新的事务表示方法。通过处理这个事务的表达式,频繁项集可以被高效输出,并解决了使用基于Apriori理论的算法时,由候选频繁1-项集生成频繁2-项集时数据项顺序判断不准确问题。该算法称为MRFI-SW算法。 This paper proposed a one-pass data stream mining algorithm to mine the recent frequent itemsets in data streams with a sliding window based on transactions.To reduce the cost of time and memory needed to slide the windows,denoted each items a bit-sequence representations. Basing on dealing with the representations,can find frequent patterns in data streams efficiently,and the sequent of frequent 2-items is correct.This paper named the method MRFI-SW（mining recent frequent itemsets by sliding window）algorithm.

作者李可任家东

机构地区燕山大学信息科学与工程学院

出处《计算机应用研究》 CSCD 北大核心 2010年第5期1711-1713,共3页 Application Research of Computers

基金国家"863"计划资助项目(2009AA01Z433) 河北省自然科学基金资助项目(F2008000888)

关键词在线数据流频繁项集滑动窗 online data stream frequent items sliding window

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献8

1MANKU G S,MOTWANI R.Approximate frequency counts over data streams[C]//Proc of the 28th International Conference on Very Large Data Bases.2002:346-357.
2JIANG N,GRUENWALD L.Research issues in data stream association rule mining[J].ACM SIGMOD Record,2006,35(1):14-19.
3AGRAWAL R,SRIKANT R.Fast algorithms for mining association rules[C]//Proc of the 20th International Conference on Very Large Data Bases.1994:484-499.
4CHANG J,LEE W.A sliding window method for finding recently frequent itemsets over online data streams[J].Journal of Information Science and Engineering,2004,20(4):175-184.
5SAVERAERS A,OMIECINSKI E,NAVATHE S.An efficient algorithm for mining association rules in large databases[C]//Proc of the 21st International Conference on Very Large Data Bases.San Francisco:Morgan Kaufmann Publisher,1995:432-444.
6LIN C H,CHIU D Y,WU Y H,et al.Mining frequent itemsets from data streams with a time-sensitive sliding window[C]//Proc of SIAM International Conference on Data Mining.2005.
7YU J X,CHONG Z,LU H,et al.False positive or false negative:mining frequent itemsets from high speed transactional data streams[C]//Proc of the 30th International Conference on Very Large Data Bases.2004:204-215.
8LI H F,LEE S Y,SHAN M K.An efficient algorithm for mining frequent itemsets over the entire history of data streams[C]//Proc of the 1st International Workshop on Knowledge Discovery in Data Streams.2004:287-291.

同被引文献9

1牛小飞,石冰,卢军,吴科.挖掘关联规则的高效ABM算法[J].计算机工程,2004,30(11):118-120. 被引量：16
2BABCOCK B,BABU S,DATAR M, et al. Models and issues in data stream systems [ C ]//Proc of the 21 st ACM SIGMOD-SIGART Sympo- sium on Principles of Database System. New York:ACM Press,2002: 1-16.
3GAROFALAKIS M, GEHRKE J. Querying and mining data streams: you only get one look a tutorial[ C]//Proc of ACM SIGMOD Interna- tional Conference on Management of Data. New York: ACM Press, 2002:635.
4LEE D, LEE W. Finding maximal frequent itemsets over online data streams adaptively [ C ]//Proc of the 5th IEEE International Confe- rence on Daia Mining. Washington DC : IEEE Computer Society,2005 : 266 - 273.
5LI Hua-fu, LEE S, SHAN M. Online mining maximal frequent itemsets over data streams[ C]//Proc of the 15th International Workshops on Research Issues in Data Engineering: Stream Data Mining and Appli- cations. 2005 : 11 - 18.
6MAO Guo-jun, WU Xin-dong, ZHU Xing-quan, et al. Mining maximal frequent itemsets from data streams[ J]. Journal of Information Sci- ence,2007,33(3 ) :251-262.
7GIANNELLA C, HAN Jia-wei, PEI Jian, et al. Mining frequent pat- terns in data streams at multiple time granularities [ M ]//Next Gene- ration Data Mining. Cambridge : MIT Press ,2005 : 191 - 212.
8BORGELT C. Keeping things simple:finding frequent itemsets by re- cursive elimination [ C ]//Proc of the 1 st International Workshop on Open Source Data Mining. New York :ACM Press,2005:66-70.
9AGRAWAL R, SRIKANT R. Fast algorithms for mining association rules[ C]//Proc of the 20th International Conference on Very Large Databases. San Francisco: Morgan Kaufmann Publishers, 1994:487- 499.

引证文献1

1徐嘉莉,陈佳,胡庆,黄波,郭红霞.基于向量的数据流滑动窗口中最大频繁项集挖掘[J].计算机应用研究,2012,29(3):837-840. 被引量：7

二级引证文献7

1徐嘉莉,杨洪军,赵茂娟,樊云.一种基于位运算的频繁闭项集挖掘算法[J].计算机应用研究,2013,30(11):3280-3282. 被引量：3
2尹绍宏,范桂丹.基于矩阵的数据流Top-k频繁项集挖掘算法[J].计算机工程,2014,40(3):55-58. 被引量：3
3尹绍宏,单坤玉,范桂丹.滑动窗口中数据流最大频繁项集挖掘算法研究[J].计算机工程与应用,2015,51(22):145-149. 被引量：7
4丁邦旭,黄永青.矩阵与前缀树方法挖掘频繁项集[J].计算机工程与应用,2015,51(22):154-157. 被引量：2
5马连灯,王占刚.基于滑动窗口模型的数据流加权频繁模式挖掘算法[J].软件工程,2016,19(10):15-17. 被引量：1
6王红梅,李芬田,王泽儒.基于滑动窗口数据流频繁项集挖掘模型综述[J].长春工业大学学报,2017,38(5):484-490. 被引量：4
7岳帅,尹绍宏.基于有序FP树和二维列表的频繁模式挖掘算法[J].哈尔滨商业大学学报（自然科学版）,2018,34(6):692-697. 被引量：3

1李俊奎,王元珍.可重写循环滑动窗口:面向高效的在线数据流处理[J].计算机科学,2007,34(12):51-55. 被引量：6
2大嘴鸭.3D窗口切换，Windows 7更酷更方便[J].软件指南,2010(2):39-39.
3汪栎,朱海东,涂时亮.窗口模式下高性能图形的设计与实现[J].计算机工程,2001,27(4):172-173. 被引量：1
4牟晓东.快捷方式的窗口模式[J].家庭电子,2003(7):51-51.
5冷雪.老爷机也能播放DVDrip[J].网友世界,2004(12):26-26.
6詹志飞.基于圈和树的频繁项集挖掘算法[J].电脑知识与技术,2010,6(5):3502-3504.
7钟斯伟.通过观察电脑启动顺序判断电脑常见故障[J].电脑知识与技术（过刊）,2015,21(7X):222-223. 被引量：1
8杨扬.基于.NET技术的数据库技术与应用[J].信息与电脑,2016,28(1):127-128. 被引量：1
9切换KMPlayer到全屏时会黑屏[J].电脑爱好者（普及版）,2010(A02):41-41.
10找不到千千静听播放进度条[J].电脑爱好者（普及版）,2010(A02):45-45.

计算机应用研究

2010年第5期

浏览历史

内容加载中请稍等...

窗口模式下在线数据流中频繁项集的挖掘被引量：1

参考文献8

同被引文献9

引证文献1

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

窗口模式下在线数据流中频繁项集的挖掘 被引量：1

参考文献8

同被引文献9

引证文献1

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

窗口模式下在线数据流中频繁项集的挖掘被引量：1