期刊文献+

窗口模式下在线数据流中频繁项集的挖掘 被引量:1

Online data stream mining of recent frequent itemsets based on sliding window model
在线阅读 下载PDF
导出
摘要 拟采用一种基于滑动窗模式的单遍挖掘算法,专注于处理近期数据;为了减少处理时间和占用的内存,设计了一种新的事务表示方法。通过处理这个事务的表达式,频繁项集可以被高效输出,并解决了使用基于Apriori理论的算法时,由候选频繁1-项集生成频繁2-项集时数据项顺序判断不准确问题。该算法称为MRFI-SW算法。 This paper proposed a one-pass data stream mining algorithm to mine the recent frequent itemsets in data streams with a sliding window based on transactions.To reduce the cost of time and memory needed to slide the windows,denoted each items a bit-sequence representations. Basing on dealing with the representations,can find frequent patterns in data streams efficiently,and the sequent of frequent 2-items is correct.This paper named the method MRFI-SW(mining recent frequent itemsets by sliding window)algorithm.
作者 李可 任家东
出处 《计算机应用研究》 CSCD 北大核心 2010年第5期1711-1713,共3页 Application Research of Computers
基金 国家"863"计划资助项目(2009AA01Z433) 河北省自然科学基金资助项目(F2008000888)
关键词 在线数据流 频繁项集 滑动窗 online data stream frequent items sliding window
  • 相关文献

参考文献8

  • 1MANKU G S,MOTWANI R.Approximate frequency counts over data streams[C]//Proc of the 28th International Conference on Very Large Data Bases.2002:346-357.
  • 2JIANG N,GRUENWALD L.Research issues in data stream association rule mining[J].ACM SIGMOD Record,2006,35(1):14-19.
  • 3AGRAWAL R,SRIKANT R.Fast algorithms for mining association rules[C]//Proc of the 20th International Conference on Very Large Data Bases.1994:484-499.
  • 4CHANG J,LEE W.A sliding window method for finding recently frequent itemsets over online data streams[J].Journal of Information Science and Engineering,2004,20(4):175-184.
  • 5SAVERAERS A,OMIECINSKI E,NAVATHE S.An efficient algorithm for mining association rules in large databases[C]//Proc of the 21st International Conference on Very Large Data Bases.San Francisco:Morgan Kaufmann Publisher,1995:432-444.
  • 6LIN C H,CHIU D Y,WU Y H,et al.Mining frequent itemsets from data streams with a time-sensitive sliding window[C]//Proc of SIAM International Conference on Data Mining.2005.
  • 7YU J X,CHONG Z,LU H,et al.False positive or false negative:mining frequent itemsets from high speed transactional data streams[C]//Proc of the 30th International Conference on Very Large Data Bases.2004:204-215.
  • 8LI H F,LEE S Y,SHAN M K.An efficient algorithm for mining frequent itemsets over the entire history of data streams[C]//Proc of the 1st International Workshop on Knowledge Discovery in Data Streams.2004:287-291.

同被引文献9

  • 1牛小飞,石冰,卢军,吴科.挖掘关联规则的高效ABM算法[J].计算机工程,2004,30(11):118-120. 被引量:16
  • 2BABCOCK B,BABU S,DATAR M, et al. Models and issues in data stream systems [ C ]//Proc of the 21 st ACM SIGMOD-SIGART Sympo- sium on Principles of Database System. New York:ACM Press,2002: 1-16.
  • 3GAROFALAKIS M, GEHRKE J. Querying and mining data streams: you only get one look a tutorial[ C]//Proc of ACM SIGMOD Interna- tional Conference on Management of Data. New York: ACM Press, 2002:635.
  • 4LEE D, LEE W. Finding maximal frequent itemsets over online data streams adaptively [ C ]//Proc of the 5th IEEE International Confe- rence on Daia Mining. Washington DC : IEEE Computer Society,2005 : 266 - 273.
  • 5LI Hua-fu, LEE S, SHAN M. Online mining maximal frequent itemsets over data streams[ C]//Proc of the 15th International Workshops on Research Issues in Data Engineering: Stream Data Mining and Appli- cations. 2005 : 11 - 18.
  • 6MAO Guo-jun, WU Xin-dong, ZHU Xing-quan, et al. Mining maximal frequent itemsets from data streams[ J]. Journal of Information Sci- ence,2007,33(3 ) :251-262.
  • 7GIANNELLA C, HAN Jia-wei, PEI Jian, et al. Mining frequent pat- terns in data streams at multiple time granularities [ M ]//Next Gene- ration Data Mining. Cambridge : MIT Press ,2005 : 191 - 212.
  • 8BORGELT C. Keeping things simple:finding frequent itemsets by re- cursive elimination [ C ]//Proc of the 1 st International Workshop on Open Source Data Mining. New York :ACM Press,2005:66-70.
  • 9AGRAWAL R, SRIKANT R. Fast algorithms for mining association rules[ C]//Proc of the 20th International Conference on Very Large Databases. San Francisco: Morgan Kaufmann Publishers, 1994:487- 499.

引证文献1

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部