期刊文献+

基于BD结构的历史数据流存储与查询 被引量:2

STORING AND QUERYING HISTORICAL DATA STREAMS BASED ON BD STRUCTURE
在线阅读 下载PDF
导出
摘要 实际应用中,人们往往不仅需要近期数据流,还需要结合大量历史数据流来共同解决问题。研究表明,处理大量历史数据流时,传统数据库索引技术(如B+树)不能提供高的存储利用率和查询效率。针对任意时间段历史数据流的存储查询问题,提出一种基于BD结构的存储与查询方法。该方法将BDTree和BDHash相结合,能有效降低BDTree的高度,减小索引项的规模,同时可以避免数据结点规模过大。在此基础上,研究了"部分扩充"策略以解决数据插入失败问题。理论分析和实验结果表明,该方法能提高存储空间利用率和查询效率,可以有效应用于历史数据流的存储和查询。 In actual applications,people require recent data streams as well as massive historical data streams together to jointly resolve problems.Research indicates that traditional database indexing techniques,such as B+ tree,can't provide high storage utilization and retrieval efficiency when handling massive historical data streams.Considering the issue of storage and query of historical data streams in any time period,this paper proposes a novel storage and query approach based upon BD structure.Combining BDTree with BDHash,this approach effectively reduces the height of BDTree and the scale of index entries.Meanwhile,it can prevent the size of data nodes from being too enormous.Based on this,we study the "partial expansion" strategy to tackle the problem of data insertion failure.Theoretical analysis and experimental results show that this approach can improve the utilization and retrieval efficiency of storage space and can be effectively used to store and query historical data streams.
出处 《计算机应用与软件》 CSCD 2011年第2期76-79,共4页 Computer Applications and Software
基金 江苏省自然科学基金项目(BK2006557)
关键词 历史数据流 BD结构 部分扩充 Historical data stream BD structure Partial expansion
  • 相关文献

参考文献7

  • 1Terry D,Goldberg D,Nichols D,et al.Continuous queries over appendonly database[C].SIGMOD 1992:321-330.
  • 2Babcock B,Babu S,Datar M,et al.Models and issues in data stream systems[C].PODS 2002:1-16.
  • 3Charu Aggarwal.A Framework for Clustering Massive-Domain Data Streams[C].ICDE 2009:102-113.
  • 4Vassilis A,Panagiotis P,Michalis P,et al.Approximate embeddingbased subsequence matching of time series[C].SIGMOD 2008:365-378.
  • 5Zhang D,Gunopules D,Tsotras V J,et al.Temporal aggregation over data streams usingmultiple granularities[C].EDBT 2002:646-663.
  • 6张冬冬,李建中,王伟平,郭龙江.数据流历史数据的存储与聚集查询处理算法[J].软件学报,2005,16(12):2089-2098. 被引量:17
  • 7David B.Lomet,et al.A Simple Bounded Disorder File Organization with Good Performance[J].ACM Transaction on Database Systems (TDDS),1988,13(4):525-551.

二级参考文献12

  • 1Guha S, Koudas N. Approximating a data stream for querying and estimation: Algorithms and performance evaluation. In: Stefano C, Christoph F, Pat S, eds. Proc. of the 18th Int'l Conf. on Data Engineering San Jose: IEEE Computer Society, 2002. 567-576.
  • 2Madden S, Shah M, Hellerstein JM, Raman V. Continuously adaptive continuous queries over streams. In: Franklin MJ, Moon B,Ailamaki A, eds. Proc. of the 2002 ACM SIGMOD Int'l Conf. on Management of Data Madison: ACM, 2002.49-60.
  • 3Gehrke J, Korn F, Srivastava D. On computing correlated aggregates over continual data streams. In: Afef WG, ed. Proc. of the2001 ACM SIGMOD Int'l Conf. on Management of Data Santa Barbara: ACM, 2001. 13-24.
  • 4Dobra A, Gehrke J, Garofalakis M, Rastogi R. Processing complex aggregate queries over data streams. In: Franklin MJ, Moon B,Ailamaki A, eds. Proc. of the 2002 ACM SIGMOD Int'l Conf. on Management of Data Madison: ACM, 2002. 61-72.
  • 5Chen Y, Dong G, Han J, Wah BW, Wang J. Multi-Dimensional regression analysis of time-series data streams. In: Bernstein PA,Loannidis YE, Ramakrishnan R, eds. Proc. of the 28th Int'l Conf. on Very Large Data Bases Hong Kong: Morgan Kaufmann Publishers, 2002. 323-334.
  • 6Zhang D, Gunopulos D, Tsotras V J, Seeger B. Temporal aggregation over data streams using multiple granularities. In: Jensen CS,Jeffery KG, eds. Proc. of the 8th Int'l Conf. on Extending Database Technology LNCS, 2002. 646-663.
  • 7Olken F. Random Sampling from Databases [Ph.D. Thesis]. Berkeley, University of California, 1993.
  • 8Transaction Processing Performance Council. TPC Benchmark H (Decision Support) Standard Specification. TPC, 2002.http://www.tpc.org/tpch/default.asp
  • 9Chandraskearan S, Franklin MJ. Streaming queries over streaming data. In: Bernstein PA, Loannidis YE, Ramakrishnan R, eds.Proc. of the 28th Int'l Conf. on Very Large Data Bases Hong Kong: Morgan Kaufmann Publishers, 2002. 203-214.
  • 10Araru A, Babu S, Widom J. An abstract semantics and concrete language for continuous queries over streams and relations.Technical Report, Stanford University Database Group, 2002.Available at http://dbpubs.stanford.edu/pub/2002-57

共引文献16

同被引文献14

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部