期刊文献+

针对XML流数据的复杂Twig Pattern查询处理 被引量:9

Complex Twig Pattern Query Processing over XML Streams
在线阅读 下载PDF
导出
摘要 XML流数据处理在研究领域引起了研究者的广泛兴趣.针对XML流数据的、具有嵌套AND/OR谓词的复杂Twig Pattern查询处理,提出一种新方法.为了提高查询处理性能,将所有Twig Pattern合并为一个共享前缀的查询树,其中,AND/OR谓词被表示为单独的抽象语法树,因而能够以文档顺序、单遍地处理复杂Twig Pattern的匹配,并避免了YFilter中对嵌套谓词进行后置处理所产生的中间结果.实验结果表明,该方法能够有效改善Twig Pattern的处理性能,尤其是在处理大文档的情况下.基于已有的研究工作,讨论如何利用DTD(document type definition)的结构和约束信息优化Twig Pattern,即这种优化是在系统运行前进行的预处理. The problem of processing streaming XML data is gaining widespread attention from the research community. In this paper, a novel approach for processing complex Twig Pattern with OR-predicates and AND-predicates over XML documents stream is presented. For the improvement of the processing performance of Twig Patterns, all the Twig Patterns are combined into a single prefix query tree that represents such queries by sharing their common prefixes. Its OR-predicates and AND-predicates of a node are represented as a separate abstract syntax tree associated with the node. Consequently, all the Twig Patterns are evaluated in a single, document-order pass over the input document stream for avoiding the interim results produced by the post-processing nested paths of YFilter. Compared with the existing approach, experimental results show that it can significantly improve the performance for matching complex Twig Patterns over XML document stream, especially for large size XML documents. Based on the prior works, the optimization of twig patters under DTD (document type definition) by using structural and constraint information of DTD is also addressed, which is static, namely, it is processed before the runtime of stream processing.
出处 《软件学报》 EI CSCD 北大核心 2007年第4期893-904,共12页 Journal of Software
基金 SupportedbytheNationalGrandFundamentalResearch973ProgramofChinaunderGrantNo.2005CB321905(国家重点基础研究发展规划(973))
关键词 XML文档流 xPam TWIG PATTERN 查询树 DTD(document type definition) XML document stream Xpath Twig Pattern query tree DTD (document type definition)
  • 相关文献

参考文献1

二级参考文献12

  • 1高军,杨冬青,唐世渭,王腾蛟.一种基于DTD的XPath逻辑优化方法[J].软件学报,2004,15(12):1860-1868. 被引量:17
  • 2Gupta AK, Suciu D. Stream processing of XPath queries with predicates. In: Halevy AY, Ives ZG, Doan AH, eds. Proc of the 2003ACM SIGMOD Int'l Conf on Management of Data. ACM, 2003.419-430.
  • 3Nguyen B, Abiteboul S, Cobena G, Preda M. Monitoring XML data on the Web. In: Aref WG, ed. Proc of the ACM/SIGMOD Conf on Management of Data. 2001. 437--448.
  • 4Chen J, Dewitt D, Tian F, Wang Y. NiagaraCQ: A scalable continuous query system for internet databases. In: Chen WD,Naughton JF, Bernstein PA, eds. Proc of the ACM/SIGMOD Conf Management of Data. ACM, 2000. 379-390.
  • 5Clark J. XML Path language (XPath). 1999. Available from the W3C, http://www.w3.org/TR/XPath.
  • 6Milo T, Suciu D, Vianu V. Typechecking for XML Transformers. In: Proc of the PODS 2000. ACM, 2000. 11-22.
  • 7Miklau G, Suciu D. Containment and equivalence for an XPath fragment. In: Popa L, ed. Proc of the 21 Symp. on Principle of Database Systems. ACM, 2002.65-76.
  • 8Neven F. Automata, logic, and XML. In: Proc of the 16th Int'l Workshop Computer Science Logic. CSL, 2002.2-26.
  • 9NASA's Astronomical Data Center. ADC XML Resource Page. http://xml.gsfc.nasa.gov.
  • 10Diao Y, Fischer P. YFilter: Efficient and scalable filtering of XML documents. In: Proc of the 18th Int'l Conf on Data Engineering. 2002. 341-345.

共引文献32

同被引文献63

引证文献9

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部