期刊文献+

数据集成中XML数据查询语义重写 被引量:9

SEMANTIC QUERY REWRITING FOR XML DATA IN DATA INTEGRATION
在线阅读 下载PDF
导出
摘要 查询重写是数据库研究的一个基本问题,它和查询优化、数据仓库、数据集成、语义缓存等数据库问题密切相关.为提高集成系统的查询效率,系统选择提交频率较高的XML查询物化为中间层视图.用户提交查询后,系统尽可能利用中间视图层中视图,而不是访问数据源来回答查询,这个问题实际可以归结为半结构化查询重写问题.考虑到中间视图层空间的有限性,已有视图应当尽可能回答更多的查询.传统查询重写方法没有考虑半结构化数据之间的约束,而根据约束可以等价变换查询,从而提高中间视图层中的表达能力.提出了一种新的半结构化查询重写的方法,该方法在保证算法正确性和完备性的基础上,利用了半结构化数据中的约束,尤其是XML文档中的路径依赖,来增强中间层物化视图的表达能力.理论分析和初步原型实验证明方法的有效性. Query rewriting is a basic problem in database research. It is relevant in many aspects of database management, including query optimization, datawarehouse, data integration, semantic cache etc. In order to improve the efficiency of integration system, the frequently submitted XML views are selected to materialize in mediator layer. After end users submit a query, the system tries to use views in mediator layer rather than the access to the data source to answer the query. Such a problem could be reduced to the semi-structured query rewriting. Due to the limited space in the mediator layer, the existing views should answer as more queries as possible. Traditional methods do not take into account the constraints in the semi-structured data, while the constraints play an important role in the transformation of queries hence enhancing the expressive ability of existing views. In this paper, a novel method is explored to make use of semantic constraints, especially path constraints in XML documents to enhance the expressive ability of materialized view in the mediator layer. The soundness and completeness of the algorithm are maintained. The theoretical analysis and prototype preliminary experiment results prove the validation of the method.
出处 《计算机研究与发展》 EI CSCD 北大核心 2002年第4期435-442,共8页 Journal of Computer Research and Development
基金 国家"九七三"重点基础研究发展规划(G1999032705) 北京大学-IBM创新研究院基金项目资助
关键词 数据集成 XML 数据查询 查询重写 数据库 半结构化数据 query rewriting, constraint, query containment, semi-structured data, XML
  • 相关文献

参考文献1

二级参考文献12

  • 1[1]A Y Levy, A O Mendelzon, Y sagiv et al. Answering queries using views. In: Proc of the 16th ACM SIGACT SIGMOD SIGART Symp on Principles of Database Systems (PODS'95). San Jose, California, 1995. 95~104
  • 2[2]Rajaraman, Y Sagiv, J D Ullman. Answering queries using templates with binding patterns. In: The 16th ACM SIGACT SIGMOD SIGART Symp on Principles of Database Systems (PODS'95). San Jose, California, 1995
  • 3[3]Rachel Pottinger, Alon levy. A scalable algorithm for answering queries using views. In: Proc of the 26th Int'l Conf on Very Large Data Bases (VLDB). Vairo, Egypt, 2000
  • 4[4]S Cohen, W Nutt, A Serebrenik. Rewering aggregate queries using views. In: Proc of the 18th ACM SIGACT SIGMOD SIGART Symp on Principles of Database Systems (PODS'99). Philadelphia, Pennsylvania, 1999
  • 5[5]D Srivastava, S Dar, H V Jagadish et al . Answering queries with aggregation using views. In: Proc of the 22nd Int'l Conf on Very Large Data Bases (VLDB'96). Bombay, India, 1996. 318~329
  • 6[6]Duschka, M R Genesereth. Answering recursive queries using views. In: Proc of the 16th ACM SIGACT SIGMOD SIGART Symp on Principles of Database Systems (PODS'97). Tucson, Arizona, 1997
  • 7[7]Yannis Papakonstantinou, Vasilis Vassalos. Query rewriting for semi-structured data. In: Proc of ACM SIGMOD Conf on Management of Data. Philadephia, Pennsylvania, 1999
  • 8[8]Vasillis Vassalos, Yannis Papakonstantinou. Describing and using query capabilities of heterogeneous sources. In: Proc of the Conf on Very Large Data Bases (VLDB). Athens, Greece, 1997. 256~265
  • 9[9]D Calvanese, G De Giacomo, M lenzerini et al. Rewriting of regular expressions and regular path queries. In: Proc of the ACM SIGACT SIGMOD SIGART Symp on Principles of Database Systems (PODS'99). Philadelphia, Pennsylvania, 1999
  • 10[10]Gio Wiederhold. Mediators in the architecture of future information systems. IEEE Computer, 1992, 25(3): 38~49

共引文献15

同被引文献61

引证文献9

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部