In the XML community, exact queries allow users to specify exactly what they want to check and/or retrieve in an XML document. When they are applied to a semi-structured document or to a document with an overly comple...In the XML community, exact queries allow users to specify exactly what they want to check and/or retrieve in an XML document. When they are applied to a semi-structured document or to a document with an overly complex model, the lack or the ignorance of the explicit document model (DTD—Document Type Definition, Schema, etc.) increases the risk of obtaining an empty result set when the query is too specific, or, too large result set when it is too vague (e.g. it contains wildcards such as “*”). The reason is that in both cases, users write queries according to the document model they have in mind;this can be very far from the one that can actually be extracted from the document. Opposed to exact queries, preference queries are more flexible and can be relaxed to expand the search space during their evaluations. Indeed, during their evaluation, certain constraints (the preferences they contain) can be relaxed if necessary to avoid precisely empty results;moreover, the returned answers can be filtered to retain only the best ones. This paper presents an algorithm for evaluating such queries inspired by the TreeMatch algorithm proposed by Yao et al. for exact queries. In the proposed algorithm, the best answers are obtained by using an adaptation of the Skyline operator (defined in relational databases) in the context of documents (trees) to incrementally filter into the partial solutions set, those which satisfy the maximum of preferential constraints. The only restriction imposed on documents is No-Self-Containment.展开更多
作为XML(extensible markup language)数据查询的核心操作,树模式查询的应用前景广泛,其研究具有重要意义。针对扩展的树模式GTP++(generalized tree pattern)提出了一种树模式描述语言XTPL(XML tree pattern language),采用指称语义方...作为XML(extensible markup language)数据查询的核心操作,树模式查询的应用前景广泛,其研究具有重要意义。针对扩展的树模式GTP++(generalized tree pattern)提出了一种树模式描述语言XTPL(XML tree pattern language),采用指称语义方式给出其完整的语义描述,使得形式化方法可以用于分析树模式查询的行为特征,有助于验证XML查询的正确性,以及提高查询处理方法的可靠性和鲁棒性。同时以路径表达式为例,以指称语义的方式给出了从路径表达式提取树模式的算法。展开更多
文摘In the XML community, exact queries allow users to specify exactly what they want to check and/or retrieve in an XML document. When they are applied to a semi-structured document or to a document with an overly complex model, the lack or the ignorance of the explicit document model (DTD—Document Type Definition, Schema, etc.) increases the risk of obtaining an empty result set when the query is too specific, or, too large result set when it is too vague (e.g. it contains wildcards such as “*”). The reason is that in both cases, users write queries according to the document model they have in mind;this can be very far from the one that can actually be extracted from the document. Opposed to exact queries, preference queries are more flexible and can be relaxed to expand the search space during their evaluations. Indeed, during their evaluation, certain constraints (the preferences they contain) can be relaxed if necessary to avoid precisely empty results;moreover, the returned answers can be filtered to retain only the best ones. This paper presents an algorithm for evaluating such queries inspired by the TreeMatch algorithm proposed by Yao et al. for exact queries. In the proposed algorithm, the best answers are obtained by using an adaptation of the Skyline operator (defined in relational databases) in the context of documents (trees) to incrementally filter into the partial solutions set, those which satisfy the maximum of preferential constraints. The only restriction imposed on documents is No-Self-Containment.
文摘作为XML(extensible markup language)数据查询的核心操作,树模式查询的应用前景广泛,其研究具有重要意义。针对扩展的树模式GTP++(generalized tree pattern)提出了一种树模式描述语言XTPL(XML tree pattern language),采用指称语义方式给出其完整的语义描述,使得形式化方法可以用于分析树模式查询的行为特征,有助于验证XML查询的正确性,以及提高查询处理方法的可靠性和鲁棒性。同时以路径表达式为例,以指称语义的方式给出了从路径表达式提取树模式的算法。