Multidatabase systems are designed to achieve schema integration and data interoperation among distributed and heterogeneous database systems. But data model heterogeneity and schema heterogeneity make this a challeng...Multidatabase systems are designed to achieve schema integration and data interoperation among distributed and heterogeneous database systems. But data model heterogeneity and schema heterogeneity make this a challenging task. A multidatabase common data model is firstly introduced based on XML, named XML-based Integration Data Model (XIDM), which is suitable for integrating different types of schemas. Then an approach of schema mappings based on XIDM in multidatabase systems has been presented. The mappings include global mappings, dealing with horizontal and vertical partitioning between global schemas and export schemas, and local mappings, processing the transformation between export schemas and local schemas. Finally, the illustration and implementation of schema mappings in a multidatabase prototype - Panorama system are also discussed. The implementation results demonstrate that the XIDM is an efficient model for managing multiple heterogeneous data sources and the approaches of schema mapping based on XIDM behave very well when integrating relational, object-oriented database systems and other file systems.展开更多
Symbol portrayal is an important function of GIS. Sharing symbolic information in different GIS platforms is necessary for GIS applications and users. This paper discusses the necessity, possibility and solution techn...Symbol portrayal is an important function of GIS. Sharing symbolic information in different GIS platforms is necessary for GIS applications and users. This paper discusses the necessity, possibility and solution technique of sharing a symbol library in different GIS platforms. The route map is designed as follows: first, to set up a general data model for the symbol library, then to design a standard exchange format, and finally to call on the GIS manufacturer to provide the interchange tools for their symbol library for the standard exchange format. This paper analyzes the general characteristics of GIS symbolic library, gives a symbol library model and a draft of XML schema of the symbol library exchange format.展开更多
We propose a three-step technique to achieve this purpose. First, we utilize a collection of XML namespaces organized into hierarchical structure as a medium for expressing data semantics. Second, we define the format...We propose a three-step technique to achieve this purpose. First, we utilize a collection of XML namespaces organized into hierarchical structure as a medium for expressing data semantics. Second, we define the format of resource descriptor for the information source discovery scheme so that we can dynamically register and/or deregister the Web data sources on the fly. Third, we employ an inverted-index mechanism to identify the subset of information sources that are relevant to a particular user query. We describe the design, architecture, and implementation of our approach—IWDS, and illustrate its use through case examples. Key words integration - heterogeneity - Web data source - XML namespace CLC number TP 311.13 Foundation item: Supported by the National Key Technologies R&D Program of China(2002BA103A04)Biography: WU Wei (1975-), male, Ph.D candidate, research direction: information integration, distribute computing展开更多
In the course of network supported collaborative design,the data processing plays a very vital role.Much effort has been spent in this area,and many kinds of approaches have been proposed.Based on the correlative mate...In the course of network supported collaborative design,the data processing plays a very vital role.Much effort has been spent in this area,and many kinds of approaches have been proposed.Based on the correlative materials,this paper presents extensible markup language(XML)based strategy for several important problems of data processing in network supported collaborative design,such as the representation of standard for the exchange of product model data(STEP)with XML in the product information expression and the management of XML documents using relational database.The paper gives a detailed exposition on how to clarify the mapping between XML structure and the relationship database structure and how XML-QL queries can be translated into structured query language(SQL)queries.Finally,the structure of data processing system based on XML is presented.展开更多
为了实现 Web 内部分布、异构数据之间的互操作和全局操作,必须对不同数据源进行集成。在分析了各集成模式的优缺点之后,提出了一种基于 XML 的虚拟化的 Web 数据集成方法。该方法采用 XML 作为集成数据的公共数据格式,通过在不同的数...为了实现 Web 内部分布、异构数据之间的互操作和全局操作,必须对不同数据源进行集成。在分析了各集成模式的优缺点之后,提出了一种基于 XML 的虚拟化的 Web 数据集成方法。该方法采用 XML 作为集成数据的公共数据格式,通过在不同的数据源和 XML 文档数据模型之间建立映射,实现了一种虚拟化的数据集成方法。这种数据集成方法简化了 Web 数据集成的实现。最后通过一个实例方案验证了方法的可行性和有效性。展开更多
The paper advances a system framework of Web data mining based on XML. This system framework inte-grates Information Retrieval with Information Extraction, and utilizes traditional data mining methods to completeWeb d...The paper advances a system framework of Web data mining based on XML. This system framework inte-grates Information Retrieval with Information Extraction, and utilizes traditional data mining methods to completeWeb data mining through XML.展开更多
文摘Multidatabase systems are designed to achieve schema integration and data interoperation among distributed and heterogeneous database systems. But data model heterogeneity and schema heterogeneity make this a challenging task. A multidatabase common data model is firstly introduced based on XML, named XML-based Integration Data Model (XIDM), which is suitable for integrating different types of schemas. Then an approach of schema mappings based on XIDM in multidatabase systems has been presented. The mappings include global mappings, dealing with horizontal and vertical partitioning between global schemas and export schemas, and local mappings, processing the transformation between export schemas and local schemas. Finally, the illustration and implementation of schema mappings in a multidatabase prototype - Panorama system are also discussed. The implementation results demonstrate that the XIDM is an efficient model for managing multiple heterogeneous data sources and the approaches of schema mapping based on XIDM behave very well when integrating relational, object-oriented database systems and other file systems.
基金Supported by the Spatial Information Engineering Key Laboratory Found of Chinese National Surveying and Mapping Bureau.(No.200722)
文摘Symbol portrayal is an important function of GIS. Sharing symbolic information in different GIS platforms is necessary for GIS applications and users. This paper discusses the necessity, possibility and solution technique of sharing a symbol library in different GIS platforms. The route map is designed as follows: first, to set up a general data model for the symbol library, then to design a standard exchange format, and finally to call on the GIS manufacturer to provide the interchange tools for their symbol library for the standard exchange format. This paper analyzes the general characteristics of GIS symbolic library, gives a symbol library model and a draft of XML schema of the symbol library exchange format.
文摘We propose a three-step technique to achieve this purpose. First, we utilize a collection of XML namespaces organized into hierarchical structure as a medium for expressing data semantics. Second, we define the format of resource descriptor for the information source discovery scheme so that we can dynamically register and/or deregister the Web data sources on the fly. Third, we employ an inverted-index mechanism to identify the subset of information sources that are relevant to a particular user query. We describe the design, architecture, and implementation of our approach—IWDS, and illustrate its use through case examples. Key words integration - heterogeneity - Web data source - XML namespace CLC number TP 311.13 Foundation item: Supported by the National Key Technologies R&D Program of China(2002BA103A04)Biography: WU Wei (1975-), male, Ph.D candidate, research direction: information integration, distribute computing
基金supported by National High Technology Research and Development Program of China(863 Program)(No.AA420060)
文摘In the course of network supported collaborative design,the data processing plays a very vital role.Much effort has been spent in this area,and many kinds of approaches have been proposed.Based on the correlative materials,this paper presents extensible markup language(XML)based strategy for several important problems of data processing in network supported collaborative design,such as the representation of standard for the exchange of product model data(STEP)with XML in the product information expression and the management of XML documents using relational database.The paper gives a detailed exposition on how to clarify the mapping between XML structure and the relationship database structure and how XML-QL queries can be translated into structured query language(SQL)queries.Finally,the structure of data processing system based on XML is presented.
文摘为了实现 Web 内部分布、异构数据之间的互操作和全局操作,必须对不同数据源进行集成。在分析了各集成模式的优缺点之后,提出了一种基于 XML 的虚拟化的 Web 数据集成方法。该方法采用 XML 作为集成数据的公共数据格式,通过在不同的数据源和 XML 文档数据模型之间建立映射,实现了一种虚拟化的数据集成方法。这种数据集成方法简化了 Web 数据集成的实现。最后通过一个实例方案验证了方法的可行性和有效性。
文摘The paper advances a system framework of Web data mining based on XML. This system framework inte-grates Information Retrieval with Information Extraction, and utilizes traditional data mining methods to completeWeb data mining through XML.