摘要
网络信息集成涉及多种不同类型的异构信息源。其目标是设计出一种高度灵活而综合的集成方法,来对这些异构信息源进行分析和整合,最终形成一致的单一数据集合,这无论是对提高基于知识的决策能力,还是提高信息的再利用率,都具有直接的现实意义。为了有效解决这种异构信息源的集成问题,将整个网络信息集成过程划分为三个阶段:数据抽取、数据集成和数据输出。在数据抽取阶段,研究了关系数据库的模式抽取问题。
Web information integration involves heterogeneous information sources of different types. It intends to design a highly flexible and integrated approach that is able to analyze the heterogeneous information sources, integrate them and finally generate eonsentaneous unitary data sets. This is of immediate practical significance both in improving the decision-making ability based on knowledge and increasing the reuse rate of information. In order to efficiently solve the integration problem of the heterogeneous information sources, this paper divided the entire network information integrating process into three phases: data mining, data integraton and data output. At the data mining phase, this paper made a study of the schema mining problem of relational database.
出处
《信息技术》
2009年第8期117-120,共4页
Information Technology