摘要
提出了一种数据空间中的命名实体集成模型(NEIM)及其在异质异构数据源中的集成方法。命名实体模型描述了数据源、实体与实体描述间的关系,能够实现从其中任意一个息查询到其它相关信息。命名实体的集成架构指出了数据空间中命名实体集成要完成的主要任务,包括命名实体的识别、实体的集成映射和实体的统一。集成算法描述了数据空间中异构数据源包含的命名实体及其描述信息的集成方法。针对结构化半结构化数据,它采取构建映射规则,使系统可以在后期持续集成这些数据源中的实体信息,实验验证了集成方法的构建映射规则的有效性。
A named entity integration model(NEIM) was proposed for Dataspace,as well as integration methods for named entities of heterogeneous data sources.Named entity integration model describes the relations among data source,named entity and the descriptions of entity.It supports any inquires from one of them to the other relevant information.The framework of named entity integration points out that the main works of the integration are named entity and its information recognition,entity integration and mapping,and entity resolution.The integrated algorithm represents the integration methods of named entity and its information in heterogeneous data sources.Especially,for the structural and semi-structured data,it constructs mapping rules,makes the system can continuous integration.The experiment validates the mapping rules.
出处
《计算机科学》
CSCD
北大核心
2012年第10期170-173,186,共5页
Computer Science
基金
福建省科技计划重大项目(2011H6016)
福建省科技计划重点项目(2011H0028)资助
关键词
数据空间
命名实体
集成
Dataspace
Named entity
Integration