期刊文献+

面向多源知识融合的扩展主题图相似性算法 被引量:10

Novel Similarity Algorithm of Extended Topic Maps for Multi-Resource Knowledge Fusion
在线阅读 下载PDF
导出
摘要 针对基于元数据或传统主题图的知识组织模式没有实现知识的多层次多粒度表示,以及知识融合过程中相似性算法准确性不高而影响融合质量的问题,结合全信息理论与扩展主题图结构特点及语义信息,提出了面向多源知识融合的扩展主题图相似性算法(ETMSC)和阈值选取的相关性、层次对应和实验确定三原则.该算法综合了语法、语义和语用的相似性,扩展了主题图元素间组成结构上的相似性,同时充分考虑了涵义及所处语境的相似性.主题图相似性的判别准则与阈值有关,阈值的确定与数据集相关.实验结果表明,ETMSC算法与单纯基于语法或语义的相似性算法相比,准确性提高了9.2%~11.1%. A novel similarity algorithm of extended topic map called ETMSC for multi-resource knowledge fusion is proposed to improve the drawbacks that the knowledge organization model based on metadata or traditional topic map can not represent knowledge multi-level and multigranularity, and the low accuracy of existing similarity algorithms. Three principles of the correlation, levels corresponding, and the experimental determination in selecting threshold are presented. The algorithm combines the comprehensive information theory with the structure and semantic information of extended topic map. The syntactic matching, semantic matching, and pragmatic matching are comprehensively considered, in which not only the structural similarity of topic map elements are extended, but also the meaning and relevance in linguistic contexts are thoroughly taken into account. Topic map similarity criterions are related to a threshold, and the determination of the threshold is associated with the data sets. Experimental results and comparisons with the traditional algorithms that are purely based on the syntactic or semantic similarity show that the F-measure of ETMSC is improved by 9.2%-11.1%.
出处 《西安交通大学学报》 EI CAS CSCD 北大核心 2010年第2期20-24,共5页 Journal of Xi'an Jiaotong University
基金 国家高技术研究发展计划资助项目(2008AA01Z131) 国家自然科学基金资助项目(60803162)
关键词 知识融合 主题图 相似性算法 knowledge fusion topic map similarity algorithm
  • 相关文献

参考文献8

  • 1田磊,覃征,衡星辰,邵利平.基于本体的多源异构XML数据近似查询方法[J].西安交通大学学报,2007,41(6):702-706. 被引量:5
  • 2鲁慧民,冯博琴,赵英良,郑庆华,刘均.一种基于扩展主题图的分布式知识融合[J].吉林大学学报(理学版),2009,47(3):543-547. 被引量:7
  • 3MAICHER L, WITSCHEL H F. Merging of distributed topic maps based on the subject identity measure (SIM) approach [M].Leipzig, Germany: LIT, 2004 : 1-11.
  • 4KIM J M, SHIN H, KIM H J. Schema and constraints-based matching and merging of topic maps [J]. Information Processing and Management, 2007, 43(4) : 930-945.
  • 5吴笑凡,周良,张磊,丁秋林.分布式主题地图合并中的TOM算法[J].武汉大学学报(工学版),2006,39(5):131-136. 被引量:9
  • 6PEPPER S. The TAO of topic maps [EB/OL]. [2008- 12-20]. http: //www. gca. org/papers/xmleurope2000/.
  • 7LU Huimin, FENG Boqin, ZHAO Yingliang, et al. A new model for distributed knowledge organization management [C]//Proceedings of the 7th International Conference on Grid and Cooperative Computing. Los Alamitos, CA, USA: IEEE Computer Society, 2008: 261-265.
  • 8钟义信.自然语言理解的全信息方法论[J].北京邮电大学学报,2004,27(4):1-12. 被引量:42

二级参考文献34

  • 1朱良兵,纪希禹.基于Topic Maps的叙词表再工程[J].现代图书情报技术,2006(9):81-84. 被引量:20
  • 2吴笑凡,周良,张磊,丁秋林.分布式主题地图合并中的TOM算法[J].武汉大学学报(工学版),2006,39(5):131-136. 被引量:9
  • 3Rubin S H.On the Fusion and Transference of Knowledge (Ⅰ-Ⅱ)[C]//Proc 2003 IEEE Int'l Conf on Information Reuse and Integration (IRI'03).Las Vegas:IEEE Systems,Man,and Cybernetics Society,2003:144-159.
  • 4XIE Neng-fu,CAO Cun-gen,GUO Hong-yu.A Knowledge Fusion Model for Web Information[C]//Proc 2005 IEEE/WIC/ACM Int'l Conf on Web Intelligence (WI'05).Washington:IEEE Computer Society,2005:67-72.
  • 5Andres F,Naito M.Dynamic Topic Mapping Using Latent Semantic Indexing[C]//Proc Third Int'l Conf on Information Technology and Applications (ICITA'05).Washington:IEEE Computer Society,2005,2:220-225.
  • 6Chan P T,Rad A B,Tsang K M.Optimization of Fused Fuzzy Systems via Genetic Algorithms[J].IEEE Transactions on Industrial Electronics,2002,49(3):685-692.
  • 7LIU Xiao-qiang.Towards Aggregate Knowledge Services System:a Distributed Cognition Framework[C]//Proc 2nd Int'l Conf on Pervasive Computing and Applications (ICPCA'07).Washington:IEEE Computer Society,2007:582-587.
  • 8Pepper S.The TAO of Topic Maps[EB/OL].2000-06-12.http://www.gca.org/papers/xmleurope2000/.
  • 9Maicher L,Witschel H F.Merging of Distributed Topic Maps Based on the Subject Identity Measure (SIM) Approach[C]//Proc LIT'04.Leipzig,Germany:Berliner XML Tags,2004:301-307.
  • 10Kim J M,Shin H,Kim H J.Schema and Constraints-based Matching and Merging of Topic Maps[J].Information Processing and Management,2007,43(4):930-945.

共引文献57

同被引文献126

引证文献10

二级引证文献53

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部