期刊文献+

机构知识库作者名自动消歧框架设计与实践 被引量:5

Disambiguating Author Names Automatically for Institutional Repository
原文传递
导出
摘要 【目的】实现对机构知识库作者名消歧的高度自动化处理,并在适当的时机提供人工介入机制。【方法】分析机构知识库作者名消歧的特殊性与消歧特征项,依此构建机构知识库作者名通用消歧框架并实践部署。【结果】该框架在实际应用中取得良好的成效,准确率达到99%以上。【局限】对缺失单位信息的作者名未进行处理;作者别名与机构别名可能存在例外情况。【结论】该框架能够有效地解决机构知识库作者名消歧的难题,在此基础上可构建更多的精准增值服务。 [Objective] This paper tries to automatically finish the disambiguation of author names in institutional repositories,and then provide human intervention mechanism at the right time.[Methods] First,we analyzed the unqiue features of the author name disambiguation.Then,we constructed a general disambiguation framework for the institutional repository.[Results] Our framework achieved good results in practice with more than 99% of precision.[Limitations] We did not examine the author names without affiliation addresses,and there may be exceptions in the alias of authors and institutions.[Conclusions] This framework could effectively disambiguate author names in institutional repositories,which helps us provide more value-added services.
作者 张旺强 祝忠明 李雅梅 卢利农 刘巍 Zhang Wangqiang;Zhu Zhongming;Li Yamei;Lu Linong;Liu Wei(Lanzhou Information Center,Chinese Academy of Sciences,Lanzhou 730000,China;ShanghaiTech University Library,Shanghai 201210,China)
出处 《数据分析与知识发现》 CSSCI CSCD 北大核心 2019年第6期92-98,共7页 Data Analysis and Knowledge Discovery
基金 中国科学院文献情报能力建设专项“机构知识库持续运行建设及开放科研知识云”(项目编号:Y8ZG051001)的研究成果之一
关键词 机构知识库 人名消歧 作者名消歧 CSpace Institutional Repository Name Disambiguation Author Name Disambiguation CSpace
  • 相关文献

参考文献4

二级参考文献30

  • 1Bagga A, Baldwin B. Entity - based Cross - document Coreferencing Using the Vector Space Model [ C ]. In:Proceedings of the 17th In- ternational Conference on Computational Linguistics. 1998:75 -85.
  • 2Mann G S, Yarowsky D. Unsupervised Personal Name Disambigu- ation[C]. In: Proceedings of the 7th Conference on Natural Lan- guage Learning at HLT - NAACL 2003 ( CoNLL - 2003 ). 2003 : 33 -40.
  • 3Fleischman M B, Hovy E. Multi - Document PerSon Name Resolu- tion [ C ]. In : Proceedings of the 42nd Annual Meeting of the Associ- ation for Computational Linguistics, Reference Resolution Workshop. 2004.
  • 4Malin B. Unsupervised Name Disambiguation via Social Network Similarity[ C ]. In : Proceedings of the SIAM International Conference on Data Mining, Workshop on Link Analysis, Counterterrorism, and Security in Conjunction. 2005 : 93 - 102.
  • 5Tang J, Zhang J, Zhang D, et al. A Unified Framework for Name Disambiguation [ C ]. In : Proceedings of the 17th International Con- ference on World Wide Web. 2008 : 1205 - 1206.
  • 6Chen C, Hu J F, Wang H F. Clustering Technique in Multi - doc- ument Personal Name Disambiguation [ C ]. In : Proceedings of the ACL - IJNCLP 2009 Student Research Workshop, Suntex, Singaore. Stroudsburg, PA, USA : Association for Computational Linguistics, 2009 : 88 - 95.
  • 7ORCID. Welcome to ORCID [ EB/OL ]. [ 2012 - 03 - 02 ]. ht- tp ://about. orcid, org/.
  • 8Bagga A. Evaluation of Coreferences and Coreference Resolution Systems [ C ]. In : Proceedings of the 1 st International Conference on Language Resources and Evaluation. Granada: European Language Resources Association, 1998.
  • 9Zhang D, Tang J, Li J Z, et al. A Constraint - based Probabilistic Framework for Name Disambiguation [ C ]. In : Proceedings of the 16th ACM Conference on Information attd Knowledge Management ( CIKM' 2007 ). 2007 : 1019 - 1022.
  • 10Kang I S, Na S H, Lee S, et al. On Co - authorship for Author Dis- ambiguation[ J]. Information Processing & Management, 2009,45 (1): 84 -97.

共引文献34

同被引文献59

引证文献5

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部