摘要
【目的】实现对机构知识库作者名消歧的高度自动化处理,并在适当的时机提供人工介入机制。【方法】分析机构知识库作者名消歧的特殊性与消歧特征项,依此构建机构知识库作者名通用消歧框架并实践部署。【结果】该框架在实际应用中取得良好的成效,准确率达到99%以上。【局限】对缺失单位信息的作者名未进行处理;作者别名与机构别名可能存在例外情况。【结论】该框架能够有效地解决机构知识库作者名消歧的难题,在此基础上可构建更多的精准增值服务。
[Objective] This paper tries to automatically finish the disambiguation of author names in institutional repositories,and then provide human intervention mechanism at the right time.[Methods] First,we analyzed the unqiue features of the author name disambiguation.Then,we constructed a general disambiguation framework for the institutional repository.[Results] Our framework achieved good results in practice with more than 99% of precision.[Limitations] We did not examine the author names without affiliation addresses,and there may be exceptions in the alias of authors and institutions.[Conclusions] This framework could effectively disambiguate author names in institutional repositories,which helps us provide more value-added services.
作者
张旺强
祝忠明
李雅梅
卢利农
刘巍
Zhang Wangqiang;Zhu Zhongming;Li Yamei;Lu Linong;Liu Wei(Lanzhou Information Center,Chinese Academy of Sciences,Lanzhou 730000,China;ShanghaiTech University Library,Shanghai 201210,China)
出处
《数据分析与知识发现》
CSSCI
CSCD
北大核心
2019年第6期92-98,共7页
Data Analysis and Knowledge Discovery
基金
中国科学院文献情报能力建设专项“机构知识库持续运行建设及开放科研知识云”(项目编号:Y8ZG051001)的研究成果之一
关键词
机构知识库
人名消歧
作者名消歧
CSpace
Institutional Repository
Name Disambiguation
Author Name
Disambiguation CSpace