摘要
大数据的产生为电子政务带来了新的机遇与挑战,也为作为电子政务信息资源之一的组织机构代码提供了全新的认知理解角度。目前政府决策时使用的数据信息资源仍未完全统一,存在数据结构和类型差异明显、数据资源不统一等问题。为了使这些孤立的数据能够更好地实现资源共享,把位于不同信息源上的数据融合起来,本文在分析讨论组织机构代码和大数据共同特点的基础上,提出一种基于多源组织机构代码信息的数据融合方法。该方法基于组织机构代码、法人信息、组织机构名称3个方面信息,实现不同来源的信息融合。实验表明,该方法的融合率达到97%,准确率为87.4%。
The advent of Big Data has brought new opportunities and challenges for e-government,it also provides a new angle of cognitive for organization codes which are one of the e-government information resources.The information resources of data used by government have not completely been unified so far, the differences of data structures and types are obviously.In order to make these isolated data realize resources sharing and fuse the data located in different sources,this paper put forward a data fusion method based on multi-source information codes.Based on three aspects of information:organization codes,legal persons,organization names,the method accomplishes data fusion from different sources.Experimental results showed that the convergence rate was 97%and accuracy was 87.4%.
出处
《测绘科学》
CSCD
北大核心
2014年第5期76-79,64,共5页
Science of Surveying and Mapping
基金
国家高技术研究发展计划(G1213)
国家科技支撑计划(2012BAH24B02
2012BAK15B04)
关键词
组织机构代码
多源
数据融合
大数据
organization codes
multi-source
data fusion
Big Data