期刊文献+

基于马尔科夫逻辑网络的实体解析改进算法 被引量:10

Improvement of Entity Resolution Based on Markov Logic Networks
在线阅读 下载PDF
导出
摘要 实体解析(Entity Resolution,ER)是数据挖掘过程中关键而又费时的一个步骤。华盛顿大学的Domingos和Singla提出了基于马尔科夫逻辑网络(Markov Logic Networks,MLNs)的ER算法。基于此算法,在原有的MLNs体系中,引入了一个可变权重的规则,试图解决原有系统无法处理的实体二义性问题。实验证明,新算法能够有效缓解数据记录的二义性问题,并且在一定程度上提高了原始算法的精度。 Entity Resolution is a crucial and expensive step in the data mining process. Domingos and Singla of University of Washington proposed of well-founded, integrated solution to the entity resolution problem based on Markov Logic. This paper tried to improve Domingos and Singla's solution by adding a formula with a changeable weight to it, to handle the problem of ambiguity of entities that the original system cannot distinguish. The new algorithm can effectively handle ambiguity of entities, and improve accuracy compared with the original algorithm, which is proved by experiments.
出处 《计算机科学》 CSCD 北大核心 2010年第8期243-247,共5页 Computer Science
基金 国家自然科学基金(No.60970081) 国家863计划专题课题(No.2007AA01Z197) "十一五"国防预研项目资助
关键词 ER MLNs 可变权重 ER, MLNs, Changeable weight
  • 相关文献

参考文献36

  • 1McCallum A,Tejada S,Quass D.Mining knowledge from text using information extraction[C] ∥Proc.of the KDD-2003 Workshop on Data Cleaning,RecordLinkage,and Object Consolidation.ACM Press,2003.
  • 2Gehrke J,Ginsparg P,Kleinberg J.KDD cup 2003[EB/OL].http://www.cs.cornell.edu/projects/kddcup.
  • 3Newcombe H,Kennedy J,Axford S,et al.Automatic linkage of vital records[J].Science,1959,130:954-959.
  • 4Fellegi I,Sunter A.A theory for record linkage[J].American Statistical Association,1969,64:1183-1210.
  • 5Agresti A.Categorical Data Analysis[M].Wiley,New York,NY,1990.
  • 6Hernandez M,Stolfo S.The merge/purge problem for large databases[C] ∥Proc.SIGMOD-95.1995:127-138.
  • 7Monge A,Elkan C.An efficient domain-independent algorithm for detecting approximately duplicate database records[C] ∥Proc.SIGMOD-97 DMKD Workshop.1997.
  • 8McCallum A,Nigam K,Ungar L.Efficient clustering of high-dimensional data sets with application to reference matching[C] ∥Proc.KDD-00.2000:169-178.
  • 9Cohen W,Richman J.Learning to match and cluster large high-dimensional data sets for data integration[C] ∥Proc.KDD-02.2002:475-480.
  • 10Tejada S,Knoblock C,Minton S.Learning object identification rules for information integration[J].Information Systems,2001,26(8):607-633.

二级参考文献87

  • 1Koller D. Probabilistic relational models [G]// LNAI 1634: Proe of the 9th Int Workshop on Inductive Logie Programming (ILP-99). Berlin: Springer, 1999:3-13.
  • 2Getoor L. Learning statistical models from relational data [D]. Stanford, California: Stanford University, 2001.
  • 3Sanghai S, Domingos P, Weld D. Dynamic probabilistic relational models [C] //Proc of the 18th Int Joint Conf on Artificial Intelligence. San Francisco: Morgan Kaufmann, 2003.
  • 4Hsu W, Joehanes R. Relational decision networks [C]//Proc of the ICML-2004 Workshop on Statistical Relational Learning and Connections to Other Fields (SRL-2004). Banff, Canada: IMLS, 2004:61-67.
  • 5Xu Z, Tresp V, Yu K, et al. Infinite hidden relational models [C] //Proc of the 22nd Conf on Uncertainty in Artificial Intelligence. Arlington, Virginia.. AUAI Press, 2006.
  • 6Friedman N, Getoor L, Koller D, et al. Learning probabilistic relational models [C] //Proc of the 16th Int Joint Conf on Artificial Intelligence (IJCAI-99). San Francisco: Morgan Kaufmann, 1999:1300-1309.
  • 7Getoor L, Koller D, Taskar B, et al. Learning probabilistie relational models with struetural uncertainty [C] //Proc of the AAAI-2000 Workshop on Learning Statistieal Models from Relational Data. Menlo Park, CA: AAAI, 2000:13-20.
  • 8Getoor L, Friedman N, Koller D, et al. Learning probabilistic models of relational structure [C] //Proc of the 18th Int Conf on Machine Learning (ICML-01). San Francisco: Morgan Kaufmann, 2001(1/2) : 170-177.
  • 9Ngo I., Haddawy P. Answering queries from contextsensitive probabilistic knowledge bases [J]. Theoretical Computer Science, 1997, 171(1/2): 147-177.
  • 10Kersting K, Raedt I. De. Adaptive Bayesian logic programs [C] //Proc of the 11th Conf on Inducive Logic Programming. Berlin: Springer, 2001:104-117.

共引文献12

同被引文献131

引证文献10

二级引证文献36

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部