期刊文献+

信息抽取技术研究与探讨 被引量:1

在线阅读 下载PDF
导出
摘要 对信息抽取技术的发展背景、概念进行了概述。详细介绍了信息抽取中研究的四个关键技术:命名实体识别、实体关系抽取、指代消解及事件探测。根据采用模型的不同,对信息抽取进行了分类介绍,分别指出了各类抽取方法的优点、缺点及研究难点。最后,对国内外在信息抽取领域中的研究现状及应用状况进行了分析,进一步说明了信息抽取技术的发展趋势。
出处 《福建电脑》 2010年第4期55-55,65,共2页 Journal of Fujian Computer
  • 相关文献

参考文献6

  • 1Ping Zhong , Jinlin Chen. A Generalized Hidden Markov Model Approach for Web Information Extraction[C]. Proceedings of the 2006 IEEE/ WIC/ACM International Conference on Web Intelligence. December 18- 22, 2006: 709-718.
  • 2Weiwei Sun , Hongzhan Li , Zhifang Sui, The integration of dependency relation classification and semantic role labeling using bilayer maximum entropy Markov models [C]. Proceedings of the Twelfth Conference on Computational Natural Language Learning. Manchester, United Kingdora August 16-17, 2008: 243-247.
  • 3Xiao Li, Ye-Yi Wang, Alex Accro. Extracting structured information from user queries with semi-supervised conditional random fields [C]. Proceedings of the 32nd international ACM SIGIR. confcrcncc on Research and dcvclopmcnt in information retrieval. Boston, MA, USA. July 19-23, 2009: 572-579.
  • 4Ching Hoi Andy Hong, Jesse Prabawa Gozali, Min-Yen Kan. FireCite: lightweight real-time reference string extraction from webpages [C]. Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. Paris, France.2009: 189-198.
  • 5ASHRAF Fafma,OZYER Tame,ALHAJJ Reda Employing Clustering Techniques for Automatic Information Extraction From HTML Documents [C]. IEEE transactiom on systems, man and cybernetics. Part C, Applicatious and reviews.2008,38(5): 660-673.
  • 6张铭,银平,邓志鸿,杨冬青.SVM+BiHMM:基于统计方法的元数据抽取混合模型[J].软件学报,2008,19(2):358-368. 被引量:27

二级参考文献22

  • 1Morville P, Rosenfeld L. Information Architecture for the World Wide Web: Designing Large-Scale Web Site. 3rd ed., Sebastopol: 0'Reilly&Associates, 2006.
  • 2Chidlovskii B Wrapping web information providers by transducer induction. In: Racdt L, Flach P, eds. Proc of the 12th Int'l of European Conf. on Machine Learning (ECML 2001). LNCS 2167, Heidelberg: Springer-Verlag, 2001.61-72.
  • 3Hitchcock S, Carr L, Jiao Z, Bergmark D, Hall W, Lagoze C, Harnad S. Developing services for open eprint archives: Globalisation, integration and the impact of links. In: Proc. of the 5th ACM Conf. on Digital Libraries (ACMDL 2000). New York: ACM Press, 2000. 143-151.
  • 4Klink S, Dengel A, Kieninger T. Rule-Based document structure understanding with a fuzzy combination of layout and textual features. Int'l Journal on Document Analysis and Recognition, 2001,4( 1): 18-26.
  • 5Kim J, Le DX, Thoma GR. Automated labeling algorithms for biomedical document images. In: Proc. of the 7th World Multiconference on Systemics, Cybernetics and Informatics. Orlando: ⅢS, 2003. 352-357.
  • 6Zhang M, Yang DQ, Deng ZH, Feng Y, Wang WQ, Zhao PX, Wu S, Wang SA, Tang SW. PKUSpace: A collaborative platform for scientific researching. In: Liu WY, Shi YC, Li Q, eds. Proc of the Int'l Conf. of Web-based Learning (ICWL 2004). LNCS 3143, Heidelberg: Springer-Verlag, 2004. 120-127.
  • 7Zhao PX, Zhang M, Yang DQ, Tang SW. Automatic extraction of metadata from digital documents. Computer Science, 2003, 30(10):217-204
  • 8Bikel DM, Miller S, Schwartz R, Weischedel R. Nymble: A high performance learning name finder. In: Proc. of the 5th Conf. on Applied Natural Language Processing (ANLC'97). San Francisco: Morgan Kaufmann Publishers, 1997. 194-201.
  • 9Seymore K, McCallum A, Rosenreid R. Learning hidden Markov model structure for information extraction. In: Califf ME, Freitag D, Kushmerick N, Muslea I, eds. Proc. of the AAAI'99 Workshop on Machine Learning for Information Extraction. Cambridge: MIT Press, 1999.37-42.
  • 10Borkar VR, Deshmukh K, Sarawagi S. Automatic segmentation of text into structured records. In: Aref WG, ed. Proc. of the ACM-SIGMOD Int'l Conf. Management of Data (SIGMOD 2001). New York: ACM Press, 2001. 175-186.

共引文献26

同被引文献7

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部