期刊文献+

从社会性标签中进行语义关系抽取--一种元数据生成方法 被引量:7

Semantic Relation Extraction from Socially-generated Tags:A Methodology for Metadata Generation
在线阅读 下载PDF
导出
摘要 标签形式的社会性语义越来越占据主导地位,使元数据界在这种新形式的信息内容表达和检索方面面临机遇和挑战。其中,主要的挑战是与标签相关的语境信息的缺失。以Flickr标签为例,对如何利用社会性语义资源来丰富主题元数据进行了实验。实验过程包含4个步骤:收集Flickr标签样本;通过共有信息计算标签间的同现情况;通过Google检索结果来追踪标签对的语境信息;用自然语言处理和机器学习技术来抽取标签间的语义关系。本实验能够利用Google搜索结果构建语境库,并且以自然语言处理和机器学习算法对这些语句进行处理。这种新方法对于赋予标签对以一定语义关系有相当高的准确率。也探讨该方法在利用社会性语义丰富的主题元数据方面的意义。 The growing predominance of social semantics in the form of tagging presents the metadata communfty with both opportunities and challenges as for leveraging this new form of information content representation and for retrieval. One key challenge is the absence of contextual information associated with these tags. This paper presents an experiment working with Flickr tags as an example of utilizing social semantics sources for enriching subject metadata. The procedure included four steps : 1 ) Collecting a sample of Flickr tags, 2) Calculating cooccurrences between tags through mutual information, 3) Trac- ing contextual information of tag pairs via Google search results,4) Applying natural language processing and machine learn- ing techniques to extract semantic relations between tags. The experiment helped us to build a context sentence collection from the Google search results, which was then processed by natural language processing and machine learning algorithms. This new approach achieved a reasonably good rate of accuracy in assigning semantic relations to tag pairs. This paper also explores the implications of this approach for using social semantics to enrich subject metadata.
出处 《现代图书情报技术》 CSSCI 北大核心 2009年第3期38-45,共8页 New Technology of Library and Information Service
关键词 关系抽取 标签 搜索引擎 社会性语义 元数据 Relation extraction Tags Search engine Social semantics Metadata
  • 相关文献

参考文献21

  • 1Agichtein,Eugene,and Luis Gravano.(2000).Snowball:Extracting Relations from Large Plain-text Collections.In Kenneth M.Anderson,et al.(Ed.),Proceedings of the 5th ACM Conference on Digital Libraries,(pp.85-94).New York:Association for Computing Machinery.
  • 2Brin,Sergey.(1998).Extracting Patterns and Relations from the World Wide Web.In Paolo Atzeni et al.(Ed.),Selected Papers from the International Workshop on the World Wide Web and Databases,(pp.172-183).London:Springer.
  • 3Bunescu,Razvan C.,and Raymond J.Mooney.(2007).Extracting Relations from Text from Word Sequences to Dependency Paths.In Anne Kao,et al.(Ed.),Text Mining and Natural Language Processing,(pp.29-44).London:Springer.
  • 4Culotta,Aron,and Jeffrey Serensen.(2004).Dependency Tree Kernels for Relation Extraction.Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics.Retrieved April 13,2008,from http://acl.ldc.upenn,edu/P/P04/P04-1054.pdf.
  • 5Culotta,Aron,Andrew McCallum,and Jonathan Betz.(2006).Integrating Probabilistic Extraction Models and Data Mining to Discover Relations and Patterns in Text.Proceedings of the Main Conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics,(pp.296-303).
  • 6Guy,Marieke,and Emma Tonkin.(2006).Folksonomies:Tidying up tags? D-Lib Magazine,12(1).Retrieved April 13,2008,from http://www.dlib.org/dlib/january06/guy/01 guy.html.
  • 7Heymann,Paul,and Hector Garcia-Molina.(2006).Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems.Technical Report 2006-10.Department of Computer Science,Stanford University.Retrieved April 13,2008,from http://labs.rightnow.com/colloquium/papers/tag_hier_mining.pdf.
  • 8Iria,Jose,and Fabio Ciravegua.(2005).Relation Extraction for Mining the Semantic Web.Dagstuhl Seminar on Machine Learning for the Semantic Web.Retrieved April 13,2008,from http://tyne.shef.ac.uk/t-rex/pdocs/dagstuhl,pdf.
  • 9Liu,Hugo and Pattie Maes.(2007).Introduction to the Semantics of People & Culture (Editorial preface).International Journal on Semantic Web and Information Systems,Special Issue on Semantics of People and Culture,3 (1).Retrieved March 28,2008,from http://larifari.org/writing/IJSWIS2007-SPC-EditorialPreface.paf.
  • 10Mathes,Adam.(2004).Folksonomies-Cooperative Classification and Communication Through Shared Metadata.Unpublished manuscript.Retrieved April 13,2008,from http://www,adammathes.com/academic/computer-mediated-communication/folksonomies.html.

同被引文献68

引证文献7

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部