期刊文献+

一种基于WWW的Ontology属性值自动提取方法 被引量:1

Automatic Extraction of Ontology Attribute Value Based on WWW
在线阅读 下载PDF
导出
摘要 属性值是描述Ontology中类的重要信息,但是当前关于属性值的自动提取的研究并不多。该文提出一种基于WWW的Ontology属性值自动提取方法。论文首先提出了一种在小规模属性值种子集的基础上,包含属性值的句子的选择与属性值提取互动的方法。这种方法利用互联网信息的冗余性,自动抽取并扩充目标属性值集合。然后,为避免人工构造属性值种子集,提出种子集自动生成的方法。我们设计实验来计算提取结果的正确率和召回率,此外,我们还通过将填充后的Ontology信息用于网页正文提取任务来展示Ontology自动扩充结果的有效性。 Attributes value is among the most important information to describe Ontology. However, few researches have been done about attribute values extraction so far. This paper proposes a method of extracting Ontology attribute values automatically based on WWW. Firstly, an interactive method is described to unilize the interaction between the attribute-val ue-related sentence selection and the attribute values extraction. This method can expand the target attribute value set from a seed set by the redundancy of WWW. Secondly, we present a method to construct the seed automatically. Experiments are conducted to examine the method in terms of precision and recall. In addition, automatically enriched Ontology informa tion is applied in webpage content extraction to test the usefulness of our approach.
出处 《中文信息学报》 CSCD 北大核心 2008年第6期69-74,共6页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目(60503071) 国家973资助项目(2004CB318102)
关键词 计算机应用 中文信息处理 因特网 互动方法 属性值提取 computer application Chinese information processing WWW interactive method attribute value extraction
  • 相关文献

参考文献9

  • 1刘耀.领域Ontology自动构建研究[D].北京大学博士后出站报告,2007.
  • 2Maedche A. Ontology Learning for the Semantic Web [M]. Boston: Kluwer Academic Publishers, 2002.
  • 3P. Cimiano, A. Hotho and S. Staab. I.earning con cept hierarchies from text corpora using formal concept analysis[J]. J. Artificial Intelligence Research 2005, 24, 305-339.
  • 4Kavalec M, Svdtek V. A study on automated relation labelling in ontology learning[J]. Buitelaar P, Cimiano P, Magnini B, eds. Ontology Learning from Text: Methods, Evaluation and Applications. Amsterdam: IOS Press, 2005.
  • 5SUI Zhifang, CHEN Yirong, HU Junfeng, WU Yun-fang, YU Shiwen. The Research on the Automatic Term Extraction in the Domain of Information Science and Technology[C]//The second East Asia Forum of Terminology Proceedings, 2002.
  • 6Delphine Bernhard. Multilingual Term Extraction from Domain-specific Corpora Using Morphological Struc- ture[C]//The Association for Computational Linguistics, Trento Italy: 2006.
  • 7Agirre, E. , Ansa, O. , Hovy, E. , and Martinez, D. 2000. Enriching very large ontologies using the www [C]//Proceedings of the Ontology Learning Workshop, ECAI 2000. Berlin, Germany: 2000.
  • 8SATOSHI SATO and YASUHIRO SASAKI. Automatic collection of related terms from the web[C]//fPSJ SIG Notes, 2003(4): 57-64.
  • 9昝红英,胡俊峰,穗志方,俞士汶.信息科学与技术领域中的术语分类研究[C]..第五届东亚术语论坛.中国海口,2002.191-197.

同被引文献9

  • 1Yoshinaga N,Torisawa K.Open-domain attribute-value acquisition from semi-structured texts[C] //LNCS 4825:Proc of the 6th ISWC Workshop OntoLex07-From Text to Knowledge:The Lexicon/Ontology Interface.Berlin:Springer,2007:55-66.
  • 2Maedche A.Ontology Learning for the Semantic Web[M].Amsterdam,Netherlands:Kluwer Academic Publishers,2002.
  • 3Agirre E,Ansa O,Hovy E.et al.Enriching very large ontology using the WWW[OL].[2010-09-25].http://www.informatik.uni-trier.de/~ley/db/conf/ecai/ecai2000ol.html.
  • 4Takahashi T.Computation of semantic equivalence for question answering[D].Nara:Nara Institute of Science and Technology,2005.
  • 5Etzioni O,Cafarella M,Downey D,et al.Web-scale information extraction in KnowItAll (Preliminary results)[C]//Proc of WWW.New York:ACM,2004:100-110.
  • 6Brin S.Extracting patterns and relations from the World Wide Web[C]//Proc of the WebDB Workshop at EDBT.Berlin:Springer,1998:172-183.
  • 7Liu Yiqun,Zhang Min,Ma Shaoping,et al.Automatic search engine performance evaluation with click-through data analysis[C]//Proc of WWW.New York:ACM,2007:1133-1134.
  • 8Saaty T L.Decision making with the analytic hierarchy process[J].International Journal of Services Sciences,2008,1(1):83-98.
  • 9田国刚.受限中文语料的自监督文本知识获取研究[D].北京:中国科学院计算技术研究所,2007.

引证文献1

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部