期刊文献+

基于TF/IDF多因素改进算法的知识单元抽取研究 被引量:1

Knowledge Unit Extracting Research Based on Improved TF/IDF Multi-Factor Algorithm
在线阅读 下载PDF
导出
摘要 深入分析知识研究的基本知识单元,对知识单元的概念、特性、载体及抽取过程做详细阐述,提出知识计量研究中的知识单元的定义与特性,对知识单元的独立性、组合性、链接性、多维性、外显性、可测性进行详细说明。根据知识单元特性以及中文文献特点,提出一种基于词长和位置考虑的TF/IDF多因素改进算法,以《半导体光电》期刊1999—2006年数据为实例,对比分析了传统TF/IDF特征词抽取方法与改进后特征词抽取算法,分析结果表明,基于词长和位置的TF/IDF多因素改进算法显著提高了知识单元抽取效率和准确性。 Based on the depth analysis of the basic knowledge unit in knowledge research, the concept, the characteristics, the carries and the extraction process of knowledge unit are expounded in detail. Explored definition and properties of knowledge unit in knowledge metrics study, explained the independence, combination, links, muhidimension, explicit and measurable. Based on the characteristics of knowledge unit and the specialization of Chinese documents, we proposed and improved TF/IDF multifactor algorithm based on the consideration of both length of word and word location. Then took the data from 1999-2006 in Semiconductor Optoelectronies journal as an example, Analyze the differences between traditional method and the improved algorithm. The results showed that the algorithm we proposed significantly increased the efficiency and precision in knowledge unit extraction.
出处 《情报学报》 CSSCI 北大核心 2011年第10期1037-1043,共7页 Journal of the China Society for Scientific and Technical Information
基金 本文得到国家社会科学基金(08BTQ025)的资助.
关键词 知识计量 知识单元 知识单元抽取 TF/IDF knowledge metrics, knowledge unit, knowledge unit extracting, TF/IDF
  • 相关文献

参考文献19

二级参考文献100

共引文献328

同被引文献24

  • 1王知津,张国华.知识组织概念模型及相关问题[J].中国图书馆学报,2004,30(4):5-9. 被引量:10
  • 2MAI J-E. Actors, domains, and constraints in the design and construction of controlled vocabularies [ J ]. Knowledge Or- ganization, 2008, 35 ( 1 ) : 16-30.
  • 3PASTO S, et al. Advantages of thesaurus representation using the simple knowledge organization system compared with pro- posed alternatives [ J]. Information Research, 2009, 14 (4) : 1-16.
  • 4袁翰青.现代文献工作基本概念[J].图书馆,1964(2):25-31.
  • 5SIGEL A. The knowledge organization on internet [EB/OL ]. [2011-02-23 ]. http://www, isko. org/wiss-org, faq. html.
  • 6HODGE G. Systems of kowledge organization for digital librar- ies : beyond traditional authority files [ M ]. Washington DC : The Digital Library Federation Council on Library and Informa- tion Resources, 2009.
  • 7ABBAS J. Structures for organizing knowledge: exploring taxon- omies, ontologies, and other schema [ M]. 100 William St., Suite 2004, New York, NY 10038 ; 2004 (212) : 925-8650.
  • 8MENZEL C. Knowledge representation, the World Wide Web, and the evolution of logic [ J]. Synthese, 2011, 182 (2) : 269-295.
  • 9西安交通大学.一种面向文本的知识单元关联关系挖掘方法:中国,CN201110312882.1[P].2012-5-2.
  • 10王曰芬,熊铭辉,吴鹏.面向个性化服务的知识组织机制研究[J].情报理论与实践,2008,31(1):7-11. 被引量:18

引证文献1

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部