期刊文献+

基于上下文的查询扩展 被引量:32

Context-Sensitive Query Expansion
在线阅读 下载PDF
导出
摘要 针对信息检索查询所使用的词可能与文档集中使用的词不匹配从而影响检索效果这一信息检索关键问题,提出了一种基于上下文的查询扩展方法,该方法根据查询的上下文信息对扩展词进行选择,同时考虑到扩展词与整个查询句以及与查询词的位置关系.在TREC信息检索测试集上进行的实验表明,相对于通常简单的语言模型,方法取得了5%~19%的提高.与流行的基于伪反馈的查询扩展方法相比,提出的方法也具有相当的平均准确率. The effectiveness of information retrieval fIR) systems is influenced by the degree of term overlap between user queries and relevant documents. Query-document term mismatch, whether partial or total, is a fact that must be dealt with by IR systems, query expansion (QE) is one method for dealing with term mismatch. Classical query expansion techniques such as the local context analysis make use of term co-occurrence statistics to incorporate additional contextual terms for enhancing passage retrieval. However, relevant contextual terms do not always co-occur frequently with the query terms and vice versa. Hence the use of such methods often brings in noise, which leads to reduced precision. On the basis of analyzing the process of producing query, the authors propose a new method of query expansion on the basis of context and global information. At the same time, the expansion terms are selected according to their relation with the whole query. Additionally, the position information between terms is considered. The experiment result on TREC data collection shows that the method proposed outperforms the language model without expansion by 5%-19%. Compared with the popular approach of query expansion, pseudo feedback, the method has the competitive average precision.
出处 《计算机研究与发展》 EI CSCD 北大核心 2010年第2期300-304,共5页 Journal of Computer Research and Development
基金 国家自然科学基金重点项目(60736044) 国家"八六三"高技术研究发展计划基金项目(2006AA01Z150) 云南省应用基础研究面上项目(2009ZC032M)~~
关键词 信息检索 查询扩展 上下文 语言模型 伪反馈 information retrieval query expansion context language model pseudo feedback
  • 相关文献

参考文献17

  • 1Ponte J, Croft W. A language modeling approach to information retrieval [C] //Proc of the 21st ACM Conf on Research and Development in Information Retrieval (SIGIR'98). New York: ACM, 1998:222-229.
  • 2Richardson R, Smeaton A. Using Wordnet in a knowledgebased approach to information retrieval, ca-0395 [R]. Dublin: Trinity College Dublin, 1995.
  • 3Lin D-K, Zhao S-J. Identifying synonyms among distributionally similar words [C]//Proc of Int Joint Conf of Artificial Intelligence (IJCAI2003). Acapuleo: Elsevier, 2003:Ⅰ492-Ⅰ493.
  • 4丁国栋,白硕,王斌.一种基于局部共现的查询扩展方法[J].中文信息学报,2006,20(3):84-91. 被引量:44
  • 5Xu J, Croft W. Query expansion using local and global document analysis [C] //Proc of the 19th Annual Int ACM SIGIR Conf on Research and Development in Information Retrieval. New York: ACM, 1996:4-11.
  • 6张敏,宋睿华,马少平.基于语义关系查询扩展的文档重构方法[J].计算机学报,2004,27(10):1395-1401. 被引量:55
  • 7Li Dekang. Dependency-based evaluation of MINIPAR [C] // Proc of the Workshop on the Evaluation of Parsing Systems. Granada: ELAR, 1998:298-312.
  • 8Peat H, Willett P. The limitations of term co-occurrence data for query expansion in document retrieval systems [J]. Journal of the American Society for Information Science, 1991, 42(5) : 378-383.
  • 9Voorhees E. Query expansion using lexical semantic relations[C]//Proe of ACM Conf on Research and Development in Information Retrieval 1994. New York: ACM, 1994:61-69.
  • 10Qiu Y, Frei H. Concept based query expansion [C]//Proc of ACM Conf on Research and Development in Information Retrieval 1993. New York: ACM, 1993:160-169.

二级参考文献64

  • 1S.K.M.Wong,Y.Y.Yao.On modeling information retrieval with probabilistic inference.ACM Trans.Information Systems,1995,13(1):69~99
  • 2J.M.Ponte,W.B.Croft.A language modeling approach to information retrieval.The 21st Annual Int'l ACM SIGIR Conf.Research and Development in Information Retrieval,Melbourne,1998
  • 3David R.Miller,Tim Leek,Richard M.Schwartz.A hidden Markov model information retrieval system.The 22nd Annual Int'l ACM SIGIR Conf.Research and Development in Information Retrieval,Berkeley,1999
  • 4D.Hiemstra,W.Kraaij.Twenty-one at TREC-7:Ad-hoc and cross-language track.The 7th Text Retrieval Conference,Gaithersburg,1999
  • 5J.Xu,R.Weischedel,C.Nguyen.Evaluating a probabilistic model for cross-lingual information retrieval.The 24th Annual Int'l ACM SIGIR Conf.Research and Development in Information Retrieval,New Orleans,2001
  • 6J.Xu,W.B.Croft.Cluster-based language models for distributed retrieval.The 22nd Annual Int'l ACM SIGIR Conf.Research and Development in Information Retrieval,Berkeley,1999
  • 7X.Liu,W.B.Croft.Passage retrieval based on language models.The 11th Int'l Conf.Information and Knowledge Management,McLean,2002
  • 8F.Jelinek.Statistical Methods for Speech Recognition.Cambirdge:MIT Press,1998
  • 9R.Rosenfeld.Two decades of statistical language modeling:Where do we go from here? Proc.IEEE,2000,88(8):1270~1278
  • 10S.F.Chen,J.T.Goodman.An empirical study of smoothing techniques for language modeling.Harvard University,Tech Rep:TR-10-98,1998

共引文献110

同被引文献278

引证文献32

二级引证文献96

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部