摘要
根据HTML文档不同标签域的分布特征和对文档内容的代表能力不同,本文提出了一种改进的向量模型;针对Web信息检索的特点,给出了一种多关键字查询向量的构建方法。最后通过文档向量与查询向量之间的相关度对检索结果进行优化,提高查准率。
According to the text term distribution and content representing ability of different fields of HTML document, an improved Vector Space Model is proposed in this paper. Then an onstructing method of query vector based on Multi-keyword is introduced. At last we can optimize the search results by the similarity between document vector and query vector.
出处
《重庆职业技术学院学报》
2006年第3期151-153,共3页
Journal of Chongqing Vocational& Technical Institute