期刊文献+

基于PageRank的Lucene排序算法优化与实现 被引量:12

Optimization and Implementation of Lucene Ranking Pages Algorithm Based on PageRank
在线阅读 下载PDF
导出
摘要 随着Web技术的迅速发展,提供个性化服务的搜索引擎技术受到用户的广泛关注,网页排序是其中的关键技术之一。本文利用PageRank算法对原有的Lucene网页排序进行了改进,设计并实现了关于手机信息搜索的个性化搜索引擎。实验结果证明,改进后的排序算法能够较好地提高信息检索的准确度,为用户带来了优于Lucene自身排序的搜索体验。 With the rapid development of web technologies, the search engine based on personalized services becomes more and more important in the society. In particular, the algorithm of ranking pages is one of the critical technologies in the area of search engines. The paper proposes an approach to improve the original ranking algorithm of Lucene via the PageRank algorithm,and designs and implements a per- sonalized search engine on mobile information. Experimental results show that the proposed sorting algorithm can improve the accuracy of the information retrieval, and users can obtain better search experience than original sorting of Lucene.
出处 《计算机工程与科学》 CSCD 北大核心 2012年第10期123-127,共5页 Computer Engineering & Science
基金 国家自然科学基金资助项目(61075059) 湖北工业大学校教研项目资助(2011006)
关键词 LUCENE PAGERANK 个性化搜索引擎 排序优化 Lucene PageRank personalized search engine sorting optimization
  • 相关文献

参考文献9

  • 1Nandigam J, Gudivada V N, Hamou-L A. Learning SoftwareEngineering Principles Using Open Source Software [ C]//Proc of the 38th Frontiers in Edueation Conference? 2008 :S3H-18-S3H-23.
  • 2Lucene[EB/OL]. [2012-08-01]. http://baike. baidu. com/view/371811. htm.
  • 3Zhang Yong,Li Jian-lin. Research and Improvement of SearchEngine Based on LuceneCC]//Proc of Intelligent Human-Ma-chine Systems and Cybernetics,2009:270-273.
  • 4Huang Lan. A Suervery on Web Information Retrieval Tech-nologies[R]. State University of New York, Department ofComputer Science ECSL, Technical Report TR-120,2000.
  • 5黄知义,周宁.Google搜索引擎的PageRank技术及其优化研究[J].图书馆学研究,2005(8):21-23. 被引量:1
  • 6曹军.Google的PageRank技术剖析[J].情报杂志,2002,21(10):15-18. 被引量:71
  • 7郑家恒.中文文本歧义切分技术研究[M].北京:清华大学出版社,1999.
  • 8吴伟,陈建峡.基于Heritrix的web信息抽取优化与实现[J].湖北工业大学学报,2012,27(2):23-26. 被引量:5
  • 9Lucene Open Source Material[EB/OL]. [2012-08-01]. http://jakarta. apache, org/lucene.

二级参考文献17

  • 1佚名.Heritrix架构简述[EB/OL].(201l-08-21)http://blog.csdn.nct/historyasamirror/article/details/.6705923.
  • 2Heritrix-home page[EB/OL].(201l-06-09)http://www.weberawlcr.com.
  • 3Sergey Brin and Lawrence Page. The anatomy of a large - scale hypertextual web search engine [J]. Proc of Tth World Wide Web Conf (WWW'98),Brisbane, 1998.
  • 4R. Baeza Yates, B. Ribeiro Neto. Modern Information Retrieval ACM Press,1998
  • 5Google inc. http: //www. google. com
  • 6Dell Zhang, Yisheng Dong. An Efficient Algorithm to Rank Web Resources.The 9th International World Wide Web Conference, 2000. http: //www9. org/w9cdrom/251/251. html
  • 7Jon Kleinberg. Authoritative Sources in a Hyperlinked Environment. Journal of the ACM, 1999;46(5)
  • 8L. Page, S. Brin, R. Motwani, T. Winograd. The PageRank Citation Ranking: Bringing order to the Web. http://www - db. stanford. edu/~ backrub /pageranksub.ps, January, 1998.
  • 9S. Brin, L. Page The Anatomy of a Large- scale Hypertextual Web Search Engine Computer Networks and ISDN Systems, 1998
  • 10Arvind Arasu, Junghoo Cho. Hector Garcia - Molina, Andreas Paepcke, Sriram Raghavan. Searching the Web. ACM Transactions on Intemet Technology,2001 ;1(1)

共引文献74

同被引文献104

引证文献12

二级引证文献31

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部