期刊文献+

基于超链接信息的搜索引擎检索结果聚类方法研究 被引量:1

Research Clustering the Searching Results of Search Engines based on Hyperlink
在线阅读 下载PDF
导出
摘要 针对目前搜索引擎返回候选信息过多从而使用户不能准确查找与主题有关结果的问题,提出了基于超链接信息的搜索引擎检索结果聚类方法,通过对网页的超链接锚文档和网页文档内容挖掘,最终将网页聚成不同的子类别。这种方法在依据网页内容进行聚类的同时,充分利用了Web结构和超链接信息,比传统的结构挖掘方法更能体现网站文档的内容特点,从而提高了聚类的准确性。 As for the problem that users spend much time to select the results from that returned from search engine and can not get the exact one, this paper presents a new approach for clustering the search engine searching results based on hyperlink information. Through mining the hyperlink anchor file of web page ,this method clusters the web page into the different little kinds, in which the contents of web page are clustered and Web structure and hyperlink information are made a best use,giving a more characteristics of Web document contents than the traditional structure mining method and improving the veracity of clustering.
作者 夏斌 徐彬
出处 《电脑开发与应用》 2007年第5期16-17,20,共3页 Computer Development & Applications
关键词 搜索引擎 超链接 结构挖掘 聚类 search engines ,hyperlink, structure mining, clustering
  • 相关文献

参考文献6

  • 1MEHMEDKANTARDZIC.数据挖掘-概念、模型、方法和算法[M].北京:清华大学出版社,2003..
  • 2韩家炜,Katuber M.数据挖掘:概念与技术[M].北京:机械工业出版社,2002.
  • 3张健沛,刘洋,杨静,代坤.搜索引擎结果聚类算法研究[J].计算机工程,2004,30(5):95-97. 被引量:11
  • 4Y C Fang, S Parthasarathy, F Schwartz. Using Clustering to Boost Text Classification [J]. In:proceedings of the IEEE ICDM Workshop on Text Mining Maebashi City ,Japan, 2002:58-68.
  • 5Dell Zhang. Semantic, Hierarchical,Online Clustering of Web Search Results [A]. In:proceedings of. The 6th Asia Pacific Web Conference (APWEB)[C].HangZhou, China, April 2004 :162-165.
  • 6李锁花,孙志挥,周晓云.基于特征向量的分布式聚类算法[J].计算机应用,2006,26(2):379-382. 被引量:6

二级参考文献10

  • 1[1]Zamir O, Etzioni O. Web Document Clustering : A Feasibility Demos- tration. SIGIR'98,1998
  • 2[2]Zamir O, Etzioni O. Grouper: A Dynamic Clustering Interface to Web Search Results. In: Proceedings of 8th WWW Conference, Toronto Canada, 1999
  • 3AGRAWAL R, GEHRKE J, GUNOPULOS D, et al. Automatic subspace clustering of high dimensional data for data mining application[A]. Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data[C]. New York: ACM Press,1998.94 - 105.
  • 4HINNEBURG A, KEIN DA. An efficient approach to clustering inlarge multimedia databases with noise[A]. Proceedings. of the 4th International Conference oa Knowledge Discovery and Data Mining (KDD'98) [C]. New York: AAAI Press, 1998.58-65.
  • 5ESTER M, KRIEGEL H-P, SANDER J, et al. A density-based algorithm for discovering clusters in large spatial databases with noise [A]. Proceedings of the 2nd International Conference On Knowledge Discovery and Data Mining (KDD'96) [C]. Portland: AAAI Press,1996. 226-231.
  • 6JANUZAJ E, KRIEGEL H-P, PFEIFLE M. Density Based Distributed Clustering[A]. Proceedings. of the 9th International Conference on Extending Database Technology[C]. Springer, 2004.88 - 105.
  • 7JANUZAJ E, KRIEGEL H - P, PFEIFLE M. Scalable Density-Based Distributed Clustering[A]. Proceedings of 8th European Conference on Principles and Practice of Knowledge Discovery in Databases(PKDD) [C]. Springer, 2004.231 - 244.
  • 8BECKMANN N, KRIEGEL H-P, SCHNEIDER R, et al. The R * -tree: An Efficient and Robust Access Method for Vector and Rectangles[A]. Proceedings ACM SIGMOD International Conference on Management of Data (SIGMOD'90) [C]. Atlantic City, NJ, New York: ACM Press, 1990. 322-331.
  • 9Dataset Generator(DatGen) [CP/OL]. http://www. data. setgenerator. com/, 2005.
  • 10马帅,王腾蛟,唐世渭,杨冬青,高军.一种基于参考点和密度的快速聚类算法[J].软件学报,2003,14(6):1089-1095. 被引量:108

共引文献28

同被引文献50

  • 1ZHAI C, COHEN W, LAFFERTY J. Beyond independent 1- evance: methods and evaluation metrics for subtopic retrieval [ C ] //Proceedings of the 26th Annual Intemational ACM SI- GIR Conference on Research and Development in Information Retrieval. New York: ACM, 2003: 10-17.
  • 2AGRAWAL R, GOLLAPUDI S, HALVERSON A, et al. Di- versifying search results [ C ] //Proceedings of the Second ACM International Conference on Web Search and Data Min- ing. New York: ACM, 2009: 5-14.
  • 3ZAMIR O E. Clustering web documents : a phrase-based method for grouping search engine results [ D ]. Washington : Univer- sity of Washington, 1999.
  • 4ZAMIR O, ETZIONI O. Web document clustering: a feasibility demonstration [ C ] //Proceedings of the 21st Annual Interna- tional ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM, 1998: 46-54.
  • 5XU Rui. Survey of clustering algorithms [ J ]. Neural Net- works. New York: IEEE, 2005, 16 (3) : 645-678.
  • 6ROKACH L. A survey of clustering algorithms [ M ] // MAIMON O, ROKACH L. Data mining and knowledge discov- ery handbook. Springer US, 2010: 969-298.
  • 7CARPINETO C, OSIN-SKI S, ROMANO G, et al. A survey of web clustering engines [ I]. New York: ACM Computing Sur- veys (CSUR), 2009, 41 (3): 17.
  • 8OSINSKI S, STEFANOWSKI J, WEISS D. Lingo: search re- sults clustering algorithm based on singular value decomposition [ M ]// MIECZYSLAW A, SLAWOMIR W, KRZYSZTOF T. Intelligent information processing and web mining. Berlin: Springer-Verlag, 2004: 359-368.
  • 9WANG Y, KITSUREGAWA M. Use link-based clustering to im- prove web search results [ C ] //Proceedings of the Second In- ternational Conference on. New York: IEEE, 2001 : 115-124.
  • 10MAITI S, SAMANTA D. Clustering web search resuhs to iden- tify information domain [M]// SABNAM S, KUNAL D, GI- TOSREE K. Emerging Trends in Computing and Communica- tion. Springer, 2014: 291-303.

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部