期刊文献+

一种基于大数据的有效搜索方法 被引量:12

A Kind of Efficient Search Method Based on Big Data
在线阅读 下载PDF
导出
摘要 针对大数据查询效率低下的问题,提出了一种有效的搜索方法。将共享的历史查询结果作为中间结果集,在新的查询请求到达时,首先与历史查询进行匹配,若能实现匹配,则直接将匹配部分的历史查询结果直接作为新查询请求结果的一部分。这减少了大量的对历史查询的重复计算,节省了搜索时间,提高了查询效率。实验对比分析表明,新的基于大数据的查询方法能较好地提高查询效率。 This paper proposed an efficient search method to the problem of low efficiency for large dada queries. Using shared history query results as a set of intermediate results, when a new query request arrives, the first match for histor- ical inquiry is directly added to the matching portion of the historical results for directly as part of the new query result of the request if achieving matching. It can reduce the large number of double counting query history, save search time and improve query efficiency. By experimental comparison and analysis show that data based query methods can improve query efficiency.
出处 《计算机科学》 CSCD 北大核心 2013年第6期183-186,共4页 Computer Science
基金 国家973计划项目(2011CB302302)资助
关键词 大数据 搜索 查询网 云数据库 Big data, Search, Query network, Cloud database
  • 相关文献

参考文献9

  • 1Dean J,Ghemawat S.MapReduce:Simplified data processing on large clusters[C]//Brewer E,Chen P,eds.Proc.of the OSDI.California:USENIX Association,2004:137-150.
  • 2Ekanayake J,Li Hui,Zhang Bing-jing,et al.Twister..A Runtime for Iterative MapReduce[C]//The First International Workshop on MapReduce and its Applications (MAPREDUCE'10).2010:110-119.
  • 3Bu Y Y,Howe B,Balazinska M,et al.HaLoop:Efficient iterarive data processing on large clusters[J].PVLDB2010,2010,3(1/2):285-296.
  • 4Isard M,Budiu M,Yu Y,et al.Dryad:Distributed data-parallel programs from sequential building blocks[J].ACM SIGOPS Operating Systems Review,2007,41 (3):59-72.
  • 5Zaharia M,Chowdhury M,Franklin M J,et al.Spark:Cluster Computing withWorking Sets[R].Technology report of UC Berkeley.2011.
  • 6Dittrich J,Quian'e-Ruiz J A,Jindal A,et al.Hadoop++:Making a yellow elephant run like a cheetah (without it even noticing)[J].PVLDB,2010,3(1/2):518-529.
  • 7陈国华,汤庸,彭泽武,李建国.基于学术社区的学术搜索引擎设计[J].计算机科学,2011,38(8):171-175. 被引量:13
  • 8殷哲,曹炬.带差商信息的云搜索优化算法及其收敛性分析[J].计算机科学,2012,39(1):252-255. 被引量:6
  • 9杨艺,周元.基于用户查询意图识别的Web搜索优化模型[J].计算机科学,2012,39(1):264-267. 被引量:17

二级参考文献36

  • 1凌波,周水庚,周傲英.P2P信息检索系统的查询结果排序与合并策略[J].计算机学报,2007,30(3):405-414. 被引量:13
  • 2张光卫,康建初,李鹤松,李德毅.基于云模型的全局最优化算法[J].北京航空航天大学学报,2007,33(4):486-490. 被引量:37
  • 3戴朝华,朱云芳,陈维荣,林建辉.云遗传算法及其应用[J].电子学报,2007,35(7):1419-1424. 被引量:84
  • 4Broder A. A taxonomy of Web search[C]//SIGIR Forum. New York, N Y, USA: ACM Press, 2002 : 3-10.
  • 5Rose D E, Levinson D. Understanding user goals in web search [C] //WWW ' 04 : Proceedings of the 13the international confe- rence on World Wide Web. New York, N Y, USA: ACM Press, 2004: 13-19.
  • 6Jansen B J,Booth D L,Spink A. Determining the user intent of Web search engine queries[C] // Williamson CL, Zurko ME, Patel-Schneider PF,et al. , eds. Proc. of the 16th Int'l Conf. on World Wide Web. New York: ACM Press, 2007:1149-1150.
  • 7Ricardo A, Liliana C B, Cristina N. The intention behind Webqueries[C]//Crestani F, Ferragina P, Sanderson M, eds. Proc. of the 13th Int'l Conf. on String Processing and Information Re- trieval (SPIRE 2006 ). Berlin, Heidelberg: Springer-Verlag, 2006 :98-109.
  • 8Qi G, Eugene A. Exploring mouse movements for inferring que- ry intent[-C]//Myaeng SH, Oard DW, Sebastianj F, et al. , eds. Proc. of the 31st Annual Int' 1 ACM SIGIR Conf. on Research and Development in Information Retrieval. 2008:707-708.
  • 9Holland J. Adaptation in Natural and Artificial Systems [M]. Ann Arbor,MI:Univ. of Michigan Press,1975:1-9.
  • 10Goldberg D E. Genetic Algorithms in .Search, Optimization, and Machine Learning [M]. New York: Addison-Wesley, 19 8 9.

共引文献30

同被引文献86

  • 1黄中华.水电机组稳定性试验中几个主要问题的探讨[J].青海电力,2004,23(4):1-2. 被引量:1
  • 2陈亮,屠成宇.基于TCAM的大容量文本搜索[J].计算机工程,2005,31(5):210-212. 被引量:2
  • 3刘卫昌,马增良.企业综合自动化系统中实时数据库系统设计[J].计算机应用研究,2005,22(8):146-149. 被引量:7
  • 4冯乃勤,邱玉辉,王芳.一种提高神经网络泛化能力的新方法[J].计算机科学,2006,33(2):201-204. 被引量:5
  • 5中文分词.http://baike.baidu.com/view/19109.htm.
  • 6Dung X L,Berti E L,Srivastava D.Truth discovery and copying detection in a dynamic world [J].Proceedings of the VLDB En-dowment,2009,2(1):562-573.
  • 7Kopeke H,Thor A,Rahm E.Evaluation of entity resolution ap-proaches on real-world match problems [J].Proceedings of the VLDBEndowment,2010,3(1/2):484-493.
  • 8Fan W F,Geerts F.Capturing missing tuples and missing value [A].Proc of the 29th ACM SIGMOD slGAcT-SIGART Symp c Principles of Database Systems [C].New York:ACM,2010:169-178.
  • 9Li M J,Ng M K,et al.Agglomerative fuzzy K-means clustering algo-rithm with selection of number of clusters [J].IEEE Transactions on Knowledge and Data Engineering,2008,20(11):1519-1534.
  • 10Frank A,Asuncion A.UCI machine learning repository [EB/0L].[2012-05-20]http://archive.ics.uci.edu/mI.

引证文献12

二级引证文献83

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部