期刊文献+

开源关系数据库集群的并行空间连接算法实现 被引量:3

Research and Realization on Parallel Spatial Join Query Algorithm Based on Open Source RDBMS Cluster
在线阅读 下载PDF
导出
摘要 当前对并行空间连接查询的研究主要集中在算法设计上,缺少在并行关系数据库管理系统上的应用实现研究.通过分析并行空间连接算法流程,利用开源并行关系数据库集群项目PL/Proxy,提出了混合式计算迁移模式并扩展了对空间操作的支持,并在其上实现了可扩展的基于空间划分的并行空间连接算法.通过真实数据的实验表明:设计实现的并行空间连接算法在空间数据划分负载均衡的情况下,可实现近线性的加速比;而在空间划分产生数据倾斜严重的情况下,仍具有一定的加速比,同时具备针对空间划分方案改进的可扩展能力.算法的实现方式为进行并行空间数据管理研究提供了一种可行的解决方案. Existing studies on parallel spatial join query mostly focus on algorithm process. Few of them pay enough attentions on implementation and application research. After analyzing existing algorithm process, parallel spatial join algorithms are divided into four phases which are parallel candidate tasks generating, assignment, executing and results collection. Each of them is designed and implemented on a parallel RDBMS cluster which is built on a open-source project named ‘PL/Proxy'. In addition, to implement parallel computation, mixed computation migration method is proposed.Spatial extension function is implemented to support the spatial data sets operation on the cluster. Result of experiments using real data sets shows that, the implemented algorithm gains near linear speedup when data declustering scheme is optimal. Moreover, speedup is also gained while significant data skew caused by data declustering exists. The data declustering scheme is replaceable for improving the algorithm performance. A practicable solution for parallel spatial data sets management is provided in this paper.
出处 《计算机系统应用》 2016年第10期233-239,共7页 Computer Systems & Applications
基金 国家高技术研究计划(863)(2013AA12A403)
关键词 空间数据划分 并行空间连接查询 计算迁移 并行关系数据库 spatial data partition parallel spatial join query computation migration parallel RDBMS
  • 相关文献

参考文献4

二级参考文献53

  • 1陈虎,张云泉,柳锴,李玉成.基于机群架构的并行数据库中间件系统改进研究[J].计算机研究与发展,2007,44(z3):142-146. 被引量:1
  • 2王永杰,洪新兰,孙伟.基于并行空间数据库和GML的WebGIS系统的研究与实现[J].测绘科学,2007,32(4):131-132. 被引量:4
  • 3[1]Dewitt D J, Kabra N, Lou J. Client-server paradise [A]. Proc 20th Int Conf on Very Large Databases[C]. Santiago: Chi le, 1994. 558-569.
  • 4[2]Hoel E, Samet H. Performance and data-parallel spatial operations [A]. Proc 20th Int Conf on Very Large Databases [C]. Santi ago: Chile, 1994. 156-167.
  • 5[3]Brinkhoff T, Kriegel H-P, Seeger B. Parallel processing of spatial joins using R-trees [A]. Int Conf on Data Engineering [C ]. New Orleans, USA: [s.n.], 1996. 67-79.
  • 6[4]Guttman A. R-tree: a dynamic index structure for spatial search [ A]. Proc ACM SIGMOD Int Conf on Management of data [C]. Boston: MA, 1984. 47-57.
  • 7[5]Theodoridis Y, Stefanakis E, Sellis T. Efficient cost models for sp a tial queries using R-trees [J]. IEEE Transactions on Knowledge and Dat a Engineering, 2000, 12(1): 105-121.
  • 8Kriegel H P,Brinkhoff T,Schneider R.Efficient spatial query processing in geographic database systems.Data Engineering Bulletin,1993,16:10-15.
  • 9DeWitt D,Gray J.Parallel database systems:the future of high performance database systems.Communications of the ACM,1992,35:85-98.
  • 10Dittrich J,Seeger B.Data redundancy and duplicate detection in spatial join processing.In:Proceedings of the 16th International Conference on Data Engineering,San Diego,CA,USA,2000.535-546.

共引文献37

同被引文献37

引证文献3

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部