期刊文献+

基于GPGPU的并行影像匹配算法 被引量:45

Parallel Image Matching Algorithm Based on GPGPU
在线阅读 下载PDF
导出
摘要 提出一种基于GPGPU的CUDA架构快速影像匹配并行算法,它能够在SIMT模式下完成高性能并行计算。并行算法根据GPU的并行结构和硬件特点,采用执行配置技术、高速存储技术和全局存储技术三种加速技术,优化数据存储结构,提高数据访问效率。实验结果表明,并行算法充分利用GPU的并行处理能力,在处理1280×1024分辨率的8位灰度图像时可达到最高多处理器warp占有率,速度是基于CPU实现的7倍。CUDA在高运算强度数据处理中呈现出的实时处理能力和计算能力,为进一步加速影像匹配性能和GPU通用计算提供了新的方法和思路。 With the development of satellite remote sensing technology, it is the key issue in remote sensing field to transform mossive data into user information in short time. The traditional image matching algorithms for optimization and implementation which were designed for common processor CPU, could not be effectively applied on graphics processing unit (GPU). Afast image matching parallel algorithm is presented based on general-purpose compu- ting on graphics processing units (GPGPU) which support Compute Unified Device Architecture (CUDA). The algorithm can execute high performance parallel computing in Single Instruction Multiple Thread (SlMT) Pattern. Qn the basis of the parallel architecture and hardware characteristic of GPU, the parallel algorithm introduces three speedup methods to improve the implementation performance: execution configuration technology, high-speed storage technology and global storage technology optimizes the data storage structure and improves the data access efficiency. The experiment result shows that GPU can with high efficiency implement the parallel algorithm and processing efficiency of 8-bit 1 280× 1 024 pictures can be up to the highest Multiprocessor Warp Occupancy, processing speed is 7 times faster than CPU-based implementation. The comparison between CUDA and CPU in image matching algorithms shows the advance of the CUDA in high arithmetic intensity real-time processing and computing data processing and this provides new methods and ideas to optimize image matching performance and GPGPU.
作者 肖汉 张祖勋
出处 《测绘学报》 EI CSCD 北大核心 2010年第1期46-51,共6页 Acta Geodaetica et Cartographica Sinica
基金 国家自然科学基金(40771177) 国家863计划(2006AA12Z136) 河南省重点科技攻关项目(072102360026)
关键词 细粒度并行计算 图形处理器的通用计算 统一计算设备架构 影像匹配 单指令多线程 fine-grained parallel computing GPGPU CUDA image matching SlMT
  • 相关文献

参考文献16

  • 1摩尔的预言:唯有CU-DA才是终极的CPU(二)[EB/OL].[2008-07-28].http://space.itpub.net/14741601/viewspace-410810.
  • 2GPU是并行计算的高手[EB/OL].[2008-10-24].http:∥www.expreview.com/review/1224821886d10275_2.html.
  • 3宗亮,邬延辉.基于集群系统的并行图像灰度匹配[J].宁波大学学报(理工版),2009,22(1):74-77. 被引量:3
  • 4NVIDIA. CUDA 2.0 for WINDOWS CUDA 2.0 Program ming Guide [EB/OL]. [2008-06-07]. http://developer.download.nvidia.com/compute/cuda/2_0/docs/NVIDIA_CUDA_Programming_Guide_2.0. pdf. 20.
  • 5PODLOZHNYUK V. Image Convolution with CUDA [EB/ OL]. [2007-01-06]. http://www. nvidia.com/object/cuda_ home. html.
  • 6HARRIS M. Optimizing Parallel Reduction in CUDA [EB/ OL]. [2007-11-08]. http://www. nvidia. com/object/cuda _home. html.
  • 7STONE J E, PHILLIPS J C, FREDDOLINO P L, et al. Accelerating Molecular Modeling Applications with Graphics Processors [J]. Journal of Computational Chemistry, 2007, 28(16):2618-2640.
  • 8张祖勋,张剑清.数字摄影测量学[M].武汉:武汉大学出版社,2007:167.
  • 9NVIDIA. CUDA 2.0 for WINDOWS CUDA 2.0 Reference Manual [EB/OL]. [2008-06-12]. http://developer. download. nvidia.com/compute/euda/2_0/docs/CudaReferenceManual_2.0. pdf.
  • 10MANAVSKI S A, VALLE G. CUDA Compatible GPU Cards as Efficient Hardware Accelerators for Smith-Waterman Sequence Alignment [J]. BMC Bioinformatics, 2008, 9(2).

二级参考文献5

共引文献18

同被引文献380

引证文献45

二级引证文献301

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部