In recent years, the widespread adoption of parallel computing, especially in multi-core processors and high-performance computing environments, ushered in a new era of efficiency and speed. This trend was particularl...In recent years, the widespread adoption of parallel computing, especially in multi-core processors and high-performance computing environments, ushered in a new era of efficiency and speed. This trend was particularly noteworthy in the field of image processing, which witnessed significant advancements. This parallel computing project explored the field of parallel image processing, with a focus on the grayscale conversion of colorful images. Our approach involved integrating OpenMP into our framework for parallelization to execute a critical image processing task: grayscale conversion. By using OpenMP, we strategically enhanced the overall performance of the conversion process by distributing the workload across multiple threads. The primary objectives of our project revolved around optimizing computation time and improving overall efficiency, particularly in the task of grayscale conversion of colorful images. Utilizing OpenMP for concurrent processing across multiple cores significantly reduced execution times through the effective distribution of tasks among these cores. The speedup values for various image sizes highlighted the efficacy of parallel processing, especially for large images. However, a detailed examination revealed a potential decline in parallelization efficiency with an increasing number of cores. This underscored the importance of a carefully optimized parallelization strategy, considering factors like load balancing and minimizing communication overhead. Despite challenges, the overall scalability and efficiency achieved with parallel image processing underscored OpenMP’s effectiveness in accelerating image manipulation tasks.展开更多
A new method is applied to characterize the defects in GaAs material(e.g.the absorption ofEL2 centres).The method consists of transmitting a laser beam(λ=1.1-1.5μm)through the GaAs wa-fer of 4—8 mm thickness and 50...A new method is applied to characterize the defects in GaAs material(e.g.the absorption ofEL2 centres).The method consists of transmitting a laser beam(λ=1.1-1.5μm)through the GaAs wa-fer of 4—8 mm thickness and 50 mm diameter.The image is received by the TOSHIBA 8844 cameraand entered into the DATASUD computer image processing system.This image is displayed on amonitor permitting to observe the inhomogeneity(like cross,cells and volutes)of theEL2 and dislocation defects.This paper will introduce a specific image processing software for GaAs materi-al,called ZHIMAG(ZHang IMAGe)and its application to GaAs wafer.The software can bealso applied to any other types of image processing.展开更多
Compared with the conventional X-ray absorption imaging, the X-ray phase-contrast imaging shows higher contrast on samples with low attenuation coefficient like blood vessels and soft tissues. Among the modalities of ...Compared with the conventional X-ray absorption imaging, the X-ray phase-contrast imaging shows higher contrast on samples with low attenuation coefficient like blood vessels and soft tissues. Among the modalities of phase-contrast imaging, the grating-based phase contrast imaging has been widely accepted owing to the advantage of wide range of sample selections and exemption of coherent source. However, the downside is the substantially larger amount of data generated from the phase-stepping method which slows down the reconstruction process. Graphic processing unit(GPU) has the advantage of allowing parallel computing which is very useful for large quantity data processing. In this paper, a compute unified device architecture(CUDA) C program based on GPU is introduced to accelerate the phase retrieval and filtered back projection(FBP) algorithm for grating-based tomography. Depending on the size of the data, the CUDA C program shows different amount of speed-up over the standard C program on the same Visual Studio 2010 platform. Meanwhile, the speed-up ratio increases as the size of data increases.展开更多
Mutual information (MI)-based image registration is effective in registering medical images, but it is computationally expensive. This paper accelerates MI-based image registration by dividing computation of mutual ...Mutual information (MI)-based image registration is effective in registering medical images, but it is computationally expensive. This paper accelerates MI-based image registration by dividing computation of mutual information into spatial transformation and histogram-based calculation, and performing 3D spatial transformation and trilinear interpolation on graphic processing unit (GPU). The 3D floating image is downloaded to GPU as flat 3D texture, and then fetched and interpolated for each new voxel location in fragment shader. The transformed resuits are rendered to textures by using frame buffer object (FBO) extension, and then read to the main memory used for the remaining computation on CPU. Experimental results show that GPU-accelerated method can achieve speedup about an order of magnitude with better registration result compared with the software implementation on a single-core CPU.展开更多
Sonar image processing system is an important intelligent system of Autonomous Un-derwater Vehicle.Based on TMS320C30 high speed DSP,it is used to realize sonar imagecompression and underwater object detections includ...Sonar image processing system is an important intelligent system of Autonomous Un-derwater Vehicle.Based on TMS320C30 high speed DSP,it is used to realize sonar imagecompression and underwater object detections including obstacle recognition in real time.Inthis paper,the software and hardware designs of this system are introduced and the experi-mental results are given.展开更多
Aiming to solve the bottleneck problem of electromagnetic scattering simulation in the scenes of extremely large-scale seas and ships,a high-frequency method by using graphics processing unit(GPU)parallel acceleration...Aiming to solve the bottleneck problem of electromagnetic scattering simulation in the scenes of extremely large-scale seas and ships,a high-frequency method by using graphics processing unit(GPU)parallel acceleration technique is proposed.For the implementation of different electromagnetic methods of physical optics(PO),shooting and bouncing ray(SBR),and physical theory of diffraction(PTD),a parallel computing scheme based on the CPU-GPU parallel computing scheme is realized to balance computing tasks.Finally,a multi-GPU framework is further proposed to solve the computational difficulty caused by the massive number of ray tubes in the ray tracing process.By using the established simulation platform,signals of ships at different seas are simulated and their images are achieved as well.It is shown that the higher sea states degrade the averaged peak signal-to-noise ratio(PSNR)of radar image.展开更多
Three-dimensional(3D)image reconstruction involves the computations of an extensive amount of data that leads to tremendous processing time.Therefore,optimization is crucially needed to improve the performance and eff...Three-dimensional(3D)image reconstruction involves the computations of an extensive amount of data that leads to tremendous processing time.Therefore,optimization is crucially needed to improve the performance and efficiency.With the widespread use of graphics processing units(GPU),parallel computing is transforming this arduous reconstruction process for numerous imaging modalities,and photoacoustic computed tomography(PACT)is not an exception.Existing works have investigated GPU-based optimization on photoacoustic microscopy(PAM)and PACT reconstruction using compute unified device architecture(CUDA)on either C++or MATLAB only.However,our study is the first that uses cross-platform GPU computation.It maintains the simplicity of MATLAB,while improves the speed through CUDA/C++−based MATLAB converted functions called MEXCUDA.Compared to a purely MATLAB with GPU approach,our cross-platform method improves the speed five times.Because MATLAB is widely used in PAM and PACT,this study will open up new avenues for photoacoustic image reconstruction and relevant real-time imaging applications.展开更多
Recently,analog visual transmission has attracted considerable attention owing to its graceful performance degradation for various wireless channels.In this study,we propose a novel analog visual communications system...Recently,analog visual transmission has attracted considerable attention owing to its graceful performance degradation for various wireless channels.In this study,we propose a novel analog visual communications system,named DVCast,in which an image denoising algorithm is used to fully utilize spatial correlation;moreover,the variable block size Discrete Cosine Transform(DCT)is used to preserve more correlation information in an image.Obviously,there is a tradeoff between system performance and computing complexity.Therefore,to improve the real-time performance of the proposed system,implementation of Block Matching with 3D filtering(BM3D)and DCT by Graphics Processing Units(GPUs)is introduced.According to DCT block size,i.e.,88,1616,and 3232,the schemes DVCast8,DVCast16,and DVCast32,respectively,are designed and implemented.Simulations show that DVCast with larger block size achieves better gain and visual quality than reference schemes.Moreover,it requires less computing time.DVCast32 outperforms conventional digital schemes by approximately 3.51 dB and achieves a 1.12 dB gain over state-of-the-art reference schemes.Furthermore,the analysis shows that DVCast can reduce overhead by at least 75%.展开更多
A software for the management of computer aided design(CAD) drawings was developed. It can be used to classify and register information including geometric features of the drawings, and to search appropriate referen...A software for the management of computer aided design(CAD) drawings was developed. It can be used to classify and register information including geometric features of the drawings, and to search appropriate reference drawings matching the given conditions of a new design. The thumbnails of the drawings can be displayed in an order of preference and can be used to zoom and access the PDF file or CAD data of a selected drawing. The thumbnails and the data structure are two key elements of the software.展开更多
The main problems of the traditional software talent training are that the knowledge learned is out of touch of the industry,weak ability to solve complex engineering problems and the comprehensive quality cannot meet...The main problems of the traditional software talent training are that the knowledge learned is out of touch of the industry,weak ability to solve complex engineering problems and the comprehensive quality cannot meet the requirements of enterprises.Taking the course of digital image processing as an example,this paper analyzes the key steps in the training process in details,and proposes a multiple mode collaborative based software engineering talent training mechanism,which brings the key subjects of teaching materials,teachers,students,enterprises,subjects into a unified training mechanism,so as to provide a new ideas for software talent teaching.展开更多
文摘In recent years, the widespread adoption of parallel computing, especially in multi-core processors and high-performance computing environments, ushered in a new era of efficiency and speed. This trend was particularly noteworthy in the field of image processing, which witnessed significant advancements. This parallel computing project explored the field of parallel image processing, with a focus on the grayscale conversion of colorful images. Our approach involved integrating OpenMP into our framework for parallelization to execute a critical image processing task: grayscale conversion. By using OpenMP, we strategically enhanced the overall performance of the conversion process by distributing the workload across multiple threads. The primary objectives of our project revolved around optimizing computation time and improving overall efficiency, particularly in the task of grayscale conversion of colorful images. Utilizing OpenMP for concurrent processing across multiple cores significantly reduced execution times through the effective distribution of tasks among these cores. The speedup values for various image sizes highlighted the efficacy of parallel processing, especially for large images. However, a detailed examination revealed a potential decline in parallelization efficiency with an increasing number of cores. This underscored the importance of a carefully optimized parallelization strategy, considering factors like load balancing and minimizing communication overhead. Despite challenges, the overall scalability and efficiency achieved with parallel image processing underscored OpenMP’s effectiveness in accelerating image manipulation tasks.
文摘A new method is applied to characterize the defects in GaAs material(e.g.the absorption ofEL2 centres).The method consists of transmitting a laser beam(λ=1.1-1.5μm)through the GaAs wa-fer of 4—8 mm thickness and 50 mm diameter.The image is received by the TOSHIBA 8844 cameraand entered into the DATASUD computer image processing system.This image is displayed on amonitor permitting to observe the inhomogeneity(like cross,cells and volutes)of theEL2 and dislocation defects.This paper will introduce a specific image processing software for GaAs materi-al,called ZHIMAG(ZHang IMAGe)and its application to GaAs wafer.The software can bealso applied to any other types of image processing.
基金the National Basic Research Program(973) of China(No.2010CB834300)the Biomedical Engineering Cross-Research Fund of Shanghai Jiao Tong University(Nos.YG2011MS49 and YG2013MS65)
文摘Compared with the conventional X-ray absorption imaging, the X-ray phase-contrast imaging shows higher contrast on samples with low attenuation coefficient like blood vessels and soft tissues. Among the modalities of phase-contrast imaging, the grating-based phase contrast imaging has been widely accepted owing to the advantage of wide range of sample selections and exemption of coherent source. However, the downside is the substantially larger amount of data generated from the phase-stepping method which slows down the reconstruction process. Graphic processing unit(GPU) has the advantage of allowing parallel computing which is very useful for large quantity data processing. In this paper, a compute unified device architecture(CUDA) C program based on GPU is introduced to accelerate the phase retrieval and filtered back projection(FBP) algorithm for grating-based tomography. Depending on the size of the data, the CUDA C program shows different amount of speed-up over the standard C program on the same Visual Studio 2010 platform. Meanwhile, the speed-up ratio increases as the size of data increases.
基金Supported by National High Technology Research and Development Program("863"Program)of China(No.863-306-ZD13-03-06)
文摘Mutual information (MI)-based image registration is effective in registering medical images, but it is computationally expensive. This paper accelerates MI-based image registration by dividing computation of mutual information into spatial transformation and histogram-based calculation, and performing 3D spatial transformation and trilinear interpolation on graphic processing unit (GPU). The 3D floating image is downloaded to GPU as flat 3D texture, and then fetched and interpolated for each new voxel location in fragment shader. The transformed resuits are rendered to textures by using frame buffer object (FBO) extension, and then read to the main memory used for the remaining computation on CPU. Experimental results show that GPU-accelerated method can achieve speedup about an order of magnitude with better registration result compared with the software implementation on a single-core CPU.
基金the High Technology Research and Development Programme of china.
文摘Sonar image processing system is an important intelligent system of Autonomous Un-derwater Vehicle.Based on TMS320C30 high speed DSP,it is used to realize sonar imagecompression and underwater object detections including obstacle recognition in real time.Inthis paper,the software and hardware designs of this system are introduced and the experi-mental results are given.
基金supported by the National Natural Science Foundation of China(61471343)the National Key Technology Research and Development Program of the Ministry of Science and Technology of China(2014BAK14B03)
基金supported by the Opening Foundation of the Agile and Intelligence Computing Key Laboratory of Sichuan Province under Grant No.H23004the Chengdu Municipal Science and Technology Bureau Technological Innovation R&D Project(Key Project)under Grant No.2024-YF08-00106-GX.
文摘Aiming to solve the bottleneck problem of electromagnetic scattering simulation in the scenes of extremely large-scale seas and ships,a high-frequency method by using graphics processing unit(GPU)parallel acceleration technique is proposed.For the implementation of different electromagnetic methods of physical optics(PO),shooting and bouncing ray(SBR),and physical theory of diffraction(PTD),a parallel computing scheme based on the CPU-GPU parallel computing scheme is realized to balance computing tasks.Finally,a multi-GPU framework is further proposed to solve the computational difficulty caused by the massive number of ray tubes in the ray tracing process.By using the established simulation platform,signals of ships at different seas are simulated and their images are achieved as well.It is shown that the higher sea states degrade the averaged peak signal-to-noise ratio(PSNR)of radar image.
基金supported in part by the Career Catalyst Research Grant from the Susan G.Komen Foundationthe Clinical and Translational Science Pilot Study Award from the National Institutes of Health.
文摘Three-dimensional(3D)image reconstruction involves the computations of an extensive amount of data that leads to tremendous processing time.Therefore,optimization is crucially needed to improve the performance and efficiency.With the widespread use of graphics processing units(GPU),parallel computing is transforming this arduous reconstruction process for numerous imaging modalities,and photoacoustic computed tomography(PACT)is not an exception.Existing works have investigated GPU-based optimization on photoacoustic microscopy(PAM)and PACT reconstruction using compute unified device architecture(CUDA)on either C++or MATLAB only.However,our study is the first that uses cross-platform GPU computation.It maintains the simplicity of MATLAB,while improves the speed through CUDA/C++−based MATLAB converted functions called MEXCUDA.Compared to a purely MATLAB with GPU approach,our cross-platform method improves the speed five times.Because MATLAB is widely used in PAM and PACT,this study will open up new avenues for photoacoustic image reconstruction and relevant real-time imaging applications.
基金the National Nature Science Foundation of China(Nos.61601128,61762053)the Science and Technology Plan Funding of Jiangxi Province of China(No.20151BBE50076)+1 种基金the Research Foundations of Education Bureau of Jiangxi Province(Nos.GJJ151001,GJJ150984)the Open Project Funding of Key Laboratory of Jiangxi Province for Numerical Simulation and Emulation Techniques,China.
文摘Recently,analog visual transmission has attracted considerable attention owing to its graceful performance degradation for various wireless channels.In this study,we propose a novel analog visual communications system,named DVCast,in which an image denoising algorithm is used to fully utilize spatial correlation;moreover,the variable block size Discrete Cosine Transform(DCT)is used to preserve more correlation information in an image.Obviously,there is a tradeoff between system performance and computing complexity.Therefore,to improve the real-time performance of the proposed system,implementation of Block Matching with 3D filtering(BM3D)and DCT by Graphics Processing Units(GPUs)is introduced.According to DCT block size,i.e.,88,1616,and 3232,the schemes DVCast8,DVCast16,and DVCast32,respectively,are designed and implemented.Simulations show that DVCast with larger block size achieves better gain and visual quality than reference schemes.Moreover,it requires less computing time.DVCast32 outperforms conventional digital schemes by approximately 3.51 dB and achieves a 1.12 dB gain over state-of-the-art reference schemes.Furthermore,the analysis shows that DVCast can reduce overhead by at least 75%.
文摘A software for the management of computer aided design(CAD) drawings was developed. It can be used to classify and register information including geometric features of the drawings, and to search appropriate reference drawings matching the given conditions of a new design. The thumbnails of the drawings can be displayed in an order of preference and can be used to zoom and access the PDF file or CAD data of a selected drawing. The thumbnails and the data structure are two key elements of the software.
文摘The main problems of the traditional software talent training are that the knowledge learned is out of touch of the industry,weak ability to solve complex engineering problems and the comprehensive quality cannot meet the requirements of enterprises.Taking the course of digital image processing as an example,this paper analyzes the key steps in the training process in details,and proposes a multiple mode collaborative based software engineering talent training mechanism,which brings the key subjects of teaching materials,teachers,students,enterprises,subjects into a unified training mechanism,so as to provide a new ideas for software talent teaching.