Accelerating the RTTOV-7 IASI and AMSU-A radiative transfer models on graphics processing units: evaluating central processing unit/graphics processing unit-hybrid and pure-graphics processing unit approaches

Abstract The F-statistic is a detection statistic used widely in searches for continuous gravitational waves with terrestrial, long-baseline interferometers. A new implementation of the F-statistic is presented which accelerates the existing "resampling" algorithm using graphics processing units (GPUs). The new implementation runs between 10 and 100 times faster than the existing implementation on central processing units without sacrificing numerical accuracy. The utility of the GPU implementation is demonstrated on a pilot narrowband search for four newly discovered millisecond pulsars in the globular cluster Omega Centauri using data from the second Laser Interferometer Gravitational-Wave Observatory observing run. The computational cost is 17:2 GPU-hours using the new implementation, compared to 1092 core-hours with the existing implementation.

Download Full-text

Paralelização do Algoritmo Floyd-Warshall usando GPU

10.5753/wscad.2013.16769 ◽

2013 ◽

Author(s):

Roussian R. A. Gaioso ◽

Walid A. R. Jradi ◽

Lauro C. M. de Paula ◽

Wanderley De S. Alencar ◽

Wellington S. Martins ◽

...

Keyword(s):

Graphics Processing Unit ◽

Central Processing Unit ◽

Processing Unit ◽

Central Processing ◽

Graphics Processing

Este artigo apresenta uma implementação paralela baseada em Graphics Processing Unit (GPU) para o problema da identiﬁcação dos caminhos mínimos entre todos os pares de vértices em um grafo. A implementação é baseada no algoritmo Floyd-Warshall e tira o máximo proveito da arquitetura multithreaded das GPUs atuais. Nossa solução reduz a comunicação entre a Central Processing Unit (CPU) e a GPU, melhora a utilização dos Streaming Multiprocessors (SMs) e faz um uso intensivo de acesso aglutinado em memória para otimizar o acesso de dados do grafo. A vantagem da implementação proposta é demonstrada por vários grafos gerados aleatoriamente utilizando a ferramenta GTgraph. Grafos contendo milhares de vértices foram gerados e utilizados nos experimentos. Os resultados mostraram um excelente desempenho em diversos grafos, alcançando ganhos de até 149x, quando comparado com uma implementação sequencial, e superando implementações tradicionais por um fator de quase quatro vezes. Nossos resultados conﬁrmam que implementações baseadas em GPU podem ser viáveis mesmo para algoritmos de grafos cujo acessos à memória e distribuição de trabalho são irregulares e causam dependência de dados.

Download Full-text

Evaluating the computing efficiencies (specificity and sensitivity) of graphics processing unit (GPU)-accelerated DNA sequence alignment tools against central processing unit (CPU) alignment tool

Journal of Bioinformatics and Sequence Analysis ◽

10.5897/jbsa2018.0109 ◽

2018 ◽

Vol 9 (2) ◽

pp. 10-14 ◽

Cited By ~ 1

Author(s):

Pawar Shrikant ◽

Stanam Aditya ◽

Zhu Ying

Keyword(s):

Dna Sequence ◽

Sequence Alignment ◽

Graphics Processing Unit ◽

Central Processing Unit ◽

Processing Unit ◽

Central Processing ◽

Dna Sequence Alignment ◽

Specificity And Sensitivity ◽

Alignment Tool ◽

Graphics Processing

Download Full-text

Power analysis for decoding of the digital audio encoding format MP3: Decoding the central processing unit and the graphics processing unit.

The Journal of the Acoustical Society of America ◽

10.1121/1.3385354 ◽

2010 ◽

Vol 127 (3) ◽

pp. 2037-2037

Author(s):

Seung Gu Kang ◽

Pil Joung Sun ◽

Cheol Hong Kim ◽

Jong‐Myon Kim

Keyword(s):

Power Analysis ◽

Graphics Processing Unit ◽

Central Processing Unit ◽

Processing Unit ◽

Digital Audio ◽

Central Processing ◽

Graphics Processing

Download Full-text

Ray-based modeling and imaging in viscoelastic media using graphics processing units

Geophysics ◽

10.1190/geo2018-0510.1 ◽

2019 ◽

Vol 84 (5) ◽

pp. S425-S436

Author(s):

Martin Sarajaervi ◽

Henk Keers

Keyword(s):

Seismic Data ◽

Graphics Processing Units ◽

Graphics Processing Unit ◽

Parallel Implementation ◽

Processing Unit ◽

Central Processing ◽

Imaging Results ◽

Viscoelastic Modeling ◽

Graphics Processing ◽

Complex Valued

In seismic data processing, the amplitude loss caused by attenuation should be taken into account. The basis for this is provided by a 3D attenuation model described by the quality factor [Formula: see text], which is used in viscoelastic modeling and imaging. We have accomplished viscoelastic modeling and imaging using ray theory and the ray-Born approximation. This makes it possible to take [Formula: see text] into account using complex-valued and frequency-dependent traveltimes. We have developed a unified parallel implementation for modeling and imaging in the frequency domain and carried out the numerical integration on a graphics processing unit. A central part of the implementation is an efficient technique for computing large integrals. We applied the integration method to the 3D SEG/EAGE overthrust model to generate synthetic seismograms and imaging results. The attenuation effects are accurately modeled in the seismograms and compensated for in the imaging algorithm. The results indicate a significant improvement in computational efficiency compared to a parallel central processing unit baseline.

Download Full-text

GPU accelerated computation of fast spectral transforms

Facta universitatis - series Electronics and Energetics ◽

10.2298/fuee1103483g ◽

2011 ◽

Vol 24 (3) ◽

pp. 483-499

Author(s):

Dusan Gajic ◽

Radomir Stankovic

Keyword(s):

Graphics Processing Units ◽

Fast Algorithms ◽

Central Processing Unit ◽

Processing Unit ◽

Memory Transfer ◽

Simple Arithmetic ◽

Central Processing ◽

Graphics Processing ◽

Spectral Transforms ◽

Gpu Implementation

This paper discusses techniques for accelerated computation of several fast spectral transforms on graphics processing units (GPUs) using the Open Computing Language (OpenCL). We present a reformulation of fast algorithms which takes into account peculiar properties of transforms to make them suitable for the GPU implementation. A special attention is paid to the organization of computations, memory transfer reductions, impact of integer and Boolean arithmetic, different structure of algorithms, etc. Performance of the GPU implementations is compared with the classical C/C++ implementations for the central processing unit (CPU). Experiments confirm that, even though the spectral transforms considered involve only simple arithmetic, significant speedups are achieved by implementing the algorithms in OpenCL and performing them on the GPU.

Download Full-text

History and Evolution of GPU Architecture

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Emerging Research Surrounding Power Consumption and Performance Issues in Utility Computing ◽

10.4018/978-1-4666-8853-7.ch006 ◽

2016 ◽

pp. 109-135

Author(s):

Prashanta Kumar Das ◽

Ganesh Chandra Deka

Keyword(s):

Image Processing ◽

Graphics Processing Unit ◽

Hardware Architecture ◽

Central Processing Unit ◽

Processing Unit ◽

3D Image ◽

Central Processing ◽

Graphics Processing ◽

Graphics Engine ◽

Gpu Architecture

The Graphics Processing Unit (GPU) is a specialized and highly parallel microprocessor designed to offload 2D/3D image from the Central Processing Unit (CPU) to expedite image processing. The modern GPU is not only a powerful graphics engine, but also a parallel programmable processor with high precision and powerful features. It is forcasted that by 2020, 48 Core GPU will be available while by 2030 GPU with 3000 core is likely to be available.This chapter describes the chronology of evolution of GPU hardware architecture and the future ahead.

Download Full-text

Ortho-Rectification of Hyperspectral Camera Data with Central Processing Unit and Graphics Processing Unit

2019 9th International Conference on Recent Advances in Space Technologies (RAST) ◽

10.1109/rast.2019.8767856 ◽

2019 ◽

Author(s):

Yunus Emre Esin ◽

Berkan Demirel ◽

Omer Ozdil ◽

Safak Ozturk

Keyword(s):

Graphics Processing Unit ◽

Central Processing Unit ◽

Processing Unit ◽

Central Processing ◽

Graphics Processing ◽

Hyperspectral Camera

Download Full-text

Acceleration of synthetic aperture radar imaging via subaperture chirp-scaling approach based on heterogeneous graphics-processing-unit–central-processing-unit architecture

Journal of Applied Remote Sensing ◽

10.1117/1.jrs.9.095083 ◽

2015 ◽

Vol 9 (1) ◽

pp. 095083

Author(s):

Yabo Liu ◽

Hongyu Li ◽

Zheng Wu ◽

Yunkai Deng ◽

Robert Wang

Keyword(s):

Synthetic Aperture Radar ◽

Graphics Processing Unit ◽

Radar Imaging ◽

Central Processing Unit ◽

Synthetic Aperture ◽

Processing Unit ◽

Central Processing ◽

Graphics Processing ◽

Aperture Radar

Download Full-text

CENTRAL PROCESSING UNIT-GRAPHICS PROCESSING UNIT COMPUTING SCHEME FOR MULTI-OBJECT TRACKING IN SURVEILLANCE

Asian Journal of Pharmaceutical and Clinical Research ◽

10.22159/ajpcr.2017.v10s1.19651 ◽

2017 ◽

Vol 10 (13) ◽

pp. 251

Author(s):

Ankush Rai ◽

Jagadeesh Kannan R

Keyword(s):

Gpu Computing ◽

Graphics Processing Unit ◽

Research Work ◽

Central Processing Unit ◽

Processing Unit ◽

Minimal Processing ◽

Central Processing ◽

Gpu Processing ◽

Graphics Processing ◽

Computing Scheme

This research work presents a novel central processing unit-graphics processing unit (CPU-GPU) computing scheme for multiple object trackingduring a surveillance operation. This facilitates nonlinear computational jobs to avail completion of computation in minimal processing time for tracking function. The work is divided into two essential objectives. First is to dynamically divide the processing operations into parallel units, and second is to reduce the communication between CPU-GPU processing units.

Download Full-text