Mechanical and Biomedical Engineering Faculty Publications and Presentations

Scalability of Incompressible Flow Computations on Multi-GPU Clusters Using Dual-Level and Tri-Level Parallelism

Dana A. Jacobsen, Boise State University
Inanc Senocak, Boise State UniversityFollow

Document Type

Conference Proceeding

Publication Date

1-4-2011

Abstract

High performance computing using graphics processing units (GPUs) is gaining popularity in the scientific computing field, with many large compute clusters being augmented with multiple GPUs in each node. We investigate hybrid tri-level (MPI-OpenMP-CUDA) parallel implementations to explore the efficiency and scalability of incompressible flow computations on GPU clusters up to 128 GPUS. This work details some of the unique issues faced when merging fine-grain parallelism on the GPU using CUDA with coarse-grain parallelism using OpenMP for intra-node and MPI for inter-node communication. Comparisons between the tri-level MPI-OpenMP-CUDA and dual-level MPI-CUDA implementations are shown using computationally large computational fluid dynamics (CFD) simulations. Our results demonstrate that a tri-level parallel implementation does not provide a significant advantage in performance over the dual-level implementation, however further research is needed to justify our conclusion for a cluster with a high GPU per node density or when using software that can utilize OpenMP’s fine-grain parallelism more effectively.

Copyright Statement

This document was originally published by American Institute of Aeronautics and Astronautics (AIAA) in 49th AIAA Aerospace Sciences Meeting including the New Horizons Forum and Aerospace Exposition 4 - 7 January 2011, Orlando, Florida. Copyright restrictions may apply.

Publication Information

Jacobsen, Dana A. and Senocak, Inanc. (2011). "Scalability of Incompressible Flow Computations on Multi-GPU Clusters Using Dual-Level and Tri-Level Parallelism". 49th AIAA Aerospace Sciences Meeting including the New Horizons Forum and Aerospace Exposition, 4-7 January 2011, Orlando, Florida, .

Download

Included in

Biomedical Engineering and Bioengineering Commons, Mechanical Engineering Commons

COinS

ScholarWorks

Mechanical and Biomedical Engineering Faculty Publications and Presentations

Scalability of Incompressible Flow Computations on Multi-GPU Clusters Using Dual-Level and Tri-Level Parallelism

Document Type

Publication Date

Abstract

Copyright Statement

Publication Information

Included in

Browse

Links

Search

Author Corner

ScholarWorks

Mechanical and Biomedical Engineering Faculty Publications and Presentations

Scalability of Incompressible Flow Computations on Multi-GPU Clusters Using Dual-Level and Tri-Level Parallelism

Authors

Document Type

Publication Date

Abstract

Copyright Statement

Publication Information

Included in

Share

Browse

Links

Search

Author Corner