Accelerating 2-Dimensional CFD on Multi-GPU Supercomputer

Li, Sen; Li, **nliang; Wang, Long; Lu, Zhonghua; Chi, Xuebin

doi:10.1007/978-3-642-16405-7_27

Sen Li⁷,
**nliang Li⁷,
Long Wang⁷,
Zhonghua Lu⁷ &
…
Xuebin Chi⁷

Part of the book series: Lecture Notes in Earth System Sciences ((LNESS))

2830 Accesses

Abstract

In this paper, we describe the domain decomposing strategy of finite-difference to implement and optimize GPU codes in solving 2-D N-S equations. To satisfy GPU architecture, our algorithms emphasize on the decomposition strategy and the maximum of exploiting the GPU memory hierarchy so that high rate of speedup can be expected. Tests on two CFD cases, respectively being cavity flow and aerofoil RAE 2822, are used. For cavity flow, we ran our simulation both on CUDA and OpenCL platform and witnessed 30–60x speedup. In aerofoil, we used 6–60 GPU devices and get speedup of 5–29 times depending on the grid size and number of devices used.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Multi GPU Implementation to Accelerate the CFD Simulation of a 3D Turbo-Machinery Benchmark Using the RapidCFD Library

Accelerating CFD simulation with high order finite difference method on curvilinear coordinates for modern GPU clusters

Article Open access 08 February 2022

Parallelizing a High-Order CFD Software for 3D, Multi-block, Structural Grids on the TianHe-1A Supercomputer

References

Anderson WK, Bonhaus DL (2009) Airfoil design on unstructured grids for turbulent flows. AIAA J 37(2):185–191
Article Google Scholar
Baldwin BS, Lomax H (1978) Thin layer approximation and algebraic model for separated turbulent Flows. AIAA 78–257.
Google Scholar
Brandvik T, Pullan G (2007) Acceleration of a two-dimensional euler flow solver using commodity graphics hardware. J Proc Inst Mech Eng Part C: J Mech Eng Sci 221:1745–1748
Article Google Scholar
Jespersen DC (2009) Acceleration of a CFD code with a GPU. NAS Technical report NAS-09-003.
Google Scholar
Toro EF (1999) Riemann solvers and numerical methods for fluid dynamics-a practical introduction. Springer, Berlin
MATH Google Scholar
http://www.grc.nasa.gov/www./wind/valid/raetaf/raetaf.html
Sanders J, Kandrot E (2011) CUDA by example: an introduction to general purpose GPU programming. Addison-Wesley, Boston
Google Scholar
Khronos openCL working group (2008) The openCL specication, V1.0.
Google Scholar
NVIDIA Corporation (2007) Compute unified device architecture programming guide. http://www.nvidia.com
Tingxing D, **nliang L, Sen L (2010) Acceleration of computational fluid dynamic codes on GPU. In: 8th Asian computational fluid dynamics conference.
Google Scholar

Download references

Author information

Authors and Affiliations

Supercomputing center, Computer Network Information Center, Chinese Academy of Sciences, Bei**g, China
Sen Li, **nliang Li, Long Wang, Zhonghua Lu & Xuebin Chi

Authors

Sen Li
View author publications
You can also search for this author in PubMed Google Scholar
**nliang Li
View author publications
You can also search for this author in PubMed Google Scholar
Long Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhonghua Lu
View author publications
You can also search for this author in PubMed Google Scholar
Xuebin Chi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Minnesota, Dep. of Earth Sciences and Minnesota, Supercomputing Institute, Pillsbury Hall 23, Minneapolis, 55455, Minnesota, USA
David A. Yuen
Network Information Center, Comuter Center and Computer, Zhong Guan Cun 4, Bei**g, 100190, China, People's Republic
Long Wang
Supercomputing Center, Zhong Guan Cun 4, Bei**g, 100190, China, People's Republic
Xuebin Chi
, Computer Science, University of Houston, Calhoun Street 4800, Houston, 77204, Texas, USA
Lennart Johnsson
Inst. Process Engineering (IPE), Chinese Academy of Sciences, Zhongguancun North Second Street 1, Bei**g, 100190, China, People's Republic
Wei Ge
, Laboratory of Computational Geodynamics,, Chinese Academy of Sciences, Yu Quan Lu 19a, Bei**g, 100049, China, People's Republic
Yaolin Shi

Appendix A

In this appendix, the code shows how GPU devices are assigned to different MPI processors. To implement domain decomposition strategy, GPU devices must be assigned in continuous numbers in x and y axis so that we can dispatch tasks according to their position.

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Li, S., Li, X., Wang, L., Lu, Z., Chi, X. (2013). Accelerating 2-Dimensional CFD on Multi-GPU Supercomputer. In: Yuen, D., Wang, L., Chi, X., Johnsson, L., Ge, W., Shi, Y. (eds) GPU Solutions to Multi-scale Problems in Science and Engineering. Lecture Notes in Earth System Sciences. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16405-7_27

Download citation

DOI: https://doi.org/10.1007/978-3-642-16405-7_27
Published: 09 January 2013
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16404-0
Online ISBN: 978-3-642-16405-7
eBook Packages: Earth and Environmental ScienceEarth and Environmental Science (R0)

Publish with us

Policies and ethics

Accelerating 2-Dimensional CFD on Multi-GPU Supercomputer

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Multi GPU Implementation to Accelerate the CFD Simulation of a 3D Turbo-Machinery Benchmark Using the RapidCFD Library

Accelerating CFD simulation with high order finite difference method on curvilinear coordinates for modern GPU clusters

Parallelizing a High-Order CFD Software for 3D, Multi-block, Structural Grids on the TianHe-1A Supercomputer

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Appendix A

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Accelerating 2-Dimensional CFD on Multi-GPU Supercomputer

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Multi GPU Implementation to Accelerate the CFD Simulation of a 3D Turbo-Machinery Benchmark Using the RapidCFD Library

Accelerating CFD simulation with high order finite difference method on curvilinear coordinates for modern GPU clusters

Parallelizing a High-Order CFD Software for 3D, Multi-block, Structural Grids on the TianHe-1A Supercomputer

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Appendix A

Appendix A

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation