![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Chapter and Conference Paper
Performance Evaluation of Compiler-Assisted OpenMP Codes on Various HPC Systems
As automatic parallelization functions are different among compilers, a serial code is often modified so that a particular target compiler can easily understand its code structure and data dependency, resultin...
-
Chapter and Conference Paper
A Compiler-Assisted OpenMP Migration Method Based on Automatic Parallelizing Information
Performance of a serial code often relies on compilers’ capabilities for automatic parallelization. In such a case, the performance is not portable to a new system because a new compiler on the new system may ...
-
Chapter and Conference Paper
Analysing the Performance Improvements of Optimizations on Modern HPC Systems
Recently, there are many types of supercomputing systems being equipped with vector processors, scalar processors, and accelerators as processing elements of the systems. Although all kinds of calculations can...
-
Chapter and Conference Paper
Performance Evaluation of a Next-Generation CFD on Various Supercomputing Systems
The Building-Cube Method (BCM) has been proposed as a new CFD method for an efficient three-dimensional flow simulation on large-scale supercomputing systems, and is based on equally-spaced Cartesian meshes. A...
-
Chapter
Automatic Tuning of CUDA Execution Parameters for Stencil Processing
Recently, Compute Unified Device Architecture (CUDA) has enabled Graphics Processing Units (GPUs) to accelerate various applications. However, to exploit the GPU’s computing power fully, a programmer has to ca...