22 Result(s)

within Chapter Kazuhiko Komatsu

Sort By Newest First Oldest First

Chapter and Conference Paper

An Asymptotic Parallel Linear Solver and Its Application to Direct Numerical Simulation for Compressible Turbulence

When solving numerically partial differential equations such as the Navier-Stokes equations, higher-order finite difference schemes are occasionally applied for spacial descretization. Compact finite differenc...

Mitsuo Yokokawa, Taiki Matsumoto, Ryo Takegami… in Computational Science – ICCS 2024 (2024)
Chapter and Conference Paper

File I/O Cache Performance of Supercomputer Fugaku Using an Out-of-Core Direct Numerical Simulation Code of Turbulence

Turbulent flows play important roles in many flow-related phenomena that appear in various fields. However, despite numerous studies on turbulence, the nature of turbulence has not yet been fully clarified. Di...

Yuto Hatanaka, Yuki Yamane, Kenta Yamaguchi… in Computational Science – ICCS 2024 (2024)
Chapter and Conference Paper

Analysis of Precision Vectors for Ising-Based Linear Regression

Quantum computing has been much attention as one of the new computational principles. In particular, annealing machines that use the Ising model of statistical mechanics are emerging and feasible next-generati...

Kaho Aoyama, Kazuhiko Komatsu… in Parallel and Distributed Computing, Applic… (2023)
Chapter and Conference Paper

A Partitioned Memory Architecture with Prefetching for Efficient Video Encoders

A hardware video encoder based on recent video coding standards such as HEVC and VVC needs to efficiently handle a massive number of memory accesses to search motion vectors. To this end, first, this paper pre...

Masayuki Sato, Yuya Omori, Ryusuke Egawa… in Parallel and Distributed Computing, Applic… (2023)
Chapter and Conference Paper

Optimizations of DNS Codes for Turbulence on SX-Aurora TSUBASA

Direct numerical simulations (DNSs) of incompressible turbulence have been performed since the late 1960s, but simulations that reproduce strongly nonlinear turbulent flows as in the real-world have not been r...

Yujiro Takenaka, Mitsuo Yokokawa… in Sustained Simulation Performance 2019 and … (2021)
Chapter and Conference Paper

A Deep Reinforcement Learning Based Feature Selector

In the field of data mining and machine learning, it is a challenge for researchers and engineers to analyze and classify the high-dimensional data. In order to minimize the classification error, it is critica...

Yiran Cheng, Kazuhiko Komatsu, Masayuki Sato… in Parallel Architectures, Algorithms and Pro… (2021)
Chapter and Conference Paper

A Dynamic Parameter Tuning Method for High Performance SpMM

Sparse matrix-matrix multiplication (SpMM) is a basic kernel that is used by many algorithms. Several researches focus on various optimizations for SpMM parallel execution. However, a division of a task for pa...

Bin Qi, Kazuhiko Komatsu, Masayuki Sato… in Parallel and Distributed Computing, Applic… (2021)
Chapter and Conference Paper

Optimization of the Himeno Benchmark for SX-Aurora TSUBASA

This paper focuses on optimizing the Himeno benchmark for the vector computing system SX-Aurora TSUBASA and analyzes its performance in detail. The Vector Engine (VE) of SX-Aurora TSUBASA achieves a high memor...

Akito Onodera, Kazuhiko Komatsu, Soya Fujimoto… in Benchmarking, Measuring, and Optimizing (2021)
Chapter and Conference Paper

Performance Evaluation of SX-Aurora TSUBASA and Its QA-Assisted Application Design

In this article, we present an overview of our on-going project entitled, R&D of a Quantum-Annealing Assisted Next Generation HPC Infrastructure and its Applications. We describes our system design concept of a n...

Hiroaki Kobayashi, Kazuhiko Komatsu in Sustained Simulation Performance 2019 and 2020 (2021)
Chapter and Conference Paper

Performance Evaluation of SX-Aurora TSUBASA by Using Benchmark Programs

This paper evaluates the basic performance of the latest vector supercomputer, SX-Aurora TSUBASA, in order to clarify its potential. First, the memory bandwidth, which is one of the features of SX-Aurora TSUBA...

Kazuhiko Komatsu, Hiroaki Kobayashi in Sustained Simulation Performance 2018 and 2019 (2020)
Chapter and Conference Paper

Develo** an Efficient Vector-Friendly Implementation of the Breadth-First Search Algorithm for NEC SX-Aurora TSUBASA

Breadth-First Search (BFS) is an important computational kernel used as a building-block for many other graph algorithms. Different algorithms and implementation approaches aimed to solve the BFS problem have ...

Ilya V. Afanasyev, Vladimir V. Voevodin… in Parallel Computational Technologies (2020)
Chapter and Conference Paper

Performance Evaluation of Tsunami Inundation Simulation on SX-Aurora TSUBASA

As tsunamis may cause damage in wide area, it is difficult to imme...

Akihiro Musa, Takashi Abe, Takumi Kishitani… in Computational Science – ICCS 2019 (2019)

Download PDF (1563 KB) View Chapter
Chapter and Conference Paper

Analysis of Relationship Between SIMD-Processing Features Used in NVIDIA GPUs and NEC SX-Aurora TSUBASA Vector Processors

This paper presents comprehensive analysis of main SIMD-processing features and computational characteristics of three high performance architectures: two NVIDIA GPU architectures (of Pascal and Volta generati...

Ilya V. Afanasyev, Vadim V. Voevodin… in Parallel Computing Technologies (2019)
Chapter and Conference Paper

Directive Translation for Various HPC Systems Using the Xevolver Framework

This paper proposes a directive translation approach that translates a special placeholder to different directives, depending on the target HPC system. The special placeholder in an application code is used as...

Kazuhiko Komatsu, Ryusuke Egawa, Hiroyuki Takizawa… in Sustained Simulation Performance 2016 (2016)
Chapter and Conference Paper

Performance Evaluation of an OpenMP Parallelization by Using Automatic Parallelization Information

To exploit the potential of many core processors, a serial code is generally optimized for a particular compiler called a target compiler, so that the compiler can understand the code structure for automatic p...

Kazuhiko Komatsu, Ryusuke Egawa, Hiroyuki Takizawa… in Sustained Simulation Performance 2014 (2015)
Chapter and Conference Paper

Code Optimization Activities Toward a High Sustained Simulation Performance

Rapid evolutions of HPC systems bring us a high computational capability. However, it is getting harder to exploit the potential of a HPC system due to the increases in system complexity. To create significant...

Ryusuke Egawa, Kazuhiko Komatsu, Hiroaki Kobayashi in Sustained Simulation Performance 2015 (2015)
Chapter and Conference Paper

Designing an HPC Refactoring Catalog Toward the Exa-scale Computing Era

Aiming at kee** performance portability of practical applications even in the exascale computing era, this paper presents a concept of HPC refactoring, and the design of the HPC refactoring catalog. The HPC ...

Ryusuke Egawa, Kazuhiko Komatsu, Hiroaki Kobayashi in Sustained Simulation Performance 2014 (2015)
Chapter and Conference Paper

Performance Evaluation of Compiler-Assisted OpenMP Codes on Various HPC Systems

As automatic parallelization functions are different among compilers, a serial code is often modified so that a particular target compiler can easily understand its code structure and data dependency, resultin...

Kazuhiko Komatsu, Ryusuke Egawa, Hiroyuki Takizawa… in Sustained Simulation Performance 2015 (2015)
Chapter and Conference Paper

A Compiler-Assisted OpenMP Migration Method Based on Automatic Parallelizing Information

Performance of a serial code often relies on compilers’ capabilities for automatic parallelization. In such a case, the performance is not portable to a new system because a new compiler on the new system may ...

Kazuhiko Komatsu, Ryusuke Egawa, Hiroyuki Takizawa, Hiroaki Kobayashi in Supercomputing (2014)
Chapter and Conference Paper

Analysing the Performance Improvements of Optimizations on Modern HPC Systems

Recently, there are many types of supercomputing systems being equipped with vector processors, scalar processors, and accelerators as processing elements of the systems. Although all kinds of calculations can...

Kazuhiko Komatsu, Toshihide Sasaki, Ryusuke Egawa… in Sustained Simulation Performance 2013 (2013)

22 Result(s)

An Asymptotic Parallel Linear Solver and Its Application to Direct Numerical Simulation for Compressible Turbulence

File I/O Cache Performance of Supercomputer Fugaku Using an Out-of-Core Direct Numerical Simulation Code of Turbulence

Analysis of Precision Vectors for Ising-Based Linear Regression

A Partitioned Memory Architecture with Prefetching for Efficient Video Encoders

Optimizations of DNS Codes for Turbulence on SX-Aurora TSUBASA

A Deep Reinforcement Learning Based Feature Selector

A Dynamic Parameter Tuning Method for High Performance SpMM

Optimization of the Himeno Benchmark for SX-Aurora TSUBASA

Performance Evaluation of SX-Aurora TSUBASA and Its QA-Assisted Application Design

Performance Evaluation of SX-Aurora TSUBASA by Using Benchmark Programs

Develo** an Efficient Vector-Friendly Implementation of the Breadth-First Search Algorithm for NEC SX-Aurora TSUBASA

Performance Evaluation of Tsunami Inundation Simulation on SX-Aurora TSUBASA

Analysis of Relationship Between SIMD-Processing Features Used in NVIDIA GPUs and NEC SX-Aurora TSUBASA Vector Processors

Directive Translation for Various HPC Systems Using the Xevolver Framework

Performance Evaluation of an OpenMP Parallelization by Using Automatic Parallelization Information

Code Optimization Activities Toward a High Sustained Simulation Performance

Designing an HPC Refactoring Catalog Toward the Exa-scale Computing Era

Performance Evaluation of Compiler-Assisted OpenMP Codes on Various HPC Systems

A Compiler-Assisted OpenMP Migration Method Based on Automatic Parallelizing Information

Analysing the Performance Improvements of Optimizations on Modern HPC Systems

Our Content

Other Sites

Help & Contacts