Skip to main content

and
  1. No Access

    Article

    Improving blocked matrix-matrix multiplication routine by utilizing AVX-512 instructions on intel knights landing and xeon scalable processors

    In high-performance computing, the general matrix-matrix multiplication (xGEMM) routine is the core of the Level 3 BLAS kernel for effective matrix-matrix multiplication operations. The performance of parallel...

    Yoosang Park, Raehyun Kim, Thi My Tuyen Nguyen, Jaeyoung Choi in Cluster Computing (2023)