Skip to main content

and
  1. No Access

    Chapter and Conference Paper

    Data Transfer and Reuse Analysis Tool for GPU-Offloading Using OpenMP

    In the high performance computing sector, researchers and application developers expend considerable effort to port their applications to GPU-based clusters in order to take advantage of the massive parallelis...

    Alok Mishra, Abid M. Malik, Barbara Chapman in OpenMP: Portable Multi-Level Parallelism o… (2020)

  2. No Access

    Chapter and Conference Paper

    Toward Supporting Multi-GPU Targets via Taskloop and User-Defined Schedules

    Many modern supercomputers such as ORNL’s Summit, LLNL’s Sierra, and LBL’s upcoming Perlmutter offer or will offer multiple, e.g., 4 to 8, GPUs per node for running computational science and engineering applic...

    Vivek Kale, Wenbin Lu, Anthony Curtis in OpenMP: Portable Multi-Level Parallelism o… (2020)

  3. No Access

    Chapter and Conference Paper

    An Application of Constraint Programming to Superblock Instruction Scheduling

    Modern computer architectures have complex features that can only be fully taken advantage of if the compiler schedules the compiled code. A standard region of code for scheduling in an optimizing compiler is ...

    Abid M. Malik, Michael Chase, Tyrel Russell in Principles and Practice of Constraint Prog… (2008)