Skip to main content

and
  1. No Access

    Chapter and Conference Paper

    An Application of Constraint Programming to Superblock Instruction Scheduling

    Modern computer architectures have complex features that can only be fully taken advantage of if the compiler schedules the compiled code. A standard region of code for scheduling in an optimizing compiler is ...

    Abid M. Malik, Michael Chase, Tyrel Russell in Principles and Practice of Constraint Prog… (2008)

  2. No Access

    Chapter and Conference Paper

    Automatic Static Feature Generation for Compiler Optimization Problems

    Modern compilers have many optimization passes which help to get a better binary code for a given program. These optimizations are NP-hard. People use different heuristics to get a near optimal solution. These...

    Abid M. Malik in AI 2011: Advances in Artificial Intelligence (2011)

  3. No Access

    Chapter and Conference Paper

    False Sharing Detection in OpenMP Applications Using OMPT API

    Writing a parallel shared memory application that scales well on the future multi-core processors is a challenging task. The contention among shared resources increases as the number of threads increases. This...

    Millad Ghane, Abid M. Malik, Barbara Chapman in OpenMP: Heterogenous Execution and Data Mo… (2015)

  4. No Access

    Chapter and Conference Paper

    Data Transfer and Reuse Analysis Tool for GPU-Offloading Using OpenMP

    In the high performance computing sector, researchers and application developers expend considerable effort to port their applications to GPU-based clusters in order to take advantage of the massive parallelis...

    Alok Mishra, Abid M. Malik, Barbara Chapman in OpenMP: Portable Multi-Level Parallelism o… (2020)

  5. No Access

    Chapter and Conference Paper

    Toward Supporting Multi-GPU Targets via Taskloop and User-Defined Schedules

    Many modern supercomputers such as ORNL’s Summit, LLNL’s Sierra, and LBL’s upcoming Perlmutter offer or will offer multiple, e.g., 4 to 8, GPUs per node for running computational science and engineering applic...

    Vivek Kale, Wenbin Lu, Anthony Curtis in OpenMP: Portable Multi-Level Parallelism o… (2020)

  6. No Access

    Chapter and Conference Paper

    Towards Efficient Remote OpenMP Offloading

    On modern heterogeneous HPC systems, the most popular way to realize distributed computation is the hybrid programming model of MPI+X (X being OpenMP/CUDA/etc.), as it has been proven to perform well with vari...

    Wenbin Lu, Baodi Shan, Eric Raut, Jie Meng in OpenMP in a Modern World: From Multi-devic… (2022)

  7. No Access

    Chapter and Conference Paper

    OpenMP Advisor: A Compiler Tool for Heterogeneous Architectures

    With the increasing diversity of heterogeneous architecture in the HPC industry, porting a legacy application to run on different architectures is a tough challenge. In this paper, we present OpenMP Advisor, a...

    Alok Mishra, Abid M. Malik, Meifeng Lin in OpenMP: Advanced Task-Based, Device and Co… (2023)