Search Results - Springer

Sort By Newest First Oldest First

Chapter and Conference Paper

New Opportunities for Compilers in Computer Security

Compiler techniques have been deployed to prevent various security attacks. Examples include mitigating memory access corruption, control flow integrity checks, race detection, software diversity, etc.

Junjie Shen, Zhi Chen, Nahid Farhady Ghalaty… in Languages and Compilers for Parallel Compu… (2019)
Chapter and Conference Paper

Towards an Achievable Performance for the Loop Nests

Numerous code optimization techniques, including loop nest optimizations, have been developed over the last four decades. Loop optimization techniques transform loop nests to improve the performance of the cod...

Aniket Shivam, Neftali Watkinson… in Languages and Compilers for Parallel Compu… (2019)
Chapter and Conference Paper

Using Hardware Counters to Predict Vectorization

Vectorization is the process of transforming the scalar implementation of an algorithm into vector form. This transformation aims to benefit from parallelism through the generation of microprocessor vector ins...

Neftali Watkinson, Aniket Shivam, Zhi Chen… in Languages and Compilers for Parallel Compu… (2019)
Chapter and Conference Paper

Polygonal Iteration Space Partitioning

This work presents a new set of loop transformations to expose and maximize data locality in loop-nests with non-uniform reuse patterns. The proposed set of transformations use the norms of the Polyhedral Mode...

Aniket Shivam, Alexandru Nicolau… in Languages and Compilers for Parallel Compu… (2017)
Book

Instruction Level Parallelism

Alex Aiken, Utpal Banerjee, Arun Kejariwal… (2016)
Chapter

Introduction

This introductory chapter discusses the role of instruction level parallelism (ILP) in optimizing compilers and in machine architectures that automatically reorder or parallelize programs. A brief overview of ...

Alex Aiken, Utpal Banerjee, Arun Kejariwal… in Instruction Level Parallelism (2016)
Chapter

Percolation Scheduling

Trace scheduling suffers from a number of problems related to its focus on a single trace at a time. Percolation scheduling overcomes these problems, to the extent possible at compile time, by providing a smal...

Alex Aiken, Utpal Banerjee, Arun Kejariwal… in Instruction Level Parallelism (2016)
Chapter

Epilogue

This book focuses on compiler-managed instruction level parallelism. While a great deal of the work on this topic has been only touched upon or mentioned only in references, the book does cover all of the majo...

Alex Aiken, Utpal Banerjee, Arun Kejariwal… in Instruction Level Parallelism (2016)
Chapter

Scheduling Basic Blocks

A basic block in a program is a sequence of consecutive operations, such that control flow enters at the beginning and leaves at the end without internal branches. While basic block scheduling is the simplest ...

Alex Aiken, Utpal Banerjee, Arun Kejariwal… in Instruction Level Parallelism (2016)
Chapter

Software Pipelining by Kernel Recognition

Kernel recognition techniques avoid the search for an appropriate initiation interval by dealing directly with a representation of the unrolled loop and its compaction. Intuitively, kernel recognition tries to...

Alex Aiken, Utpal Banerjee, Arun Kejariwal… in Instruction Level Parallelism (2016)
Chapter

Overview of ILP Architectures

In this chapter we trace the history of computer architecture, focusing on the evolution of techniques for instruction-level parallelism. After briefly summarizing the early years of machine design, we focus o...

Alex Aiken, Utpal Banerjee, Arun Kejariwal… in Instruction Level Parallelism (2016)
Chapter

Trace Scheduling

Since its introduction by Joseph A. Fisher in 1979, trace scheduling has influenced much of the work on compile-time ILP. Initially developed for use in microcode compaction, trace scheduling quickly became th...

Alex Aiken, Utpal Banerjee, Arun Kejariwal… in Instruction Level Parallelism (2016)
Chapter

Modulo Scheduling

Loop parallelization, and particularly the parallelization of innermost loops, is the most critical aspect of any parallelizing compiler. Trace scheduling can be applied to loops, but has the disadvantage that...

Alex Aiken, Utpal Banerjee, Arun Kejariwal… in Instruction Level Parallelism (2016)
Chapter and Conference Paper

A Compilation and Run-Time Framework for Maximizing Performance of Self-scheduling Algorithms

Ordinary programs contain many parallel loops which account for a significant portion of these programs’ completion time. The parallel executions of such loops can significantly speedup performance of modern m...

Yizhuo Wang, Laleh Aghababaie Beni, Alexandru Nicolau… in Network and Parallel Computing (2014)

Download PDF (1417 KB)
Chapter and Conference Paper

Just in Time Load Balancing

Leveraging Loop Level Parallelism (LLP) is one of the most attractive techniques for improving program performance on emerging multi-cores. Ordinary programs contain a large amount of parallel and DOALL loops,...

Rosario Cammarota, Alexandru Nicolau… in Languages and Compilers for Parallel Compu… (2013)
Chapter and Conference Paper

Optimizing Program Performance via Similarity, Using a Feature-Agnostic Approach

This work proposes a new technique for performance evaluation to predict performance of parallel programs across diverse and complex systems. In this work the term system is comprehensive of the hardware organ...

Rosario Cammarota, Laleh Aghababaie Beni… in Advanced Parallel Processing Technologies (2013)
Chapter and Conference Paper

On the Determination of Inlining Vectors for Program Optimization

In this paper we propose a new technique and a framework to select inlining heuristic constraints - referred to as an inlining vector, for program optimization. The proposed technique uses machine learning to mod...

Rosario Cammarota, Alexandru Nicolau, Alexander V. Veidenbaum… in Compiler Construction (2013)

Download PDF (725 KB)
Chapter and Conference Paper

How Many Threads to Spawn during Program Multithreading?

Thread-level program parallelization is key for exploiting the hardware parallelism of the emerging multi-core systems. Several techniques have been proposed for program multithreading. However, the existing t...

Alexandru Nicolau, Arun Kejariwal in Languages and Compilers for Parallel Computing (2011)
Chapter and Conference Paper

Performance Characterization of Itanium^® 2-Based Montecito Processor

This paper presents the performance characteristics of the Intel^®Itanium^®2-based Montecito processor and compares its performance to the previous generation Madison processor. Measurements on both are done using ...

Darshan Desai, Gerolf F. Hoflehner… in Computer Performance Evaluation and Benchm… (2009)
Chapter and Conference Paper

Using Recursion to Boost ATLAS’s Performance

We investigate the performance benefits of a novel recursive formulation of Strassen’s algorithm over highly tuned matrix-multiply (MM) routines, such as the widely used ATLAS for high-performance systems.

Paolo D’Alberto, Alexandru Nicolau in High-Performance Computing (2008)

54 Result(s)

New Opportunities for Compilers in Computer Security

Towards an Achievable Performance for the Loop Nests

Using Hardware Counters to Predict Vectorization

Polygonal Iteration Space Partitioning

Instruction Level Parallelism

Introduction

Percolation Scheduling

Epilogue

Scheduling Basic Blocks

Software Pipelining by Kernel Recognition

Overview of ILP Architectures

Trace Scheduling

Modulo Scheduling

A Compilation and Run-Time Framework for Maximizing Performance of Self-scheduling Algorithms

Just in Time Load Balancing

Optimizing Program Performance via Similarity, Using a Feature-Agnostic Approach

On the Determination of Inlining Vectors for Program Optimization

How Many Threads to Spawn during Program Multithreading?

Performance Characterization of Itanium^® 2-Based Montecito Processor

Using Recursion to Boost ATLAS’s Performance

Our Content

Other Sites

Help & Contacts