![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Article
PartialRC: A Partial Recomputing Method for Efficient Fault Recovery on GPGPUs
GPGPUs are increasingly being used to as performance accelerators for HPC (High Performance Computing) applications in CPU/GPU heterogeneous computing systems, including TianHe-1A, the world's fastest supercom...
-
Article
A Hybrid Circular Queue Method for Iterative Stencil Computations on GPUs
In this paper, we present a hybrid circular queue method that can significantly boost the performance of stencil computations on GPU by carefully balancing usage of registers and shared-memory. Unlike earlier ...
-
Article
PARBLO: Page-Allocation-Based DRAM Row Buffer Locality Optimization
DRAM row buffer conflicts can increase memory access latency significantly. This paper presents a new page-allocation-based optimization that works seamlessly together with some existing hardware and software ...
-
Article
Forword