-
Article
IterML: Iterative Machine Learning for Intelligent Parameter Pruning and Tuning in Graphics Processing Units
With the rise of graphics processing units (GPUs), the parallel computing community needs better tools to productively extract performance from the GPU. While modern compilers provide flags to activate differe...
-
Article
Open AccessIdentifying multi-hit carcinogenic gene combinations: Scaling up a weighted set cover algorithm using compressed binary matrix representation on a GPU
Despite decades of research, effective treatments for most cancers remain elusive. One reason is that different instances of cancer result from different combinations of multiple genetic mutations (hits). Ther...
-
Article
Open AccessAuthor Correction: Differentiating between cancer and normal tissue samples using multi-hit combinations of genetic mutations
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
-
Article
GPU-Based Iterative Medical CT Image Reconstructions
The algebraic reconstruction technique (ART) is an iterative algorithm for CT (i.e., computed tomography) image reconstruction that delivers better image quality with less radiation dosage than the industry-st...
-
Article
Open AccessDifferentiating between cancer and normal tissue samples using multi-hit combinations of genetic mutations
Cancer is known to result from a combination of a small number of genetic defects. However, the specific combinations of mutations responsible for the vast majority of cancers have not been identified. Current...
-
Article
OpenDwarfs: Characterization of Dwarf-Based Benchmarks on Fixed and Reconfigurable Architectures
The proliferation of heterogeneous computing platforms presents the parallel computing community with new challenges. One such challenge entails evaluating the efficacy of such parallel architectures and ident...
-
Article
Open AccessmuBLASTP: database-indexed protein sequence search on multicore CPUs
The Basic Local Alignment Search Tool (BLAST) is a fundamental program in the life sciences that searches databases for sequences that are most similar to a query sequence. Currently, the BLAST algorithm utili...
-
Chapter and Conference Paper
On the Energy Proportionality of Distributed NoSQL Data Stores
The computing community is facing several big data challenges due to the unprecedented growth in the volume and variety of data. Many large-scale Internet companies use distributed NoSQL data stores to mitigate t...
-
Chapter and Conference Paper
CoreTSAR: Adaptive Worksharing for Heterogeneous Systems
The popularity of heterogeneous computing continues to increase rapidly due to the high peak performance, favorable energy efficiency, and comparatively low cost of accelerators. However, heterogeneous program...
-
Article
GBench: benchmarking methodology for evaluating the energy efficiency of supercomputers
Recent studies point to power consumption becoming the major design constraint in exascale computing systems. Current scientific benchmarks, such as LINPACK, only evaluate high-performance computing (HPC) syst...
-
Article
Performance characterization of data-intensive kernels on AMD Fusion architectures
The cost of data movement over the PCI Express bus is one of the biggest performance bottlenecks for accelerating data-intensive applications on traditional discrete GPU architectures. To address this bottlene...
-
Article
The Green500 list: escapades to exascale
Energy efficiency is now a top priority. The first four years of the Green500 have seen the importance of energy efficiency in supercomputing grow from an afterthought to the forefront of innovation as we appr...
-
Article
Parallel Mining of Neuronal Spike Streams on Graphics Processing Units
Multi-electrode arrays (MEAs) provide dynamic and spatial perspectives into brain function by capturing the temporal behavior of spikes recorded from cultures and living tissue. Understanding the firing patter...
-
Article
Reliable MapReduce computing on opportunistic resources
MapReduce offers an ease-of-use programming paradigm for processing large data sets, making it an attractive model for opportunistic compute resources. However, unlike dedicated resources, where MapReduce has ...
-
Article
Open AccessHigh-performance biocomputing for simulating the spread of contagion over large contact networks
Many important biological problems can be modeled as contagion diffusion processes over interaction networks. This article shows how the EpiSimdemics interaction-based simulation system can be applied to the g...
-
Article
Open AccessMulti-dimensional characterization of electrostatic surface potential computation on graphics processors
Calculating the electrostatic surface potential (ESP) of a biomolecule is critical towards understanding biomolecular function. Because of its quadratic computational complexity (as a function of the number of at...
-
Reference Work Entry In depth
Homology to Sequence Alignment, From
-
Article
Open AccessMissing genes in the annotation of prokaryotic genomes
Protein-coding gene detection in prokaryotic genomes is considered a much simpler problem than in intron-containing eukaryotic genomes. However there have been reports that prokaryotic gene finder programs hav...
-
Chapter and Conference Paper
Making a Case for Proactive Flow Control in Optical Circuit-Switched Networks
Optical circuit-switched networks such as National LambdaRail (NLR) offer dedicated bandwidth to support large-scale bulk data transfer. Though a dedicated circuit-switched network eliminates congestion from t...
-
Chapter and Conference Paper
Effective Dynamic Voltage Scaling Through CPU-Boundedness Detection
Dynamic voltage scaling (DVS) allows a program to execute at a non-peak CPU frequency in order to reduce CPU power, and hence, energy consumption; however, it is oftentimes done at the expense of performance d...