Search
Search Results
-
Accelerating neural network architecture search using multi-GPU high-performance computing
Neural networks stand out from artificial intelligence because they can complete challenging tasks, such as image classification. However, designing...
-
Balancing of Web Applications Workload Using Hybrid Computing (CPU–GPU) Architecture
The current network architecture does not properly manage the load of web applications or keep track of user-web application interaction. There are...
-
Computing large 2D convolutions on GPU efficiently with the im2tensor algorithm
Attaining the best possible throughput when computing convolutions is a challenge for signal and image processing systems, be they HPC...
-
Hybridhadoop: CPU-GPU hybrid scheduling in hadoop
As a GPU has become an essential component in high performance computing, it has been attempted by many works to leverage GPU computing in Hadoop....
-
A parallel acceleration GPU algorithm for large deformation of thin shell structures based on peridynamics
Loaded shell structures may deform, rotate, and crack, leading to fracture. The traditional finite element method describes material internal forces...
-
SLO-Aware DL Job Scheduling for Efficient FPGA-GPU Edge Cloud Computing
Deep learning applications have become increasingly popular in recent years, leading to the development of specialized hardware accelerators such as... -
GPU-based butterfly counting
When dealing with large bipartite graphs, butterfly counting is a crucial and time-consuming operation. Graphics processing units (GPUs) are widely...
-
A GPU-enabled acceleration algorithm for the CAM5 cloud microphysics scheme
The National Center for Atmospheric Research released a global atmosphere model named Community Atmosphere Model version 5.0 (CAM5), which aimed to...
-
OpenMP offload toward the exascale using Intel® GPU Max 1550: evaluation of STREAmS compressible solver
Nearly 20 years after the birth of general-purpose GPU computing, the HPC landscape is now dominated by GPUs. After years of undisputed dominance by...
-
A novel parallel mammogram sharpening framework using modified Laplacian filter for lumps identification on GPU
In medical diagnosis, mammographic imaging is mainly concerned with the breast parenchymal patterns (counterbalance of glandular tissue and fatty...
-
Using heterogeneous computing and edge computing to accelerate anomaly detection in remotely sensed multispectral images
This paper proposes a parallel algorithm exploiting heterogeneous computing and edge computing for anomaly detection (AD) in remotely sensed...
-
Utilization-prediction-aware energy optimization approach for heterogeneous GPU clusters
Optimizing energy consumption in heterogeneous GPU clusters is of paramount importance to enhance overall system efficiency and reduce operational...
-
Acceleration of 3D feature-enhancing noise filtering in hybrid CPU/GPU systems
FlowDenoising is a new approach to noise reduction in biological volumes obtained with three-dimensional electron microscopy (3DEM). Its abilities to...
-
Distributed out-of-memory NMF on CPU/GPU architectures
We propose an efficient distributed out-of-memory implementation of the non-negative matrix factorization (NMF) algorithm for heterogeneous...
-
Optimized Python library for reconstruction of ensemble-based gene co-expression networks using multi-GPU
Gene co-expression networks are valuable tools for discovering biologically relevant information within gene expression data. However, analysing...
-
Parallelization with load balancing of the weather scheme WSM7 for heterogeneous CPU-GPU platforms
This article provides an enhanced parallelization of the WSM7 microphysics scheme for the Weather Research and Forecasting Model (WRF). The...
-
Graph analysis using a GPU-based parallel algorithm: quantum clustering
The article introduces a new method for applying Quantum Clustering to graph structures. Quantum Clustering (QC) is a density-based unsupervised...
-
A perceptual and predictive batch-processing memory scheduling strategy for a CPU-GPU heterogeneous system
When multiple central processing unit (CPU) cores and integrated graphics processing units (GPUs) share off-chip main memory, CPU and GPU...
-
CC-RRTMG_SW++: Further optimizing a shortwave radiative transfer scheme on GPU
Atmospheric radiation is one of the most important atmospheric physics, and its expensive computation cost severely restricts the numerical...
-
Can GPU performance increase faster than the code error rate?
Graphics processing units (GPUs) are the reference architecture to accelerate high-performance computing applications and the training/interference...