Search
Search Results
-
BOUNCE: memory-efficient SIMD approach for lightweight integer compression
Integer compression plays an important role in columnar database systems to reduce the main memory footprint as well as to speedup query processing....
-
Partition-based SIMD Processing and its Application to Columnar Database Systems
The Single Instruction Multiple Data (SIMD) paradigm became a core principle for optimizing query processing in columnar database systems. Until now,...
-
SIMD vectorization for simultaneous solution of locally varying linear systems with multiple right-hand sides
Developments in numerical simulation of flows and high-performance computing influence one another. More detailed simulation methods create a...
-
Interruptible Nodes: Reducing Queueing Costs in Irregular Streaming Dataflow Applications on Wide-SIMD Architectures
Streaming dataflow applications are an attractive target to parallelize on wide-SIMD processors such as GPUs. These applications can be expressed as...
-
SIMD-based low bit-depth motion estimation with application to HEVC
Motion estimation (ME) is a crucial stage in video encoding process since it is the main contributor to high compression ratios. However, it is a...
-
SIMD Fundamentals
Chapter 1 introduces x86 SIMD fundamentals and essential concepts. It begins with a section that defines SIMD. This section also introduces SIMD... -
SIMD Usage and Optimization Guidelines
Congratulations if you have made it this far. I hope that your x86-AVX SIMD journey has been informative and worthwhile. I would like to conclude... -
Supplemental C++ SIMD Programming
In the previous eight chapters, you learned critical programming details about AVX, AVX2, and AVX-512. You also discovered how to create SIMD... -
On the Hardness of Scheme-Switching Between SIMD FHE Schemes
Fully homomorphic encryption (FHE) schemes are either lightweight and can evaluate boolean circuits or are relatively heavy and can evaluate... -
Efficient MIP volume rendering via fast SIMD interpolation and memory access reordering
This study describes an acceleration method that can perform efficient maximum intensity projection (MIP) visualization, which is essential for...
-
Introduction to X86-AVX SIMD Programming
In the first six chapters of this book, you learned essential details about an x86-64 processor including its data types, general-purpose registers,... -
A simple and efficient storage format for SIMD-accelerated SpMV
SpMV (Sparse matrix-vector multiplication) is an essential component in scientific computing and has attracted the attention of researchers in...
-
VIP: A SIMD vectorized analytical query engine
Query execution engines for analytics are continuously adapting to the underlying hardware in order to maximize performance. Wider SIMD registers and...
-
Design and Implementation of the CNN Accelator Based on Multi-streaming SIMD Mechanisms
In this paper, we propose a SIMD architecture for CNN model computing based on multi-streaming SIMD concepts. The proposed processor is called the... -
Fast Skinny-128 SIMD Implementations for Sequential Modes of Operation
This paper reports new software implementation results for the Skinny-128 tweakable block ciphers on various SIMD architectures. More precisely, we... -
uLog: a software-based approximate logarithmic number system for computations on SIMD processors
This paper presents a new number representation based on logarithmic number system (LNS) called unsigned logarithmic number system ( ulog ), as an...
-
A Quantum-Inspired Model for Bit-Serial SIMD-Parallel Computation
Bit-serial SIMD-parallel execution was once commonly used in supercomputers, but fell out of favor as it became practical to implement word-level... -
SIMD Vector Crunching
Vector computing can be read as equip** a processing unit with replicated ALUs. To benefit from this hardware concurrency, we have to phrase our... -
Parallel pairwise operations on data stored in DNA: sorting, XOR, shifting, and searching
Prior research has introduced the Single-Instruction-Multiple-Data paradigm for DNA computing (SIMD DNA). It offers the potential for storing...
-
A Case Study of LLVM-Based Analysis for Optimizing SIMD Code Generation
This paper presents a methodology for using LLVM-based tools to tune the DCA++ (dynamical cluster approximation) application that targets the new ARM...