-
Article
FPGA-based acceleration architecture for Apache Spark operators
Apache Spark has been the most popular in-memory processing framework for big data applications deployed in data centers. As a CPU-only parallel programming framework, Spark can satisfy the requirement of comp...
-
Article
Open AccessA heterogeneous 3-D stacked PIM accelerator for GCN-based recommender systems
Modern recommendation systems integrate graph convolution neural networks (GCN) for enhancing embedding representation. Compared with widely deployed neural network-based models, the extra message propagation ...
-
Article
Intensity-modulated radiotherapy alone compared with intensity-modulated radiotherapy plus concurrent chemotherapy in intermediate-risk nasopharyngeal carcinoma
This study aimed to investigate the clinical benefit of adding concurrent chemotherapy to intensity-modulated radiotherapy (IMRT) for nasopharyngeal carcinoma (NPC) patients with an intermediate risk (stage II...
-
Article
A hybrid memory architecture supporting fine-grained data migration
Hybrid memory systems composed of dynamic random access memory (DRAM) and Non-volatile memory (NVM) often exploit page migration technologies to fully take the advantages of different memory media. Most previo...
-
Article
ARCHER: a ReRAM-based accelerator for compressed recommendation systems
Modern recommendation systems are widely used in modern data centers. The random and sparse embedding lookup operations are the main performance bottleneck for processing recommendation systems on traditional ...
-
Article
A survey on dynamic graph processing on GPUs: concepts, terminologies and systems
Graphs that are used to model real-world entities with vertices and relationships among entities with edges, have proven to be a powerful tool for describing real-world problems in applications. In most real-w...
-
Article
Open AccessHypermethylated GPR135 gene expression is a favorable independent prognostic factor in nasopharyngeal carcinoma
To investigate the methylation status and expression level of G protein-coupled receptor 135 (GPR135) in nasopharyngeal carcinoma (NPC) and determine its prognostic value.
-
Chapter and Conference Paper
PCB Defect Detection Algorithm Based on Multi-scale Fusion Network
PCB defect detection is a crucial link for the production of PCB board, in order to pursue the yield, the production chain must ensure that there is an efficient PCB defect detection method. The traditional PC...
-
Chapter and Conference Paper
An Efficient Graph Accelerator with Distributed On-Chip Memory Hierarchy
Graph processing has evolved and expanded swiftly with artificial intelligence and big data technology. High-Bandwidth Memory (HBM), which delivers terabyte-level memory bandwidth, has opened up new development p...
-
Article
UCat: heterogeneous memory management for unikernels
Unikernels provide an efficient and lightweight way to deploy cloud computing services in application-specialized and single-address-space virtual machines (VMs). They can efficiently deploy hundreds of uniker...
-
Article
ReCSA: a dedicated sort accelerator using ReRAM-based content addressable memory
With the increasing amount of data, there is an urgent need for efficient sorting algorithms to process large data sets. Hardware sorting algorithms have attracted much attention because they can take advantag...
-
Article
Studies on Bioflocculant Production by Pseudoalteromonas sp. NUM8, a Marine Bacteria Isolated from the Circulating Seawater
A bioflocculant producing potential bacteria was isolated from the circulating seawater of bio-filter using streak plate methods. The bacteria was identified through biochemical characteristics, partial 16S ri...
-
Article
Editorial for the special issue on high performance distributed computing
-
Article
Cost Efficient Edge Service Placement for Crowdsensing via Bus Passengers
Edge computing is highly recommended to support Mobile Crowdsensing (MCS) applications for sensing data processing. In this paper, we consider the MCS applications supported by the mobile phones of bus passengers...
-
Article
Resource abstraction and data placement for distributed hybrid memory pool
Emerging byte-addressable non-volatile memory (NVM) technologies offer higher density and lower cost than DRAM, at the expense of lower performance and limited write endurance. There have been many studies on hyb...
-
Article
Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture
Graph processing is widely used in modern society, such as social networks, bioinformatics, and information networks. It is observed that the dataflow architecture has been demonstrated to effectively resolve ...
-
Chapter and Conference Paper
Superpage-Friendly Page Table Design for Hybrid Memory Systems
Page migration has long been adopted in hybrid memory systems comprising dynamic random access memory (DRAM) and non-volatile memories (NVMs), to improve the system performance and energy efficiency. However, pag...
-
Article
An effective framework for asynchronous incremental graph processing
Although many graph processing systems have been proposed, graphs in the real-world are often dynamic. It is important to keep the results of graph computation up-to-date. Incremental computation is demonstrat...
-
Article
Enhancing application performance via DAG-driven scheduling in task parallelism for cloud center
Nowadays, offloading technologies are applied to smart devices, which add more jobs into cloud data center. In cloud data center, limited physical resources and competitions of different jobs all need to be im...
-
Article
FunctionFlow: coordinating parallel tasks
With the growing popularity of task-based parallel programming, nowadays task-parallel programming libraries and languages are still with limited support for coordinating parallel tasks. Such limitation forces...