![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Chapter and Conference Paper
DHDSearch: A Framework for Batch Time Series Searching on MapReduce
We present DHDSearch, a framework for distributed batch time series searching on MapReduce. DHDSearch is based on a two-layer DHDTree. The upper DHDTree serves as a route tree to distribute the time series. Wh...
-
Chapter and Conference Paper
HDUMP: A Data Recovery Tool for Hadoop
Hadoop is a popular distributed framework for massive data processing. HDFS is the underlying file system of Hadoop. More and more companies use Hadoop as data processing platform. Once Hadoop crashes, the dat...
-
Chapter and Conference Paper
Modeling and Evaluating MID1 ICAL Pipeline on Spark
Squire Kilometre Array (SKA) project generates almost the hugest data volume in the world. SKA data flow pipelines need almost real-time processing ability, which brings huge challenges to the execution framew...
-
Chapter and Conference Paper
Clustering Time Series Utilizing a Dimension Hierarchical Decomposition Approach
Time series clustering has attracted amount of attention recently. However, clustering massive time series faces the challenge of the huge computation cost. To reduce the computation cost, we propose a novel D...
-
Chapter and Conference Paper
Gene Therapy in the Rd6 Mouse Model of Retinal Degeneration
The rd6 mouse is a natural model of an RPE-based (retinal pigment epithelium) autosomal recessive retinitis pigmentosa (RP) caused by mutations in the Mfrp (membrane-type frizzled related protein) gene. Previousl...
-
Chapter and Conference Paper
An Efficient K-means Clustering Algorithm on MapReduce
As an important approach to analyze the massive data set, an efficient k-means implementation on MapReduce is crucial in many applications. In this paper we propose a series of strategies to improve the efficienc...
-
Chapter and Conference Paper
Combination of In-Memory Graph Computation with MapReduce: A Subgraph-Centric Method of PageRank
In order to improve the efficiency of the PageRank algorithm, parallelizing methods, especially the ones based on MapReduce, interest many researchers during the past several years. Previous implementations of...