Skip to main content

and
  1. No Access

    Chapter and Conference Paper

    HDUMP: A Data Recovery Tool for Hadoop

    Hadoop is a popular distributed framework for massive data processing. HDFS is the underlying file system of Hadoop. More and more companies use Hadoop as data processing platform. Once Hadoop crashes, the dat...

    Zhongsheng Li, Qiuhong Li, Wei Wang in Database Systems for Advanced Applications (2018)

  2. No Access

    Chapter and Conference Paper

    Modeling and Evaluating MID1 ICAL Pipeline on Spark

    Squire Kilometre Array (SKA) project generates almost the hugest data volume in the world. SKA data flow pipelines need almost real-time processing ability, which brings huge challenges to the execution framew...

    Zhongsheng Li, Qiuhong Li, Yimin Liu in Database Systems for Advanced Applications (2018)