We are improving our search experience. To check which content you have full access to, or for advanced search, go back to the old search.

Search

Please fill in this field.
Filters applied:

Search Results

Showing 1-13 of 13 results
  1. A Performance Comparison of Clustering Algorithms for Big Data on DataMPI

    Clustering algorithms for big data have important applications in finance. DataMPI is a communication library based on key-value pairs that extends...
    Mo Hai in Data Science
    Conference paper 2020
  2. Investigating the performance of Hadoop and Spark platforms on machine learning algorithms

    One of the most challenging issues in the big data research area is the inability to process a large volume of information in a reasonable time....

    Ali Mostafaeipour, Amir Jahangard Rafsanjani, ... Joshuva Arockia Dhanraj in The Journal of Supercomputing
    Article 13 May 2020
  3. MapReduce scheduling algorithms in Hadoop: a systematic study

    Hadoop is a framework for storing and processing huge volumes of data on clusters. It uses Hadoop Distributed File System (HDFS) for storing data and...

    Soudabeh Hedayati, Neda Maleki, ... Kamal Berahmand in Journal of Cloud Computing
    Article Open access 10 October 2023
  4. Interference-aware co-scheduling method based on classification of application characteristics from hardware performance counter using data mining

    Computational scientists and engineers who are eager to obtain the best performance of scientific applications need efficient application...

    Jieun Choi, Geunchul Park, Dukyun Nam in Cluster Computing
    Article 12 June 2019
  5. MapReduce: an infrastructure review and research insights

    In the current decade, doing the search on massive data to find “hidden” and valuable information within it is growing. This search can result in...

    Neda Maleki, Amir Masoud Rahmani, Mauro Conti in The Journal of Supercomputing
    Article 08 June 2019
  6. xCCL: A Survey of Industry-Led Collective Communication Libraries for Deep Learning

    Machine learning techniques have become ubiquitous both in industry and academic applications. Increasing model sizes and training data volumes...

    Adam Weingram, Yuke Li, ... **aoyi Lu in Journal of Computer Science and Technology
    Article 01 February 2023
  7. CirroData: Yet Another SQL-on-Hadoop Data Analytics Engine with High Performance

    This paper presents CirroData, a high-performance SQL-on-Hadoop system designed for Big Data analytics workloads. As a home-grown enterprise-level...

    Zheng-Hao **, Haiyang Shi, ... **aoyi Lu in Journal of Computer Science and Technology
    Article 17 January 2020
  8. Big Data and HPC Convergence: The Cutting Edge and Outlook

    The data growth over the last couple of decades increases on a massive scale. As the volume of the data increases so are the challenges associated...
    Sardar Usman, Rashid Mehmood, Iyad Katib in Smart Societies, Infrastructure, Technologies and Applications
    Conference paper 2018
  9. Combining Hadoop with MPI to Solve Metagenomics Problems that are both Data- and Compute-intensive

    Metagenomics, the study of all microbial species cohabitants in an environment, often produces large amount of sequence data varying from several GBs...

    Han Lin, Zhichao Su, ... Zheng Wu in International Journal of Parallel Programming
    Article 07 October 2017
  10. Accelerating Iterative Big Data Computing Through MPI

    Current popular systems, Hadoop and Spark, cannot achieve satisfied performance because of the inefficient overlap** of computation and...

    Article 13 March 2015
  11. Performance Benefits of DataMPI: A Case Study with BigDataBench

    Apache Hadoop and Spark are gaining prominence in Big Data processing and analytics. Both of them are widely deployed in Internet companies. On the...
    Conference paper 2014
  12. Feasibility analysis of AsterixDB and Spark streaming with Cassandra for stream-based processing

    For getting up-to-date insight into online services, extracted data has to be processed in near real time. For example, major big data companies...

    Pekka Pääkkönen in Journal of Big Data
    Article Open access 08 April 2016
  13. Preface

    Article 13 March 2015
Did you find what you were looking for? Share feedback.