Search Page | SpringerLink

A guide to creating an effective big data management framework

Many agencies and organizations, such as the U.S. Geological Survey, handle massive geospatial datasets and their auxiliary data and are thus faced...

S. T. Arundel, K. G. McKeehan, ... P. T. Thiem in Journal of Big Data

Article Open access 26 September 2023

Leveraging an open source serverless framework for high energy physics computing

CERN (Centre Europeen pour la Recherce Nucleaire) is the largest research centre for high energy physics (HEP). It offers unique computational...

Vincenzo Eduardo Padulano, Pablo Oliver Cortés, ... Germán Moltó in The Journal of Supercomputing

Article Open access 02 January 2023

TMaR: a two-stage MapReduce scheduler for heterogeneous environments

In the context of MapReduce task scheduling, many algorithms mainly focus on the scheduling of Reduce tasks with the assumption that scheduling of...

Neda Maleki, Hamid Reza Faragardi, ... Jay Lofstead in Human-centric Computing and Information Sciences

Article Open access 07 October 2020

A learning-based framework for spatial join processing: estimation, optimization and tuning

The importance and complexity of spatial join operation resulted in the availability of many join algorithms, some of which are tailored for big-data...

Tin Vu, Alberto Belussi, ... Ahmed Eldawy in The VLDB Journal

Article Open access 13 February 2024

Comprehensive techniques for multi-tenant deep learning framework on a Hadoop YARN cluster

We have designed and implemented a new data processing framework called “MeLoN” (Multi-tenant dEep Learning framework On yarN) which aims to...

Seoungbeom Heo, Dae-Cheol Kang, ... Jik-Soo Kim in Cluster Computing

Article 17 November 2022

Estimating runtime of a job in Hadoop MapReduce

Hadoop MapReduce is a framework to process vast amounts of data in the cluster of machines in a reliable and fault-tolerant manner. Since being aware...

Narges Peyravi, Ali Moeini in Journal of Big Data

Article Open access 06 July 2020

A distributed WND-LSTM model on MapReduce for short-term traffic flow prediction

Building data-driven intelligent transportation is a significant task for establishing data-centric smart cities, and exceptionally efficient and...

Dawen **a, Maoting Zhang, ... Huaqing Li in Neural Computing and Applications

Article 02 July 2020

Economic mining of thermal power plant based on improved Hadoop-based framework and Spark-based algorithms

In order to explore potential value of explosively growing data in thermal power unit, this paper proposes a big data mining method based on...

**aoqiang Wen, Zhibin Wu, ... Lifeng Wu in The Journal of Supercomputing

Article 12 June 2023

Mining Skyline Patterns from Big Data Environments based on a Spark Framework

Simultaneously, the application of resilient distributed datasets (RDD) in cloud computing provides a good environment for data analysis of big data....

Jimmy Ming-Tai Wu, Huiying Zhou, ... Mohamed Baza in Journal of Grid Computing

Article 05 April 2023

Efficient allocation of independent gridlet on small, medium, and large grid

Gridlet allocation in a computational grid environment is a major research issue to obtain not only the efficient gridlet allocation technique but...

D. Rajeswari, S. Ramamoorthy, R. Srinivasan in Personal and Ubiquitous Computing

Article 20 March 2023

Efficient verification of parallel matrix multiplication in public cloud: the MapReduce case

With the advent of cloud-based parallel processing techniques, services such as MapReduce have been considered by many businesses and researchers for...

Ramtin Bagheri, Morteza Amini, Somayeh Dolatnezhad Samarin in Journal of Big Data

Article Open access 15 October 2020

Design of ChaApache framework for securing Hadoop application in big data

Hadoop is one of the biggest software structures for distributing the data to compute and handle big data. Big data is a group of composite and...

Saritha Gattoju, V. Nagalakshmi in Multimedia Tools and Applications

Article 01 October 2022

A distributed framework for large-scale semantic trajectory similarity join

The similarity join is a common yet expensive operator for large-scale semantic trajectories analytics. In this paper, we propose DFST , an efficient...

Ruijie Tian, Jiajun Li, ... Fei Wang in Multimedia Tools and Applications

Article 13 July 2023

Parallel computation of probabilistic skyline queries using MapReduce

In recent years, numerous applications have been continuously generating large amounts of uncertain data. The advanced analysis queries such as...

Elaheh Gavagsaz in The Journal of Supercomputing

Article 18 April 2020

Distributed probabilistic top-k dominating queries over uncertain databases

In many real-world applications such as business planning and sensor data monitoring, one important, yet challenging, task is to rank objects (e.g.,...

Niranjan Rai, **ang Lian in Knowledge and Information Systems

Article 01 July 2023

A new Apache Spark-based framework for big data streaming forecasting in IoT networks

Analyzing time-dependent data acquired in a continuous flow is a major challenge for various fields, such as big data and machine learning. Being...

Antonio M. Fernández-Gómez, David Gutiérrez-Avilés, ... Francisco Martínez-Álvarez in The Journal of Supercomputing

Article 21 February 2023

Prefetched wald adaptive boost classification based Czekanowski similarity MapReduce for user query processing with bigdata

With large volumes of data being generated in recent years and the inception of big data analytics on social media necessitates accurate user query...

S. Tamil Selvan, P. Balamurugan, M. Vijayakumar in Distributed and Parallel Databases

Article 05 January 2021

Mining frequent Itemsets from transaction databases using hybrid switching framework

With the growing volume of data, mining Frequent Itemsets remains of paramount importance. These have applications in various domains such as market...

P.P Jashma Suresh, U Dinesh Acharya, N.V. Subba Reddy in Multimedia Tools and Applications

Article 16 February 2023