Adaptive Dynamic Load Balancing in Heterogeneous Multiple GPUs-CPUs Distributed Setting: Case Study of B&B Tree Search

Vu, Trong-Tuan; Derbel, Bilel; Melab, Nouredine

doi:10.1007/978-3-642-44973-4_11

Trong-Tuan Vu³,
Bilel Derbel³ &
Nouredine Melab³

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7997))

Included in the following conference series:

International Conference on Learning and Intelligent Optimization

1779 Accesses

Abstract

The emergence of new hybrid and heterogenous multi-GPUs multi-CPUs large scale platforms offers new opportunities and poses new challenges when solving difficult optimization problems. This paper targets irregular tree search algorithms in which workload is unpredictable. We propose an adaptive distributed approach allowing to distribute the load dynamically at runtime while taking into account the computing abilities of either GPUs or CPUs. Using Branch-and-Bound and FlowShop as a case study, we deployed our approach using up to \(20\) GPUs and \(128\) CPUs. Through extensive experiments in different system configurations, we report near optimal speedups, thus providing new insights into how to take full advantage of both GPUs and CPUs power in modern computing platforms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 42.79; Price includes VAT (Germany)

Softcover Book: EUR 53.49; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A parallel local search in CPU/GPU for scheduling independent tasks on large heterogeneous computing systems

Article 26 October 2014

E-OSched: a load balancing scheduler for heterogeneous multicores

Article 23 May 2018

Parallel PIPS-SBB: multi-level parallelism for stochastic mixed-integer programs

Article 15 February 2019

References

Blumofe, R.D., Leiserson, C.E.: Scheduling multithreaded computations by work stealing. J. ACM 46, 720–748 (1999)
Article MathSciNet MATH Google Scholar
Boukedjar, A., Lalami, M.E., El-Baz, D.: Parallel branch and bound on a CPU-GPU system. In: 20th International Conference on Parallel, Distributed and Network-Based Processing, pp. 392–398 (2012)
Google Scholar
Carneiro, T., Muritiba, A.E., Negreiros, M., De Campos, L., Augusto, G.: A new parallel schema for branch-and-bound algorithms using GPGPU. In: 23rd Symposium on Computer Architecture and High Performance Computing, pp. 41–47 (2011)
Google Scholar
Chakroun, I., Melab, M.: An adaptative multi-GPU based branch-and-bound. a case study: the flow-shop scheduling problem. In: 14th IEEE Interernational Conference on High Performance Computing and Communications (2012)
Google Scholar
Dijkstra, E.W.: Derivation of a termination detection algorithm for distributed computations. In: Broy, M. (ed.) Control Flow and Data Flow: Concepts of Distributed Programming, pp. 507–512. Springer, Berlin (1987)
Google Scholar
Dinan, J., Olivier, S., Sabin, G., Prins, J., Sadayappan, P., Tseng, C.-W.: A message passing benchmark for unbalanced applications. Simul. Model. Pract. Theor. 16(9), 1177–1189 (2008)
Article Google Scholar
Matteo, F., Charles, E.L., Keith, H.R.: The implementation of the cilk-5 multithreaded language. SIGPLAN Not. 33, 212–223 (1998)
Article Google Scholar
Grid500 French national gird. https://www.grid5000.fr/
James, D., Brian, L.D., Sadayappan, P., Krishnamoorthy, S., Jarek, N.: Scalable work stealing. In: Proceedings of ACM Conference on High Performance Computing Networking, Storage and Analysis, pp. 53:1–53:11 (2009)
Google Scholar
Lalami, M.E., El-Baz, D.: GPU implementation of the branch and bound method for knapsack problems. In: IPDPS Workshops, pp. 1769–1777 (2012)
Google Scholar
Melab, N., Chakroun, I., Mezmaz, M., Tuyttens, D.: A GPU-accelerated b &b algorithm for the flow-shop scheduling problem. In: 14th IEEE Conference on Cluster Computing (2012)
Google Scholar
Min, S.-J., Iancu, C., Yelick, K.: Hierarchical work stealing on manycore clusters. In: Proceedings of 5th Conference on Partitioned Global Address Space Programming Models (2011)
Google Scholar
Saraswat, V.A., Kambadur, P., Kodali, S., Grove, D., Krishnamoorthy, S.: Lifeline-based global load balancing. In: 16th ACM Symposium on Principles and Practice of Parallel Programming (PPoPP ’11), pp. 201–212 (2011)
Google Scholar
Taillard, E.: Benchmarks for basic scheduling problems. Eur. J. Oper. Res. 64(2), 278–285 (1993)
Article MATH Google Scholar

Download references

Acknowledgments

This material is based on work supported by INRIA HEMERA project. Experiments presented in this paper were carried out using the Grid5000 experimental testbed, being developed under the INRIA ALADDIN development action with support from CNRS, RENATER and several Universities as well as other funding bodies (see https://www.grid5000.fr). Thanks also to Imen Chakroun for her precious contributions to the code development of the GPU kernel.

Author information

Authors and Affiliations

DOLPHIN, INRIA Lille - Nord Europe, University Lille 1, Lille, France
Trong-Tuan Vu, Bilel Derbel & Nouredine Melab

Authors

Trong-Tuan Vu
View author publications
You can also search for this author in PubMed Google Scholar
Bilel Derbel
View author publications
You can also search for this author in PubMed Google Scholar
Nouredine Melab
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Trong-Tuan Vu .

Editor information

Editors and Affiliations

Università Catania Dipto. Matematica e Informatica, Catania, Italy
Giuseppe Nicosia
Industrial & Systems Engineering, University of Florida, Gainesville, Florida, USA
Panos Pardalos

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vu, TT., Derbel, B., Melab, N. (2013). Adaptive Dynamic Load Balancing in Heterogeneous Multiple GPUs-CPUs Distributed Setting: Case Study of B&B Tree Search. In: Nicosia, G., Pardalos, P. (eds) Learning and Intelligent Optimization. LION 2013. Lecture Notes in Computer Science(), vol 7997. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-44973-4_11

Download citation

DOI: https://doi.org/10.1007/978-3-642-44973-4_11
Published: 26 November 2013
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-44972-7
Online ISBN: 978-3-642-44973-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Adaptive Dynamic Load Balancing in Heterogeneous Multiple GPUs-CPUs Distributed Setting: Case Study of B&B Tree Search

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A parallel local search in CPU/GPU for scheduling independent tasks on large heterogeneous computing systems

E-OSched: a load balancing scheduler for heterogeneous multicores

Parallel PIPS-SBB: multi-level parallelism for stochastic mixed-integer programs

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Adaptive Dynamic Load Balancing in Heterogeneous Multiple GPUs-CPUs Distributed Setting: Case Study of B&B Tree Search

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A parallel local search in CPU/GPU for scheduling independent tasks on large heterogeneous computing systems

E-OSched: a load balancing scheduler for heterogeneous multicores

Parallel PIPS-SBB: multi-level parallelism for stochastic mixed-integer programs

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation