Towards an Optimized Containerization of HPC Job Schedulers Based on Namespaces

  • Conference paper
  • First Online:
Network and Parallel Computing (NPC 2021)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 13152))

Included in the following conference series:

Abstract

Recently, container technology is gaining increasing attention and has become an alternative to the traditional virtual machines artifact. The technology is used to deploy large-scale applications in several areas such as Big Data, AI, and High-Performance Computing (HPC). In the HPC field, several management tools exist as Slurm in one hand. On the other hand, the literature has considered many container scheduling strategies. The majority of container scheduling strategies don’t think about the amount of data transmitted between containers. This paper presents a new container scheduling strategy that automatically groups containers that belong to the same group (Namespace) on the same node. In brief, the plan is application-aware as long as someone knows which containers should be grouped in the same Namespace. The objective is to compact the nodes with containers of the same group to reduce the number of nodes used, the communication inter-node costs, and improve containerized applications’ overall Quality-of-Service (QoS). Our proposed strategy is implemented under the Kubernetes framework. Experiments demonstrate the potential of our strategy under different scenarios. Most importantly, we show first that cohabitation between our new scheduling strategy and the default Kubernetes strategy is possible and for the benefit of the system. Thanks to Namespaces, cohabitation is not limited to two methods for scheduling either batch jobs or online services. Second, thanks to the deployment automation, we also demonstrate that multiple Slurm clusters can be instantiated from a pool of bare metal nodes. This reality contributes to the concept of “HPC as a Service.”

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 58.84
Price includes VAT (Germany)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 74.89
Price includes VAT (Germany)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    See https://golang.org and https://kubernetes.io/.

References

  1. Bauer, M.: Solving Problems in HPC with Singularity. CernVM Workshop 2019, June 2019. https://cds.cern.ch/record/2677637

  2. Beltre, A.M., Saha, P., Govindaraju, M., Younge, A., Grant, R.E.: Enabling HPC workloads on cloud infrastructure using kubernetes container orchestration mechanisms. In: 2019 IEEE/ACM International Workshop on Containers and New Orchestration Paradigms for Isolated Environments in HPC (CANOPIE-HPC), pp. 11–20 (2019)

    Google Scholar 

  3. Casalicchio, E., Iannucci, S.: The state-of-the-art in container technologies: application, orchestration and security. Concurrency Comput. Practice Exp. 32(17), e5668 (2020)

    Google Scholar 

  4. Cérin, C., Greneche, N., Menouer, T.: Towards pervasive containerization of HPC job schedulers. In: 2020 IEEE 32nd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), pp. 281–288 (2020)

    Google Scholar 

  5. Guan, X., Wan, X., Choi, B., Song, S., Zhu, J.: Application oriented dynamic resource allocation for data centers using docker containers. IEEE Commun. Lett. 21(3), 504–507 (2017)

    Article  Google Scholar 

  6. Hoque, S., d. Brito, M.S., Willner, A., Keil, O., Magedanz, T.: Towards container orchestration in fog computing infrastructures. In: 2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC), vol. 2, pp. 294–299, July 2017

    Google Scholar 

  7. Jiang, C., et al.: Characterizing co-located workloads in alibaba cloud datacenters. IEEE Trans. Cloud Comput., 1 (2020)

    Google Scholar 

  8. Liu, B., Li, P., Lin, W., Shu, N., Li, Y., Chang, V.: A new container scheduling algorithm based on multi-objective optimization. Soft Comput. 22, 1–12 (2018)

    Google Scholar 

  9. Marzolla, M., Babaoglu, Ö., Panzieri, F.: Server consolidation in clouds through gossi**. In: 12th IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks, WOWMOM, Lucca, Italy, 20–24 June, 2011, pp. 1–6 (2011)

    Google Scholar 

  10. Menouer, T., Darmon, P.: A new container scheduling algorithm based on multi-objective optimization. In: 27th Euromicro International Conference on Parallel, Distributed and Network-based Processing, Pavia, Italy, February 2019

    Google Scholar 

  11. Menouer, T., Manad, O., Cérin, C., Darmon, P.: Power efficiency containers scheduling approach based on machine learning technique for cloud computing environment. In: Esposito, C., Hong, J., Choo, K.-K.R. (eds.) I-SPAN 2019. CCIS, vol. 1080, pp. 193–206. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30143-9_16

    Chapter  Google Scholar 

  12. Pahl, C., Lee, B.: Containers and clusters for edge cloud architectures - a technology review. In: 2015 3rd International Conference on Future Internet of Things and Cloud, pp. 379–386, August 2015

    Google Scholar 

  13. Smarr, L., Catlett, C.: Metacomputing. Commun. ACM 35, 44–52 (1992)

    Google Scholar 

  14. Steve Buchanan, Janaka Rangama, N.B.: Introducing azure kubernetes service: a practical guide to container orchestration. In: Apress (2019)

    Google Scholar 

  15. Sureshkumar, M., Rajesh, P.: Optimizing the docker container usage based on load scheduling. In: 2017 2nd International Conference on Computing and Communications Technologies (ICCCT), pp. 165–168, February 2017

    Google Scholar 

  16. **n, L.: The evolution of large-scale co-location technology at alibaba, 28 November 2019. https://www.alibabacloud.com/blog/the-evolution-of-large-scale-co-location-technology-at-alibaba_595595

  17. Zhao, A., Huang, Q., Huang, Y., Zou, L., Chen, Z., Song, J.: Research on resource prediction model based on kubernetes container auto-scaling technology. IOP Conf. Ser. Materials Sci. Eng. 569, 052092 (2019)

    Google Scholar 

  18. Zhou, N., Georgiou, Y., Zhong, L., Zhou, H., Pospieszny, M.: Container orchestration on HPC systems. In: 2020 IEEE 13th International Conference on Cloud Computing (CLOUD), pp. 34–36. IEEE (2020)

    Google Scholar 

  19. The apache software foundation. mesos, apache. http://mesos.apache.org/

  20. Docker swarmkit. https://github.com/docker/swarmkit/

  21. Kubernetes scheduler. https://kubernetes.io/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tarek Menouer .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 IFIP International Federation for Information Processing

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Menouer, T., Greneche, N., Cérin, C., Darmon, P. (2022). Towards an Optimized Containerization of HPC Job Schedulers Based on Namespaces. In: Cérin, C., Qian, D., Gaudiot, JL., Tan, G., Zuckerman, S. (eds) Network and Parallel Computing. NPC 2021. Lecture Notes in Computer Science(), vol 13152. Springer, Cham. https://doi.org/10.1007/978-3-030-93571-9_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-93571-9_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-93570-2

  • Online ISBN: 978-3-030-93571-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation