Edge Workload Prediction Based on Deep Learning

  • Chapter
  • First Online:
5G Edge Computing
  • 79 Accesses

Abstract

In the post-COVID-19 era, the rapid growth of edge computing underscores the importance of accurate workload prediction to maximize utilization of limited edge resources. Both edge service providers (ESPs) and edge service consumers (ESCs) can benefit significantly from such predictions. Existing workload prediction paradigms (edge-only or cloud-only) fall short by neglecting inter-site correlations and encountering data transmission delays. Given the expanding adoption of edge platforms by web services, achieving accuracy and efficiency in workload prediction is of vital importance. This chapter introduces a cloud-edge collaborated edge workload prediction framework that employs a collaborative cloud-edge paradigm for edge workload prediction using multi-view graphs. Specifically, the global stage develops a learnable aggregation layer per edge site to enhance efficiency while capturing inter-site correlations. The local stage devises a disaggregation layer that merges intra-site and inter-site correlations to enhance prediction accuracy. Through extensive experimentation on real-world edge workload data from China’s largest edge service provider, the proposed workload prediction framework outperforms existing methods in minimizing time consumption and reducing communication costs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 128.39
Price includes VAT (Germany)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
EUR 171.19
Price includes VAT (Germany)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. C. Nguyen, C. Klein, and E. Elmroth, “Multivariate lstm-based location-aware workload prediction for edge data centers,” in IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, 2019, pp. 341–350.

    Google Scholar 

  2. E. Cortez, A. Bonde, A. Muzio, M. Russinovich, M. Fontoura, and R. Bianchini, “Resource central: Understanding and predicting workloads for improved resource management in large cloud platforms,” in Proceedings of the 26th Symposium on Operating Systems Principles, 2017, pp. 153–167.

    Google Scholar 

  3. A. Ousterhout, J. Fried, J. Behrens, A. Belay, and H. Balakrishnan, “Shenango: Achieving high cpu efficiency for latency-sensitive datacenter workloads,” in USENIX Symposium on Networked Systems Design and Implementation, 2019, pp. 361–378.

    Google Scholar 

  4. R. Singh, S. Agarwal, M. Calder, and P. Bahl, “Cost-effective cloud edge traffic engineering with cascara,” in USENIX Symposium on Networked Systems Design and Implementation, 2021, pp. 201–216.

    Google Scholar 

  5. C. Joo and N. B. Shroff, “A novel coupled queueing model to control traffic via qos-aware collision pricing in cognitive radio networks,” in Proceedings of the International Conference on Computer Communications, 2017, pp. 1–9.

    Google Scholar 

  6. F. Ye, Z. Lin, C. Chen, Z. Zheng, and H. Huang, “Outlier-resilient web service qos prediction,” in Proceedings of the ACM Web Conference, 2021, pp. 3099–3110.

    Google Scholar 

  7. H. Lin, Y. Fan, J. Zhang, and B. Bai, “Rest: Reciprocal framework for spatiotemporal-coupled predictions,” in Proceedings of Web Conference, 2021.

    Google Scholar 

  8. P. R. Winters, “Forecasting sales by exponentially weighted moving averages,” Management Science, 1960.

    Google Scholar 

  9. G. E. Box, G. M. Jenkins, G. C. Reinsel, and G. M. Ljung, Time series analysis: forecasting and control. John Wiley & Sons, 2015.

    Google Scholar 

  10. X. Shi, Z. Chen, H. Wang, D.-Y. Yeung, W.-K. Wong, and W.-c. Woo, “Convolutional lstm network: A machine learning approach for precipitation nowcasting,” Advances in Neural Information Processing Systems, vol. 28, 2015.

    Google Scholar 

  11. I. Sutskever, O. Vinyals, and Q. V. Le, “Sequence to sequence learning with neural networks,” Advances in Neural Information Processing Systems, vol. 27, pp. 3104–3112, 2014.

    Google Scholar 

  12. W. Sun and X. Xu, “Aledar: An attentions-based encoder-decoder and autoregressive model for workload forecasting of cloud data center,” in IEEE International Conference on Computer Supported Cooperative Work in Design, 2022, pp. 59–64.

    Google Scholar 

  13. H. Zhou, S. Zhang, J. Peng, S. Zhang, J. Li, H. **ong, and W. Zhang, “Informer: Beyond efficient transformer for long sequence time-series forecasting,” in Proceedings of the AAAI Conference on Artificial Intelligence, no. 12, 2021, pp. 11 106–11 115.

    Google Scholar 

  14. X. Wang, Y. Ma, Y. Wang, W. **, X. Wang, J. Tang, C. Jia, and J. Yu, “Traffic flow prediction via spatial temporal graph neural network,” in Proceedings of Web Conference, 2020.

    Google Scholar 

  15. Z. Wu, S. Pan, G. Long, J. Jiang, and C. Zhang, “Graph wavenet for deep spatial-temporal graph modeling,” ar**v preprint ar**v:1906.00121, 2019.

    Google Scholar 

  16. K. Guo, Y. Hu, Y. Sun, S. Qian, J. Gao, and B. Yin, “Hierarchical graph convolution network for traffic forecasting,” in Proceedings of the AAAI Conference on Artificial Intelligence, no. 1, 2021, pp. 151–159.

    Google Scholar 

  17. Y. Li, R. Yu, C. Shahabi, and Y. Liu, “Diffusion convolutional recurrent neural network: Data-driven traffic forecasting,” in Proceedings of International Conference on Learning Representations, 2018.

    Google Scholar 

  18. B. Yu, H. Yin, and Z. Zhu, “Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting,” ar**v preprint ar**v:1709.04875, 2017.

    Google Scholar 

  19. B. Liu, J. Guo, C. Li, and Y. Luo, “Workload forecasting based elastic resource management in edge cloud,” Computers & Industrial Engineering, vol. 139, p. 106136, 2020.

    Article  Google Scholar 

  20. J. Kumar and A. K. Singh, “Performance assessment of time series forecasting models for cloud datacenter networks’ workload prediction,” Wireless Personal Communications, vol. 116, no. 3, pp. 1949–1969, 2021.

    Article  Google Scholar 

  21. Y. Zhu, W. Zhang, Y. Chen, and H. Gao, “A novel approach to workload prediction using attention-based lstm encoder-decoder network in cloud environment,” EURASIP Journal on Wireless Communications and Networking, vol. 2019, no. 1, pp. 1–18, 2019.

    Article  Google Scholar 

  22. J. Kumar, A. K. Singh, and R. Buyya, “Self directed learning based workload forecasting model for cloud resource management,” Information Sciences, vol. 543, pp. 345–366, 2021.

    Article  Google Scholar 

  23. Q. He, Z. Dong, F. Chen, S. Deng, W. Liang, and Y. Yang, “Pyramid: enabling hierarchical neural networks with edge computing,” in Proceedings of the ACM Web Conference, 2022, pp. 1860–1870.

    Google Scholar 

  24. M. Xu, Z. Fu, X. Ma, L. Zhang, Y. Li, F. Qian, S. Wang, K. Li, J. Yang, and X. Liu, “From cloud to edge: a first look at public edge platforms,” in Proceedings of the ACM Internet Measurement Conference, 2021, pp. 37–53.

    Google Scholar 

  25. H. Sakoe and S. Chiba, “Dynamic programming algorithm optimization for spoken word recognition,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 26, no. 1, pp. 43–49, 1978.

    Article  Google Scholar 

  26. Z. Zhang, M. Zhang, A. G. Greenberg, Y. C. Hu, R. Mahajan, and B. Christian, “Optimizing cost and performance in online service provider networks.” in Proceedings of the USENIX Symposium on Networked Systems Design and Implementation, 2010, pp. 33–48.

    Google Scholar 

  27. F. Yu and V. Koltun, “Multi-scale context aggregation by dilated convolutions,” ar**v preprint ar**v:1511.07122, 2015.

    Google Scholar 

  28. T. N. Kipf and M. Welling, “Semi-supervised classification with graph convolutional networks,” ar**v preprint ar**v:1609.02907, 2016.

    Google Scholar 

  29. L. Zhang, L. Chen, and J. Xu, “Autodidactic neurosurgeon: Collaborative deep inference for mobile edge intelligence via online learning,” in Proceedings of the ACM Web Conference, 2021, pp. 3111–3123.

    Google Scholar 

  30. D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” ar**v preprint ar**v:1412.6980, 2014.

    Google Scholar 

  31. S. Guo, Y. Lin, N. Feng, and C. Song, “Attention based spatial-temporal graph convolutional networks for traffic flow forecasting,” in Proceedings of the AAAI Conference on Artificial Intelligence, no. 01, 2019, pp. 922–929.

    Google Scholar 

  32. Y. Seo, M. Defferrard, P. Vandergheynst, and X. Bresson, “Structured sequence modeling with graph convolutional recurrent networks,” in International Conference on Neural Information Processing. Springer, 2018, pp. 362–373.

    Google Scholar 

  33. K. Guo, Y. Hu, Z. Qian, H. Liu, K. Zhang, Y. Sun, J. Gao, and B. Yin, “Optimized graph convolution recurrent neural network for traffic prediction,” IEEE Transactions on Intelligent Transportation Systems, vol. 22, no. 2, pp. 1138–1149, 2020.

    Article  Google Scholar 

  34. K. Guo, Y. Hu, Z. S. Qian, Y. Sun, J. Gao, and B. Yin, “An optimized temporal-spatial gated graph convolution network for traffic forecasting,” IEEE Intelligent Transportation Systems Magazine, 2020.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Ma, X., Xu, M., Li, Q., Li, Y., Zhou, A., Wang, S. (2024). Edge Workload Prediction Based on Deep Learning. In: 5G Edge Computing. Springer, Singapore. https://doi.org/10.1007/978-981-97-0213-8_3

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-0213-8_3

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-0212-1

  • Online ISBN: 978-981-97-0213-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation