Abstract
In the post-COVID-19 era, the rapid growth of edge computing underscores the importance of accurate workload prediction to maximize utilization of limited edge resources. Both edge service providers (ESPs) and edge service consumers (ESCs) can benefit significantly from such predictions. Existing workload prediction paradigms (edge-only or cloud-only) fall short by neglecting inter-site correlations and encountering data transmission delays. Given the expanding adoption of edge platforms by web services, achieving accuracy and efficiency in workload prediction is of vital importance. This chapter introduces a cloud-edge collaborated edge workload prediction framework that employs a collaborative cloud-edge paradigm for edge workload prediction using multi-view graphs. Specifically, the global stage develops a learnable aggregation layer per edge site to enhance efficiency while capturing inter-site correlations. The local stage devises a disaggregation layer that merges intra-site and inter-site correlations to enhance prediction accuracy. Through extensive experimentation on real-world edge workload data from China’s largest edge service provider, the proposed workload prediction framework outperforms existing methods in minimizing time consumption and reducing communication costs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
C. Nguyen, C. Klein, and E. Elmroth, “Multivariate lstm-based location-aware workload prediction for edge data centers,” in IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, 2019, pp. 341–350.
E. Cortez, A. Bonde, A. Muzio, M. Russinovich, M. Fontoura, and R. Bianchini, “Resource central: Understanding and predicting workloads for improved resource management in large cloud platforms,” in Proceedings of the 26th Symposium on Operating Systems Principles, 2017, pp. 153–167.
A. Ousterhout, J. Fried, J. Behrens, A. Belay, and H. Balakrishnan, “Shenango: Achieving high cpu efficiency for latency-sensitive datacenter workloads,” in USENIX Symposium on Networked Systems Design and Implementation, 2019, pp. 361–378.
R. Singh, S. Agarwal, M. Calder, and P. Bahl, “Cost-effective cloud edge traffic engineering with cascara,” in USENIX Symposium on Networked Systems Design and Implementation, 2021, pp. 201–216.
C. Joo and N. B. Shroff, “A novel coupled queueing model to control traffic via qos-aware collision pricing in cognitive radio networks,” in Proceedings of the International Conference on Computer Communications, 2017, pp. 1–9.
F. Ye, Z. Lin, C. Chen, Z. Zheng, and H. Huang, “Outlier-resilient web service qos prediction,” in Proceedings of the ACM Web Conference, 2021, pp. 3099–3110.
H. Lin, Y. Fan, J. Zhang, and B. Bai, “Rest: Reciprocal framework for spatiotemporal-coupled predictions,” in Proceedings of Web Conference, 2021.
P. R. Winters, “Forecasting sales by exponentially weighted moving averages,” Management Science, 1960.
G. E. Box, G. M. Jenkins, G. C. Reinsel, and G. M. Ljung, Time series analysis: forecasting and control. John Wiley & Sons, 2015.
X. Shi, Z. Chen, H. Wang, D.-Y. Yeung, W.-K. Wong, and W.-c. Woo, “Convolutional lstm network: A machine learning approach for precipitation nowcasting,” Advances in Neural Information Processing Systems, vol. 28, 2015.
I. Sutskever, O. Vinyals, and Q. V. Le, “Sequence to sequence learning with neural networks,” Advances in Neural Information Processing Systems, vol. 27, pp. 3104–3112, 2014.
W. Sun and X. Xu, “Aledar: An attentions-based encoder-decoder and autoregressive model for workload forecasting of cloud data center,” in IEEE International Conference on Computer Supported Cooperative Work in Design, 2022, pp. 59–64.
H. Zhou, S. Zhang, J. Peng, S. Zhang, J. Li, H. **ong, and W. Zhang, “Informer: Beyond efficient transformer for long sequence time-series forecasting,” in Proceedings of the AAAI Conference on Artificial Intelligence, no. 12, 2021, pp. 11 106–11 115.
X. Wang, Y. Ma, Y. Wang, W. **, X. Wang, J. Tang, C. Jia, and J. Yu, “Traffic flow prediction via spatial temporal graph neural network,” in Proceedings of Web Conference, 2020.
Z. Wu, S. Pan, G. Long, J. Jiang, and C. Zhang, “Graph wavenet for deep spatial-temporal graph modeling,” ar**v preprint ar**v:1906.00121, 2019.
K. Guo, Y. Hu, Y. Sun, S. Qian, J. Gao, and B. Yin, “Hierarchical graph convolution network for traffic forecasting,” in Proceedings of the AAAI Conference on Artificial Intelligence, no. 1, 2021, pp. 151–159.
Y. Li, R. Yu, C. Shahabi, and Y. Liu, “Diffusion convolutional recurrent neural network: Data-driven traffic forecasting,” in Proceedings of International Conference on Learning Representations, 2018.
B. Yu, H. Yin, and Z. Zhu, “Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting,” ar**v preprint ar**v:1709.04875, 2017.
B. Liu, J. Guo, C. Li, and Y. Luo, “Workload forecasting based elastic resource management in edge cloud,” Computers & Industrial Engineering, vol. 139, p. 106136, 2020.
J. Kumar and A. K. Singh, “Performance assessment of time series forecasting models for cloud datacenter networks’ workload prediction,” Wireless Personal Communications, vol. 116, no. 3, pp. 1949–1969, 2021.
Y. Zhu, W. Zhang, Y. Chen, and H. Gao, “A novel approach to workload prediction using attention-based lstm encoder-decoder network in cloud environment,” EURASIP Journal on Wireless Communications and Networking, vol. 2019, no. 1, pp. 1–18, 2019.
J. Kumar, A. K. Singh, and R. Buyya, “Self directed learning based workload forecasting model for cloud resource management,” Information Sciences, vol. 543, pp. 345–366, 2021.
Q. He, Z. Dong, F. Chen, S. Deng, W. Liang, and Y. Yang, “Pyramid: enabling hierarchical neural networks with edge computing,” in Proceedings of the ACM Web Conference, 2022, pp. 1860–1870.
M. Xu, Z. Fu, X. Ma, L. Zhang, Y. Li, F. Qian, S. Wang, K. Li, J. Yang, and X. Liu, “From cloud to edge: a first look at public edge platforms,” in Proceedings of the ACM Internet Measurement Conference, 2021, pp. 37–53.
H. Sakoe and S. Chiba, “Dynamic programming algorithm optimization for spoken word recognition,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 26, no. 1, pp. 43–49, 1978.
Z. Zhang, M. Zhang, A. G. Greenberg, Y. C. Hu, R. Mahajan, and B. Christian, “Optimizing cost and performance in online service provider networks.” in Proceedings of the USENIX Symposium on Networked Systems Design and Implementation, 2010, pp. 33–48.
F. Yu and V. Koltun, “Multi-scale context aggregation by dilated convolutions,” ar**v preprint ar**v:1511.07122, 2015.
T. N. Kipf and M. Welling, “Semi-supervised classification with graph convolutional networks,” ar**v preprint ar**v:1609.02907, 2016.
L. Zhang, L. Chen, and J. Xu, “Autodidactic neurosurgeon: Collaborative deep inference for mobile edge intelligence via online learning,” in Proceedings of the ACM Web Conference, 2021, pp. 3111–3123.
D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” ar**v preprint ar**v:1412.6980, 2014.
S. Guo, Y. Lin, N. Feng, and C. Song, “Attention based spatial-temporal graph convolutional networks for traffic flow forecasting,” in Proceedings of the AAAI Conference on Artificial Intelligence, no. 01, 2019, pp. 922–929.
Y. Seo, M. Defferrard, P. Vandergheynst, and X. Bresson, “Structured sequence modeling with graph convolutional recurrent networks,” in International Conference on Neural Information Processing. Springer, 2018, pp. 362–373.
K. Guo, Y. Hu, Z. Qian, H. Liu, K. Zhang, Y. Sun, J. Gao, and B. Yin, “Optimized graph convolution recurrent neural network for traffic prediction,” IEEE Transactions on Intelligent Transportation Systems, vol. 22, no. 2, pp. 1138–1149, 2020.
K. Guo, Y. Hu, Z. S. Qian, Y. Sun, J. Gao, and B. Yin, “An optimized temporal-spatial gated graph convolution network for traffic forecasting,” IEEE Intelligent Transportation Systems Magazine, 2020.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Ma, X., Xu, M., Li, Q., Li, Y., Zhou, A., Wang, S. (2024). Edge Workload Prediction Based on Deep Learning. In: 5G Edge Computing. Springer, Singapore. https://doi.org/10.1007/978-981-97-0213-8_3
Download citation
DOI: https://doi.org/10.1007/978-981-97-0213-8_3
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0212-1
Online ISBN: 978-981-97-0213-8
eBook Packages: Computer ScienceComputer Science (R0)