Improving the application performance of Loki via algorithm optimization

Zhu, Wenming; Su, Wen**g; Yang, Kai; Chen, Hao

doi:10.1007/s00530-023-01197-5

Improving the application performance of Loki via algorithm optimization

Regular Paper
Published: 10 January 2024

Volume 30, article number 2, (2024)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Wenming Zhu¹^na1,
Wen**g Su²^na1,
Kai Yang¹ &
…
Hao Chen²

88 Accesses
Explore all metrics

Abstract

Loki is a state-of-the-art adaptive bitrate algorithm for the transmission of real-time-communication (RTC) video. It fuses traditional heuristic methods with a learning-based model to maximize the quality of experience (QoE) under diverse network conditions. However, a recurring rebound pattern is observed in Loki’s decision-making process where the decision frequently oscillates between the two boundaries of the action space, making Loki fail to adapt to the fluctuating network bandwidth. To address this issue, we propose Loki+, which improves both the fusion mechanism and the design of the learning-based actor. Specifically, we replace the element-wise multiplication with a simple but effective trend fusion and further optimize the design of reward and loss functions for training Loki+. Extensive simulation results show that Loki+ significantly improves the QoE in the aspects of reducing the stall rate by 20%\(\sim\)60% and the frame delay by 3.5%\(\sim\)30.5% while maintaining a similar sending bitrate or video quality, compared with Loki.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-agent deep reinforcement learning: a survey

Article Open access 15 April 2021

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

A practical guide to multi-objective reinforcement learning and planning

Article Open access 13 April 2022

Data availability

The data that support the findings of this study are available on request from the corresponding author HC, upon reasonable request.

Code availability

Not applicable.

References

Trade, U.N.C., Development: Estimates of global e-commerce 2019 and preliminary assessment of COVID-19 impact on online retail 2020. https://news.un.org/zh/story/2021/05/1083402 (2021)
Information, C.A., Technology, C.: Real-time interactive industry development research report. http://www.caict.ac.cn/kxyj/qwfb/ztbg/202206/t20220614_ 404308.htm (2022)
Carlucci, G., De Cicco, L., Holmer, S., Mascolo, S.: Congestion control for web real-time communication. IEEE/ACM Trans. Netw. 25(5), 2629–2642 (2017). https://doi.org/10.1109/TNET.2017.2703615
Article Google Scholar
Zhang, H., Zhou, A., Hu, Y., Li, C., Wang, G., Zhang, X., Ma, H., Wu, L., Chen, A., Wu, C.: Loki: Improving long tail performance of learning-based real-time video adaptation by fusing rule-based models. Association for Computing Machinery, New York, NY, USA (2021). https://doi.org/10.1145/3447993.3483259
Mao, H., Chen, S., Dimmery, D., Singh, S., Blaisdell, D., Tian, Y., Alizadeh, M., Bakshy, E.: Real-world Video Adaptation with Reinforcement Learning (2020)
Zhou, A., Zhang, H., Su, G., Wu, L., Ma, R., Meng, Z., Zhang, X., **e, X., Ma, H., Chen, X.: Learning to coordinate video codec with transport protocol for mobile video telephony. Association for Computing Machinery, New York, NY, USA (2019)
Book Google Scholar
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal Policy Optimization Algorithms (2017)
Zhang, H., Zhou, A., Ma, H.: Improving mobile interactive video qoe via two-level online cooperative learning. IEEE Trans. Mobile Comput. 22(10), 5900–5917 (2023). https://doi.org/10.1109/TMC.2022.3179782
Article Google Scholar
Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. CoRR ar**v:abs/1412.6980 (2014)
Pereyra, G., Tucker, G., Chorowski, J., Kaiser, L., Hinton, G.: Regularizing Neural Networks by Penalizing Confident Output Distributions (2017). https://openreview.net/forum?id=HkCjNI5ex
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. A Bradford Book, Cambridge, MA, USA (2018)
Google Scholar
Schulman, J., Moritz, P., Levine, S., Jordan, M.I., Abbeel, P.: High-dimensional continuous control using generalized advantage estimation. CoRR ar**v:abs/1506.02438 (2015)
Commission, F.C.: Raw Data-Measuring Broad-band America 2016. https://www.fcc.gov/reports-research/reports/measuring-broadband-america/raw-data-measuring-broadband-ameri-2016 (2016)
Riiser, H., Vigmostad, P., Griwodz, C., Halvorsen, P.: Commute path bandwidth traces from 3g networks: Analysis and applications. Association for Computing Machinery, New York, NY, USA (2013)
Book Google Scholar
Yi, G., Yang, D., Bentaleb, A., Li, W., Li, Y., Zheng, K., Liu, J., Ooi, W.T., Cui, Y.: The acm multimedia 2019 live video streaming grand challenge. Association for Computing Machinery, New York, NY, USA (2019)
Book Google Scholar
Akhtar, Z., Nam, Y.S., Govindan, R., Rao, S., Chen, J., Katz-Bassett, E., Ribeiro, B., Zhan, J., Zhang, H.: Oboe: Auto-tuning video abr algorithms to network conditions. Association for Computing Machinery, New York, NY, USA (2018)
Google Scholar

Download references

Acknowledgements

This work was partially supported by the National Natural Science Foundation of China (62101241), Jiangsu Provincial Double-Innovation Doctor Program (JSSCBS20210001), and Changzhou Power Supply Branch of State Grid Jiangsu Electric Power Co., Ltd.(SGJSCZ00KJJS2311209).

Funding

Research grants from the National Natural Science Foundation of China (62101241), Jiangsu Provincial Double-Innovation Doctor Program (JSSCBS20210001), and Changzhou Power Supply Branch of State Grid Jiangsu Electric Power Co., Ltd. ( SGJSCZ00KJJS2311209).

Author information

Wenming Zhu and Wen**g Su have contributed equally to this work.

Authors and Affiliations

Changzhou Power Supply Branch of State Grid Jiangsu Electric Power Co., Ltd., Junqian Street, Changzhou, 213100, Jiangsu Province, China
Wenming Zhu & Kai Yang
School of Electronic Science and Engineering, Nan**g University, **anlin Street, Nan**g, 210023, Jiangsu Province, China
Wen**g Su & Hao Chen

Authors

Wenming Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Wen**g Su
View author publications
You can also search for this author in PubMed Google Scholar
Kai Yang
View author publications
You can also search for this author in PubMed Google Scholar
Hao Chen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Material preparation, data collection, and analysis were performed by WZ, WS, and KY. Conceptualization and methodology were performed by WS and HC. The first draft of the manuscript was written by WS and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Hao Chen.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Ethical approval

Not applicable.

Consent to participate

All authors agreed to participate.

Consent for publication

All authors gave permission for publication.

Additional information

Communicated by B. Bao.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhu, W., Su, W., Yang, K. et al. Improving the application performance of Loki via algorithm optimization. Multimedia Systems 30, 2 (2024). https://doi.org/10.1007/s00530-023-01197-5

Download citation

Received: 21 September 2023
Accepted: 08 December 2023
Published: 10 January 2024
DOI: https://doi.org/10.1007/s00530-023-01197-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improving the application performance of Loki via algorithm optimization

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-agent deep reinforcement learning: a survey

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

A practical guide to multi-objective reinforcement learning and planning

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Consent to participate

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Improving the application performance of Loki via algorithm optimization

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-agent deep reinforcement learning: a survey

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

A practical guide to multi-objective reinforcement learning and planning

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Consent to participate

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation