Trust-driven reinforcement selection strategy for federated learning on IoT devices

Rjoub, Gaith; Wahab, Omar Abdel; Bentahar, Jamal; Bataineh, Ahmed

doi:10.1007/s00607-022-01078-1

Trust-driven reinforcement selection strategy for federated learning on IoT devices

Special Issue Article
Published: 01 April 2022

Volume 106, pages 1273–1295, (2024)
Cite this article

Computing Aims and scope Submit manuscript

Gaith Rjoub ORCID: orcid.org/0000-0002-7282-0687¹,
Omar Abdel Wahab²,
Jamal Bentahar¹ &
…
Ahmed Bataineh¹

1296 Accesses
28 Citations
Explore all metrics

Abstract

Federated learning is a distributed machine learning approach that enables a large number of edge/end devices to perform on-device training for a single machine learning model, without having to share their own raw data. We consider in this paper a federated learning scenario wherein the local training is carried out on IoT devices and the global aggregation is done at the level of an edge server. One essential challenge in this emerging approach is IoT devices selection (also called scheduling), i.e., how to select the IoT devices to participate in the distributed training process. The existing approaches suggest to base the scheduling decision on the resource characteristics of the devices to guarantee that the selected devices would have enough resources to carry out the training. In this work, we argue that trust should be an integral part of the decision-making process and therefore design a trust establishment mechanism between the edge server and IoT devices. The trust mechanism aims to detect those IoT devices that over-utilize or under-utilize their resources during the local training. Thereafter, we introduce DDQN-Trust, a double deep Q learning-based selection algorithm that takes into account the trust scores and energy levels of the IoT devices to make appropriate scheduling decisions. Finally, we integrate our solution into four federated learning aggregation approaches, namely, FedAvg, FedProx, FedShare and FedSGD. Experiments conducted using a real-world dataset show that our DDQN-Trust solution always achieves better performance compared to two main benchmarks: the DQN and random scheduling algorithms. The results also reveal that FedProx outperforms the competitor aggregation models in terms of accuracy when integrated into our DDQN-Trust solution.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Institutional subscriptions

A Trust and Energy-Aware Double Deep Reinforcement Learning Scheduling Strategy for Federated Learning on IoT Devices

One-Shot Federated Learning-based Model-Free Reinforcement Learning

Adaptive device sampling and deadline determination for cloud-based heterogeneous federated learning

Article Open access 03 November 2023

Notes

References

AbdulRahman S, Tout H, Mourad A, Talhi C (2020) FedMCCS: multicriteria client selection model for optimal IoT federated learning. IEEE Internet Things J 8(6):4723–4735
Article Google Scholar
Anh TT, Luong NC, Niyato D, Kim DI, Wang LC (2019) Efficient training management for mobile crowd-machine learning: a deep reinforcement learning approach. IEEE Wirel Commun Lett 8(5):1345–1348
Article Google Scholar
Bataineh AS, Bentahar J, Mizouni R, Abdel Wahab O, Rjoub G, El Barachi M (2021) Cloud computing as a platform for monetizing data services: a two-sided game business model. IEEE Trans Netw Serv Manag. https://doi.org/10.1109/TNSM.2021.3128160
Article Google Scholar
Bataineh AS, Bentahar J, Wahab OA, Mizouni R, Rjoub G (2020) A game-based secure trading of big data and IoT services: blockchain as a two-sided market. In: International conference on service-oriented computing. Springer, pp 85–100
Bentahar J, Drawel N, Sadiki A (2022) Quantitative group trust: a two-stage verification approach. In: Faliszewski P, Mascardi V, Pelachaud C, Taylor ME (eds) Proceedings of the 21st international conference on autonomous agents and multiagent systems (AAMAS 2022), pp xx–xx
Chen M, Yang Z, Saad W, Yin C, Poor HV, Cui S (2019) A joint learning and communications framework for federated learning over wireless networks. CoRR. ar**v:1909.07972
Chen S, Shen C, Zhang L, Tang Y (2021) Dynamic aggregation for heterogeneous quantization in federated learning. IEEE Trans Wirel Commun 20:6804–6819
Article Google Scholar
Dai H, Zeng X, Yu Z, Wang T (2019) A scheduling algorithm for autonomous driving tasks on mobile edge computing servers. J Syst Archit 94:14–23
Article Google Scholar
Ding D, Fan X, Zhao Y, Kang K, Yin Q, Zeng J (2020) Q-learning based dynamic task scheduling for energy-efficient cloud computing. Future Gener Comput Syst 108:361–371
Article Google Scholar
Dinh CT, Tran NH, Nguyen MN, Hong CS, Bao W, Zomaya AY, Gramoli V (2020) Federated learning over wireless networks: convergence analysis and resource allocation. IEEE/ACM Trans Netw 29:398–409
Article Google Scholar
Drawel N, Bentahar J, Laarej A, Rjoub G (2020) Formalizing group and propagated trust in multi-agent systems. In: Bessiere C (ed) Proceedings of the twenty-ninth international joint conference on artificial intelligence, IJCAI 2020, pp 60–66. https://doi.org/10.24963/ijcai.2020/9
Drawel N, Bentahar J, Laarej A, Rjoub G (2022) Formal verification of group and propagated trust in multi-agent systems. Auton Agents Multi-Agent Syst. https://doi.org/10.1007/s10458-021-09542-6 (in press)
Article Google Scholar
Hu C, Jiang J, Wang Z (2019) Decentralized federated learning: a segmented gossip approach. CoRR. ar**v:1908.07782
Huang J, Li S, Chen Y (2020) Revenue-optimal task scheduling and resource management for IoT batch jobs in mobile edge computing. Peer Peer Netw Appl 13(5):1776–1787
Article Google Scholar
Iglewicz B, Hoaglin DC (1993) How to detect and handle outliers, vol 16. ASQ Press, Milwaukee
Google Scholar
Khan LU, Saad W, Han Z, Hossain E, Hong CS (2021) Federated learning for internet of things: recent advances, taxonomy, and open challenges. IEEE Commun Surv Tutor 23:1759–1799
Article Google Scholar
Lei L, Tan Y, Zheng K, Liu S, Zhang K, Shen X (2020) Deep reinforcement learning for autonomous internet of things: model, applications and challenges. IEEE Commun Surv Tutor 22(3):1722–1760
Article Google Scholar
Li T, Sahu AK, Zaheer M, Sanjabi M, Talwalkar A, Smith V (2018) Federated optimization in heterogeneous networks. ar**v preprint. ar**v:1812.06127
Lin C, Deng D, Chih Y, Chiu H (2019) Smart manufacturing scheduling with edge computing using multiclass deep Q network. IEEE Trans Ind Inform 15(7):4276–4284
Article Google Scholar
Luo S (2020) Dynamic scheduling for flexible job shop with new job insertions by deep reinforcement learning. Appl Soft Comput 91:106208
Article Google Scholar
Ma Z, Zhao M, Cai X, Jia Z (2021) Fast-convergent federated learning with class-weighted aggregation. J Syst Archit 117:102125
Article Google Scholar
Mao H, Alizadeh M, Menache I, Kandula S (2016) Resource management with deep reinforcement learning. In: Proceedings of the 15th ACM workshop on hot topics in networks, pp 50–56
McMahan B, Moore E, Ramage D, Hampson S, y Arcas BA (2017) Communication-efficient learning of deep networks from decentralized data. In: Artificial intelligence and statistics. PMLR, pp 1273–1282
Nesterov Y et al (2018) Lectures on convex optimization, vol 137. Springer, Berlin
Google Scholar
Nguyen HT, Luong NC, Zhao J, Yuen C, Niyato D (2020) Resource allocation in mobility-aware federated learning networks: a deep reinforcement learning approach. In: 6th IEEE world forum on internet of things, WF-IoT 2020, New Orleans, LA, USA, June 2-16, 2020. IEEE, pp 1–6
Nishio T, Yonetani R (2019) Client selection for federated learning with heterogeneous resources in mobile edge. In: IEEE international conference on communications, ICC 2019, Shanghai, China, May 20-24, 2019. IEEE, pp 1–7
Rjoub G, Bentahar J, Abdel Wahab O, Saleh Bataineh A (2020) Deep and reinforcement learning for automated task scheduling in large-scale cloud computing systems. Pract Exp Concurr Comput 33:e5919
Article Google Scholar
Rjoub G, Bentahar J, Wahab OA (2020) Bigtrustscheduling: trust-aware big data task scheduling approach in cloud computing environments. Future Gener Comput Syst 110:1079–1097
Article Google Scholar
Rjoub G, Bentahar J, Wahab OA, Bataineh AS (2019) Deep smart scheduling: a deep learning approach for automated big data scheduling over the cloud. In: 7th International conference on future internet of things and cloud, FiCloud 2019, Istanbul, Turkey, August 26-28, 2019. IEEE, pp 189–196
Rjoub G, Wahab OA, Bentahar J, Bataineh A (2020) A trust and energy-aware double deep reinforcement learning scheduling strategy for federated learning on IoT devices. In: International conference on service-oriented computing. Springer, pp 319–333
Rjoub G, Wahab OA, Bentahar J, Bataineh AS (2021) Improving autonomous vehicles safety in snow weather using federated YOLO CNN learning. In: International conference on mobile web and intelligent information systems. Springer, pp 121–134
Shamir O, Srebro N, Zhang T (2014) Communication-efficient distributed optimization using an approximate newton-type method. In: International conference on machine learning. PMLR, pp 1000–1008
van Hasselt H, Guez A, Silver D (2016) Deep reinforcement learning with double Q-learning. In: Schuurmans D, Wellman MP (eds) Proceedings of the thirtieth AAAI conference on artificial intelligence, February 12-17, 2016, Phoenix, Arizona, USA. AAAI Press, pp 2094–2100
Wahab OA, Bentahar J, Otrok H, Mourad A (2019) Resource-aware detection and defense system against multi-type attacks in the cloud: repeated Bayesian Stackelberg game. IEEE Trans Dependable Secur Comput 18:605–622
Article Google Scholar
Wahab OA, Cohen R, Bentahar J, Otrok H, Mourad A, Rjoub G (2020) An endorsement-based trust bootstrap** approach for newcomer cloud services. Inf Sci 527:159–175
Article Google Scholar
Wahab OA, Mourad A, Otrok H, Taleb T (2021) Federated machine learning: survey, multi-level classification, desirable criteria and future directions in communication and networking systems. IEEE Commun Surv Tutor 23:1342–1397
Article Google Scholar
Wang X, Han Y, Wang C, Zhao Q, Chen X, Chen M (2019) In-edge AI: intelligentizing mobile edge computing, caching and communication by federated learning. IEEE Netw 33(5):156–165
Article Google Scholar
Yang HH, Liu Z, Quek TQS, Poor HV (2020) Scheduling policies for federated learning in wireless networks. IEEE Trans Commun 68(1):317–333
Article Google Scholar
Zhao Y, Li M, Lai L, Suda N, Civin D, Chandra V (2018) Federated learning with non-iid data. ar**v preprint. ar**v:1806.00582
Zhou Z, Yang S, Pu L, Yu S (2020) CEFL: online admission control, data scheduling, and accuracy tuning for cost-efficient federated learning across edge nodes. IEEE Internet Things J 7(10):9341–9356
Article Google Scholar
Zhu G, Liu D, Du Y, You C, Zhang J, Huang K (2020) Toward an intelligent edge: wireless communication meets machine learning. IEEE Commun Mag 58(1):19–25
Article Google Scholar

Download references

Funding

Funding was provided by Natural Sciences and Engineering Research Council of Canada (Discovery Grant), Defence Research and Development Canada (Innovation for Defence Excellence and Security (IDEaS)), Mitacs (Accelerate).

Author information

Authors and Affiliations

Concordia University, Montreal, QC, Canada
Gaith Rjoub, Jamal Bentahar & Ahmed Bataineh
Université du Québec en Outaouais, Gatineau, QC, Canada
Omar Abdel Wahab

Authors

Gaith Rjoub
View author publications
You can also search for this author in PubMed Google Scholar
Omar Abdel Wahab
View author publications
You can also search for this author in PubMed Google Scholar
Jamal Bentahar
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Bataineh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gaith Rjoub.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rjoub, G., Wahab, O.A., Bentahar, J. et al. Trust-driven reinforcement selection strategy for federated learning on IoT devices. Computing 106, 1273–1295 (2024). https://doi.org/10.1007/s00607-022-01078-1

Download citation

Received: 10 March 2021
Accepted: 15 March 2022
Published: 01 April 2022
Issue Date: April 2024
DOI: https://doi.org/10.1007/s00607-022-01078-1

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Institutional subscriptions

Trust-driven reinforcement selection strategy for federated learning on IoT devices

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Trust and Energy-Aware Double Deep Reinforcement Learning Scheduling Strategy for Federated Learning on IoT Devices

One-Shot Federated Learning-based Model-Free Reinforcement Learning

Adaptive device sampling and deadline determination for cloud-based heterogeneous federated learning

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Subscribe and save

Buy Now

Navigation

Trust-driven reinforcement selection strategy for federated learning on IoT devices

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Trust and Energy-Aware Double Deep Reinforcement Learning Scheduling Strategy for Federated Learning on IoT Devices

One-Shot Federated Learning-based Model-Free Reinforcement Learning

Adaptive device sampling and deadline determination for cloud-based heterogeneous federated learning

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Subscribe and save

Buy Now

Search

Navigation