Log in

Multi-robot Coordination and Planning in Uncertain and Adversarial Environments

  • Group Robotics (M Gini and F Amigoni, Section Editors)
  • Published:
Current Robotics Reports Aims and scope Submit manuscript

Abstract

Purpose of Review

Deploying a team of robots that can carefully coordinate their actions can make the entire system robust to individual failures. In this report, we review recent algorithmic development in making multi-robot systems robust to environmental uncertainties, failures, and adversarial attacks.

Recent Findings

We find the following three trends in the recent research in the area of multi-robot coordination: (1) resilient coordination to either withstand failures and/or attack or recover from failures/attacks; (2) risk-aware coordination to manage the trade-off risk and reward, where the risk stems due to environmental uncertainty; (3) Graph neural networks based coordination to learn decentralized multi-robot coordination policies. These algorithms have been applied to tasks such as formation control, task assignment and scheduling, search and planning, and informative data collection.

Summary

In order for multi-robot systems to become practical, we need coordination algorithms that can scale to large teams of robots dealing with dynamically changing, failure-prone, contested, and uncertain environments. There has been significant recent research on multi-robot coordination that has contributed resilient and risk-aware algorithms to deal with these issues and reduce the gap between theory and practice. Learning-based approaches have been seen to be promising, especially since they can learn who, when, and how to communicate for effective coordination. However, these algorithms have also been shown to be vulnerable to adversarial attacks, and as such develo** learning-based coordination strategies that are resilient to such attacks and robust to uncertainties is an important open area of research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (France)

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

Papers of particular interest, published recently, have been highlighted as: • Of importance •• Of major importance

  1. Christensen H. 2020. A roadmap for us robotics: from internet to robotics (2020 edition), http://www.hichristensen.com/pdf/roadmap-2020.pdf. (Accessed on 11/18/2020).

  2. Mobile Industrial Robots A/S. Five collaborative mobile robots applications, https://www.mobile-industrial-robots.com/en/resources/whitepapers/5-collaborative-mobile-robots-applications/, Accessed: 2020-09-20.

  3. Tzoumas V, Jadbabaie A, Pappas G. 2018. Resilient non-submodular maximization over matroid constraints. ar**v:1804.01013.

  4. Zhou L, Tzoumas V, Pappas G, Tokekar P. Resilient active target tracking with multiple robots. IEEE Robot Autom Lett 2018;4(1):129–136.

    Article  Google Scholar 

  5. Martinelli A, Pont F, Siegwart R. Multi-robot localization using relative observations. 2005 IEEE international conference on robotics and automation (ICRA). IEEE; 2005. p. 2797–2802.

  6. Prorok A. Redundant robot assignment on graphs with uncertain edge costs. Distributed autonomous robotic systems. Springer; 2019. p. 313–327.

  7. Chung JJ, Smith AJ, Skeele R, Hollinger GA. Risk-aware graph search with dynamic edge cost discovery. Int J Robot Res 2019;38(2-3):182–195.

    Article  Google Scholar 

  8. • Park H, Hutchinson S. Robust rendezvous for multi-robot system with random node failures: an optimization approach. Auton Robots 2018;42(8):1807–1818. This study presents distributed robust algorithms for multi-robot rendezvous with random node failures.

    Article  Google Scholar 

  9. Matarić M. J., Sukhatme GS, Østergaard E.H. Multi-robot task allocation in uncertain environments. Auton Robot 2003;14(2-3):255–263.

    Article  MATH  Google Scholar 

  10. Denning T, Matuszek C, Koscher K, Smith JR, Kohno T. A spotlight on security and privacy risks with future household robots: attacks and lessons. Proceedings of the 11th International conference on ubiquitous computing. ACM; 2009. p. 105–114.

  11. Agmon N, Kaminka GA, Kraus S. Multi-robot adversarial patrolling: facing a full-knowledge opponent. J Artif Intell Res 2011;42:887–916.

    MathSciNet  MATH  Google Scholar 

  12. Sless E, Agmon N, Kraus S. Multi-robot adversarial patrolling: facing coordinated attacks. Proceedings of the 2014 international conference on autonomous agents and multi-agent systems. International Foundation for Autonomous Agents and Multiagent Systems; 2014. p. 1093–1100.

  13. Gil S, Kumar S, Mazumder M, Katabi D, Rus D. Guaranteeing spoof-resilient multi-robot networks. Auton Robot 2017;41(6):1383–1400.

    Article  Google Scholar 

  14. •• Saulnier K, Saldana D, Prorok A, J Pappas G, Kumar V. Resilient flocking for mobile robot teams. IEEE Robot Autom Lett 2017;2(2):1039–1046. This study firstly introduces resilience in formation control and presents a distributed resilient controller to achieve flocking behaviors of multi-robot systems, despite some adversarial team members.

    Article  Google Scholar 

  15. Schlotfeldt B, Tzoumas V, Thakur D, Pappas G. Resilient active information gathering with mobile robots. 2018 IEEE/RSJ International conference on intelligent robots and systems (IROS). IEEE; 2018. p. 4309–4316.

  16. Liteye, Counter uas (cuas), https://liteye.com/products/counter-uas/, Accessed: 2020-09-20.

  17. Parker LE. 1994. Heterogeneous multi-robot cooperation, Massachusetts Inst of Tech Cambridge Artificial Intelligence Lab, Tech Rep.

  18. Saldana D, Prorok A, Sundaram S, Campos MF, Kumar V. Resilient consensus for time-varying networks of dynamic agents. 2017 American control conference (ACC). IEEE; 2017. p. 252–258.

  19. •• Ramachandran RK, Preiss JA, Sukhatme GS. Resilience by reconfiguration: Exploiting heterogeneity in robot teams. 2019 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE; 2019. p. 6518–6525. The first work to guarantee multi-robot resilience by exploiting resource heterogeneity and reconfiguring communication networks after resource failures.

  20. • Song J, Gupta S. Care: Cooperative autonomy for resilience and efficiency of robot teams for complete coverage of unknown environments under robot failures. Auton Robots 2020;44(3):647–671. This study presents a game-theoretical strategy that trade-offs resilience and efficiency in multi-robot coverage.

    Article  Google Scholar 

  21. • Mitra A, A Richards J, Bagchi S, Sundaram S. Resilient distributed state estimation with mobile agents: overcoming byzantine adversaries, communication losses, and intermittent measurements. Auton Robots 2019;43(3):743–768. This study develops resilient, fully-distributed, and provably correct algorithms for estimating the state of a target of interest in dynamic, failure-prone, and adversarial environments.

    Article  Google Scholar 

  22. • Yang F, Chakraborty N. Algorithm for optimal chance constrained linear assignment. 2017 IEEE international conference on robotics and automation (ICRA). IEEE; 2017. p. 801–808. This study formulates the multi-robot task assignment with payoff uncertainty as a chance-constrained combinatorial optimization problem and presents provably-good algorithms for solving this problem.

  23. •• Zhou L, Tokekar P. An approximation algorithm for risk-averse submodular optimization. International workshop on the algorithmic foundations of robotics. Springer; 2018. p. 144–159. This study presents the first polynomial-time algorithm with bounded guarantees for solving CVaR based discrete submodular maximization problems and verifies the performance of the proposed algorithm in the multi-robot assignment and environmental monitoring scenarios.

  24. A Oliehoek F, Amato C, et al, Vol. 1. A concise introduction to decentralized POMDPs. Berlin: Springer; 2016.

    Book  MATH  Google Scholar 

  25. Zhu H, Alonso-Mora J. Chance-constrained collision avoidance for mavs in dynamic environments. IEEE Robot Autom Lett 2019;4(2):776–783.

    Article  Google Scholar 

  26. •• Ruiz L, Gama F, Ribeiro A. Graph neural networks: Architectures, stability and transferability. ar**v:2008.01767. 2020. A fundamental study on Graph Neural Networks with analysis of their decentralized architectures, stability, and transferability that can act as theoretical bases for using GNNs to solve practical multi-robot coordination problems.

  27. •• Tolstaya E, Gama F, Paulos J, Pappas G, Kumar V, Ribeiro A. Learning decentralized controllers for robot swarms with graph neural networks. Conference on robot learning; 2020. p. 671–682. The first work that implements GNNs to learn decentralized controllers for multi-robot formation, e.g., flocking.

  28. Li Q, Gama F, Ribeiro A, Prorok A. 2019. Graph neural networks for decentralized multi-robot path planning. ar**v:1912.06095.

  29. • Wang Z, Gombolay M. Learning scheduling policies for multi-robot coordination with graph attention networks. IEEE Robot Autom Lett 2020;5(3):4509–4516. This study implements GNNs to learn real-time policies for multi-robot scheduling that is modeled as a combinatorial optimization problem.

    Article  Google Scholar 

  30. Chen J, Baskaran A, Zhang Z, Tokekar P. 2020. Multi-agent reinforcement learning for persistent monitoring. ar**v:2011.01129.

  31. Prorok A. 2018. Graph neural networks for learning robot team coordination. ar**v:1805.03737.

  32. Zhang T, Zhang W, Gupta MM. Resilient robots: concept, review, and future directions. Robotics 2017;6(4):22.

    Article  Google Scholar 

  33. Bezzo N, Weimer J, Pajic M, Sokolsky O, Pappas G, Lee I. Attack resilient state estimation for autonomous robotic systems. 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE; 2014. p. 3692–3698.

  34. Bezzo N, Weimer J, Du Y, Sokolsky O, Son SH, Lee I. A stochastic approach for attack resilient uav motion planning. 2016 American control conference (ACC). IEEE; 2016. p. 1366–1372.

  35. LeBlanc HJ, Zhang H, Koutsoukos X, Sundaram S. Resilient asymptotic consensus in robust networks. IEEE J Select Areas Commun 2013;31(4):766–781.

    Article  Google Scholar 

  36. Renganathan V, Summers T. Spoof resilient coordination for distributed multi-robot systems. 2017 International symposium on multi-robot and multi-agent systems (MRS). IEEE; 2017. p. 135–141.

  37. Saldana D, Prorok A, Campos MF, Kumar V. Triangular networks for resilient formations. Distributed autonomous robotic systems. Springer; 2018. p. 147–159.

  38. Guerrero-Bonilla L, Saldana D, Kumar V. Design guarantees for resilient robot formations on lattices. IEEE Robot Autom Lett 2018;4(1):89–96.

    Article  Google Scholar 

  39. Saldaña D., Guerrero-Bonilla L, Kumar V. Resilient backbones in hexagonal robot formations. Distributed autonomous robotic systems. Springer; 2019. p. 427–440.

  40. Guerrero-Bonilla L, Kumar V. Realization of r-robust formations in the plane using control barrier functions. IEEE Control Syst Lett 2019;4(2):343–348.

    Article  MathSciNet  Google Scholar 

  41. Usevitch J, Panagou D. Resilient leader-follower consensus to arbitrary reference values in time-varying graphs. IEEE Trans Autom Control 2019;65(4):1755–1762.

    Article  MathSciNet  MATH  Google Scholar 

  42. Usevitch J, Panagou D. Resilient finite-time consensus: a discontinuous systems perspective. 2020 American control conference (ACC). IEEE; 2020. p. 3285–3290.

  43. Senejohnny D, Sundaram S, De Persis C, Tesi P. Resilience against misbehaving nodes in self-triggered coordination networks. IEEE; 2018. p. 2848–2853.

  44. Senejohnny DM, Sundaram S, De Persis C, Tesi P. Resilience against misbehaving nodes in asynchronous networks. Automatica 2019;104:26–33.

    Article  MathSciNet  MATH  Google Scholar 

  45. Sun X, Nambiar R, Melhorn M, Shoukry Y, Nuzzo P. Dos-resilient multi-robot temporal logic motion planning. 2019 International conference on robotics and automation (ICRA). IEEE; 2019. p. 6051–6057.

  46. Mitra A, Sundaram S. Secure distributed observers for a class of linear time invariant systems in the presence of byzantine adversaries. 2016 IEEE 55th Conference on decision and control (CDC). IEEE; 2016. p. 2709–2714.

  47. Mitra A, Abbas W, Sundaram S. On the impact of trusted nodes in resilient distributed state estimation of lti systems. 2018 IEEE Conference on decision and control (CDC). IEEE; 2018. p. 4547–4552.

  48. Mitra A, Sundaram S. Byzantine-resilient distributed observers for lti systems. Automatica 2019;108:108487.

    Article  MathSciNet  Google Scholar 

  49. Zhou L, Tokekar P. An approximation algorithm for distributed resilient submodular maximization. 2019 international symposium on multi-robot and multi-agent systems (MRS). IEEE; 2019. p. 216–218.

  50. Zhou L, Tzoumas V, J Pappas G, Tokekar P. Distributed attack-robust submodular maximization for multi-robot planning. 2020 IEEE International conference on robotics and automation (ICRA). IEEE; 2020. to appear.

  51. Shi G, Zhou L, Tokekar P. Robust multiple-path orienteering problem: securing against adversarial attacks. 2020 robotics: science and systems (RSS); 2020. to appear.

  52. Shishika D, Kumar V. Local-game decomposition for multiplayer perimeter-defense problem. 2018 IEEE conference on decision and control (CDC). IEEE; 2018. p. 2093–2100.

  53. Shishika D, Paulos J, Dorothy MR, Hsieh MA, Kumar V. Team composition for perimeter defense with patrollers and defenders. 2019 IEEE 58th conference on decision and control (CDC). IEEE; 2019. p. 7325–7332.

  54. Shishika D, Paulos J, Kumar V. Cooperative team strategies for multi-player perimeter-defense games. IEEE Robot Autom Lett 2020;5(2):2738–2745.

    Article  Google Scholar 

  55. Ramachandran RK, Fronda N, Sukhatme GS. Resilience in multi-robot target tracking through reconfiguration. 2019 IEEE/International conference on robotics and automations (ICRA); 2020. p. 6518–6525.

  56. K Ramachandran R, Zhou L, A Preiss J, S Sukhatme G. Resilient coverage: exploring the local-to-global trade-off. 2020 IEEE/RSJ International conference on intelligent robots and systems (IROS); 2020. to appear.

  57. Mayya S, Saldaña D., Kumar V. 2020. Resilient task allocation in heterogeneous multi-robot systems. ar**v:2009.04593.

  58. Ramachandran RK, Pierpaoli P, Egerstedt M, Sukhatme GS. 2020. Resilient monitoring in heterogeneous multi-robot systems through network reconfiguration. ar**v:2008.01321.

  59. Oh K-K, Park M-C, Ahn H-S. A survey of multi-agent formation control. Automatica 2015; 53:424–440.

    Article  MathSciNet  MATH  Google Scholar 

  60. Tokekar P, Isler V, Franchi A. Multi-target visual tracking with aerial robots. 2014 IEEE/RSJ International conference on intelligent robots and systems (IROS). IEEE; 2014. p. 3067–3072.

  61. Atanasov N, Le Ny J, Daniilidis K, Pappas G. Information acquisition with sensing robots. 2014 IEEE International conference on robotics and automation (ICRA); 2014. p. 6447–6454.

  62. Zhou L, Tokekar P. Active target tracking with self-triggered communications. 2017 IEEE International conference on robotics and automation (ICRA). IEEE; 2017. p. 2117–2123.

  63. Zhou L, Tokekar P. Active target tracking with self-triggered communications in multi-robot teams. IEEE Trans Autom Sci Eng 2018;16(3):1085–1096.

    Article  Google Scholar 

  64. Zhou L, Tokekar P. Sensor assignment algorithms to improve observability while tracking targets. IEEE Trans Robot 2019;35(5):1206–1219.

    Article  Google Scholar 

  65. Michini M, Hsieh MA, Forgoston E, Schwartz IB. Robotic tracking of coherent structures in flows. IEEE Trans Robot 2014;30(3):593–603.

    Article  Google Scholar 

  66. Kumar V, Michael N. Opportunities and challenges with autonomous micro aerial vehicles. Int J Robot Res 2012;31(11):1279–1291.

    Article  Google Scholar 

  67. Nemhauser GL, Wolsey LA, Fisher ML. An analysis of approximations for maximizing submodular set functions-i. Math Program 1978;14(1):265–294.

    Article  MathSciNet  MATH  Google Scholar 

  68. Fisher ML, Nemhauser GL, Wolsey LA. An analysis of approximations for maximizing submodular set functions-ii. Polyhedral combinatorics. Springer; 1978. p. 73–87.

  69. •• Tzoumas V, Gatsis K, Jadbabaie A, J Pappas G. Resilient monotone submodular function maximization. 2017 IEEE 56th Annual conference on decision and control (CDC). IEEE; 2017. p. 1362–1367. A fundamental study that formulates a resilient submodular maximization problem and presents a polynomial-time and provably close-to-optimal algorithm for solving it. The findings from this paper are generic and can be applied to many multi-robot applications where a team of robots aims to optimize a submodular objective in adversarial environments.

  70. Song D, Kim C. -Y., Yi J. Simultaneous localization of multiple unknown and transient radio sources using a mobile robot. IEEE Trans Robot 2012;28(3):668–680.

    Article  Google Scholar 

  71. Peltzer O, Brown K, Schwager M, Kochenderfer MJ, Sehr M. 2020. Stt-cbs: A conflict-based search algorithm for multi-agent path finding with stochastic travel times. ar**v:2004.08025.

  72. Yel E, Lin TX, Bezzo N. Self-triggered adaptive planning and scheduling of uav operations. 2018 IEEE International conference on robotics and automation (ICRA). IEEE; 2018. p. 7518–7524.

  73. Toubeh M, Tokekar P. 2019. Risk-aware planning by confidence estimation using deep learning-based perception. ar**v:1910.00101.

  74. • Chow Y, Tamar A, Mannor S, Pavone M. Risk-sensitive and robust decision-making: a cvar optimization approach. Advances in neural information processing systems; 2015. p. 1522–1530. This study presents the first approximate value-iteration algorithm with error guarantees for solving CVaR MDPs.

  75. Chow Y, Ghavamzadeh M, Janson L, Pavone M. Risk-constrained reinforcement learning with percentile risk criteria. J Mach Learn Res 2017;18(1):6070–6120.

    MathSciNet  MATH  Google Scholar 

  76. •• Majumdar A, Pavone M. How should a robot assess risk? towards an axiomatic theory of risk in robotics. Robotics research. Springer; 2020. p. 75–84. This study analyzes and discusses how should a robot quantifies risk and what constitutes a “good” risk measure toward ensuring safety for robots performing under uncertainty.

  77. Fridovich-Keil D, Bajcsy A, Fisac JF, Herbert SL, Wang S, Dragan AD, Tomlin CJ. Confidence-aware motion prediction for real-time collision avoidance. Int J Robot Res 2020;39(2-3): 250–265.

    Article  Google Scholar 

  78. Singh S, Chow Y, Majumdar A, Pavone M. A framework for time-consistent, risk-sensitive model predictive control: Theory and algorithms. IEEE Trans Autom Control 2018;64(7):2905–2912.

    Article  MathSciNet  MATH  Google Scholar 

  79. Yang F, Chakraborty N. Algorithm for optimal chance constrained knapsack problem with applications to multi-robot teaming. 2018 IEEE international conference on robotics and automation (ICRA). IEEE; 2018. p. 1043–1049.

  80. Yang F, Chakraborty N. Chance constrained simultaneous path planning and task assignment for multiple robots with stochastic path costs. 2020 IEEE international conference on robotics and automation (ICRA). IEEE; 2020. p. 6661–6667.

  81. Jorgensen S, Chen RH, Milam MB, Pavone M. The team surviving orienteers problem: routing teams of robots in uncertain environments with survival constraints. Auton Robot 2018;42(4):927–952.

    Article  Google Scholar 

  82. Lacotte J, Ghavamzadeh M, Chow Y, Pavone M. Risk-sensitive generative adversarial imitation learning. 22nd international conference on artificial intelligence and statistics. PMLR; 2019. p. 2154–2163.

  83. Nam C, Shell DA. Analyzing the sensitivity of the optimal assignment in probabilistic multi-robot task allocation. IEEE Robot Autom Lett 2016;2(1):193–200.

    Google Scholar 

  84. Zhu H, Alonso-Mora J. B-uavc: Buffered uncertainty-aware voronoi cells for probabilistic multi-robot collision avoidance. 2019 international symposium on multi-robot and multi-agent systems (MRS). IEEE; 2019. p. 162–168.

  85. da Silva Arantes M, Toledo CFM, Williams BC, Ono M. Collision-free encoding for chance-constrained nonconvex path planning. IEEE Trans Robot 2019;35(2):433–448.

    Article  Google Scholar 

  86. Wang A, Jasour A, Williams BC. Non-gaussian chance-constrained trajectory planning for autonomous vehicles under agent uncertainty. IEEE Robot Autom Lett 2020;5(4):6041–6048.

    Article  Google Scholar 

  87. Indelman V. Cooperative multi-robot belief space planning for autonomous navigation in unknown environments. Auton Robot 2018;42(2):353–373.

    Article  Google Scholar 

  88. Kochenderfer MJ. Decision making under uncertainty: theory and application. Cambridge: MIT press; 2015.

    Book  MATH  Google Scholar 

  89. • Amato C, Konidaris G, Anders A, Cruz G, P How J, P Kaelbling L. Policy search for multi-robot coordination under uncertainty. Int J Robot Res 2016;35(14):1760–1778. The study presents a new MacDec-POMDP planning algorithm that utilizes macro-actions to solve significantly larger problems than existing Dec-POMDP planners.

    Article  Google Scholar 

  90. Omidshafiei S, Agha-Mohammadi A-A, Amato C, Liu S-Y, How JP, Vian J. Decentralized control of multi-robot partially observable markov decision processes using belief space macro-actions. Int J Robot Res 2017;36(2):231–258.

    Article  Google Scholar 

  91. Amato C, Konidaris G, Kaelbling LP, How JP. Modeling and planning with macro-actions in decentralized pomdps. J Artif Intell Res 2019;64:817–859.

    Article  MathSciNet  MATH  Google Scholar 

  92. Omidshafiei S, Pazis J, Amato C, How JP, Vian J. 2017. Deep decentralized multi-task multi-agent reinforcement learning under partial observability. ar**v:1703.06182.

  93. Toubeh M, Zhou L, Tokekar P. In: 2019 Northeast Robotics Colloquium (NERC), accepted as poster presentation. Risk-aware path planning and assignment with uncertainty extraction from deep learning; 2019.

  94. Sharma VD, Toubeh M, Zhou L, Tokekar P. Risk-aware planning and assignment for ground vehicles using uncertain perception from aerial vehicles. 2020 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE; 2020. to appear.

  95. Rockafellar RT, Uryasev S. Optimization of conditional value-at-risk. J Risk 2000;2:21–42.

    Article  Google Scholar 

  96. Maehara T. Risk averse submodular utility maximization. Oper Res Lett 2015;43(5):526–529.

    Article  MathSciNet  MATH  Google Scholar 

  97. Ohsaka N, Yoshida Y. Portfolio optimization for influence spread. Proceedings of the 26th international conference on World Wide Web, International World Wide Web Conferences Steering Committee; 2017. p. 977–985.

  98. Wilder B. Risk-sensitive submodular optimization. Proceedings of the 32nd AAAI conference on artificial intelligence; 2018.

  99. Zhou L, Tokekar P. Risk-aware submodular optimization for multi-robot coordination. IEEE Trans Robot, submitted.

  100. Balasubramanian R, Zhou L, Tokekar P, Sujit P. 2020. Risk-aware submodular optimization for multi-objective travelling salesperson problem. ar**v:2011.01095.

  101. LaValle SM. Planning algorithms. Cambridge: Cambridge university press; 2006.

    Book  MATH  Google Scholar 

  102. Hart PE, Nilsson NJ, Raphael B. A formal basis for the heuristic determination of minimum cost paths. IEEE Trans Syst Sci Cybern 1968;4(2):100–107.

    Article  Google Scholar 

  103. Karaman S, Frazzoli E. Sampling-based algorithms for optimal motion planning. Int J Robot Res 2011;30(7):846–894.

    Article  MATH  Google Scholar 

  104. Hollinger GA, Pereira AA, Binney J, Somers T, Sukhatme GS. Learning uncertainty in ocean current predictions for safe and reliable navigation of underwater vehicles. J Field Robot 2016;33(1): 47–66.

    Article  Google Scholar 

  105. Monahan GE. State of the art—a survey of partially observable markov decision processes: theory, models, and algorithms. Manag Sci 1982;28(1):1–16.

    Article  MATH  Google Scholar 

  106. Liu M, Amato C, P Anesta E, D Griffith J, P How J. Learning for decentralized control of multiagent systems in large, partially-observable stochastic environments. AAAI; 2016. p. 2523–2529.

  107. Choudhury S, Gupta JK, Kochenderfer MJ, Sadigh D, Bohg J. 2020. Dynamic multi-robot task allocation under uncertainty and temporal constraints. ar**. Proceedings of the IEEE/CVF Conference on computer vision and pattern recognition; 2020. p. 4106–4115.

  108. Liu Y-C, Tian J, Ma C-Y, Glaser N, Kuo C-W, Kira Z. 2020. Who2com: Collaborative perception via learnable handshake communication. ar**v:2003.09575.

  109. Gama F, Bruna J, Ribeiro A. 2019. Stability properties of graph neural networks. ar**v:1905.04497.

  110. Khan A, Tolstaya E, Ribeiro A, Kumar V. Graph policy gradients for large scale robot control. Conference on robot learning; 2020. p. 823–834.

  111. Khan A, Kumar V, Ribeiro A. 2019. Graph policy gradients for large scale unlabeled motion planning with constraints. ar**v:1909.10704.

  112. Sutton RS, Barto AG. Reinforcement learning: an introduction. Cambridge: MIT press; 2018.

    MATH  Google Scholar 

  113. Liu S, Lever G, Merel J, Tunyasuvunakool S, Heess N, Graepel T. 2019. Emergent coordination through competition. ar**v:1902.07151.

  114. Blumenkamp J, Prorok A. 2020. The emergence of adversarial communication in multi-agent reinforcement learning. ar**v:2008.02616.

  115. Kurakin A, Goodfellow I, Bengio S. Adversarial examples in the physical world 2017. International conference on learning representation (ICLR) (Workshop); 2017.

  116. Eykholt K, Evtimov I, Fernandes E, Li B, Rahmati A, **ao C, Prakash A, Kohno T, Song D. Robust physical-world attacks on deep learning visual classification. Proceedings of the IEEE conference on computer vision and pattern recognition; 2018. p. 1625–1634.

  117. Madry A, Makelov A, Schmidt L, Tsipras D, Vladu A. 2017. Towards deep learning models resistant to adversarial attacks. ar**v:1706.06083.

  118. Athalye A, Carlini N, Wagner D. 2018. Obfuscated gradients give a false sense of security: circumventing defenses to adversarial examples. ar**v:1802.00420.

  119. Tramèr F., Kurakin A, Papernot N, Goodfellow I, Boneh D, McDaniel P. 2017. Ensemble adversarial training: Attacks and defenses. ar**v:1705.07204.

Download references

Funding

The authors would like to thank the National Science Foundation (NSF IIS-1637915) and the Office of Naval Research (ONR N00014-18-1-2829) for their supports.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pratap Tokekar.

Ethics declarations

Conflict of Interest

The authors declare no competing interests.

Additional information

Human and Animal Rights and Informed Consent

This article does not contain any studies with human or animal subjects performed by any of the authors.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article belongs to the Topical Collection: Topical Collection on Group Robotics

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, L., Tokekar, P. Multi-robot Coordination and Planning in Uncertain and Adversarial Environments. Curr Robot Rep 2, 147–157 (2021). https://doi.org/10.1007/s43154-021-00046-5

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s43154-021-00046-5

Keywords

Navigation