Multi-robot Coordination and Planning in Uncertain and Adversarial Environments

Zhou, Lifeng; Tokekar, Pratap

doi:10.1007/s43154-021-00046-5

Multi-robot Coordination and Planning in Uncertain and Adversarial Environments

Group Robotics (M Gini and F Amigoni, Section Editors)
Published: 19 April 2021

Volume 2, pages 147–157, (2021)
Cite this article

Current Robotics Reports Aims and scope Submit manuscript

2684 Accesses
18 Citations
3 Altmetric
Explore all metrics

Abstract

Purpose of Review

Deploying a team of robots that can carefully coordinate their actions can make the entire system robust to individual failures. In this report, we review recent algorithmic development in making multi-robot systems robust to environmental uncertainties, failures, and adversarial attacks.

Recent Findings

We find the following three trends in the recent research in the area of multi-robot coordination: (1) resilient coordination to either withstand failures and/or attack or recover from failures/attacks; (2) risk-aware coordination to manage the trade-off risk and reward, where the risk stems due to environmental uncertainty; (3) Graph neural networks based coordination to learn decentralized multi-robot coordination policies. These algorithms have been applied to tasks such as formation control, task assignment and scheduling, search and planning, and informative data collection.

Summary

In order for multi-robot systems to become practical, we need coordination algorithms that can scale to large teams of robots dealing with dynamically changing, failure-prone, contested, and uncertain environments. There has been significant recent research on multi-robot coordination that has contributed resilient and risk-aware algorithms to deal with these issues and reduce the gap between theory and practice. Learning-based approaches have been seen to be promising, especially since they can learn who, when, and how to communicate for effective coordination. However, these algorithms have also been shown to be vulnerable to adversarial attacks, and as such develo** learning-based coordination strategies that are resilient to such attacks and robust to uncertainties is an important open area of research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (France)

Instant access to the full article PDF.

Institutional subscriptions

Distributed Reinforcement Learning for Robot Teams: a Review

Article 01 September 2022

Decentralized Multiagent Reinforcement Learning for Efficient Robotic Control by Coordination Graphs

Exploiting Heterogeneity in Robotic Networks

References

Papers of particular interest, published recently, have been highlighted as: • Of importance •• Of major importance

Christensen H. 2020. A roadmap for us robotics: from internet to robotics (2020 edition), http://www.hichristensen.com/pdf/roadmap-2020.pdf. (Accessed on 11/18/2020).
Mobile Industrial Robots A/S. Five collaborative mobile robots applications, https://www.mobile-industrial-robots.com/en/resources/whitepapers/5-collaborative-mobile-robots-applications/, Accessed: 2020-09-20.
Tzoumas V, Jadbabaie A, Pappas G. 2018. Resilient non-submodular maximization over matroid constraints. ar**v:1804.01013.
Zhou L, Tzoumas V, Pappas G, Tokekar P. Resilient active target tracking with multiple robots. IEEE Robot Autom Lett 2018;4(1):129–136.
Article Google Scholar
Martinelli A, Pont F, Siegwart R. Multi-robot localization using relative observations. 2005 IEEE international conference on robotics and automation (ICRA). IEEE; 2005. p. 2797–2802.
Prorok A. Redundant robot assignment on graphs with uncertain edge costs. Distributed autonomous robotic systems. Springer; 2019. p. 313–327.
Chung JJ, Smith AJ, Skeele R, Hollinger GA. Risk-aware graph search with dynamic edge cost discovery. Int J Robot Res 2019;38(2-3):182–195.
Article Google Scholar
• Park H, Hutchinson S. Robust rendezvous for multi-robot system with random node failures: an optimization approach. Auton Robots 2018;42(8):1807–1818. This study presents distributed robust algorithms for multi-robot rendezvous with random node failures.
Article Google Scholar
Matarić M. J., Sukhatme GS, Østergaard E.H. Multi-robot task allocation in uncertain environments. Auton Robot 2003;14(2-3):255–263.
Article MATH Google Scholar
Denning T, Matuszek C, Koscher K, Smith JR, Kohno T. A spotlight on security and privacy risks with future household robots: attacks and lessons. Proceedings of the 11th International conference on ubiquitous computing. ACM; 2009. p. 105–114.
Agmon N, Kaminka GA, Kraus S. Multi-robot adversarial patrolling: facing a full-knowledge opponent. J Artif Intell Res 2011;42:887–916.
MathSciNet MATH Google Scholar
Sless E, Agmon N, Kraus S. Multi-robot adversarial patrolling: facing coordinated attacks. Proceedings of the 2014 international conference on autonomous agents and multi-agent systems. International Foundation for Autonomous Agents and Multiagent Systems; 2014. p. 1093–1100.
Gil S, Kumar S, Mazumder M, Katabi D, Rus D. Guaranteeing spoof-resilient multi-robot networks. Auton Robot 2017;41(6):1383–1400.
Article Google Scholar
•• Saulnier K, Saldana D, Prorok A, J Pappas G, Kumar V. Resilient flocking for mobile robot teams. IEEE Robot Autom Lett 2017;2(2):1039–1046. This study firstly introduces resilience in formation control and presents a distributed resilient controller to achieve flocking behaviors of multi-robot systems, despite some adversarial team members.
Article Google Scholar
Schlotfeldt B, Tzoumas V, Thakur D, Pappas G. Resilient active information gathering with mobile robots. 2018 IEEE/RSJ International conference on intelligent robots and systems (IROS). IEEE; 2018. p. 4309–4316.
Liteye, Counter uas (cuas), https://liteye.com/products/counter-uas/, Accessed: 2020-09-20.
Parker LE. 1994. Heterogeneous multi-robot cooperation, Massachusetts Inst of Tech Cambridge Artificial Intelligence Lab, Tech Rep.
Saldana D, Prorok A, Sundaram S, Campos MF, Kumar V. Resilient consensus for time-varying networks of dynamic agents. 2017 American control conference (ACC). IEEE; 2017. p. 252–258.
•• Ramachandran RK, Preiss JA, Sukhatme GS. Resilience by reconfiguration: Exploiting heterogeneity in robot teams. 2019 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE; 2019. p. 6518–6525. The first work to guarantee multi-robot resilience by exploiting resource heterogeneity and reconfiguring communication networks after resource failures.
• Song J, Gupta S. Care: Cooperative autonomy for resilience and efficiency of robot teams for complete coverage of unknown environments under robot failures. Auton Robots 2020;44(3):647–671. This study presents a game-theoretical strategy that trade-offs resilience and efficiency in multi-robot coverage.
Article Google Scholar
• Mitra A, A Richards J, Bagchi S, Sundaram S. Resilient distributed state estimation with mobile agents: overcoming byzantine adversaries, communication losses, and intermittent measurements. Auton Robots 2019;43(3):743–768. This study develops resilient, fully-distributed, and provably correct algorithms for estimating the state of a target of interest in dynamic, failure-prone, and adversarial environments.
Article Google Scholar
• Yang F, Chakraborty N. Algorithm for optimal chance constrained linear assignment. 2017 IEEE international conference on robotics and automation (ICRA). IEEE; 2017. p. 801–808. This study formulates the multi-robot task assignment with payoff uncertainty as a chance-constrained combinatorial optimization problem and presents provably-good algorithms for solving this problem.
•• Zhou L, Tokekar P. An approximation algorithm for risk-averse submodular optimization. International workshop on the algorithmic foundations of robotics. Springer; 2018. p. 144–159. This study presents the first polynomial-time algorithm with bounded guarantees for solving CVaR based discrete submodular maximization problems and verifies the performance of the proposed algorithm in the multi-robot assignment and environmental monitoring scenarios.
A Oliehoek F, Amato C, et al, Vol. 1. A concise introduction to decentralized POMDPs. Berlin: Springer; 2016.
Book MATH Google Scholar
Zhu H, Alonso-Mora J. Chance-constrained collision avoidance for mavs in dynamic environments. IEEE Robot Autom Lett 2019;4(2):776–783.
Article Google Scholar
•• Ruiz L, Gama F, Ribeiro A. Graph neural networks: Architectures, stability and transferability. ar**v:2008.01767. 2020. A fundamental study on Graph Neural Networks with analysis of their decentralized architectures, stability, and transferability that can act as theoretical bases for using GNNs to solve practical multi-robot coordination problems.
•• Tolstaya E, Gama F, Paulos J, Pappas G, Kumar V, Ribeiro A. Learning decentralized controllers for robot swarms with graph neural networks. Conference on robot learning; 2020. p. 671–682. The first work that implements GNNs to learn decentralized controllers for multi-robot formation, e.g., flocking.
Li Q, Gama F, Ribeiro A, Prorok A. 2019. Graph neural networks for decentralized multi-robot path planning. ar**v:1912.06095.
• Wang Z, Gombolay M. Learning scheduling policies for multi-robot coordination with graph attention networks. IEEE Robot Autom Lett 2020;5(3):4509–4516. This study implements GNNs to learn real-time policies for multi-robot scheduling that is modeled as a combinatorial optimization problem.
Article Google Scholar
Chen J, Baskaran A, Zhang Z, Tokekar P. 2020. Multi-agent reinforcement learning for persistent monitoring. ar**v:2011.01129.
Prorok A. 2018. Graph neural networks for learning robot team coordination. ar**v:1805.03737.
Zhang T, Zhang W, Gupta MM. Resilient robots: concept, review, and future directions. Robotics 2017;6(4):22.
Article Google Scholar
Bezzo N, Weimer J, Pajic M, Sokolsky O, Pappas G, Lee I. Attack resilient state estimation for autonomous robotic systems. 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE; 2014. p. 3692–3698.
Bezzo N, Weimer J, Du Y, Sokolsky O, Son SH, Lee I. A stochastic approach for attack resilient uav motion planning. 2016 American control conference (ACC). IEEE; 2016. p. 1366–1372.
LeBlanc HJ, Zhang H, Koutsoukos X, Sundaram S. Resilient asymptotic consensus in robust networks. IEEE J Select Areas Commun 2013;31(4):766–781.
Article Google Scholar
Renganathan V, Summers T. Spoof resilient coordination for distributed multi-robot systems. 2017 International symposium on multi-robot and multi-agent systems (MRS). IEEE; 2017. p. 135–141.
Saldana D, Prorok A, Campos MF, Kumar V. Triangular networks for resilient formations. Distributed autonomous robotic systems. Springer; 2018. p. 147–159.
Guerrero-Bonilla L, Saldana D, Kumar V. Design guarantees for resilient robot formations on lattices. IEEE Robot Autom Lett 2018;4(1):89–96.
Article Google Scholar
Saldaña D., Guerrero-Bonilla L, Kumar V. Resilient backbones in hexagonal robot formations. Distributed autonomous robotic systems. Springer; 2019. p. 427–440.
Guerrero-Bonilla L, Kumar V. Realization of r-robust formations in the plane using control barrier functions. IEEE Control Syst Lett 2019;4(2):343–348.
Article MathSciNet Google Scholar
Usevitch J, Panagou D. Resilient leader-follower consensus to arbitrary reference values in time-varying graphs. IEEE Trans Autom Control 2019;65(4):1755–1762.
Article MathSciNet MATH Google Scholar
Usevitch J, Panagou D. Resilient finite-time consensus: a discontinuous systems perspective. 2020 American control conference (ACC). IEEE; 2020. p. 3285–3290.
Senejohnny D, Sundaram S, De Persis C, Tesi P. Resilience against misbehaving nodes in self-triggered coordination networks. IEEE; 2018. p. 2848–2853.
Senejohnny DM, Sundaram S, De Persis C, Tesi P. Resilience against misbehaving nodes in asynchronous networks. Automatica 2019;104:26–33.
Article MathSciNet MATH Google Scholar
Sun X, Nambiar R, Melhorn M, Shoukry Y, Nuzzo P. Dos-resilient multi-robot temporal logic motion planning. 2019 International conference on robotics and automation (ICRA). IEEE; 2019. p. 6051–6057.
Mitra A, Sundaram S. Secure distributed observers for a class of linear time invariant systems in the presence of byzantine adversaries. 2016 IEEE 55th Conference on decision and control (CDC). IEEE; 2016. p. 2709–2714.
Mitra A, Abbas W, Sundaram S. On the impact of trusted nodes in resilient distributed state estimation of lti systems. 2018 IEEE Conference on decision and control (CDC). IEEE; 2018. p. 4547–4552.
Mitra A, Sundaram S. Byzantine-resilient distributed observers for lti systems. Automatica 2019;108:108487.
Article MathSciNet Google Scholar
Zhou L, Tokekar P. An approximation algorithm for distributed resilient submodular maximization. 2019 international symposium on multi-robot and multi-agent systems (MRS). IEEE; 2019. p. 216–218.
Zhou L, Tzoumas V, J Pappas G, Tokekar P. Distributed attack-robust submodular maximization for multi-robot planning. 2020 IEEE International conference on robotics and automation (ICRA). IEEE; 2020. to appear.
Shi G, Zhou L, Tokekar P. Robust multiple-path orienteering problem: securing against adversarial attacks. 2020 robotics: science and systems (RSS); 2020. to appear.
Shishika D, Kumar V. Local-game decomposition for multiplayer perimeter-defense problem. 2018 IEEE conference on decision and control (CDC). IEEE; 2018. p. 2093–2100.
Shishika D, Paulos J, Dorothy MR, Hsieh MA, Kumar V. Team composition for perimeter defense with patrollers and defenders. 2019 IEEE 58th conference on decision and control (CDC). IEEE; 2019. p. 7325–7332.
Shishika D, Paulos J, Kumar V. Cooperative team strategies for multi-player perimeter-defense games. IEEE Robot Autom Lett 2020;5(2):2738–2745.
Article Google Scholar
Ramachandran RK, Fronda N, Sukhatme GS. Resilience in multi-robot target tracking through reconfiguration. 2019 IEEE/International conference on robotics and automations (ICRA); 2020. p. 6518–6525.
K Ramachandran R, Zhou L, A Preiss J, S Sukhatme G. Resilient coverage: exploring the local-to-global trade-off. 2020 IEEE/RSJ International conference on intelligent robots and systems (IROS); 2020. to appear.
Mayya S, Saldaña D., Kumar V. 2020. Resilient task allocation in heterogeneous multi-robot systems. ar**v:2009.04593.
Ramachandran RK, Pierpaoli P, Egerstedt M, Sukhatme GS. 2020. Resilient monitoring in heterogeneous multi-robot systems through network reconfiguration. ar**v:2008.01321.
Oh K-K, Park M-C, Ahn H-S. A survey of multi-agent formation control. Automatica 2015; 53:424–440.
Article MathSciNet MATH Google Scholar
Tokekar P, Isler V, Franchi A. Multi-target visual tracking with aerial robots. 2014 IEEE/RSJ International conference on intelligent robots and systems (IROS). IEEE; 2014. p. 3067–3072.
Atanasov N, Le Ny J, Daniilidis K, Pappas G. Information acquisition with sensing robots. 2014 IEEE International conference on robotics and automation (ICRA); 2014. p. 6447–6454.
Zhou L, Tokekar P. Active target tracking with self-triggered communications. 2017 IEEE International conference on robotics and automation (ICRA). IEEE; 2017. p. 2117–2123.
Zhou L, Tokekar P. Active target tracking with self-triggered communications in multi-robot teams. IEEE Trans Autom Sci Eng 2018;16(3):1085–1096.
Article Google Scholar
Zhou L, Tokekar P. Sensor assignment algorithms to improve observability while tracking targets. IEEE Trans Robot 2019;35(5):1206–1219.
Article Google Scholar
Michini M, Hsieh MA, Forgoston E, Schwartz IB. Robotic tracking of coherent structures in flows. IEEE Trans Robot 2014;30(3):593–603.
Article Google Scholar
Kumar V, Michael N. Opportunities and challenges with autonomous micro aerial vehicles. Int J Robot Res 2012;31(11):1279–1291.
Article Google Scholar
Nemhauser GL, Wolsey LA, Fisher ML. An analysis of approximations for maximizing submodular set functions-i. Math Program 1978;14(1):265–294.
Article MathSciNet MATH Google Scholar
Fisher ML, Nemhauser GL, Wolsey LA. An analysis of approximations for maximizing submodular set functions-ii. Polyhedral combinatorics. Springer; 1978. p. 73–87.
•• Tzoumas V, Gatsis K, Jadbabaie A, J Pappas G. Resilient monotone submodular function maximization. 2017 IEEE 56th Annual conference on decision and control (CDC). IEEE; 2017. p. 1362–1367. A fundamental study that formulates a resilient submodular maximization problem and presents a polynomial-time and provably close-to-optimal algorithm for solving it. The findings from this paper are generic and can be applied to many multi-robot applications where a team of robots aims to optimize a submodular objective in adversarial environments.
Song D, Kim C. -Y., Yi J. Simultaneous localization of multiple unknown and transient radio sources using a mobile robot. IEEE Trans Robot 2012;28(3):668–680.
Article Google Scholar
Peltzer O, Brown K, Schwager M, Kochenderfer MJ, Sehr M. 2020. Stt-cbs: A conflict-based search algorithm for multi-agent path finding with stochastic travel times. ar**v:2004.08025.
Yel E, Lin TX, Bezzo N. Self-triggered adaptive planning and scheduling of uav operations. 2018 IEEE International conference on robotics and automation (ICRA). IEEE; 2018. p. 7518–7524.
Toubeh M, Tokekar P. 2019. Risk-aware planning by confidence estimation using deep learning-based perception. ar**v:1910.00101.
• Chow Y, Tamar A, Mannor S, Pavone M. Risk-sensitive and robust decision-making: a cvar optimization approach. Advances in neural information processing systems; 2015. p. 1522–1530. This study presents the first approximate value-iteration algorithm with error guarantees for solving CVaR MDPs.
Chow Y, Ghavamzadeh M, Janson L, Pavone M. Risk-constrained reinforcement learning with percentile risk criteria. J Mach Learn Res 2017;18(1):6070–6120.
MathSciNet MATH Google Scholar
•• Majumdar A, Pavone M. How should a robot assess risk? towards an axiomatic theory of risk in robotics. Robotics research. Springer; 2020. p. 75–84. This study analyzes and discusses how should a robot quantifies risk and what constitutes a “good” risk measure toward ensuring safety for robots performing under uncertainty.
Fridovich-Keil D, Bajcsy A, Fisac JF, Herbert SL, Wang S, Dragan AD, Tomlin CJ. Confidence-aware motion prediction for real-time collision avoidance. Int J Robot Res 2020;39(2-3): 250–265.
Article Google Scholar
Singh S, Chow Y, Majumdar A, Pavone M. A framework for time-consistent, risk-sensitive model predictive control: Theory and algorithms. IEEE Trans Autom Control 2018;64(7):2905–2912.
Article MathSciNet MATH Google Scholar
Yang F, Chakraborty N. Algorithm for optimal chance constrained knapsack problem with applications to multi-robot teaming. 2018 IEEE international conference on robotics and automation (ICRA). IEEE; 2018. p. 1043–1049.
Yang F, Chakraborty N. Chance constrained simultaneous path planning and task assignment for multiple robots with stochastic path costs. 2020 IEEE international conference on robotics and automation (ICRA). IEEE; 2020. p. 6661–6667.
Jorgensen S, Chen RH, Milam MB, Pavone M. The team surviving orienteers problem: routing teams of robots in uncertain environments with survival constraints. Auton Robot 2018;42(4):927–952.
Article Google Scholar
Lacotte J, Ghavamzadeh M, Chow Y, Pavone M. Risk-sensitive generative adversarial imitation learning. 22nd international conference on artificial intelligence and statistics. PMLR; 2019. p. 2154–2163.
Nam C, Shell DA. Analyzing the sensitivity of the optimal assignment in probabilistic multi-robot task allocation. IEEE Robot Autom Lett 2016;2(1):193–200.
Google Scholar
Zhu H, Alonso-Mora J. B-uavc: Buffered uncertainty-aware voronoi cells for probabilistic multi-robot collision avoidance. 2019 international symposium on multi-robot and multi-agent systems (MRS). IEEE; 2019. p. 162–168.
da Silva Arantes M, Toledo CFM, Williams BC, Ono M. Collision-free encoding for chance-constrained nonconvex path planning. IEEE Trans Robot 2019;35(2):433–448.
Article Google Scholar
Wang A, Jasour A, Williams BC. Non-gaussian chance-constrained trajectory planning for autonomous vehicles under agent uncertainty. IEEE Robot Autom Lett 2020;5(4):6041–6048.
Article Google Scholar
Indelman V. Cooperative multi-robot belief space planning for autonomous navigation in unknown environments. Auton Robot 2018;42(2):353–373.
Article Google Scholar
Kochenderfer MJ. Decision making under uncertainty: theory and application. Cambridge: MIT press; 2015.
Book MATH Google Scholar
• Amato C, Konidaris G, Anders A, Cruz G, P How J, P Kaelbling L. Policy search for multi-robot coordination under uncertainty. Int J Robot Res 2016;35(14):1760–1778. The study presents a new MacDec-POMDP planning algorithm that utilizes macro-actions to solve significantly larger problems than existing Dec-POMDP planners.
Article Google Scholar
Omidshafiei S, Agha-Mohammadi A-A, Amato C, Liu S-Y, How JP, Vian J. Decentralized control of multi-robot partially observable markov decision processes using belief space macro-actions. Int J Robot Res 2017;36(2):231–258.
Article Google Scholar
Amato C, Konidaris G, Kaelbling LP, How JP. Modeling and planning with macro-actions in decentralized pomdps. J Artif Intell Res 2019;64:817–859.
Article MathSciNet MATH Google Scholar
Omidshafiei S, Pazis J, Amato C, How JP, Vian J. 2017. Deep decentralized multi-task multi-agent reinforcement learning under partial observability. ar**v:1703.06182.
Toubeh M, Zhou L, Tokekar P. In: 2019 Northeast Robotics Colloquium (NERC), accepted as poster presentation. Risk-aware path planning and assignment with uncertainty extraction from deep learning; 2019.
Sharma VD, Toubeh M, Zhou L, Tokekar P. Risk-aware planning and assignment for ground vehicles using uncertain perception from aerial vehicles. 2020 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE; 2020. to appear.
Rockafellar RT, Uryasev S. Optimization of conditional value-at-risk. J Risk 2000;2:21–42.
Article Google Scholar
Maehara T. Risk averse submodular utility maximization. Oper Res Lett 2015;43(5):526–529.
Article MathSciNet MATH Google Scholar
Ohsaka N, Yoshida Y. Portfolio optimization for influence spread. Proceedings of the 26th international conference on World Wide Web, International World Wide Web Conferences Steering Committee; 2017. p. 977–985.
Wilder B. Risk-sensitive submodular optimization. Proceedings of the 32nd AAAI conference on artificial intelligence; 2018.
Zhou L, Tokekar P. Risk-aware submodular optimization for multi-robot coordination. IEEE Trans Robot, submitted.
Balasubramanian R, Zhou L, Tokekar P, Sujit P. 2020. Risk-aware submodular optimization for multi-objective travelling salesperson problem. ar**v:2011.01095.
LaValle SM. Planning algorithms. Cambridge: Cambridge university press; 2006.
Book MATH Google Scholar
Hart PE, Nilsson NJ, Raphael B. A formal basis for the heuristic determination of minimum cost paths. IEEE Trans Syst Sci Cybern 1968;4(2):100–107.
Article Google Scholar
Karaman S, Frazzoli E. Sampling-based algorithms for optimal motion planning. Int J Robot Res 2011;30(7):846–894.
Article MATH Google Scholar
Hollinger GA, Pereira AA, Binney J, Somers T, Sukhatme GS. Learning uncertainty in ocean current predictions for safe and reliable navigation of underwater vehicles. J Field Robot 2016;33(1): 47–66.
Article Google Scholar
Monahan GE. State of the art—a survey of partially observable markov decision processes: theory, models, and algorithms. Manag Sci 1982;28(1):1–16.
Article MATH Google Scholar
Liu M, Amato C, P Anesta E, D Griffith J, P How J. Learning for decentralized control of multiagent systems in large, partially-observable stochastic environments. AAAI; 2016. p. 2523–2529.
Choudhury S, Gupta JK, Kochenderfer MJ, Sadigh D, Bohg J. 2020. Dynamic multi-robot task allocation under uncertainty and temporal constraints. ar**. Proceedings of the IEEE/CVF Conference on computer vision and pattern recognition; 2020. p. 4106–4115.
Liu Y-C, Tian J, Ma C-Y, Glaser N, Kuo C-W, Kira Z. 2020. Who2com: Collaborative perception via learnable handshake communication. ar**v:2003.09575.
Gama F, Bruna J, Ribeiro A. 2019. Stability properties of graph neural networks. ar**v:1905.04497.
Khan A, Tolstaya E, Ribeiro A, Kumar V. Graph policy gradients for large scale robot control. Conference on robot learning; 2020. p. 823–834.
Khan A, Kumar V, Ribeiro A. 2019. Graph policy gradients for large scale unlabeled motion planning with constraints. ar**v:1909.10704.
Sutton RS, Barto AG. Reinforcement learning: an introduction. Cambridge: MIT press; 2018.
MATH Google Scholar
Liu S, Lever G, Merel J, Tunyasuvunakool S, Heess N, Graepel T. 2019. Emergent coordination through competition. ar**v:1902.07151.
Blumenkamp J, Prorok A. 2020. The emergence of adversarial communication in multi-agent reinforcement learning. ar**v:2008.02616.
Kurakin A, Goodfellow I, Bengio S. Adversarial examples in the physical world 2017. International conference on learning representation (ICLR) (Workshop); 2017.
Eykholt K, Evtimov I, Fernandes E, Li B, Rahmati A, **ao C, Prakash A, Kohno T, Song D. Robust physical-world attacks on deep learning visual classification. Proceedings of the IEEE conference on computer vision and pattern recognition; 2018. p. 1625–1634.
Madry A, Makelov A, Schmidt L, Tsipras D, Vladu A. 2017. Towards deep learning models resistant to adversarial attacks. ar**v:1706.06083.
Athalye A, Carlini N, Wagner D. 2018. Obfuscated gradients give a false sense of security: circumventing defenses to adversarial examples. ar**v:1802.00420.
Tramèr F., Kurakin A, Papernot N, Goodfellow I, Boneh D, McDaniel P. 2017. Ensemble adversarial training: Attacks and defenses. ar**v:1705.07204.

Download references

Funding

The authors would like to thank the National Science Foundation (NSF IIS-1637915) and the Office of Naval Research (ONR N00014-18-1-2829) for their supports.

Author information

Authors and Affiliations

Electrical, Computer Engineering, Virginia Tech, Blacksburg, VA, 24061, USA
Lifeng Zhou
Computer Science, University of Maryland, College Park, MD, 20742, USA
Pratap Tokekar

Authors

Lifeng Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Pratap Tokekar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pratap Tokekar.

Ethics declarations

Conflict of Interest

The authors declare no competing interests.

Additional information

Human and Animal Rights and Informed Consent

This article does not contain any studies with human or animal subjects performed by any of the authors.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article belongs to the Topical Collection: Topical Collection on Group Robotics

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, L., Tokekar, P. Multi-robot Coordination and Planning in Uncertain and Adversarial Environments. Curr Robot Rep 2, 147–157 (2021). https://doi.org/10.1007/s43154-021-00046-5

Download citation

Accepted: 04 March 2021
Published: 19 April 2021
Issue Date: June 2021
DOI: https://doi.org/10.1007/s43154-021-00046-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (France)

Instant access to the full article PDF.

Institutional subscriptions

Multi-robot Coordination and Planning in Uncertain and Adversarial Environments