Reinforcement Learning for 3 vs. 2 Keepaway

Stone, Peter; Sutton, Richard S.; Singh, Satinder

doi:10.1007/3-540-45324-5_23

Peter Stone⁴,
Richard S. Sutton⁴ &
Satinder Singh⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2019))

Included in the following conference series:

Robot Soccer World Cup

686 Accesses
8 Citations

Abstract

As a sequential decision problem, robotic soccer can benefit from research in reinforcement learning. We introduce the 3 vs. 2 keepaway domain, a subproblem of robotic soccer implemented in the RoboCup soccer server. We then explore reinforcement learning methods for policy evaluation and action selection in this distributed, real-time, partially observable, noisy domain. We present empirical results demonstrating that a learned policy can dramatically outperform hand-coded policies.

Download to read the full chapter text

Chapter PDF

Learning to Run Faster in a Humanoid Robot Soccer Environment Through Reinforcement Learning

Towards a Principled Solution to Simulated Robot Soccer

rSoccer: A Framework for Studying Reinforcement Learning in Small and Very Small Size Robot Soccer

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

J. S. Albus. Brains, Behavior, and Robotics. Byte Books, Peterborough, NH, 1981.
Google Scholar
T. Andou. Refinement of soccer agents’ positions using reinforcement learning. In H. Kitano, editor, RoboCup-97: Robot Soccer World Cup I, pages 373–388. Springer Verlag, Berlin, 1998.
Chapter Google Scholar
R. H. Crites and A. G. Barto. Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Processing Systems 8, Cambridge, MA, 1996. MIT Press.
Google Scholar
H. Kitano, M. Tambe, P. Stone, M. Veloso, S. Coradeschi, E. Osawa, H. Matsubara, I. Noda, and M. Asada. The RoboCup synthetic agent challenge 97. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, pages 24–29, San Francisco, CA 1997. Morgan Kaufmann.
Google Scholar
D. McAllester and P. Stone. Kee** the ball from cmunited-99. In P. Stone, T. Balch, and G. Kraetszchmar, editors, RoboCup-2000: Robot Soccer World Cup IV, Berlin, 2001. Springer Verlag. To appear.
Google Scholar
I. Noda, H. Matsubara, K. Hiraki, and I. Frank. Soccer server: A tool for research on multiagent systems. Applied Artificial Intelligence, 12:233-50, 1998.
Article Google Scholar
J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA, 1993.
Google Scholar
P. Stone. Layered Learning in Multiagent Systems: A Winning Approach to Robotic Soccer. MIT Press, 2000.
Google Scholar
P. Stone, P. Riley, and M. Veloso. The CMUnited-99 champion simulator team. In M. Veloso, E. Pagello, and H. Kitano, editors, RoboCup-99: Robot Soccer World Cup III, pages 35–48. Springer Verlag, Berlin, 2000.
Chapter Google Scholar
P. Stone and M. Veloso. Team-partitioned, opaque-transition reinforcement learning. In M. Asada and H. Kitano, editors, RoboCup-98: Robot Soccer World Cup II. Springer Verlag, Berlin, 1999. Also in Proceedings of the Third International Conference on Autonomous Agents,1999.
Google Scholar
R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge,Massachusetts, 1998.
Google Scholar
R. S. Sutton and S. D. Whitehead. Online learning with random representations. In Proceedings of the Tenth International Conference on Machine Learning, pages 314-21, 1993.
Google Scholar
M. Tan. Multi-agent reinforcement learning: Independent vs. cooperative agents. In Proceedings of the Tenth International Conference on Machine Learning, pages 330-37, 1993.
Google Scholar
E. Uchibe. Cooperative Behavior Acquisition by Learning and Evolution in a Multi-Agent Environment for Mobile Robots. PhD thesis, Osaka University, January 1999.
Google Scholar
M. Veloso, P. Stone, and M. Bowling. Anticipation as a key for collaboration in a team of agents: A case study in robotic soccer. In Proceedings of SPIE Sensor Fusion and Decentralized Control in Robotic Systems II, volume 3839, Boston, September 1999.
Google Scholar
C. J. C. H. Watkins. Learning from Delayed Rewards. PhD thesis, King’s Cambridge, UK, 1989.
Google Scholar

Download references

Author information

Authors and Affiliations

AT&T Labs — Research, 180 Park Ave., Florham Park, NJ, 07932, USA
Peter Stone, Richard S. Sutton & Satinder Singh

Authors

Peter Stone
View author publications
You can also search for this author in PubMed Google Scholar
Richard S. Sutton
View author publications
You can also search for this author in PubMed Google Scholar
Satinder Singh
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

AT & Labs — Research, 180 Park Ave., room A273, Florham Park, NJ, 07932, USA
Peter Stone
The Robotics Institute, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA, 15213-3891, USA
Tucker Balch
Neural Information Processing Department, University of Ulm, Oberer Eselsberg, 89069, Ulm, Germany
Gerhard Kraetzschmar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Stone, P., Sutton, R.S., Singh, S. (2001). Reinforcement Learning for 3 vs. 2 Keepaway. In: Stone, P., Balch, T., Kraetzschmar, G. (eds) RoboCup 2000: Robot Soccer World Cup IV. RoboCup 2000. Lecture Notes in Computer Science(), vol 2019. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45324-5_23

Download citation

DOI: https://doi.org/10.1007/3-540-45324-5_23
Published: 20 September 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42185-6
Online ISBN: 978-3-540-45324-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Reinforcement Learning for 3 vs. 2 Keepaway

Abstract

Chapter PDF

Similar content being viewed by others

Learning to Run Faster in a Humanoid Robot Soccer Environment Through Reinforcement Learning

Towards a Principled Solution to Simulated Robot Soccer

rSoccer: A Framework for Studying Reinforcement Learning in Small and Very Small Size Robot Soccer

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Reinforcement Learning for 3 vs. 2 Keepaway

Abstract

Chapter PDF

Similar content being viewed by others

Learning to Run Faster in a Humanoid Robot Soccer Environment Through Reinforcement Learning

Towards a Principled Solution to Simulated Robot Soccer

rSoccer: A Framework for Studying Reinforcement Learning in Small and Very Small Size Robot Soccer

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation