Log in

Path optimization for marine vehicles in ocean currents using reinforcement learning

  • Original article
  • Published:
Journal of Marine Science and Technology Aims and scope Submit manuscript

Abstract

This study proposes a path planning algorithm for marine vehicles based on machine learning. The algorithm considers the dynamic characteristics of the vehicle and disturbance effects in ocean environments. The movements of marine vehicles are influenced by various physical disturbances in ocean environments, such as wind, waves, and currents. In the present study, the effects of ocean currents are the primary consideration. A kinematic model is used to incorporate the nonholonomic motion characteristics of a marine vehicle, and the reinforcement learning algorithm is used for path optimization to generate a feasible path that can be tracked by the vehicle. The proposed approach determines a near-optimal path that connects the start and goal points with a reasonable computational cost when the map and current field data are provided. To verify the optimality and validity of the proposed algorithm, a set of simulations were performed in simulated and actual ocean current conditions, and their results are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Bryson AE, Ho YC (1975) Applied Optimal Control: Optimization. Estimation and Control. Hemisphere Pub, Corp

  2. Sutton RS, Barto AG (1998) Introduction to Reinforcement Learning. MIT Press

  3. Warren CW (1990) A technique for autonomous underwater vehicle route planning. IEEE J Ocean Eng 15(3):199–204

    Article  Google Scholar 

  4. Carroll KP, McClaran SR, Nelson EL, Barnett DM, Friesen DK, Williams GN (1992) AUV path planning: an A* approach to path planning with consideration of variable vehicle speeds and multiple, overlap**, time-dependent exclusion zones. In: Proceedings of the Symposium on AUV Technology, Washington, DC, June 02–03, pp 79–84

  5. Sugihara K, Yuh J (1997) GA-based motion planning for underwater robotic vehicle. In: Proceedings of the 10th International Symposium on Unmanned Untethered Submersible Technology, Durham, NH, USA, pp 406–415

  6. Alvarez A, Caiti A (2002) Interactions of autonomous underwater vehicles with variable scale ocean structures. In: Proceedings of the IFAC World Conference Systems, Barcelona, Spain

  7. Alvarez A, Caiti A, Onken R (2004) Evolutionary path planning for autonomous underwater vehicles in a variable ocean. IEEE J Ocean Eng 29(2):418–429

    Article  Google Scholar 

  8. Garau B, Alvarez A, Oliver G (2005) Path planning of autonomous underwater vehicles in current fields with complex spatial variability: an A* approach. In: Proceedings of the 2005 IEEE International Conference on Robotics and Automation, Barcelona, Spain, April 18–22, pp 194–198

  9. Lee T, Chung H, Myung H (2011) Multi-resolution path planning for marine surface vehicle considering environmental effects. In: 2011 IEEE OCEANS, Santander, Spain, June 06–09, pp 1–9

  10. Kim H, Lee T, Chung H, Son N, Myung H (2012) Any-angle path planning with limit-cycle circle set for marine surface vehicle. In: 2012 IEEE International Conference on Robotics and Automation, Saint Paul, MN, May 14–18, pp 2275–2280

  11. Leyland G, Philpott A (2005) Rowing to Barbados. Dagstuhl Seminar Proceedings 05031 Algorithms for Optimization with Incomplete Information. http://drops.dagstuhl.de/opus/volltexte/2005/65

  12. Shao W, Zhou P, Thong SK (2012) Development of a novel forward dynamic programming method for weather routing. J Mar Sci Technol 17(2):239–251

    Article  Google Scholar 

  13. Lin YH, Fang MC, Yeung RW (2013) The optimization of ship weather-routing algorithm based the composite influence of multi-dynamic elements. Appl Ocean Res 43:184–194

    Article  Google Scholar 

  14. Padhy CP, Sen D, Bhaskaran PK (2008) Application of wave model for weather routing of ships in the North Indian Ocean. Nat Hazards 44(3):373–385

    Article  Google Scholar 

  15. Rao D, Williams SB (2009) Large-scale path planning for underwater gliders in ocean currents. In: Proceedings of 2009 Australasian Conference on Robotics and Automation, Sydney, Australia, December 02–04

  16. Ross S, Chaib-draa B, Pineau J (2008) Bayesian reinforcement learning in continuous POMDPs with application to robot navigation. In: 2008 IEEE International Conference on Robotics and Automation, Pasadena, CA, May 19–23, pp 2845–2851

  17. Fjerdingen SA, Kyrkjeboe E, Transeth AA (2010) AUV Pipeline Following using Reinforcement learning. In: 2010 41st International Symposium on and 2010 6th German Conference on Robotics, Munich, Germany, June 07–09, pp 1–8

  18. El-Fakdi A, Carreras M (2013) Two-step gradient-based reinforcement learning for underwater robotics behavior learning. Robot Auton Syst 61(3):271–282

    Article  Google Scholar 

  19. Carreres M, Yuh Junku, Batlle J, Ridao P (2005) A behavior-based scheme using reinforcement learning for autonomous underwater vehicles. IEEE J Ocean Eng 30(2):416–427

    Article  Google Scholar 

  20. Laumond JP, Sekhavat S, Lamiraux F (1998) Robot Motion Planning and Control. Guidelines in nonholonomic motion planning for mobile robots. Springer, Berlin

    Book  Google Scholar 

  21. Zexiang L, Canny JF (2012) Nonholonomic Motion Planning. Springer Science & Business Media

  22. Chapra SC (2010) Numerical Methods for Engineers, 6th edn. McGraw-Hill

  23. Karaman S, Frazzoli E (2011) Sampling-based algorithms for optimal motion planning. Int J Robot Res 30(7):846–894

    Article  MATH  Google Scholar 

Download references

Acknowledgments

This research was a part of the project titled ‘Development of Management Technology for HNS Accident', funded by the Ministry of Oceans and Fisheries, Korea. This research was also supported by a grant from the Hanwha Corporation, R&D Center, Korea.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to **whan Kim.

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yoo, B., Kim, J. Path optimization for marine vehicles in ocean currents using reinforcement learning. J Mar Sci Technol 21, 334–343 (2016). https://doi.org/10.1007/s00773-015-0355-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00773-015-0355-9

Keywords

Navigation