An Intelligent Scheduling Method for Multi-domain Cooperative Operation Based on Deep Reinforcement Learning

  • Conference paper
  • First Online:
Proceedings of 2022 10th China Conference on Command and Control (C2 2022)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 949))

Included in the following conference series:

  • 1057 Accesses

Abstract

In order to deal with the Multi-Domain Operation (MDO) proposed by the US military, our army has begun to focus on develo** its joint operation capabilities, particularly the capability of multi-domain cooperative operation scheduling. Optimal scheduling allows for a reduction of operation costs and time. However, finding such schedules is usually difficult in a large-scale and complex environment, which is extremely prone to non-optimal and non-convergent behaviors. Recent advances in Deep Reinforcement Learning (DRL) in learning complex behaviors allow new approach options. This paper presents an efficient DRL environment for multi-domain cooperative operation scheduling in order to arrange suitable scheduling in MDO, in which multi-domain operation scheduling is modeled as a DRL problem. In addition, we present an end-to-end DRL-based method for automatically learning the multi-domain operation scheduling problem. Furthermore, we devise an action selection method that satisfies the task priority constraints. Finally, we provide a multi-domain operation instance, and the experiment demonstrates the effectiveness and applicability of the environment and method proposed in this paper for dealing with the multi-domain cooperative operation scheduling challenge.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 389.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 499.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info
Hardcover Book
USD 499.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Zhang, W.M., Huang, S.P., Huang, J.C., et al.: Analysis on multi-domain operation and its command and control problems. Command Inf. Syst. Technol. 11(01), 1–6 (2020)

    Google Scholar 

  2. Liu, K.: Theoretical thinking on the joint all-domain command and control system of the U.S. army.  J. China Acad. Electron. Inf. Technol. 16(07), 722–727 (2021)

    Google Scholar 

  3. Liu, Y., Zhou, J., Lim, A., et al.: Lower bounds and heuristics for the Unit-Capacity resource constrained project scheduling problem with transfer time. Comput. Ind. Eng. 161, 107605 (2021)

    Article  Google Scholar 

  4. Tao, W.Q, Wei, Y.: Overview of unified management and scheduling technology of battlefield resources. Automat. Instrumen. 261(07), 55–57+63 (2021)

    Google Scholar 

  5. Levchuk, G.M., Levchuk, Y.N., Luo, J., et al.: Normative design of organization: Mission planning. IEEE Trans. Syst. Man  Cybern. Part A: Syst. Humans 32(3), 346–359 (2002)

    Article  Google Scholar 

  6. Liu, J., Wang, W., Li, X., et al.: A motif-based mission planning method for UAV swarms considering dynamic reconfiguration. Def. Sci. J. 68(2), 159–166 (2018)

    Article  Google Scholar 

  7. Blazewicz, J., Lenstra, J.K., Kan, A.H.G.R.: Scheduling subject to resource constraints: classification and complexity. Discret. Appl. Math. 5(1), 11–24 (1983)

    Article  MathSciNet  Google Scholar 

  8. Li, J.L., Wang, P., Lv, Z.G., et al.: An improved genetic algorithm for resource scheduling under combat command and control. Comput. Appli. Soft. 39(02), 55–62 (2022)

    Google Scholar 

  9. Poppenborg, J., Knust, S.: A flow-based tabu search algorithm for the RCPSP with transfer times. OR Spectrum 38(2), 305–334 (2015). https://doi.org/10.1007/s00291-015-0402-2

    Article  MathSciNet  MATH  Google Scholar 

  10. Ding, H., Gu, X.: Hybrid of human learning optimization algorithm and particle swarm optimization algorithm with scheduling strategies for the flexible job-shop scheduling problem. Neurocomputing 414, 313–332 (2020)

    Article  Google Scholar 

  11. Khurshid, B., Maqsood, S., Omair, M., et al.: An improved evolution strategy hybridization with simulated annealing for permutation flow shop scheduling problems. IEEE Access 9, 94505–94522 (2021)

    Article  Google Scholar 

  12. Li, Y., Qiu, X.H., Liu, X.D., et al.: Deep reinforcement learning and its application in autonomous fitting optimization for attack areas of UCAVs. J. Syst. Eng. Electron. 31(4), 734–742 (2020)

    Article  Google Scholar 

  13. Han, B.A., Yang, J.J.: Research on adaptive job shop scheduling problems based on dueling double DQN. IEEE Access 8, 186474–186495 (2020)

    Article  Google Scholar 

  14. Cheng, F., Huang, Y., Tanpure, B., Sawalani, P., Cheng, L., Liu, C.: Cost-aware job scheduling for cloud instances using deep reinforcement learning. Clust. Comput. 25(1), 619–631 (2021). https://doi.org/10.1007/s10586-021-03436-8

    Article  Google Scholar 

  15. Schulman, J., Wolski, F., Dhariwal, P., et al.: Proximal policy optimization algorithms. ar**v preprint ar**v:1707.06347 (2017)

  16. Sutton, R.S., Barto, A.G.: Reinforcement learning: An introduction, 2nd edn. MIT Press, Cambridge, USA (2018)

    MATH  Google Scholar 

  17. Schulman, J., Levine, S., Abbeel, P., et al.: Trust region policy optimization. Proc. Mach. Learn. Res. 37, 1889–1897 (2015)

    Google Scholar 

  18. Tassel P, Gebser M, Schekotihin K.: A reinforcement learning environment for job-shop scheduling. ar**v preprint ar**v:2104.03760 (2021)

  19. Wan, L.P., Lan, X.G., Zhang, H.B., et al.: A review of deep reinforcement learning theory and application. Pattern Recogn. Artifi. Intell. 32(1), 67–81 (2019)

    Google Scholar 

  20. Pasaraba, W.L.: The Conduct and Assessment of A2C2 Experiment 7. Naval Postgraduate School, Monterey, California (2000)

    Google Scholar 

Download references

Acknowledgments

Research for this paper was supported by the Equipment advance research project (50912020401), and the Aviation Science Foundation of China (201908052002). The authors also gratefully acknowledge the helpful comments and suggestions of the reviewers, which have improved the presentation.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Haiying Liu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Chinese Institute of Command and Control

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

He, Z., Liu, H., Huang, K., Cheng, G. (2022). An Intelligent Scheduling Method for Multi-domain Cooperative Operation Based on Deep Reinforcement Learning. In: Proceedings of 2022 10th China Conference on Command and Control. C2 2022. Lecture Notes in Electrical Engineering, vol 949. Springer, Singapore. https://doi.org/10.1007/978-981-19-6052-9_47

Download citation

Publish with us

Policies and ethics

Navigation