Log in

Facilitating the Process of Performance Analysis of HPC Applications

  • Published:
Lobachevskii Journal of Mathematics Aims and scope Submit manuscript

Abstract

Performance analysis of supercomputing applications is an essential task that should be performed by any HPC user, since optimizing job performance reduces, sometimes by several times and even by orders of magnitude, the speed of carrying out computational experiments. There are many advanced analysis software tools that can be used for this purpose, but it is difficult for many users to figure out which tool to choose in their particular situation, how to use it and how to interpret the obtained results. In order to tackle this challenge, we are develo** a guide, which describes a number of common steps that can be helpful for analyzing the performance of most applications running on modern CPUs. The paper also shows how the proposed guide was used for the analysis of real-life application for modeling the atmospheric boundary layer, which was performed on the Lomonosov-2 supercomputer.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

REFERENCES

  1. High Performance Computing Market Size to Surpass USD 64.65. https://www.globenewswire.com/news-release/2022/04/04/2415844/0/en/High-Performance-Computing-Market-Size-to- Surpass-USD-64-65-Bn-by-2030.html.

  2. D. Nikitenko, P. Shvets, and V. Voevodin, ‘‘Why do users need to take care of their HPC applications efficiency?,’’ Lobachevskii J. Math. 41, 1521 (2020). https://doi.org/10.1134/S1995080220080132

    Article  Google Scholar 

  3. Performance Optimisation and Productivity. A Centre of Excellence in Computing Applications. https://pop-coe.eu/.

  4. CPI Description. https://www.intel.com/content/www/us/en/develop/documentation/vtunehelp/top/reference/cpu-metrics-reference.html#cpu-metrics-reference_CLOCKTICKS-PER-INSTRUCTIONS-RETIRED-CPI.

  5. A. Yasin, ‘‘A Top-Down method for performance analysis and counters architecture,’’ in Proceedings of the ISPASS 2014—IEEE International Symposium on Performance Analysis of Systems and Software (IEEE Comput. Soc., 2014), pp. 35–44. https://doi.org/10.1109/ISPASS.2014.6844459

  6. Intel VTune Profiler homepage. https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/vtune-profiler.html.

  7. Top-down Microarchitecture Analysis Method using VTune. https://software.intel.com/en-us/vtune-cookbook-top-down-microarchitecture-analysis-method.

  8. Intel Advisor Homepage. https://www.intel.com/content/www/us/en/developer/tools/oneapi/advisor.html#gs.s6l4io.

  9. J. Vetter and C. Chambreau, mpip: Lightweight, scalable mpi profiling (2005). http://gec.di.uminho.pt/Discip/MInf/cpd1415/PCP/MPI/mpiP_%20Lightweight,%20Scalable%20 MPI%20Profiling.pdf.

  10. D. Nikitenko, A. Antonov, P. Shvets, S. Sobolev, K. Stefanov, V. Voevodin, V. Voevodin, and S. Zhumatiy, ‘‘JobDigest—detailed system monitoring-based supercomputer application behavior analysis,’’ in Supercomputing, Proceedings of the 3rd Russian Supercomputing Days, RuSCDays 2017, Moscow, Russia, September 25–26, 2017 (Springer, Cham, 2017), pp. 516–529. https://doi.org/10.1007/978-3-319-71255-0_42

  11. K. Stefanov, V. Voevodin, S. Zhumatiy, and V. Voevodin, ‘‘Dynamically reconfigurable distributed modular monitoring system for supercomputers (DiMMon),’’ Proc. Comput. Sci. 66, 625–634 (2015). https://doi.org/10.1016/j.procs.2015.11.071

    Article  Google Scholar 

  12. POP Standard Metrics for Parallel Performance Analysis. Performance Optimisation and Productivity. https://pop-coe.eu/node/69.

  13. A. Glazunov, E. Mortikov, K. Barskov, E. Kadantsev, and S. Zilitinkevich, ‘‘Layered structure of stably stratified turbulent shear flows,’’ Izv., Atmos. Ocean. Phys. 55, 312–323 (2019).

    Article  Google Scholar 

  14. E. Mortikov, ‘‘Numerical simulation of the motion of an ice keel in a stratified flow,’’ Izv., Atmos. Ocean. Phys. 52, 108–115 (2016).

    Article  Google Scholar 

  15. E. Mortikov and A. Debolskiy, ‘‘Direct numerical simulation of stratified turbulent flows and passive tracer transport on HPC systems: Comparison of CPU architectures,’’ Supercomput. Front. Innov. 8 (4), 50–68 (2021). https://doi.org/10.14529/jsfi210405

  16. E. Tkachenko, A. Debolskiy, and E. Mortikov, ‘‘Intercomparison of subgrid scale models in large-eddy simulation of sunset atmospheric boundary layer turbulence: Computational aspects,’’ Lobachevskii J. Math. 42, 1580–1595 (2021). https://doi.org/10.1134/S1995080221070234

    Article  MathSciNet  MATH  Google Scholar 

  17. A. V. Debolskiy, E. V. Mortikov, A. V. Glazunov, and C. Lüpkes, ‘‘Evaluation of surface layer stability functions and their extension to first order turbulent closures for weakly and strongly stratified stable boundary layer,’’ Boundary-Layer Meteorol., No. 57719, 1–21 (2023).

  18. E. Kadantsev, E. Mortikov, and S. Zilitinkevich, ‘‘The resistance law for stably stratified atmospheric planetary boundary layers,’’ Quart. J. R. Meteorol. Soc. 147 (737), 2233–2243 (2021).

    Article  Google Scholar 

  19. Y. Morinishi, T. Lund, O. Vasilyev, and P. Moin, ‘‘Fully conservative higher order finite difference schemes for incompressible flow,’’ J. Comput. Phys. 143, 90–124 (1998). https://doi.org/10.1006/jcph.1998.5962

    Article  MathSciNet  MATH  Google Scholar 

  20. D. Lilly, ‘‘A proposed modification of the germano subgrid-scale closure method,’’ Phys. Fluids A 4, 633–635 (1992).

    Article  Google Scholar 

  21. C. Meneveau, T. Lund, and W. Cabot, ‘‘A lagrangian dynamic subgrid-scale model of turbulence,’’ J. Fluid Mech. 319, 353–385 (1996).

    Article  MATH  Google Scholar 

  22. G. Svensson at al., ‘‘Evaluation of the diurnal cycle in the atmospheric boundary layer over land as represented by a variety of single-column models: The second gabls experiment,’’ Bound.-Layer Meteorol. 140, 177–206 (2011).

    Article  Google Scholar 

  23. F. Jansson, G. van den Oord, I. Pelupessy, J. H. Grönqvist, A. P. Siebesma, and D. Crommelin, ‘‘Regional superparameterization in a global circulation model using large eddy simulations,’’ J. Adv. Model. Earth Syst. 11, 2958–2979 (2019).

    Article  Google Scholar 

  24. G. Van Den Oord, M. Chertova, F. Jansson, I. Pelupessy, P. Siebesma, and D. Crommelin, ‘‘Performance optimization and load-balancing modeling for superparametrization by 3d les,’’ in Proceedings of the Platform for Advanced Scientific Computing Conference (2021), pp. 1–8.

  25. W. W. Grabowski, ‘‘Coupling cloud processes with the large-scale dynamics using the cloud-resolving convection parameterization (CRCP),’’ J. Atmos. Sci. 58, 978–997 (2001).

    Article  Google Scholar 

  26. V. Voevodin, D. Shaikhislamov, and D. Nikitenko, ‘‘How to assess the quality of supercomputer resource usage,’’ Supercomput. Front. Innov. 9 (3), 4–18 (2022). https://doi.org/10.14529/jsfi220301

    Article  Google Scholar 

Download references

Funding

The development of the guide described in Section 2 was carried out within the framework of the scientific program of the National Center for Physics and Mathematics (‘‘National Center for Supercomputer Architecture Research’’ project). The practical results of the analysis of the HPC application described in Section 3 were achieved at Lomonosov Moscow State University with the financial support of the Russian Science Foundation (agreement no. 21-71-30003). The research is carried out using the equipment of shared research facilities of HPC computing resources at Lomonosov Moscow State University.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to V. V. Voevodin, A. V. Debolskiy or E. V. Mortikov.

Additional information

(Submitted by E. E. Tyrtyshnikov)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Voevodin, V.V., Debolskiy, A.V. & Mortikov, E.V. Facilitating the Process of Performance Analysis of HPC Applications. Lobachevskii J Math 44, 3178–3190 (2023). https://doi.org/10.1134/S1995080223080589

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1134/S1995080223080589

Keywords:

Navigation