Log in

Application of optimal subset regression and stacking hybrid models to estimate COVID-19 cases in Dhaka, Bangladesh

  • Research
  • Published:
Theoretical and Applied Climatology Aims and scope Submit manuscript

Abstract

The COVID-19 outbreaks revealed a severe healthcare crisis with many loopholes in the global healthcare system. It is more crucial to quantify COVID-19 cases when COVID-19 occurs in humid to semi-humid climatic conditions. There are issues with a lack of meteorological and air pollution data and future information on COVID-19 mortality, as is the case in Bangladesh. To deal with this issue, the present research aims to apply four single artificial intelligence models, including additive regression (AR), M5P tree (M5P), random subspace (RSS), and support vector machine (SVM), and construct their stacking hybrid ensemble models for predicting COVID-19 mortality cases at five sites in greater Dhaka City, Bangladesh. The proposed methods were developed using a total of eight input datasets that included climatic factors such as relative humidity, temperature, precipitation, wind speed, and air pollutants including sulfur dioxide (SO2), ozone (O3), carbon monoxide (CO), and nitrate oxide (NO2). Various input data combinations are appraised according to predictive performance, utilizing statistical tests and graphical presentation. The datasets were categorized into two classes (68:32) for model generation (training data) and model validation (testing data) with a fivefold cross-validation technique. Results show that SVM is superior to other AR, M5P, and RSS models (R2 testing = 0.86–0.91, MAE = 1.33–2.02, RMSE = 3.12–3.85, RAE% = 19.68–29.86, and RRSE% = 40.58–41.01). The sensitivity analysis findings reveal a higher sensitivity for all input parameters selected except CO in the predictive results. Relative humidity, wind speed, and SO2 were the three input parameters that most influenced the results of subset regression and sensitivity analysis. The SVM is a promising method because it can predict COVID-19 mortality in greater Dhaka City with fewer input parameters. The suggested model developed in this research produced satisfactory outcomes in COVID-19 mortality prediction. It will be a new method for future COVID-19 prevention for policymakers and health experts. Serious social concern and robust public health measures may lessen the environmental impact of COVID-19 cases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

Data will be available upon reasonable request to the corresponding authors.

Code availability

Code will be available upon reasonable request to the corresponding authors.

References

Download references

Acknowledgements

We acknowledge the anonymous reviewer to improve the quality of the manuscript. The authors would like to thank the Bangladesh Meteorological Department (BMD), and the Department of Environment (DOE) for providing the related datasets for this research.

Funding

Authors thankfully acknowledge the Deanship of Scientific Research providing administrative and financial supports. Funding for this research was given under award numbers RGP2/363/44 by the Deanship of Scientific Research; King Khalid University, Ministry of Education, Kingdom of Saudi Arabia.

Author information

Authors and Affiliations

Authors

Contributions

A. R. M. T. I. designed, conceptualized, and drafted the original manuscript and planned the documents; M. A. F. and S. C. P. were involved in the literature review, software, map**, statistical analysis, interpretation of the analysis, and discussion; M. S. and M. C. R. contributed to instrumental setup, data analysis, and validation; M. M. M. contributed to data collection and extraction; Islam, A. R. M. T., J. M., A. E., and M. A. P. had done the internal review and proofreading during the manuscript drafting stage.

Corresponding author

Correspondence to Abu Reza Md. Towfiqul Islam.

Ethics declarations

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 692 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Islam, A.R.M.T., Elbeltagi, A., Mallick, J. et al. Application of optimal subset regression and stacking hybrid models to estimate COVID-19 cases in Dhaka, Bangladesh. Theor Appl Climatol 154, 797–814 (2023). https://doi.org/10.1007/s00704-023-04589-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00704-023-04589-9

Navigation