Log in

Low visibility event prediction using random forest and K-nearest neighbor methods

  • Research
  • Published:
Theoretical and Applied Climatology Aims and scope Submit manuscript

Abstract

Low visibility events at King Khalid airport in Riyadh, Saudi Arabia, are investigated using hourly time series of meteorological and air pollution data from April 2015 to December 2017. The analysis of binary classification is based on two machine learning classifiers (random forest (RF) and K-nearest neighbors (KNN)). Six models based on the feature selection methods of RF feature importance and Pearson correlation matrix are presented. The classification tasks include two resampling approaches (random oversampling and random undersampling) to address the problem of an imbalanced dataset of the visibility event classes. An important finding is that oversampling outperforms undersampling for the evaluated classifiers and achieves higher scores in terms of accuracy and F1 score metrics. The RF classifier has a better performance compared to the KNN in both sampling approaches. The RF classifier with oversampling approach provides the best overall performance in terms of accuracy, F1 score, and area under the receiver operating characteristics (AUROC). The best model has scores above 0.95 based on all the evaluation metrics considered in the study. Air temperature and dewpoint temperature have minimal impact on the performance, whereas the particulate matter with aerodynamic diameter <10 μm (PM10) has a profound impact on the performance. It is found that the PM10 has the highest importance (52%) for the low visibility events based on the analysis of RF feature importance. Other pollutants and meteorological variables show relative importance between 5 and 10% for low visibility events. Overall, the best model is found when all variables, except temperature and dewpoint temperature, are employed to predict the visibility classes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data Availability

Data are available with the corresponding author and available upon request.

Code availability

Not applicable.

References

Download references

Funding

The authors received funding through a student scholarship from the Ministry of Education in Saudi Arabia.

Author information

Authors and Affiliations

Authors

Contributions

Saleh H. Alhathloul: formal analysis, investigation, writing — original draft. Abdul khan: writing — review and editing. Ashok Mishra: writing — review and editing.

Corresponding author

Correspondence to Ashok K. Mishra.

Ethics declarations

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

All the authors have agreed to the present version of the manuscript and have no objection for its publication.

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Alhathloul, S.H., Mishra, A.K. & Khan, A.A. Low visibility event prediction using random forest and K-nearest neighbor methods. Theor Appl Climatol 155, 1289–1300 (2024). https://doi.org/10.1007/s00704-023-04697-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00704-023-04697-6

Navigation