Abstract
This work pointed out the use of machine learning tools to predict the effect of CO, O3, CH4, and CO2 on TBL (tracheal, bronchus, and lung cancer) deaths from 1990 to 2019. In this study, data from 203 countries/locations were used. We used evaluation metrics like accuracy, area under curve (AUC), recall, precision, and Matthews correlation coefficient (MCC) to determine the prediction efficiency of the models. The models that yielded accuracy between 89 and 90 were selected in this study. The essential features in the prediction process were extracted, and it was found that CO influenced the prediction process. Extra trees classifier, random forest classifier, gradient boosting classifier, and light gradient boosting machine were selected from 14 other classifiers based on the accuracy metric. The best-performing models, according to our benchmark standards, are the extra trees classifier (90.83%), random forest classifier (89.17%), gradient boosting classifier (89.17%), and light gradient boosting machine (89.17). We conclude that machine learning models can be used in predicting mortality, i.e., the number of deaths, and could assist us in predicting the role of air pollutants on TBL deaths globally.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig7_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig8_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig9_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig10_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig11_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig12_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig13_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig14_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig15_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig16_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig17_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig18_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11356-023-29448-y/MediaObjects/11356_2023_29448_Fig19_HTML.png)
Similar content being viewed by others
Data availability
The datasets used in this study are available from https://giovanni.gsfc.nasa.gov/giovanni/ and https://www.healthdata.org/.
References
Abolhassani M, Guais A, Chaumet-Riffaud P et al (2009) Carbon dioxide inhalation causes pulmonary inflammation. Am J Physiol Lung Cell Mol Physiol 296:L657–L665. https://doi.org/10.1152/ajplung.90460.2008
Azad N, Rojanasakul Y, Vallyathan V (2008) Inflammation and lung cancer: roles of reactive oxygen/nitrogen species. J Toxicol Environ Health, Part B 11:1–15. https://doi.org/10.1080/10937400701436460
Babatola SS (2018) Global burden of diseases attributable to air pollution. J Public Health Afr 9:813. https://doi.org/10.4081/jphia.2018.813
Beeson WL, Abbey DE, Knutsen SF (1998) Long-term concentrations of ambient air pollutants and incident lung cancer in California adults: results from the AHSMOG study.Adventist Health Study on Smog. Environ Health Perspect 106:813–823. https://doi.org/10.1289/ehp.106-1533247
Bray F, Ferlay J, Soerjomataram I et al (2018) Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 68:394–424. https://doi.org/10.3322/caac.21492
Chen T-M, Kuschner WG, Gokhale J, Shofer S (2007) Outdoor air pollution: nitrogen dioxide, sulfur dioxide, and carbon monoxide health effects. Am J Med Sci 333:249–256. https://doi.org/10.1097/MAJ.0b013e31803b900f
Cohen AJ, Brauer M, Burnett R et al (2017) Estimates and 25-year trends of the global burden of disease attributable to ambient air pollution: an analysis of data from the Global Burden of Diseases Study 2015. Lancet 389:1907–1918. https://doi.org/10.1016/S0140-6736(17)30505-6
Conibear L, Reddington CL, Silver BJ et al (2022) Sensitivity of air pollution exposure and disease burden to emission changes in China using machine learning emulation. GeoHealth 6:e2021GH000570. https://doi.org/10.1029/2021GH000570
Deng Y, Peng L, Li N et al (2021) Tracheal, bronchus, and lung cancer burden and related risk factors in the United States and China. Am J Transl Res 13:1928–1951
Fan C-Y, Fann JC-Y, Yang M-C et al (2021) Estimating global burden of COVID-19 with disability-adjusted life years and value of statistical life metrics. J Formos Med Assoc 120:S106–S117. https://doi.org/10.1016/j.jfma.2021.05.019
Global Burden of Disease (GBD).(2014) In: Institute for Health Metrics and Evaluation. https://www.healthdata.org/gbd. Accessed 28 May 2023
Guo H, Liu J, Wei J (2021) Ambient ozone, PM1 and female lung cancer incidence in 436 Chinese counties. Int J Environ Res Public Health 18:10386. https://doi.org/10.3390/ijerph181910386
Institute for Health Metrics and Evaluation (IHME) (2020) GBD Compare data visualization. Seattle, WA: IHME, University of Washington. http://vizhub.healthdata.org/gbd-compare. Accessed 28 May 2023
Juginović A, Vuković M, Aranza I, Biloš V (2021) Health impacts of air pollution exposure from 1990 to 2019 in 43 European countries. Sci Rep 11:22516. https://doi.org/10.1038/s41598-021-01802-5
Kim K-H, Jahan SA, Kabir E (2011) A review of diseases associated with household air pollution due to the use of biomass fuels. J Hazard Mater 192:425–431. https://doi.org/10.1016/j.jhazmat.2011.05.087
Kurmi OP, Lam KBH, Ayres JG (2012) Indoor air pollution and the lung in low- and medium-income countries. Eur Respir J 40:239–254. https://doi.org/10.1183/09031936.00190211
Li X, **ao J, Huang M et al (2020) Associations of county-level cumulative environmental quality with mortality of chronic obstructive pulmonary disease and mortality of tracheal, bronchus and lung cancers. Sci Total Environ 703:135523. https://doi.org/10.1016/j.scitotenv.2019.135523
Lim W-Y, Seow A (2012) Biomass fuels and lung cancer. Respirology 17:20–31. https://doi.org/10.1111/j.1440-1843.2011.02088.x
Liu F, Jiang X, Zhang M (2022) Global burden analysis and AutoGluon prediction of accidental carbon monoxide poisoning by Global Burden of Disease Study 2019. Environ Sci Pollut Res 29:6911–6928. https://doi.org/10.1007/s11356-021-15895-y
Merryman JI, Park PG, Schuller HM (1997) Carbon dioxide, an important messenger molecule for small cell lung cancer. Chest 112:779–784. https://doi.org/10.1378/chest.112.3.779
Molassiotis A, Kwok SWH, Leung AYM, Tyrovolas S (2022) Associations between sociodemographic factors, health spending, disease burden, and life expectancy of older adults (70 + years old) in 22 countries in the Western Pacific Region, 1995–2019: estimates from the Global Burden of Disease (GBD) Study 2019. GeroScience 44:925–951. https://doi.org/10.1007/s11357-021-00494-z
Oh SM, Kim HR, Park YJ et al (2011) Organic extracts of urban air pollution particulate matter (PM2.5)-induced genotoxicity and oxidative stress in human lung bronchial epithelial cells (BEAS-2B cells). Mutat Res Genet Toxicol Environ Mutagen 723:142–151. https://doi.org/10.1016/j.mrgentox.2011.04.003
Paul AR, Khan F, Jain A, Saha SC (2021) Deposition of smoke particles in human airways with realistic waveform. Atmosphere 12:912. https://doi.org/10.3390/atmos12070912
Rezaei F, Mazidimoradi A, Rayatinejad A et al (2023) Temporal trends of tracheal, bronchus, and lung cancer between 2010 and 2019, in Asian countries by geographical region and sociodemographic index, comparison with global data. Thorac Cancer 1759-7714:14912. https://doi.org/10.1111/1759-7714.14912
Rovira J, Domingo JL, Schuhmacher M (2020) Air quality, health impacts and burden of disease due to air pollution (PM10, PM2.5, NO2 and O3): application of AirQ+ model to the Camp de Tarragona County (Catalonia, Spain). Sci Total Environ 703:135538. https://doi.org/10.1016/j.scitotenv.2019.135538
Schuller HM (1994) Carbon dioxide potentiates the mitogenic effects of nicotine and its carcinogenic derivative, NNK, in normal and neoplastic neuroendocrine lung cells via stimulation of autocrine and protein kinase C-dependent mitogenic pathways. Neurotoxicology 15:877–886
Sørhaug S, Steinshamn S, Nilsen OG, Waldum HL (2006) Chronic inhalation of carbon monoxide: effects on the respiratory and cardiovascular system at doses corresponding to tobacco smoking. Toxicology 228:280–290. https://doi.org/10.1016/j.tox.2006.09.008
Soriano JB, Kendrick PJ, Paulson KR et al (2020) Prevalence and attributable health burden of chronic respiratory diseases, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet Respir Med 8:585–596. https://doi.org/10.1016/S2213-2600(20)30105-3
Valavanidis A, Vlachogianni T, Fiotakis K, Loridas S (2013) Pulmonary oxidative stress, inflammation and cancer: respirable particulate matter, fibrous dusts and ozone as major causes of lung carcinogenesis through reactive oxygen species mechanisms. Int J Environ Res Public Health 10:3886–3907. https://doi.org/10.3390/ijerph10093886
Yao Y, Miao X, Wang L et al (2022) Methane alleviates lung injury through the IL-10 pathway by increasing T regulatory cells in a mouse asthma model. J Immunol Res 2022:e6008376. https://doi.org/10.1155/2022/6008376
Yin P, Brauer M, Cohen AJ et al (2020) The effect of air pollution on deaths, disease burden, and life expectancy across China and its provinces, 1990–2017: an analysis for the Global Burden of Disease Study 2017. Lancet Planet Health 4:e386–e398. https://doi.org/10.1016/S2542-5196(20)30161-3
Zhang Y, Mi M, Zhu N et al (2023) Global burden of tracheal, bronchus, and lung cancer attributable to occupational carcinogens in 204 countries and territories, from 1990 to 2019: results from the global burden of disease study 2019. Ann Med 55:2206672. https://doi.org/10.1080/07853890.2023.2206672
Author information
Authors and Affiliations
Contributions
The corresponding author contributes to all the sections of the manuscript.
Corresponding author
Ethics declarations
Ethics approval
This is an observational study, and ethical approval is not required, as confirmed by the data supplier.
Consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Responsible Editor: Lotfi Aleya
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Mogaraju, .K. Machine learning strengthened prediction of tracheal, bronchus, and lung cancer deaths due to air pollution. Environ Sci Pollut Res 30, 100539–100551 (2023). https://doi.org/10.1007/s11356-023-29448-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11356-023-29448-y