Association between two-component systems gene mutation and Mycobacterium tuberculosis transmission revealed by whole genome sequencing

Two-component systems (TCSs) play a crucial role in the growth of Mycobacterium tuberculosis (M. tuberculosis). However, the precise regulatory mechanism of their contribution remain to be elucidated, and only a limited number of studies have investigated the impact of gene mutations within TCSs on the transmission of M. tuberculosis. Therefore, this study aims to explore the relationship between TCSs gene mutation and the global transmission of M. tuberculosis.

Results

A total of 13531 M.tuberculosis strains were enrolled in the study. Most of the M.tuberculosis strains belonged to lineage4 (n=6497,48.0%), followed by lineage2 (n=5136,38.0%). Our results showed that a total of 36 single nucleotide polymorphisms (SNPs) were positively correlated with clustering of lineage2, such as Rv0758 (phoR, C820G), Rv1747(T1102C), and Rv1057(C1168T). A total of 30 SNPs showed positive correlation with clustering of lineage4, such as phoR(C182A, C1184G, C662T, T758G), Rv3764c (tcrY, G1151T), and Rv1747 C20T. A total of 19 SNPs were positively correlated with cross-country transmission of lineage2, such as phoR A575C, Rv1028c (kdpD, G383T, G1246C), and Rv1057 G817T. A total of 41 SNPs were positively correlated with cross-country transmission of lineage4, such as phoR(T758G, T327G, C284G), kdpD(G1755A, G625C), Rv1057 C980T, and Rv1747 T373G.

Conclusions

Our study identified that SNPs in genes of two-component systems were related to the transmission of M. tuberculosis. This finding adds another layer of complexity to M. tuberculosis virulence and provides insight into future research that will help to elucidate a novel mechanism of M. tuberculosis pathogenicity.

View this article's peer review reports

Iron-related gene mutations driving global Mycobacterium tuberculosis transmission revealed by whole-genome sequencing

Article Open access 06 March 2024

Association between fatty acid metabolism gene mutations and Mycobacterium tuberculosis transmission revealed by whole genome sequencing

Article Open access 01 December 2023

The geno-spatio analysis of Mycobacterium tuberculosis complex in hot and cold spots of Guangxi, China

Article Open access 01 July 2020

Background

Tuberculosis is a serious global health problem caused by Mycobacterium tuberculosis (M. tuberculosis), a pathogen that lives and thrives inside human cells [1]. It is a highly contagious and often fatal disease that affects millions of people worldwide, making it a significant burden on public health systems and societies. However, despite its enormous global burden, the factors that contribute to tuberculosis transmission are still poorly understood. Therefore, develo** a better understanding of M. tuberculosis transmission is critical for guiding effective tuberculosis control strategies and reducing the disease’s burden on society.

Bacterial two-component systems (TCSs) are the most important sensing mechanisms that respond to a diverse range of ligands, including ions, gases, and metabolites. In pathogenic bacteria, TCSs play a crucial role in promoting pathogenesis by regulating bacterial gene expression in response to hostile host environments or metabolic stresses [2, 37](Additional file 2: Tables S1-S2). Construction of the maximum likelihood phylogenetic tree was conducted through the IQ-TREE software package (version 1.6.12), utilizing the JC nucleotide substitution model and gamma model of rate heterogeneity, with 100 bootstrap replicates included [38]. Mycobacterium canettii CIPT140010059 was deemed to be an outlier. The resultant phylogenetic tree was visualized through the utilization of iTOL (https://itol.embl.de/) (Fig. 3, Additional file 1: Figs. S1–S7).

Propagation analysis

Cluster analysis was utilized to investigate the influence of two-component system gene mutations on the transmission of M. tuberculosis [39]. Based on a previous study [40], we applied clustering to define transmission clusters and used a threshold of less than 25 SNPs. In addition, we chose the threshold of 25 SNPs because our isolates were spread in terms of location and time (1991–2019) and because we were probably missing several intermediary isolates (and cases) in our collection. (Additional file 2: Tables S1-S2). Additionally, according to the classification of transmission clusters by scholars, we also divided transmission clusters into large, medium, or small (large, over 75th percentile; medium, between 25th and 75th percentile; and small, under 25th percentile) [14]. To enhance understanding of the global distribution patterns and conduct an extensive analysis of the transmission dynamics of M.tuberculosis strains, we classified them into cross-country and within-country clusters. Furthermore, we categorized the M. tuberculosis strains into cross-regional and within-regional clusters based on geographic location utilizing the United Nations standard regions (UN M.49).

Acquisition of two-component system genes

A total of 45 two-component system genes were obtained according to NCBI and literature search [2, 7, 41]. Python was utilized to detect mutations in genes associated with TCSs (Additional file 2: Table S3).

Modeling and statistical analysis

Prediction models including gradient boosting decision tree and random forest were established by machine learning using the Scikit-learn Python package. We randomly divided all samples into training and test sets at a ratio of 7:3. Each of the models was evaluated with the metrics of Kappa, sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV), positive likelihood ratio (PLR), negative likelihood ratio (NLR) and area under curve (AUC) [42]. After the model was fitted, we evaluated the importance of the input variables on the model. To enhance the precision of predicting risk factors, we utilized the score to assess the influence of each input feature of the models, and take the intersection of both conditions and obtain the top-performing accessions as the important features [43, 44]. Subsequently, we established the generalized linear mixed model by using the statsmodels.api Python package to further analyze the important features and obtain the final influencing factors. All statistical analyses were performed using SPSS 26.0. All statistical tests were two-tailed, and P values less than 0.05 were considered statistically significant.

Data Availability

The whole genome sequences have been submitted to the NCBI under the accession number PRJNA1002108.

Change history

08 January 2024
A Correction to this paper has been published: https://doi.org/10.1186/s12864-023-09914-0

Abbreviations

M. tuberculosis :: Mycobacterium tuberculosis
TCSs:: Two-component systems
WGS:: Whole genome sequencing
SPHCC:: Shandong Public Health Clinical Research Center
WRCH:: Weifang Respiratory Clinical Hospital
CTAB:: Cetyltrimethylammonium Bromide
QC:: Quality control
SNP:: Single nucleotide polymorphism
SNPs:: Single nucleotide polymorphisms
NCBI:: National Center for Biotechnology Information
PPV:: Positive Predictive Value
NPV:: Negative Predictive Value
PLR:: Positive Likelihood Ratio
NLR:: Negative Likelihood Ratio
AUC:: Area Under Curve

References

World Health Organization. Global Tuberculosis report 2022. Geneva: World Health Organization; 2022.
Google Scholar
Parish T. Two-Component Regulatory Systems of Mycobacteria. Microbiol Spectr. 2014;2(1):MGM2-0010-2013.
Article Google Scholar
Zhou PF, Long QX, Zhou YX, Wang HH, **e J. Mycobacterium tuberculosis two-Component systems and implications in novel vaccines and Drugs. Crit Rev Eukar Gene Expr. 2012;22:37–52.
Article Google Scholar
Kusebauch U, Ortega C, Ollodart A, Rogers RS, Sherman DR, Moritz RL, et al. Mycobacterium tuberculosis supports protein tyrosine phosphorylation. Proc Natl Acad Sci U S A. 2014;111:9265–70.
Article CAS PubMed PubMed Central Google Scholar
Buglino JA, Sankhe GD, Lazar N, Bean JM, Glickman MS. Integrated sensing of host stresses by inhibition of a cytoplasmic two-component system controls M. Tuberculosis acute lung Infection. eLife. 2021;10:e65351.
Article CAS PubMed Central Google Scholar
Supply P, Magdalena J, Himpens S, Locht C. Identification of novel intergenic repetitive units in a mycobacterial two-component system operon. Mol Microbiol. 1997;26:991–1003.
Article CAS PubMed Google Scholar
Bretl DJ, Demetriadou C, Zahrt TC. Adaptation to Environmental Stimuli within the host: Two-Component Signal Transduction Systems of Mycobacterium tuberculosis. Microbiol Mol Biol Rev. 2011;75:566–82.
Article CAS PubMed Central Google Scholar
Takiff HE, Feo O. Clinical value of whole-genome sequencing of Mycobacterium tuberculosis. Lancet Infect Dis. 2015;15:1077–90.
Article CAS PubMed Google Scholar
Walker TM, Kohl TA, Omar SV, Hedge J, Del Ojo Elias C, Bradley P, et al. Whole-genome sequencing for prediction of Mycobacterium tuberculosis drug susceptibility and resistance: a retrospective cohort study. Lancet Infect Dis. 2015;15:1193–202.
Article CAS PubMed PubMed Central Google Scholar
Köser CU, Bryant JM, Becq J, Török ME, Ellington MJ, Marti-Renom MA, et al. Whole-genome sequencing for rapid susceptibility testing of M. Tuberculosis. N Engl J Med. 2013;369:290–2.
Article PubMed Google Scholar
Waturuocha UW, Krishna MS, Malhotra V, Dixit NM, Saini DK. A low-prevalence single-nucleotide polymorphism in the Sensor kinase PhoR in Mycobacterium tuberculosis suppresses its autophosphatase activity and reduces pathogenic fitness: implications in Evolutionary Selection. Front Microbiol. 2021;12:724482.
Article PubMed PubMed Central Google Scholar
Mendes MV, Tunca S, Antón N, Recio E, Sola-Landa A, Aparicio JF, et al. The two-component phor-phop system of Streptomyces natalensis: inactivation or deletion of phoP reduces the negative phosphate regulation of pimaricin biosynthesis. Metab Eng. 2007;9:217–27.
Article CAS PubMed Google Scholar
De Maio F, Berisio R, Manganelli R, Delogu G. PE_PGRS proteins of Mycobacterium tuberculosis: a specialized molecular task force at the forefront of host-pathogen interaction. Virulence. 2020;11:898–915.
Article PubMed PubMed Central Google Scholar
Chiner-Oms Á, Sánchez-Busó L, Corander J, Gagneux S, Harris SR, Young D et al. Genomic determinants of speciation and spread of the Mycobacterium tuberculosis complex. Sci Adv. 2019.
Vashist A, Malhotra V, Sharma G, Tyagi JS, Clark-Curtiss JE. Interplay of PhoP and DevR response regulators defines expression of the dormancy regulon in virulent Mycobacterium tuberculosis. J Biol Chem. 2018;293:16413–25.
Article CAS PubMed PubMed Central Google Scholar
Ryndak M, Wang S, Smith I. PhoP, a key player in Mycobacterium tuberculosis virulence. Trends Microbiol. 2008;16:528–34.
Article CAS PubMed Google Scholar
Agrawal R, Saini DK. Rv1027c-Rv1028c encode functional KdpDE two–component system in Mycobacterium tuberculosis. Biochem Biophys Res Commun. 2014;446:1172–8.
Article CAS PubMed Google Scholar
Sassetti CM, Boyd DH, Rubin EJ. Genes required for mycobacterial growth defined by high density mutagenesis. Mol Microbiol. 2003;48:77–84.
Article CAS PubMed Google Scholar
Sassetti CM, Rubin EJ. Genetic requirements for mycobacterial survival during Infection. Proc Natl Acad Sci U S A. 2003;100:12989–94.
Article CAS PubMed PubMed Central Google Scholar
Parish T, Smith DA, Kendall S, Casali N, Bancroft GJ, Stoker NG. Deletion of two-component regulatory systems increases the virulence of Mycobacterium tuberculosis. Infect Immun. 2003;71:1134–40.
Article CAS PubMed PubMed Central Google Scholar
Haydel SE, Benjamin WH, Dunlap NE, Clark-Curtiss JE. Expression, autoregulation, and DNA binding properties of the Mycobacterium tuberculosis TrcR response regulator. J Bacteriol. 2002;184:2192–203.
Article CAS PubMed PubMed Central Google Scholar
Wernisch L, Kendall SL, Soneji S, Wietzorrek A, Parish T, Hinds J, et al. Analysis of whole-genome microarray replicates using mixed models. Bioinformatics. 2003;19:53–61.
Article CAS PubMed Google Scholar
Haydel SE, Dunlap NE, Benjamin WH. In vitro evidence of two-component system phosphorylation between the Mycobacterium tuberculosis TrcR/TrcS proteins. Microb Pathog. 1999;26:195–206.
Article CAS PubMed Google Scholar
Chen X, He G, Wang S, Lin S, Chen J, Zhang W. Evaluation of whole-genome sequence method to Diagnose Resistance of 13 anti-tuberculosis Drugs and characterize resistance genes in clinical Multi-drug Resistance Mycobacterium tuberculosis isolates from China. Front Microbiol. 2019;10:1741.
Article PubMed PubMed Central Google Scholar
Yang C, Luo T, Shen X, Wu J, Gan M, Xu P, et al. Transmission of multidrug-resistant Mycobacterium tuberculosis in Shanghai, China: a retrospective observational study using whole-genome sequencing and epidemiological investigation. Lancet Infect Dis. 2017;17:275–84.
Article CAS PubMed Google Scholar
Koster KJ, Largen A, Foster JT, Drees KP, Qian L, Desmond E, et al. Genomic sequencing is required for identification of Tuberculosis transmission in Hawaii. BMC Infect Dis. 2018;18:608.
Article CAS PubMed PubMed Central Google Scholar
Hicks ND, Yang J, Zhang X, Zhao B, Grad YH, Liu L, et al. Clinically prevalent mutations in Mycobacterium tuberculosis alter propionate metabolism and mediate multidrug tolerance. Nat Microbiol. 2018;3:1032–42.
Article CAS PubMed PubMed Central Google Scholar
Liu Q, Ma A, Wei L, Pang Y, Wu B, Luo T, et al. China’s Tuberculosis epidemic stems from historical expansion of four strains of Mycobacterium tuberculosis. Nat Ecol Evol. 2018;2:1982–92.
Article PubMed PubMed Central Google Scholar
Huang H, Ding N, Yang T, Li C, Jia X, Wang G, et al. Cross-sectional whole-genome sequencing and epidemiological study of Multidrug-resistant Mycobacterium tuberculosis in China. Clin Infect Dis. 2019;69:405–13.
Article CAS PubMed Google Scholar
Luo T, Comas I, Luo D, Lu B, Wu J, Wei L, et al. Southern East Asian origin and coexpansion of Mycobacterium tuberculosis Bei**g family with Han Chinese. Proc Natl Acad Sci USA. 2015;112:8136–41.
Article CAS PubMed PubMed Central Google Scholar
Jiang Q, Liu Q, Ji L, Li J, Zeng Y, Meng L, et al. Citywide transmission of Multidrug-resistant Tuberculosis under China’s Rapid Urbanization: a Retrospective Population-based genomic spatial epidemiological study. Clin Infect Dis. 2020;71:142–51.
Article PubMed Google Scholar
Coll F, Phelan J, Hill-Cawthorne GA, Nair MB, Mallard K, Ali S, et al. Genome-wide analysis of multi- and extensively drug-resistant Mycobacterium tuberculosis. Nat Genet. 2018;50:307–16.
Article PubMed Google Scholar
Jung Y, Han D, BWA-MEME. BWA-MEM emulated with a machine learning approach. Bioinformatics. 2022;:btac137.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–9.
Article PubMed PubMed Central Google Scholar
Liu F, Zhang Y, Zhang L, Li Z, Fang Q, Gao R, et al. Systematic comparative analysis of single-nucleotide variant detection methods from single-cell RNA sequencing data. Genome Biol. 2019;20:242.
Article CAS PubMed PubMed Central Google Scholar
Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012;6:80–92.
Article CAS PubMed Google Scholar
Coll F, McNerney R, Guerra-Assunção JA, Glynn JR, Perdigão J, Viveiros M, et al. A robust SNP barcode for ty** Mycobacterium tuberculosis complex strains. Nat Commun. 2014;5:4812.
Article CAS PubMed Google Scholar
Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32:268–74.
Article CAS PubMed Google Scholar
Seto J, Wada T, Suzuki Y, Ikeda T, Mizuta K, Yamamoto T, et al. Mycobacterium tuberculosis Transmission among Elderly persons, Yamagata Prefecture, Japan, 2009–2015. Emerg Infect Dis. 2017;23:448–55.
Article PubMed PubMed Central Google Scholar
Walker TM, Ip CL, Harrell RH, Evans JT, Kapatai G, Dedicoat MJ, et al. Whole-genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational study. Lancet Infect Dis. 2013;13:137–46.
Article CAS PubMed PubMed Central Google Scholar
Li X, Lv X, Lin Y, Zhen J, Ruan C, Duan W, et al. Role of two-component regulatory systems in intracellular survival of Mycobacterium tuberculosis. J Cell Biochem. 2019;120:12197–207.
Article CAS PubMed Google Scholar
Luo Y, Xue Y, Song H, Tang G, Liu W, Bai H, et al. Machine learning based on routine laboratory indicators promoting the discrimination between active Tuberculosis and latent Tuberculosis Infection. J Infect. 2022;84:648–57.
Article PubMed Google Scholar
Bi X, Xu Q, Luo X, Sun Q, Wang Z. Weighted Random Support Vector machine clusters analysis of resting-state fMRI in mild cognitive impairment. Front Psychiatry. 2018;9:340.
Article PubMed PubMed Central Google Scholar
Agarwal G, Saade S, Shahid M, Tester M, Sun Y. Quantile function modeling with application to salinity tolerance analysis of plant data. BMC Plant Biol. 2019;19:526.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Shandong Provincial Hospital, Shandong Provincial Chest Hospital, 13 municipal-level and 21 county-level local health departments for drug susceptibility data, demographic, and clinical data.

Funding

This work was supported by the Department of Science & Technology of Shandong Province (CN) (No.2007GG30002033; No.2017GSF218052) and the **an Science and Technology Bureau (CN) (No.201704100). The funding body/bodies did not provide any assistance in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations

Shandong University of Traditional Chinese Medicine, **an, Shandong, 250014, People’s Republic of China
Yameng Li, Yawei Hou, Tingting Wang, Yingying Li & Huaichen Li
Artificial Intelligence Institute, Qilu University of Technology (Shandong Academy of Sciences), **an, Shandong, 250011, People’s Republic of China
**anglong Kong
Department of Respiratory and Critical Care Medicine, The Third Affiliated Hospital of Shandong First Medical University (Affiliated Hospital of Shandong Academy of Medical Sciences), **an, Shandong, 250031, People’s Republic of China
Yifan Li
Department of Respiratory and Critical Care Medicine, Shandong Provincial Hospital Affiliated to Shandong University, Shandong Provincial Hospital Affiliated to Shandong First Medical University, **gwuweiqi Road, Huaiyin District, **an, Shandong, 250021, People’s Republic of China
Ningning Tao, Yao Liu & Huaichen Li
Shandong First Medical University & Shandong Academy of Medical Sciences, **an, Shandong, 250117, People’s Republic of China
Qilin Han

Authors

Yameng Li
View author publications
You can also search for this author in PubMed Google Scholar
**anglong Kong
View author publications
You can also search for this author in PubMed Google Scholar
Yifan Li
View author publications
You can also search for this author in PubMed Google Scholar
Ningning Tao
View author publications
You can also search for this author in PubMed Google Scholar
Yawei Hou
View author publications
You can also search for this author in PubMed Google Scholar
Tingting Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yingying Li
View author publications
You can also search for this author in PubMed Google Scholar
Qilin Han
View author publications
You can also search for this author in PubMed Google Scholar
Yao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Huaichen Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

HCL, YL, and YML participated in the study design. YL, HCL, YML, XLK, NNT, and YFL performed data collection and statistical analyses. YL, TTW, YYL, and YWH helped draft the manuscript. YWH, QLH, and YYL overviewed and supervised the project. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Yao Liu or Huaichen Li.

Ethics declarations

Ethics approval and consent to participate

This study complies with the Declaration of Helsinki, and was approved by the Ethics Committee of Shandong Provincial Hospital, affiliated with Shandong University (SPH), the Ethics Weifang Respiratory Clinical Hospital (WRCH) and the Ethics Committee of Shandong Provincial Chest Hospital (SPCH), which waived informed patient consent because all patient records and information were anonymized and deidentified before the analysis.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original version of this article was revised: there was an error in affiliation 1 and the Background section of the Abstract had to be revised.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1: Additional file 2: Tables S1–S3

Supplementary Material 2: Additional file 2: Tables S4–S8

Supplementary Material 3: Additional file 1: Fig. S2

Supplementary Material 4: Additional file 1: Fig. S1

Supplementary Material 5: Additional file 1: Fig. S4

Supplementary Material 6: Additional file 1: Fig. S3

Supplementary Material 7: Additional file 1: Fig. S6

Supplementary Material 8: Additional file 1: Fig. S5

Supplementary Material 9: Additional file 1: Fig. S7

Supplementary Material 10: Additional file 1: Fig. S8

Supplementary Material 11: Additional file 1: Fig. S9

Supplementary Material 12: Additional file 1: Fig. S12

Supplementary Material 13: Additional file 1: Fig. S11

Supplementary Material 14: Additional file 1: Fig. S10

Supplementary Material 15: Additional file 1: Fig. S13

Supplementary Material 16: Additional file 1: Fig. S14

Supplementary Material 17: Additional file 1: Fig. S16

Supplementary Material 18: Additional file 1: Fig. S15

Supplementary Material 19: Additional file 1: Fig. S18

Supplementary Material 20: Additional file 1: Fig. S19

Supplementary Material 21: Additional file 1: Fig. S17

Supplementary Material 22: Additional file 2: Tables S9–S38

Supplementary Material 23: Legends of Additional files

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Li, Y., Kong, X., Li, Y. et al. Association between two-component systems gene mutation and Mycobacterium tuberculosis transmission revealed by whole genome sequencing. BMC Genomics 24, 718 (2023). https://doi.org/10.1186/s12864-023-09788-2

Download citation

Received: 04 August 2023
Accepted: 06 November 2023
Published: 28 November 2023
DOI: https://doi.org/10.1186/s12864-023-09788-2

Association between two-component systems gene mutation and Mycobacterium tuberculosis transmission revealed by whole genome sequencing

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Background

Propagation analysis

Acquisition of two-component system genes

Modeling and statistical analysis

Data Availability

Change history

08 January 2024

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation