Unraveling the link between PTBP1 and severe asthma through machine learning and association rule mining method

Pirmoradi, Saeed; Hosseiniyan Khatibi, Seyed Mahdi; Zununi Vahed, Sepideh; Homaei Rad, Hamed; Khamaneh, Amir Mahdi; Akbarpour, Zahra; Seyedrezazadeh, Ensiyeh; Teshnehlab, Mohammad; Chapman, Kenneth R.; Ansarin, Khalil

doi:10.1038/s41598-023-42581-5

Unraveling the link between PTBP1 and severe asthma through machine learning and association rule mining method

Article
Open access
Published: 16 September 2023

Volume 13, article number 15399, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Unraveling the link between PTBP1 and severe asthma through machine learning and association rule mining method

Download PDF

Saeed Pirmoradi¹^na1,
Seyed Mahdi Hosseiniyan Khatibi^2,3^na1,
Sepideh Zununi Vahed²^na1,
Hamed Homaei Rad³^na1,
Amir Mahdi Khamaneh⁴,
Zahra Akbarpour³,
Ensiyeh Seyedrezazadeh⁵,
Mohammad Teshnehlab⁶,
Kenneth R. Chapman⁷ &
…
Khalil Ansarin³

919 Accesses
2 Altmetric
Explore all metrics

Abstract

Severe asthma is a chronic inflammatory airway disease with great therapeutic challenges. Understanding the genetic and molecular mechanisms of severe asthma may help identify therapeutic strategies for this complex condition. RNA expression data were analyzed using a combination of artificial intelligence methods to identify novel genes related to severe asthma. Through the ANOVA feature selection approach, 100 candidate genes were selected among 54,715 mRNAs in blood samples of patients with severe asthmatic and healthy groups. A deep learning model was used to validate the significance of the candidate genes. The accuracy, F1-score, AUC-ROC, and precision of the 100 genes were 83%, 0.86, 0.89, and 0.9, respectively. To discover hidden associations among selected genes, association rule mining was applied. The top 20 genes including the PTBP1, RAB11FIP3, APH1A, and MYD88 were recognized as the most frequent items among severe asthma association rules. The PTBP1 was found to be the most frequent gene associated with severe asthma among those 20 genes. PTBP1 was the gene most frequently associated with severe asthma among candidate genes. Identification of master genes involved in the initiation and development of asthma can offer novel targets for its diagnosis, prognosis, and targeted-signaling therapy.

The early detection of asthma based on blood gene expression

Article 12 November 2018

Development and validation of asthma risk prediction models using co-expression gene modules and machine learning methods

Article Open access 12 July 2023

CDC167 exhibits potential as a biomarker for airway inflammation in asthma

Article Open access 05 April 2024

Introduction

Asthma is a common chronic airway disease and a global public health problem, affecting nearly 300 million people around the world¹ and ranked 16th among the main causes of years of life loss with a disability². Two recognized asthma endotypes exist based on the absence or presence of type 2 (T2) airway inflammation. Its most common form, Type 2 asthma is an eosinophilic chronic airway inflammation characterized by recurrent and reversible airway narrowing and obstruction, airway hyper-responsiveness, mucous hypersecretion, and oftentimes airway wall remodeling³. These effects are associated with T-helper2 (Th2) cells, innate lymphoid cells, eosinophils, mast cells, and B cells all interconnected by a complex interplay of chemokines and cytokines. Although our understanding of asthma is far from complete, there is a growing understanding of the molecular mechanisms that underlie common clinical phenotypes⁴.

Despite the worldwide distribution of asthma guidelines and advances in the treatment of asthma, a significant number of patients experience poor control of asthma stated as difficult-to-treat or severe asthma. Severe asthma comprises 3–10% of the asthmatic adult population^5,6 but accounts for more than 60% of the expenses^7,8. The management of this type of asthma has many challenges related to adherence, psychosocial morbidity, and treatment. Patients with severe asthma need repeated oral corticosteroid therapy that frequently is connected with a variety of adverse events⁹. Current therapies are insufficient for patients with severe asthma^10,11,12. Moreover, currently, there is no marker to define the length of therapy or assess the response. Therefore, the development of novel targeted therapies and biomarkers can introduce precision medicine for these patient and this form of asthma calls for more detailed studies to target the exploration of the pathologic mechanisms at genomic and molecular levels. Exploring severe asthma mechanisms at the genomic level may shed light on disease processes hel** to pave the way for a better understanding of the process with potential therapeutic consequences. Recent studies have focused on the immune cells^13,14 and signaling pathways in the asthma process¹⁵, generating a variety of associated gene candidates of interest.

In the U-BIOPRED (Unbiased Biomarkers in Prediction of Respiratory Disease Outcome) dataset¹⁶, researchers studied severe asthma based on omics data obtained from different tissue/cells including bronchial biopsies, bronchial and nasal brushings, sputum, urine, and blood. The current study aimed to employ and analyze the gene expression data provided in the U-BIOPRED study. In the present study, we focused on gene expression in blood samples, since inflammatory and immune cells, as well as systemic treatments, are transported through the blood to reach the lungs¹⁷; providing actual insights into the complex gene interactions associated with asthma severity.

New technologies are changing medicine, and this revolution starts with health data, including clinical images, genomic, and prescribed therapy data¹⁸. We are witnessing the exponential growth of machine learning applications in health-related information besides the traditional analysis techniques, which are not suitable for managing this vast amount of data¹⁹. Furthermore, applying state-of-the-art machine learning-based algorithms helped us to clarify and get a better understanding of trigger genes in severe asthma, yielding the potential to comprehensively characterize the actual mechanism of severe asthma at the gene level by considering gene expression and gene–gene interactions.

Materials and methods

Study population

Asthma-related gene expression datasets were obtained from the Gene Expression Omnibus (GEO) repository of the National Center for Biotechnology Information (NCBI). In this study, three datasets were employed, two of which are known as GSE69683¹⁷ (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE69683) and GSE76262²⁰ (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE76262). Table 1 elaborates more on the represented classes of these two datasets. In GSE69683 and GSE76262, gene expression levels were extracted from blood and sputum, respectively. Two datasets were measured based on the same platform, referred to as GPL13158. This platform applied the Affymetrix Human Genome HT U133 + PM array technique, in which gene expression levels are reported using 54,715 Probes for 20,277 genes. These datasets are subsets of Unbiased BIOmarkers in PREDiction of respiratory disease outcomes (U-BIOPRED), a multi-center prospective cohort study in which the researchers collected gene expression data from 16 clinical centers in 11 European countries.

Table 1 Datasets information.

Full size table

Another asthma-related dataset is GSE110551²¹ (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE110551) containing gene expression levels of 78 patients with asthma and 78 individuals unaffected by asthma. This dataset is available on the NCBI-GEO website, and gene expression level measurement was carried out by using the GPL 570 platform. This platform utilizes the Affymetrix Human Genome U133 Plus 2.0 Array technique, to which 54,675 probes were applied to record gene expression data. In GSE110551, a variety of important clinical information such as Body Mass Index (BMI) is reported in addition to asthma status. BMI is a significant factor in determining the obesity status of donors and this property gives GSE110551 a unique position to facilitate the study of the relationship between asthma and obesity at the gene level. Table 2 represents the sample counts of this database in each category. The datasets were used during the current study are available in the GEO repository (https://www.ncbi.nlm.nih.gov/geo/). This study was conducted according to the principles of the Declaration of Helsinki (2013) and informed consent was obtained from all subjects and/or their legal guardian(s).

Table 2 GSE110551 dataset details.

Full size table

Methodology

The proposed method in this study has five main steps; Reading, Pre-processing, Feature Selection, Classification, and Association Rule Mining (Fig. 1).

Reading step

In the reading stage, each dataset was downloaded from the NCBI-GEO repository. Subsequently, clinical data and gene expression were extracted for all samples. Each sample had a features vector (with 54,715 rows) and a corresponding label. The class label had two values, 0 for the case (severe asthma) and 1 for the control (non-asthma) samples. Tables 2 and 3 represent case–control details for GSE69683, GSE76262, and GSE110551 datasets.

Table 3 Case–control details for GSE69683 and GSE76262.

Full size table

Pre-processing step

The pre-processing step included three sub-steps; cross-validation, normalization, and resampling. Data were split into two parts using the hold-out cross-validation method, at a 70% to 30% ratio, for training and test portions, respectively. Only the training data were used during the pre-processing, feature selection, model training for classification, and association rule mining. On the other hand, test data were utilized only within the performance evaluation of feature selection and classification steps.

For the normalization of the training data, two approaches were applied during (i) feature selection and (ii) classification; Min–max for the former and Z-score for the latter as described previously²². In order to address the side-effects of imbalanced data, we applied random duplicate oversampling technique to bring parity between minority and majority classes, only for the training part of the data of course.

Feature selection step

Feature selection (FS) is an essential step for feature processing in machine learning, pattern recognition, and data mining. Although FS is rooted deeply in the statistics literature, it plays an important role in machine learning applications. FS algorithms are applied to input data to eliminate irrelevant and redundant features, especially in high-dimensional spaces such as genomics data. By utilizing FS methods, generally, the main goal is to assist the machine learning algorithm to focus on those aspects of the data that are more valuable for analysis and future predictions.

Applying high-throughput molecular analysis methods has given rise to more genomic datasets with higher dimensionality and more complexity. As the data dimensionality grows, it also becomes harder for biology researchers to interpret genomic interactions. Selecting a subset of relevant features using FS techniques not only facilitates the interpretation of genomics data by nominating biomarkers associated with diseases but can also improve classification performance by avoiding the curse of dimensionality that comes with high-dimensional spaces.

FS methods can be classified into four categories, consisting of filter, wrapper, embedded, and hybrid approaches²³. Generally, filter methods use some scoring functions to rank features based on their differentiation strength. Filter methods detect the importance of features individually and do not consider possible interactions between them. In addition, they have a low computational cost due to not utilizing classifiers in the feature selection process; as a matter of fact, they are classifier-independent²⁴.

Regardless of the major developments in the feature selection field, the analysis of high dimension- low sample size (HDLSS) data is an active area for research²⁵. In the biology domain, genomics data fall into the HDLSS category and it is advisable to consider HDLSS methods for feature selection.

During this study, a serial combination method containing the analysis of variance (ANOVA) and association rule mining²² is used to select features in gene expression data as illustrated in Fig. 1. Later on, a classifier model is utilized to pinpoint the quality of selected features obtained after the ANOVA step. The classifier model would verify that selected features can exhibit highly differentiating characteristics; in order to discover possible hidden gene relationships, association rule mining is put to work subsequently.

ANOVA

ANOVA is a simple and powerful method to compare the mean value of multiple groups (classes) in a dataset. It highlights any significant difference between the mean values of groups²⁶. This method has many advantages including: being robust in point view violations of its assumptions, being more intuitive to analyze the interaction of two features, being effective even in datasets with imbalanced number of samples in target classes, and also being easy to generalize to more than two groups without increasing the Type 1 error. ANOVA is called F-statistic in statistics literature, and is calculated by Eq. (1)²⁷.

$${F}_{value}= \frac{BMS}{WMS}$$

(1)

In Eq. (1), BMS and WMS are between mean squares and within mean squares, respectively. BMS and WMS are calculated by Eqs. (2) and (3).

$$BMS= \frac{BSS}{{df}_{B}}= \frac{\sum_{i=1}^{C}{n}_{i}{({\overline{x} }_{i}-\overline{x })}^{2}}{{df}_{B}}$$

(2)

$$WMS= \frac{WSS}{{df}_{W}}= \frac{\sum_{i=1}^{C}({n}_{i}-1){{\sigma }_{i}}^{2}}{{df}_{W}}$$

(3)

BSS and WSS are between sum squares and within sum squares, respectively. Where $\overline{x }$ = mean value of total samples, ${\overline{x} }_{i}$ = mean value of ith class, ${\sigma }_{i}$ = standard deviation of ith class, ${n}_{i}$ = sample number of ith class. ${df}_{B}$ = C − 1 and ${df}_{W}$ = N–C represent degree of freedom, in which N = number of total sample and C = Number of classes.

ANOVA assigns calculated F-values for all features and performs the ranking process accordingly. Features with high F-values are significant, since they represent better differentiation capabilities between classes.

Classification step

The classification process plays a major role in machine learning tasks. During this stage, the classifier model learns to separate classes of data based on the input features. This separation is performed using linear or non-linear boundaries and the performance of the classifier demonstrates how much the classes are separable.

The classifier model acts as a predictor of disease status, such as case–control, patient-healthy, cancer subtypes, etc., in medical applications. It can also indicate the quality of selected features if a feature selection step is performed previously. A classifier model with a decent performance verifies that selected features embody important differentiating characteristics to determine disease status, and the elimination of irrelevant features does not hurt the model’s differentiation ability.

We have employed a deep model for classification purposes and the proposed deep learning algorithm utilizes Self-Organizing Auto-Encoder (SOAE)²⁸ as the building blocks of the deep model. The distinctive property of a self-organizing deep auto-encoder is that it can automatically determine its structure (number of layers and neurons) based on the input data.

Association rule mining step

Genes and their products (i.e., proteins and RNAs) in the human body act according to their exclusive functions, in a complicated and orchestrated way. Most of the classical methods in molecular biology fall short of providing an overall picture of gene functions and interactions. Nowadays, the DNA microarray technique is widely used to measure thousands of gene expression levels at a given time, under given conditions for any cells or tissues. Unlike traditional methods in molecular biology, the analysis of microarray data is challenging and not straightforward. The microarray data is reported in a matrix form of N × M, in which N rows and M columns represent genes (commonly thousands) and samples (generally hundreds), respectively. Biology researchers apply some computational strategies, such as clustering and bi-clustering, to analyze microarray data^29,30. However, discovering the existing interactions between genes is not achievable using the same course of action, since most of the genes have functions in more than one gene network³¹. The Association Rule Mining (ARM) method is an innovative Gene Association Analysis (GAA) practice that can help discover such relationships.

Association rule mining is a useful methodology in the data mining area to uncover possible hidden connections in large and high dimensional, yet sparse data. Apriori is probably the most used association rule mining algorithm to date³²; however, several improved algorithms are proposed as well, such as the Apriori-Hybrid algorithm³³, fuzzy association rule algorithm³⁴ and FP-Growth algorithm^35,36.

We have employed the Apriori algorithm to generate frequent item sets and association rules for this study. The algorithm contains three main steps; generating frequent itemsets, generating association rules, and filtering, as shown in Fig. 1.

Selected rules and features in the association rule mining stage are evaluated using prior biological knowledge; either available in the literature or open-access biological databases. Furthermore, related association rules are studied to unmask the selected gene interactions. The main purpose of this step is to verify the biological significance of selected genes and their association rules.

Ethical approval

This study was approved by the Ethics Committee of the National Institute for Medical Research Development (NIMAD), Teran, Iran (Ethical code: IR.NIMAD.REC.1398.099).

Results

We propose a multi-step procedure to discover the gene(s) that control the disease process, by considering gene-asthma and gene–gene interactions in severe asthma (Fig. 1). In the wake of going through the reading and preprocessing steps, training data gets ready for the feature selection stage. With 54,715 features reported for 436 samples (case and control) in training data, we calculated the F-value using the ANOVA method. Features with higher F-values imply more relevance to severe asthma and help distinguish healthy and asthmatic cases more accurately. The top 100 genes (features) with the highest F-value (shown in Fig. 2a.) were selected as the leading features for the proceeding steps. We chose the “100” feature count threshold based on our experiments, ensuring that they embody good differentiation power while not sacrificing much of the performance.

We used a classifier model to evaluate the differentiation quality of selected genes. A deep neural network model based on SOAE (self-organizing auto-encoder) as a building block was utilized to classify new training data with 100 selected features. The accuracy, F1-measure, and AUC-ROC are applied to evaluate the classifier performance, Eqs. (4), (5), (6) and (7). [True Positive (TP), True Negative (TN), False Positive (FP), and False Negative (FN)]

$$Accuracy= \frac{TP+TN}{TP+FN+FP+TN}\times 100$$

(4)

$$F-measure= \frac{Precision\times Recall}{Precision+Recall},$$

(5)

where

$$Recall=\frac{TP}{TP+FN}$$

(6)

$$Precision=\frac{TP}{TP+FP}$$

(7)

Accuracy, F1-score, Precision, Recall, and AUC-ROC were reported for training and test data of GSE69683 in Table 4. The classification performance of training data (accuracy = 89% and AUC-ROC = 0.88) and test data (accuracy = 83% and AUC-ROC = 0.89) showed that the deep model was able to learn and classify severe asthmatic and healthy groups, achieving very good scores. The confusion matrix and ROC curve for both training and test data are displayed in Fig. 2b and c, respectively.

Table 4 The performance metrics for the classification step (SOAE Model).

Full size table

Following the classification step, we employed the association rule mining method to first, discover the possible hidden connections between nominated genes and severe asthma disease and second, figure out the complex relationships among selected genes. We applied a binning preprocess to gene expression values (or features). Each gene expression value can be categorized into one of three bins, namely low, intermediate, and high. This allowed us to convert continuous gene expression values to discrete gene expression itemsets. Moving on, the Apriori algorithm was used to generate frequent itemsets, and rules with minimum support and lift threshold values set to 0.28 and 1.1, respectively. All of the mentioned configurations resulted in the generation of 30,261,700 association rules, based on 387,848 frequent itemsets obtained through the Apriori algorithm.

We selected 115 association rules with the consequent part relevant to severe asthma (Fig. 3a). Overall, 62 unique genes or probes could be identified from these rules, with varying appearance frequency which is shown in Fig. 3b and c. The PTBP1 (repeat count = 16), RAB11FIP3 (repeat count = 14), ZBTB20 (repeat count = 11), APH1A (repeat count = 9), SLC38A10 (repeat count = 8), GPM6B (repeat count = 7), TMEM101 (repeat count = 6), BCL11A (repeat count = 5), GNAQ (repeat count = 4), KCTD21 (repeat count = 4), TCF4 (repeat count = 3), B2M (repeat count = 3), DCAF8 (repeat count = 3), MYD88 (repeat count = 3), WIP12 (repeat count = 2), and EEF1A1 (repeat count = 2) are genes (or items) with the highest frequency of appearance amongst extracted rules. In addition, there are 4 frequent probes including 244308_PM_at (repeat count = 13), 241577_PM_at (repeat count = 10), 242052_PM_at (repeat count = 4), and 244711_PM_at (repeat count = 3); since gene symbol is not reported for these probes in Affymetrix guide, we refer to them as unknown genes with some numerical identifiers.

Moreover, a graph network for severe asthma association rules and the strength distribution of severe asthma association rules according to their support, lift, and confidence values are shown in Fig. 4a and b. We presented example 'if–then' association rules in Table 5, where the antecedent contained PTBP1 (most frequent itemset with 16 repeat counts) for asthmatic condition.

Table 5 Asthmatic association rules, where the antecedent contained PTBP1.

Full size table

Comparing the Boxplots of asthma and healthy groups, it is obvious that there is a significant difference in all reported genes (Fig. 5a). Density distribution and violin plots are also available for reported genes in Fig. 5b and c.

The PTBP1 was the most frequent itemset with 16 repeat counts in asthmatic association rules; it also appears to be impactful in steering moderate asthma to severe asthma. As a result, we decided to investigate this gene more closely using other datasets. The boxplot, density distribution plot, and violin plot for PTBP1 are shown for both severe asthma and healthy groups in Fig. 6. Also, the PTBP1 gene expression of GSE69683, GSE76262, and GSE110551 are presented in Fig. 6a–c, respectively. The same mentioned meaningful difference between the two categories is visible in the PTBP1 expression plots for the GSE69683 dataset. A comparison of medians, interquartile ranges, and whiskers for GSE76262 reveals an insignificant difference. This is predictable due to the sampling source of this dataset, sputum, which is considered noisy, because of the presence of many other cells. Hence, GSE69683 plots are more precise, since their data comes from blood samples, and the PTBP1 is more present in the blood.

The GSE110551 is another dataset available for severe asthma studies. It contains BMI scores as well, allowing us to study possible relationships between obesity and severe asthma at the gene level. We investigated the model’s performance on this dataset in two aspects. At first, groups of healthy (non-asthmatic and non-obese) and severe asthma (asthmatic and non-obese) were observed. The larger box length and interquartile range for the healthy group indicate more spread-out data points. A smaller interquartile range for the severe asthmatic group suggests that data is more centered and less dispersed. By paying closer attention to the whiskers, a clear spread and difference in point distributions of asthmatic and healthy group categories can be noticed.

Afterward, we studied the PTBP1 expressions in severe asthmatic and non-asthmatic groups, by considering obesity status; the corresponding probability density function plot, box plot, and violin plot reveal that obesity as internal stress, affects PTBP1 expression level in samples (Fig. 7a,b). In Fig. 7c and d, the PTBP1 expression box plot displays the poor difference between medians, interquartile ranges, and whiskers.

According to the insights extracted from data and the profound effect of the PTBP1 in severe asthma and its susceptibility to various stressors such as obesity and hypoxia, we selected 55 association rules in which the PTBP1 appeared as the antecedent part. The PTBP1 association rules based on lift measure are displayed in Fig. 8a (having minimum threshold values of 1.42). Additionally, a graph network for the PTBP1 is shown in Fig. 8b, which can help improve understanding of the PTBP1 gene interactions in severe asthma disease. More in-depth coverage of these findings is available in the discussion section of the study. Also, the Bar plot of PTBP1 expression in various tissues of the human body was illustrated in Fig. 8c³⁷.

Discussion

In the present work, 100 candidate mRNAs were identified using the ANOVA approach, considering their strong rules and correlation with asthma severity. Utilizing association rule mining, 16 candidate mRNAs were found that have more than 2 repetitions in asthma association rules. Through in-depth functional analysis of asthma candidate genes in literature and research surveys, the PTBP1, RAB11FIB3, APH1A, and MYD88 demonstrated promise as important factors in the development and pathogenesis of severe asthma, potentially involved in the airway hyperresponsiveness, inflammation, and remodeling.

Based on our results, the PTBP1 [polypyrimidine tract-binding protein 1; also known as hnRNP1 (heterogeneous nuclear ribonuclear protein I)] ranked as the 1^st important gene in mediating severe asthma (Fig. 9). PTBP1 is an RNA-binding protein with versatile molecular functions related to RNA splicing and metabolism³⁸. The PTBP1 is the main known repressive regulator of posttranscriptional gene expression that regulates mRNA stability, splicing, localization, and translation³⁹. It involves the processing of mRNAs, affecting the cleavage of 3′-end and alternative polyadenylation^38,40. Moreover, PTBP1 regulates the expression of several transcripts in different cells via its interactions with microRNAs (miRs). miR-326 by targeting the PTBP1 stimulates autophagy to lessen pulmonary fibrosis⁴¹. The PTBP1 also regulates cellular migration, proliferation, and apoptosis through different pathways⁴². Additionally, it regulates the alternative splicing of its downstream target genes involved in cell growth and DNA damage⁴³.

The imbalance of Th17/ regulatory T cells (Treg) and Th1/Th2 is a key factor in the pathogenesis of asthma¹⁵. Since alternative splicing of RNAs has vital roles during the maturation and activation of immune cells, abnormal splicing due to PTBP1 dysregulation can trigger autoimmune diseases. Researchers have reported that a specific deficiency of the PTBP1 in dendritic cells (DCs) can elevate the expression of MHC II and disturb T-cell homeostasis without impacting the development of the DC. However, it could increase the populations of memory and activated T cells. In an asthma mice model, deletion of Ptbp1 could elevate the immune response by immune cell recruitment, mainly, eosinophils into the lungs, inducing lung damage⁴⁴. Evidence also indicates that PTBP1 is involved in the activation of T-cells by targeting different molecules and mechanisms. This protein is needed for the optimal proliferation, expansion, activation, and survival of T cells. Moreover, optimal expression of T cell activation markers (IFN-γ, TNF-α, IL-2, CD25, CD69, and CD40 ligand (CD40L) is dependent on the PTBP1. It functions as a critical regulator of CD4 T-cell activation that controls the expression of IL-2 and CD40L via the activation of the nuclear factor-κB (NF-κB) and phospholipase Cγ1 (PLCγ1) pathways. Downregulation of the PTBP1 leads to a reduction of these signaling pathways in T cells, preceding alterations in cell division and cytokine expression⁴⁵. Moreover, the PTBP1 regulates the expression of a regulator of the T-cell receptor (TCR), CD5, via different polyadenylation⁴⁶. The PTBP1 participates in the regulation of the CD46 alternative splicing as well⁴⁷. The PTBP1 is also upregulated in B lymphocytes and has important roles in B cell receptor-mediated antibody production⁴⁸ and B-cell selection in germinal centers⁴⁹. Given the roles of PTBP1 in the regulation of different processes of immune cells, its impairment may lead to immune dysregulation in asthma.

Beyond dysregulated immunity and inflammation, structural alterations of the bronchial wall (airway remodeling) are involved in the pathogenesis of asthma, starting from the initial stages of the asthmatic natural history⁵⁰. Epithelial alterations, the thickness of airway subepithelium, and smooth muscle (ASM) along with bronchial neoangiogenesis are hallmarks of asthmatic bronchial remodeling. Epithelial injury/repair cycles are significant signs in asthma airway remodeling, which are followed by metaplasia/hyperplasia of mucus-producing goblet cells, sub-epithelial fibrosis development, epithelial-mesenchymal transition (EMT), and basal membrane thickness. In addition to the abovementioned damage⁵¹. The Th2 cytokines, particularly IL-13, are triggers of mucus production and goblet cell hyperplasia. Among cytokines, TGF-β is the most potent inducer of the EMT and fibroblast to myofibroblast transition (FMT) along with subepithelial and ASM remodeling, wherein downregulates PTEN (phosphatase and tensin homolog), a phosphatase^52,53.

Vascular endothelial growth factor (VEGF) has the most significant pathogenic role in asthmatic vascular permeability, remodeling, and angiogenesis. The VEGF is produced by inflammatory (mast cells, macrophages, and eosinophils) and endothelial cells in response to different stimuli, especially hypoxia-inducible factor-1 alpha (HIF-1α)⁵⁴. Activation of the HIF-1α under hypoxia increases inflammation and airway hyperresponsiveness via CD8⁺ type 2 cytotoxic T cells⁵⁵. In pulmonary endothelial cells, an elevated expression of HIF-1α is associated with alterations in nitric oxide and cellular metabolism that are the hallmarks of pulmonary hypertension⁵⁶. Moreover, in the inflammatory leukocytes, HIF-1 supports energy metabolism to prevent ATP depletion⁵⁷ by upregulating pyruvate kinase muscle 2 (PKM2). By stabilizing HIF-1α mRNA⁵⁸, the PTBP1 plays important roles in PKM splicing, regulating the PKM1/PKM2 ratio, PKM2 generation, and a metabolic switching from oxidative phosphorylation to glycolysis⁵⁹, an early asthmatic event^60,61. A decreased expression of miR-124 in endothelial cells of the pulmonary artery deregulates the splicing of PTBP1 and its target (PKM2), leading to hyperproliferation of endothelial cells⁶². Moreover, in the pulmonary hypertensive vessel, the inflammatory, proliferative, and metabolic states of fibroblasts are regulated by miR-124, PTBP1, and PKM signaling⁶². It is also reported that PKM2 induces the expression of proinflammatory factors and the glycolysis-inactive form of PKM2 has an important function in the pathogenesis of asthma⁶¹.

At the molecular level, different signaling pathways are involved in the pathogenesis of asthma. Protein kinase C-delta (PKC-δ) induces proinflammatory cytokine production via the NF-κB pathway, indicating its regulatory role in airway inflammation⁶³. The PKC-δ is a positive upstream controller of phosphoinositide 3-kinase (PI3K)/Akt/ mammalian target of rapamycin (mTOR)/HIF-1α/VEGF pathway in asthma⁶⁴. Evidence indicates that PI3K has essential roles in different aspects of asthma through HIF-1α-mediated VEGF expression^65,66. PTEN has also an impact on asthma⁶⁷ by controlling cytokine signaling and different signaling pathways⁶⁸; the PI3k/Akt pathway is mainly inhibited by PTEN⁶⁹. A recent report indicated that overexpression of the PTBP1 could decrease the PTEN expression and elevate the phosphorylation level of Akt significantly, inducing proliferation and migration of asthmatic airway ASM cells⁷⁰. The PTBP1 is itself positively regulated by neuro-oncological ventral antigen 1 (NOVA1), an RNA-binding protein⁷⁰. It is worth noting that the mammalian target of rapamycin (mTOR) signaling is another necessary factor for the initiation of HIF-1α activity and VEGF expression. The PTBP1 also regulates cellular migration, proliferation, and apoptosis through different molecules and pathways⁴². Additionally, it regulates the alternative splicing of its downstream target genes involved in cell growth and DNA damage⁴³.

The PTBP1 is involved in several mechanisms and pathways including motility and cell structure, localization and protein targeting, protein modification and metabolism, cell cycle, immunity, muscle contraction, and so on; it might be upregulated by TGF-β1 through C-MYC in Keloids, a connection that can be a possible pathogenic mechanism for fibrotic disease. Moreover, the response of fibroblasts to the TGF-β induces the PTBP1 activation in Keloid⁷¹. It was demonstrated that miR-124–mediated downregulation of cell proliferation was due to its effects on the PTBP1; this can exert exquisite regulation of many downstream molecules that are important in cell proliferation, such as cell cycle-related genes FOXO3, Notch1, PTEN, p27Kip1 and p21Cip1^62,72,73,74.

Evidence indicates a cause-effect and organ-organ interaction between the lung and adipose tissue. Obese subjects have an expanded chance of asthma and stout asthmatics have serious exacerbations, diminished reaction to a few asthmatic solutions, and diminished quality of life^75,76. The major alterations linked with obesity include activation of the immune system and a positive energy balance⁷⁷. This bidirectional control between inflammatory and metabolic pathways stimulates a movement from obesity to asthma severity.

To date, there is a lack of reports linking the PTBP1 with asthma and obesity. However, there is some evidence to indicate how the impairment of PTBP1 may lead to obesity. A long non-coding RNA (H19) has an important role in the metabolism of lipids, where its upregulation can improve insulin sensitivity and protect against obesity⁷⁸. The PTBP1 can interact with H19 to reprogram lipid homeostasis in the liver⁷⁹. On the other hand, the PTBP1 is needed for cleavage, activation, and translocation of the SREBP1 (sterol-regulatory element binding proteins). The SREBP1 is a transcription factor that regulates genes involved in the glycolysis and de novo lipogenesis pathways, leading to hepatic accumulation of lipid and insulin resistance⁸⁰. In obese patients, an elevated level of SREBP1 is associated with insulin resistance and hepatic steatosis⁸¹. Interaction of the PTBP1 with H19 blocks its function, inhibiting the cleavage of the SREBP1 precursor. However, nuclear translocation of the SREBP1 in the absence of H19, stimulates the transcription of lipid-related genes, resulting in the accumulation of lipid⁸².

PTBP1 can significantly affect airway inflammation and remodeling in asthma by modulating the PI3K/PTEN/AKT/mTOR/HIF-1/VEGF signaling pathways. The chronic asthma phenomena can be a result of cytokines/chemokines, especially TGF-β’s effect on the PTBP1 expression pattern and subsequently, its impact on key signaling molecules. Hallmarks of severe asthma and the possible role of the identified mRNAs by artificial intelligence methods are displayed in Fig. 9.

Our study has some limitations. First of all, we did not evaluate the expression of the identified genes in clinical samples. Moreover, the molecular mechanism of the identified RNAs was not evaluated in the asthma models. Further experimental and clinical studies are needed to be performed to achieve these goals. It should be also noted that due to their well-known accuracy, gene network analysis and Machine learning methods have been applied widely for discovering novel diagnostic, prognostic, and therapeutic targets in the realm of asthma. Moreover, plenty of genes have been identified to have key roles in the pathogenesis of asthma^{17,83,84,85,86}. Some other genes such as thyroid peroxidase and superoxide dismutase 2 were identified to play important roles in asthma^87,88. However, it is challenging to locate the exact genes complicated in a complex asthma disease owing to the nature of a gene’s multiple functions and heterogeneous mechanisms of the severe disease^85,89.

Conclusion and perspective

We analyzed the blood transcriptome profiles of severe asthma patients and compared them with healthy controls, using artificial intelligence approaches (e.g., ANOVA, deep learning, and association rule mining). Our findings determine the PTBP1 as a main candidate gene and the most frequent item among asthma association rules. Given the discussed pluripotent roles of the PTBP1, it is reasonable to speculate that PTBP1 may regulate the immune cells, proinflammatory responses, hypoxia-related cellular metabolism, and airway remodeling in asthma. Also, the PTBP1 affects the basic mechanisms of pre-mRNA splicing including spliceosome assembly, miRNA synthesis and maturation, and the expression, activity, and intracellular localization of splicing factors that can trigger severe asthma. PTBP1 may also establish a strong bridge between asthma and obesity. The notion of moving from traditional treatments toward more novel therapeutic strategies such as RNA-based targeted therapy can open a new horizon in medicine to overcome asthma and obesity disorders where there are no boundaries.

Data availability

The data obtained from the artificial intelligence approaches will be available from the corresponding authors upon request.

References

Masoli, M. et al. The global burden of asthma: executive summary of the GINA Dissemination Committee report. Allergy 59(5), 469–478 (2004).
PubMed Google Scholar
Schofield, J. P. et al. A topological data analysis network model of asthma based on blood gene expression profiles. bioRxiv 13, 516328 (2019).
Google Scholar
Bhalla, A., Mukherjee, M. & Nair, P. Airway eosinophilopoietic and autoimmune mechanisms of eosinophilia in severe asthma. Immunol. Allergy Clin. 38(4), 639–654 (2018).
Google Scholar
Gruffydd-Jones, K. Unmet needs in asthma. Ther. Clin. Risk Manag. 15, 409 (2019).
PubMed PubMed Central CAS Google Scholar
Chung, K. F. et al. International ERS/ATS guidelines on definition, evaluation and treatment of severe asthma. Eur. Respir. J. 43(2), 343–373 (2014).
PubMed ADS CAS Google Scholar
Hekking, P.-P.W. et al. The prevalence of severe refractory asthma. J. Allergy Clin. Immunol. 135(4), 896–902 (2015).
PubMed Google Scholar
Antonicelli, L. et al. Asthma severity and medical resource utilisation. Eur. Respir. J. 23(5), 723–729 (2004).
PubMed CAS Google Scholar
Sadatsafavi, M. et al. Direct health care costs associated with asthma in British Columbia. Can. Respir. J. 17(2), 74–80 (2010).
PubMed PubMed Central Google Scholar
Zazzali, J. L. et al. Risk of corticosteroid-related adverse events in asthma patients with high oral corticosteroid use. Allergy Asthma Proc. 36(4), 268–274 (2015).
PubMed Google Scholar
Adatia, A. & Vliagoftis, H. Challenges in severe asthma: Do we need new drugs or new biomarkers?. Front. Med. (Lausanne) 9, 921967 (2022).
PubMed Google Scholar
Kerstjens, H. A. et al. Tiotropium in asthma poorly controlled with standard combination therapy. N. Engl. J. Med. 367(13), 1198–1207 (2012).
PubMed CAS Google Scholar
Barnes, N. et al. Effectiveness of omalizumab in severe allergic asthma: A retrospective UK real-world study. J. Asthma 50(5), 529–536 (2013).
PubMed PubMed Central CAS Google Scholar
Grayson, M. H. et al. Advances in asthma in 2017: Mechanisms, biologics, and genetics. J. Allergy Clin. Immunol. 142(5), 1423–1436 (2018).
PubMed CAS Google Scholar
Boonpiyathad, T. et al. Immunologic mechanisms in asthma. Semin. Immunol. https://doi.org/10.1016/j.smim.2019.101333 (2019).
Article PubMed Google Scholar
Ma, B. et al. PI3K/AKT/mTOR and TLR4/MyD88/NF-κB signaling inhibitors attenuate pathological mechanisms of allergic asthma. Inflammation 44(5), 1895–1907 (2021).
PubMed CAS Google Scholar
Shaw, D. E. et al. Clinical and inflammatory characteristics of the European U-BIOPRED adult severe asthma cohort. Eur. Respir. J. 46(5), 1308–1321 (2015).
PubMed CAS Google Scholar
Bigler, J. et al. A severe asthma disease signature from gene expression profiling of peripheral blood from U-BIOPRED cohorts. Am. J. Respir. Crit. Care Med. 195(10), 1311–1320 (2017).
PubMed CAS Google Scholar
Li, Y. et al. Literature review on the applications of machine learning and blockchain technology in smart healthcare industry: A bibliometric analysis. J. Healthc. Eng. 2021, 9739219 (2021).
PubMed PubMed Central Google Scholar
Piccialli, F. et al. A survey on deep learning in medicine: Why, how and when?. Inform. Fusion 66, 111–137 (2021).
Google Scholar
Kuo, C.-H.S. et al. T-helper cell type 2 (Th2) and non-Th2 molecular phenotypes of asthma using sputum transcriptomics in U-BIOPRED. Eur. Respir. J. 49(2), 1602135 (2017).
PubMed Google Scholar
Michalovich, D. et al. Obesity and disease severity magnify disturbed microbiome-immune interactions in asthma patients. Nat. Commun. 10(1), 1–14 (2019).
Google Scholar
Aghayousefi, R. et al. A diagnostic miRNA panel to detect recurrence of ovarian cancer through artificial intelligence approaches. J. Cancer Res. Clin. Oncol. https://doi.org/10.1007/s00432-022-04468-2 (2022).
Article PubMed Google Scholar
Chandrashekar, G. & Sahin, F. A survey on feature selection methods. Comput. Electr. Eng. 40(1), 16–28 (2014).
Google Scholar
Duch, W. Filter methods. In Feature Extraction: Foundations and Applications (eds Guyon, I. et al.) 89–117 (Springer Berlin Heidelberg, 2006).
Google Scholar
Tsai, C.-F. & Sung, Y.-T. Ensemble feature selection in high dimension, low sample size datasets: Parallel and serial combination approaches. Knowl.-Based Syst. 203, 106097 (2020).
Google Scholar
Kim, T. K. Understanding one-way ANOVA using conceptual figures. Korean J. Anesthesiol. 70(1), 22 (2017).
PubMed PubMed Central Google Scholar
Kim, H.-Y. Analysis of variance (ANOVA) comparing means of more than two groups. Restor. Dent. Endod. 39(1), 74 (2014).
PubMed PubMed Central Google Scholar
Pirmoradi, S. et al. A self-organizing deep auto-encoder approach for classification of complex diseases using SNP genomics data. Appl. Soft Comput. 97, 106718 (2020).
Google Scholar
Bayardo Jr, R.J. Efficiently mining long patterns from databases. in Proceedings of the 1998 ACM SIGMOD international conference on Management of data. 1998.
Pan, F., et al. Carpenter: Finding closed patterns in long biological datasets. in Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining. 2003.
Alves, R., Rodriguez-Baena, D. S. & Aguilar-Ruiz, J. S. Gene association analysis: A survey of frequent pattern mining from gene expression data. Brief. Bioinform. 11(2), 210–224 (2010).
PubMed CAS Google Scholar
Agrawal, R. and R. Srikant. Fast algorithms for mining association rules. in Proc. 20th int. conf. very large data bases, VLDB. 1994. Citeseer.
Agrawal, R. et al. Fast discovery of association rules. Adv. Knowl. Discov. Data Min. 12(1), 307–328 (1996).
Google Scholar
Kuok, C. M., Fu, A. & Wong, M. H. Mining fuzzy association rules in databases. ACM SIGMOD Rec. 27(1), 41–46 (1998).
Google Scholar
Han, J. et al. Mining frequent patterns without candidate generation: A frequent-pattern tree approach. Data Min. Knowl. Disc. 8(1), 53–87 (2004).
MathSciNet Google Scholar
Han, J., Pei, J. & Yin, Y. Mining frequent patterns without candidate generation. ACM SIGMOD Rec. 29(2), 1–12 (2000).
Google Scholar
Bio GPS. 2021; Available from: http://biogps.org/#goto=welcome.
Zhu, W. et al. Roles of PTBP1 in alternative splicing, glycolysis, and oncogensis. J. Zhejiang Univ.-Sci. B https://doi.org/10.1631/jzus.B1900422 (2020).
Article PubMed PubMed Central Google Scholar
Takahashi, H. et al. Significance of polypyrimidine tract-binding protein 1 expression in colorectal cancer. Mol. Cancer Ther. 14(7), 1705–1716 (2015).
PubMed CAS Google Scholar
Fu, X.-D. & Ares, M. Context-dependent control of alternative splicing by RNA-binding proteins. Nat. Rev. Genet. 15(10), 689–701 (2014).
PubMed PubMed Central CAS Google Scholar
Xu, T. et al. MiR-326 inhibits inflammation and promotes autophagy in silica-induced pulmonary fibrosis through targeting TNFSF14 and PTBP1. Chem. Res. Toxicol. 32(11), 2192–2203 (2019).
PubMed CAS Google Scholar
Zhu, W. et al. Roles of PTBP1 in alternative splicing, glycolysis, and oncogensis. J. Zhejiang Univ. Sci. B 21(2), 122–136 (2020).
MathSciNet PubMed PubMed Central CAS Google Scholar
Huan, L. et al. Hypoxia induced LUCAT1/PTBP1 axis modulates cancer cell viability and chemotherapy response. Mol. Cancer 19(1), 1–17 (2020).
MathSciNet Google Scholar
Geng, G. et al. PTBP1 is necessary for dendritic cells to regulate T-cell homeostasis and antitumour immunity. Immunology 163(1), 74–85 (2021).
PubMed PubMed Central CAS Google Scholar
La Porta, J. et al. The RNA-binding protein, polypyrimidine tract-binding protein 1 (PTBP1) is a key regulator of CD4 T cell activation. PLoS ONE 11(8), e0158708 (2016).
MathSciNet PubMed PubMed Central Google Scholar
Domingues, R. G. et al. CD5 expression is regulated during human T-cell activation by alternative polyadenylation, PTBP1, and miR-204. Eur. J. Immunol. 46(6), 1490–1503 (2016).
PubMed PubMed Central CAS Google Scholar
Tang, S. J. et al. Characterization of the regulation of CD46 RNA alternative splicing. J. Biol. Chem. 291(27), 14311–14323 (2016).
PubMed PubMed Central CAS Google Scholar
Bielli, P. et al. Regulation of BCL-X splicing reveals a role for the polypyrimidine tract binding protein (PTBP1/hnRNP I) in alternative 5′ splice site selection. Nucleic Acids Res. 42(19), 12070–12081 (2014).
PubMed PubMed Central CAS Google Scholar
Monzón-Casanova, E. et al. The RNA-binding protein PTBP1 is necessary for B cell selection in germinal centers. Nat. Immunol. 19(3), 267–278 (2018).
PubMed PubMed Central Google Scholar
Holgate, S. T. et al. Epithelial-mesenchymal communication in the pathogenesis of chronic asthma. Proc. Am. Thorac. Soc. 1(2), 93–98 (2004).
MathSciNet PubMed CAS Google Scholar
Ijaz, T. et al. Systems biology approaches to understanding Epithelial Mesenchymal Transition (EMT) in mucosal remodeling and signaling in asthma. World Allergy Organ. J. 7(1), 1–14 (2014).
Google Scholar
Walker, E. J. et al. Transcriptomic changes during TGF-β-mediated differentiation of airway fibroblasts to myofibroblasts. Sci. Rep. 9(1), 20377 (2019).
PubMed PubMed Central ADS CAS Google Scholar
Lv, X. et al. TGF-β1 induces airway smooth muscle cell proliferation and remodeling in asthmatic mice by up-regulating miR-181a and suppressing PTEN. Int. J. Clin. Exp. Pathol. 12(1), 173–181 (2019).
PubMed PubMed Central CAS Google Scholar
Chetta, A. et al. Vascular endothelial growth factor up-regulation and bronchial wall remodelling in asthma. Clin. Exp. Allergy 35(11), 1437–1442 (2005).
PubMed CAS Google Scholar
Ning, F. et al. Hypoxia enhances CD8(+) T(C)2 cell-dependent airway hyperresponsiveness and inflammation through hypoxia-inducible factor 1α. J. Allergy Clin. Immunol. 143(6), 2026-2037.e7 (2019).
PubMed CAS Google Scholar
Fijalkowska, I. et al. Hypoxia inducible-factor1alpha regulates the metabolic shift of pulmonary hypertensive endothelial cells. Am. J. Pathol. 176(3), 1130–1138 (2010).
PubMed PubMed Central CAS Google Scholar
Sumbayev, V. V. & Nicholas, S. A. Hypoxia-inducible factor 1 as one of the “signaling drivers” of Toll-like receptor-dependent and allergic inflammation. Arch. Immunol. Ther. Exp. (Warsz) 58(4), 287–294 (2010).
PubMed CAS Google Scholar
Wang, M. J. & Lin, S. A region within the 5′-untranslated region of hypoxia-inducible factor-1α mRNA mediates its turnover in lung adenocarcinoma cells. J. Biol. Chem. 284(52), 36500–36510 (2009).
PubMed PubMed Central CAS Google Scholar
He, X. et al. Involvement of polypyrimidine tract-binding protein (PTBP1) in maintaining breast cancer cell growth and malignant properties. Oncogenesis 3(1), e84 (2014).
PubMed PubMed Central CAS Google Scholar
Qian, X. et al. IL-1/inhibitory κB kinase ε-induced glycolysis augment epithelial effector function and promote allergic airways disease. J. Allergy Clin. Immunol. 142(2), 435-450.e10 (2018).
PubMed CAS Google Scholar
van de Wetering, C. et al. Pyruvate kinase M2 promotes expression of proinflammatory mediators in house dust mite-induced allergic airways disease. J. Immunol. 204(4), 763–774 (2020).
PubMed PubMed Central Google Scholar
Zhang, H. et al. Metabolic and proliferative state of vascular adventitial fibroblasts in pulmonary hypertension is regulated through a MicroRNA-124/PTBP1 (polypyrimidine tract binding protein 1)/pyruvate kinase muscle axis. Circulation 136(25), 2468–2485 (2017).
PubMed PubMed Central CAS Google Scholar
Page, K. et al. Regulation of airway epithelial cell NF-kappa B-dependent gene expression by protein kinase C delta. J. Immunol. 170(11), 5681–5689 (2003).
PubMed CAS Google Scholar
Choi, Y. H. et al. Inhibition of protein kinase C delta attenuates allergic airway inflammation through suppression of PI3K/Akt/mTOR/HIF-1 alpha/VEGF pathway. PLoS ONE 8(11), e81773 (2013).
PubMed PubMed Central ADS Google Scholar
Lee, K. S. et al. Phosphoinositide 3-kinase-delta inhibitor reduces vascular permeability in a murine model of asthma. J. Allergy Clin. Immunol. 118(2), 403–409 (2006).
PubMed CAS Google Scholar
Kim, S. R. et al. HIF-1α inhibition ameliorates an allergic airway disease via VEGF suppression in bronchial epithelium. Eur. J. Immunol. 40(10), 2858–2869 (2010).
PubMed CAS Google Scholar
Yoo, E. J. et al. Phosphoinositide 3-kinase in asthma: Novel roles and therapeutic approaches. Am. J. Respir. Cell Mol. Biol. 56(6), 700–707 (2017).
PubMed PubMed Central CAS Google Scholar
Kim, S. R. & Lee, Y. C. PTEN as a unique promising therapeutic target for occupational asthma. Immunopharmacol. Immunotoxicol. 30(4), 793–814 (2008).
PubMed CAS Google Scholar
Boosani, C. S., Gunasekar, P. & Agrawal, D. K. An update on PTEN modulators - a patent review. Expert Opin. Ther. Pat. 29(11), 881–889 (2019).
PubMed PubMed Central CAS Google Scholar
Cheng, Y. et al. Knockdown of NOVA1 inhibits inflammation and migration of asthmatic airway smooth muscle cells to regulate PTEN/Akt pathway by targeting PTBP1. Mol. Immunol. 138, 31–37 (2021).
PubMed CAS Google Scholar
Jiao, H. et al. TGF-β1 induces polypyrimidine tract-binding protein to alter fibroblasts proliferation and fibronectin deposition in keloid. Sci. Rep. 6(1), 1–11 (2016).
MathSciNet Google Scholar
Wang, D. et al. MicroRNA-124 controls the proliferative, migratory, and inflammatory phenotype of pulmonary vascular fibroblasts. Circ. Res. 114(1), 67–78 (2014).
PubMed CAS Google Scholar
Xue, Y. et al. Genome-wide analysis of PTB-RNA interactions reveals a strategy used by the general splicing repressor to modulate exon inclusion or skip**. Mol. Cell 36(6), 996–1006 (2009).
PubMed PubMed Central CAS Google Scholar
Llorian, M. et al. Position-dependent alternative splicing activity revealed by global profiling of alternative splicing events regulated by PTB. Nat. Struct. Mol. Biol. 17(9), 1114 (2010).
PubMed PubMed Central CAS Google Scholar
Miethe, S. et al. Obesity and asthma. J. Allergy Clin. Immunol. 146(4), 685–693 (2020).
PubMed Google Scholar
Peters, U., Dixon, A. E. & Forno, E. Obesity and asthma. J. Allergy Clin. Immunol. 141(4), 1169–1179 (2018).
PubMed PubMed Central Google Scholar
Ortiz, V. E. & Kwo, J. Obesity: Physiologic changes and implications for preoperative management. BMC Anesthesiol. 15, 97 (2015).
PubMed PubMed Central Google Scholar
Schmidt, E. et al. LincRNA H19 protects from dietary obesity by constraining expression of monoallelic genes in brown fat. Nat. Commun. 9(1), 3622 (2018).
PubMed PubMed Central ADS Google Scholar
Liu, C. et al. Long noncoding RNA H19 interacts with polypyrimidine tract-binding protein 1 to reprogram hepatic lipid homeostasis. Hepatology 67(5), 1768–1783 (2018).
PubMed CAS Google Scholar
Ruiz, R. et al. Sterol regulatory element-binding protein-1 (SREBP-1) is required to regulate glycogen synthesis and gluconeogenic gene expression in mouse liver. J. Biol. Chem. 289(9), 5510–5517 (2014).
PubMed PubMed Central CAS Google Scholar
Pettinelli, P. et al. Enhancement in liver SREBP-1c/PPAR-alpha ratio and steatosis in obese patients: Correlations with insulin resistance and n-3 long-chain polyunsaturated fatty acid depletion. Biochim. Biophys. Acta 1792(11), 1080–1086 (2009).
PubMed CAS Google Scholar
Zhu, Y. et al. Knock-down of circular RNA H19 induces human adipose-derived stem cells adipogenic differentiation via a mechanism involving the polypyrimidine tract-binding protein 1. Exp. Cell Res. 387(2), 111753 (2020).
MathSciNet PubMed CAS Google Scholar
Weathington, N. et al. BAL cell gene expression in severe asthma reveals mechanisms of severe disease and influences of medications. Am. J. Respir. Crit. Care Med. 200(7), 837–856 (2019).
PubMed PubMed Central CAS Google Scholar
Wan, Y. I. et al. Genome-wide association study to identify genetic determinants of severe asthma. Thorax 67(9), 762–768 (2012).
PubMed CAS Google Scholar
Modena, B. D. et al. Gene expression correlated with severe asthma characteristics reveals heterogeneous mechanisms of severe disease. Am. J. Respir. Crit. Care Med. 195(11), 1449–1463 (2017).
PubMed PubMed Central CAS Google Scholar
Melén, E. & Pershagen, G. Pathophysiology of asthma: Lessons from genetic research with particular focus on severe asthma. J. Intern. Med. 272(2), 108–120 (2012).
PubMed Google Scholar
Voraphani, N. et al. An airway epithelial iNOS-DUOX2-thyroid peroxidase metabolome drives Th1/Th2 nitrative stress in human severe asthma. Mucosal. Immunol. 7(5), 1175–1185 (2014).
PubMed PubMed Central CAS Google Scholar
Huang, Y. et al. Key genes and co-expression modules involved in asthma pathogenesis. PeerJ 8, e8456 (2020).
PubMed PubMed Central Google Scholar
Li, Y. et al. A comprehensive genomic pan-cancer classification using The Cancer Genome Atlas gene expression data. BMC Genom. 18(1), 508 (2017).
MathSciNet Google Scholar

Download references

Acknowledgements

This work was financially supported by the National Institute for Medical Research Development (NIMAD), Tehran, Iran (#983118). Also, the authors would like to thank the Clinical Research Development Unit of Tabriz Valiasr Hospital and Rahat Breath, Kidney Research Center, and Sleep Research Center for their assistance in this research.

Funding

This research was funded by the National Institute for Medical Research Development (NIMAD) (Grant No: 983118).

Author information

These authors contributed equally: Saeed Pirmoradi, Seyed Mahdi Hosseiniyan Khatibi, Sepideh Zununi Vahed and Hamed Homaei Rad.

Authors and Affiliations

Clinical Research Development Unit of Tabriz Valiasr Hospital, Tabriz University of Medical Sciences, Tabriz, Iran
Saeed Pirmoradi
Kidney Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
Seyed Mahdi Hosseiniyan Khatibi & Sepideh Zununi Vahed
Rahat Breath and Sleep Research Center, Tabriz University of Medical Science, Tabriz, Iran
Seyed Mahdi Hosseiniyan Khatibi, Hamed Homaei Rad, Zahra Akbarpour & Khalil Ansarin
Faculty of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran
Amir Mahdi Khamaneh
Tuberculosis and Lung Disease Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
Ensiyeh Seyedrezazadeh
Department of Electric and Computer Engineering, K.N. Toosi University of Technology, Tehran, Iran
Mohammad Teshnehlab
Division of Respiratory Medicine, Department of Medicine, University of Toronto, Toronto, ON, Canada
Kenneth R. Chapman

Authors

Saeed Pirmoradi
View author publications
You can also search for this author in PubMed Google Scholar
Seyed Mahdi Hosseiniyan Khatibi
View author publications
You can also search for this author in PubMed Google Scholar
Sepideh Zununi Vahed
View author publications
You can also search for this author in PubMed Google Scholar
Hamed Homaei Rad
View author publications
You can also search for this author in PubMed Google Scholar
Amir Mahdi Khamaneh
View author publications
You can also search for this author in PubMed Google Scholar
Zahra Akbarpour
View author publications
You can also search for this author in PubMed Google Scholar
Ensiyeh Seyedrezazadeh
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Teshnehlab
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth R. Chapman
View author publications
You can also search for this author in PubMed Google Scholar
Khalil Ansarin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception and design study: S.Z.V., S.M.H.K., A.M.K., E.S.R.; Machine Learning: S.P., H.H.R.; Data analysis: S.M.H.K., S.Z.V. and Z.A.; Writing original draft: S.P., S.M.H.K., S.Z.V., H.H.R.; Review and editing: all authors, Final Revision: K.A., M.T. and K.R.C.

Corresponding authors

Correspondence to Kenneth R. Chapman or Khalil Ansarin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pirmoradi, S., Hosseiniyan Khatibi, S.M., Zununi Vahed, S. et al. Unraveling the link between PTBP1 and severe asthma through machine learning and association rule mining method. Sci Rep 13, 15399 (2023). https://doi.org/10.1038/s41598-023-42581-5

Download citation

Received: 22 March 2023
Accepted: 12 September 2023
Published: 16 September 2023
DOI: https://doi.org/10.1038/s41598-023-42581-5
Springer Nature Limited

Unraveling the link between PTBP1 and severe asthma through machine learning and association rule mining method

Abstract

Similar content being viewed by others

The early detection of asthma based on blood gene expression

Development and validation of asthma risk prediction models using co-expression gene modules and machine learning methods

CDC167 exhibits potential as a biomarker for airway inflammation in asthma

Introduction