Abstract
This study on severe malarial anemia (SMA: Hb < 6.0 g/dL), a leading global cause of childhood morbidity and mortality, compares the entire expressed whole blood host transcriptome between Kenyan children (3-48 mos.) with non-SMA (Hb ≥ 6.0 g/dL, n = 39) and SMA (n = 18). Differential expression analyses reveal 1403 up-regulated and 279 down-regulated transcripts in SMA, signifying impairments in host inflammasome activation, cell death, and innate immune and cellular stress responses. Immune cell profiling shows decreased memory responses, antigen presentation, and immediate pathogen clearance, suggesting an immature/improperly regulated immune response in SMA. Module repertoire analysis of blood-specific gene signatures identifies up-regulation of erythroid genes, enhanced neutrophil activation, and impaired inflammatory responses in SMA. Enrichment analyses converge on disruptions in cellular homeostasis and regulatory pathways for the ubiquitin-proteasome system, autophagy, and heme metabolism. Pathway analyses highlight activation in response to hypoxic conditions [Hypoxia Inducible Factor (HIF)−1 target and Reactive Oxygen Species (ROS) signaling] as a central theme in SMA. These signaling pathways are also top-ranking in protein abundance measures and a Ugandan SMA cohort with available transcriptomic data. Targeted RNA-Seq validation shows strong concordance with our entire expressed transcriptome data. These findings identify key molecular themes in SMA pathogenesis, offering potential targets for new malaria therapies.
Similar content being viewed by others
Introduction
Malaria remains a significant global public health challenge with 249 million annual cases and 608,000 deaths1. The majority of the cases (233 million, 93.6%) and mortality (580,000, 95.4%) occurred in the WHO African region and are due to infections with Plasmodium falciparum1, mainly in children under five years of age. Kenya faces a substantial challenge with P. falciparum malaria, reporting ∼3.42 (2.23–5.02) million annual cases and ∼11,788 (11,100–12,700) deaths, primarily in the under-five population1. Since the disease burden increases with transmission intensity, severe malaria remains among the leading causes of morbidity and mortality in children residing in holoendemic P. falciparum regions of Kenya, and other such regions of sub-Saharan Africa2,3,4. In high transmission regions, the primary manifestation of severe malaria is severe malarial anemia [SMA, hemoglobin (Hb)<6.0 g/dL] in the presence and absence of respiratory distress, with cerebral malaria occurring only in rare (atypical) cases2,5,6.
The etiology of SMA is multifaceted and includes overlap** characteristics such as the lysis of infected and uninfected erythrocytes, sequestration of erythrocytes in the spleen, and suppression of bone marrow functions7,8. Although natural immunity is acquired following repeated infections with P. falciparum9,10,11, innate immunity is particularly important for influencing disease severity in young, malaria-naïve children. Our previous longitudinal studies in Kenyan children using a combination of candidate-gene approaches, genome-wide association studies, high-throughput genoty**, and array-based whole transcriptional profiling revealed that the development of SMA is mediated, partially by innate immune response genes7,12,13,14,15. Our targeted transcriptome analyses also revealed that differentially expressed genes (DEGs) in host ubiquitination processes are a central feature of SMA pathogenesis16. Microarray analysis of candidate genes in whole blood identified DEGs that encode amino acid transport, phospholipid metabolic processes, and positive regulation of nitrogen metabolism in Gabonese children with SMA (<6 years old)17.
Advances in high-throughput sequencing technologies and bioinformatics analyses have provided important insights into the human immune response to P. falciparum and identified potential vaccine candidates17,18,19,20. For example, studies investigating gene expression in adults during controlled human malaria infection experiments identified >2,700 DEGs in the whole blood transcriptome, for which a subset of 265 genes was associated with transcription and cell-cycle regulation, phosphatidylinositol signaling, and erythrocytic development21. In addition, next-generation RNA sequencing in whole blood, which has the potential to concomitantly capture host and parasite gene expression, showed that severe malaria (i.e., cerebral malaria, hyperlactatemia, or their combination) in Gambian children (<16 years old) was associated with increased expression of granulopoiesis and interferon-γ–related genes, and inadequate suppression of type 1 interferon signaling18. Additional analysis of whole blood transcriptomes in Ugandan children (age; 18 mos. to 12 yrs.) revealed that erythropoietic and nuclear factor erythroid 2 like 2 (NRF2)-regulated genes were differentially expressed between cerebral malaria and SMA cases22. Despite progress in defining the human immune response to P. falciparum, a comprehensive investigation of the entire expressed transcriptome has not been reported in children who develop SMA, the group that suffers the highest global morbidity and mortality2,5,8,23,24,25. Here, we present the top emergent biological processes, networks, and pathways for the first entire expressed whole blood transcriptome in Kenya children (<5 years old) from a holoendemic region of western Kenya who develop SMA as the exclusive phenotype of severe malaria.
Results
Demographic and clinical characteristics of the study participants
Since anemia definitions, particularly in pediatrics, require georeferencing26, we utilized data from our cohort of 1644 children (3-48 mos) followed over a 36-month period for which we recorded over 19,000 hemoglobin measurements. Because childhood SMA is among the leading causes of anemia-related death in western Kenya, we utilized a dynamic programming algorithm to determine average Hb concentrations across 36 months of follow-up to define categories that best-captured childhood mortality27. This method revealed three distinct Hb categories at p < 0.001: ≤5.90 g/dL (n = 62, mortality fraction = 0.53), 5.91–8.09 g/dL (n = 209, mortality fraction = 0.15), and >8.09 g/dL (n = 1373, mortality fraction = 0.03, Supplementary Fig. S1). These results parallel those in a previous longitudinal birth cohort (0-48 mos.) with 14,317 repeated Hb measurements from the same geographical region28. Based on this data-driven approach, two groups were investigated: SMA defined as Hb<6.0 g/dL and non-SMA as Hb ≥ 6.0 g/dL. For the RNA-Seq analysis, 39 children (3–48 mos.) with non-SMA and 18 with SMA were selected from the overall cohort (n = 577, 1–59 mos.), excluding those with HbSS (Supplementary Fig. S2). Admission demographic and clinical characteristics of the children selected for RNA-Seq in whole blood are shown in Table 1. Based on the selection criteria, sex (p = 0.777), overall age (p = 0.283), and distribution within age categories (p = 0.781) were comparable. Glucose levels (p = 0.377) and axillary temperature (p = 0.121) were also comparable between the groups. Consistent with more profound anemia in children with SMA, hematocrit (p = 1.678E−09) and red blood cell counts (p = 2.072E−09) were lower. White blood cell (p = 0.007) and lymphocyte (p = 0.001) counts were also elevated in children with SMA. Other hematological measures were comparable between the groups, as were parasitological indices, clinical features, and genetic variants.
Comparative cellular pathways in non-SMA and SMA reveals divergent metabolic and immune responses
To identify unique and shared genes, a Venn diagram analysis was performed on the 53,286 transcripts, revealing 602 genes that were uniquely expressed in non-SMA, 493 in SMA, and 16,036 co-expressed genes (Fig. 1a). Uniquely expressed genes in non-SMA and SMA were then explored by canonical pathway analysis (MetaCoreTM), revealing four significant sub-networks (Supplementary Table S1). The top-ranked sub-network in children with non-SMA was [TFF3↔IL-6↔IL6RA↔ADAM17↔gp130, (p = 3.180E−12)], highlighting gene ontology (GO) processes for the crucial role of T-helper 17 cell lineage commitment and differentiation, interleukin-6-mediated signaling, and T-helper cell lineage commitment in driving T-helper 17 type immune responses. The second sub-network [SHOX2↔Neuregulin1↔FGFR2↔Endothelin-1↔ECE2, (p = 1.930E−07)] indicates processes that regulate cellular signaling and proliferation, particularly through receptor tyrosine kinase and enzyme-linked pathways. The top-ranked sub-network in children with SMA [c-Myc↔C/EBPbeta↔STAT1↔STAT5 ↔ERK1/2, (p = 1.240E−200)] was associated with signaling pathways linked to growth factors, peptide hormones, stimuli, and transmembrane receptor protein tyrosine kinases. The second sub-network [ACE1↔des-Arg9-bradykinin↔BDKRB1↔des-Arg10-kallidin↔BDKRB2, (p = 5.310E−103)] underscores positive regulation of cellular responses associated with hormones, chemicals, and metabolic processes. Collectively, these data indicate that non-SMA was characterized by changes in immune response and cell signaling, while SMA involved a broader spectrum of cellular activities, including metabolic regulation and hormone responses. Differential expression analysis of the 53,286 transcripts was then performed, identifying 1682 DEGs [false discovery rate (FDR)-adjusted, padj < 0.050]: 1403 up- and 279 down-regulated genes in children with SMA (Fig. 1b).
Insights on gene expression variance and group heterogeneity
A PCA analysis revealed that the first principal component (PCA1) accounted for 36.1% of total variance and differentiated a portion of the non-SMA and SMA groups, while PCA2 (13.8% total variance) had tighter clustering for the non-SMA group (Supplementary Fig. S3). The non-SMA group displayed tighter clustering (homogeneity) across age, sex, and sickle cell traits. In contrast, the SMA group exhibited more dispersion, notably among males and those with the HbAA trait. Thus, while there were distinct gene expression features that distinguished between the two groups, children with SMA were more heterogeneous.
Functional interactions and pathway networks indicate altered immune responses, metabolic processes, and erythrocyte differentiation
To further explore the pathogenesis of SMA, we performed non-supervised hierarchical cluster analysis of the significant DEGs (n = 1682). These analyses identified three unique co-regulated gene clusters that differentiated children with non-SMA and SMA with distinct patterns for WBCs and lymphocytes but not for parasitemia levels, sickle cell status (HbAA vs. HbAS), or age distribution (Fig. 1c). To gain further insight into the networks for the major co-regulated gene clusters, canonical pathway maps for direct functional interactions were generated. The network for down-regulated genes (n = 279, cluster 1) in SMA was IRF1↔SUZ12↔IL-1β↔NRF2↔LHX2 with the transcription factor, IRF1, as the central divergence hub (green-square box, 56 direct interactions) and IL-1β as the central convergence hub (red-square box, 44 direct interactions, Fig. 1d). The top processes associated with the functional interactions for cluster 1 were regulation of innate immune response (p = 3.272E−17), and defense response (p = 6.348E−17). Cluster 2 (n = 489, up-regulated) generated a TAL1↔LYL1↔BRD4↔FOXO3A↔EKLF1 network with the transcription factor, TAL1, as the central divergence hub (green-square box, 343 direct interactions). The central convergence hub in cluster 2 (red-square boxes) was the transcription factor E2F2 (7 functional interactions), while the secondary divergence hubs were GLUT1 and HMBS (both with 7 direct interactions, Fig. 1e). Top interactions for cluster 2 were metabolic process (p = 1.804E−25) and erythrocyte differentiation (p = 1.425E−13). Collectively, SMA was associated with suppressed innate immune stress responses, elevated metabolic processes, and promotion of erythrocyte differentiation.
Altered leukocytic immune cell profiles in SMA
To determine if leukocytic immune profiles differed in children who developed SMA, a bioinformatics approach was implemented using CIBERSORTx29,30. Despite interindividual variability, transcriptional profiling identified five immune cell types that were differentially expressed at p < 0.050 (Fig. 2a). Children with SMA had increased expression of naïve B cells (p = 9.741E-05) and CD8 T cells (p = 0.026) (Fig. 2b). In contrast, the SMA group had a lower proportion of expression for memory B cells (p = 0.035), activated dendritic cells (p = 0.007), and neutrophils (p = 0.026) (Fig. 2b). Based on the immune cell type profiles in the expression data, children with SMA appeared to have a decreased antigenic response and reduced immune priming.
Blood transcriptome module repertoire analysis highlights erythroid upregulation and neutrophil activation amidst impaired inflammation in SMA
To further identify host immunological profiles associated with SMA, blood-specific gene expression signatures were examined using the BloodGen3Module in R31,32. To focus on the most enriched signals, the top 10% of the up- and down-regulated gene clusters are presented. As shown in Fig. 2c, d, and Supplementary Table S2, up-regulated gene patterns were largely encompassed by erythroid cells (e.g., HBBP1, RHD, ST3GAL1, SLC2A1, and GCLC), protein modification (e.g., autophagy and ubiquitin-proteome system: FURIN, GABARAPL2, TM7SF2, RNF14, UBE2M, TAX1BP1), and neutrophil activation (e.g., CEACAM6, CEACAM8, CTSG, DEFA3, DEFA4). Conversely, the top down-regulated gene signatures were for inflammation (e.g., FCGR3B, TLR1, TLR5, FPR2, HIF1A, SOD2, HSD17B11, MAPK14, SOCS3, LIMK2, ACSL1, NDUFB3), interferon and cytokines/chemokines (CSF2RB, FPR2, IFIT2, IFIT5, MX1, NOD2, GBP1, GBP4, GBP5, CASP1, CASP4, CASP5, MAPK14, TAP1, PSMB9, SP100), and neutrophils (e.g., HSPA1A, ADAM8, FCGR2A, FCGR3B, FPR1, CYB5R4, NCF4, HLA-E, LCP1, BCL3, IRF1, NFKBIA, BCL3, and MCL1, Fig. 2c, d, and Supplementary Table S2). Individual-level data for the top aggregates and accompanying gene expression modules are shown in Fig. 2e. In summary, BloodGen3Module analysis showed that SMA was characterized by the upregulation of erythroid cells, enhanced neutrophil activation, and impaired inflammatory response.
Functional Enrichment Analysis Reveals Disruptions in Protein Degradation, Heme metabolism, Cellular Clearance Mechanisms, and Efferocytosis in SMA
To identify characteristics associated with develo** SMA, Gene Ontology (GO) enrichment analysis was performed for three domains (i.e., biological process, cellular components, and molecular functions) with the top 20 in each presented in Fig. 3a. Of the three domains, biological processes exhibited the most significant enrichment in children with SMA. For example, proteasome-mediated ubiquitin-dependent protein catabolic process (padj = 8.392E−05), protoporphyrinogen IX metabolic process (padj = 1.266E−04), and regulation of autophagy (padj = 1.275E−04) were the top-ranked biological processes. The ubiquitin ligase complex (padj = 2.659E−03), phagophore assembly site (padj = 6.669E−03), and hemoglobin complex (padj = 6.804E−03) were the top cellular components enriched in SMA. The chromatin DNA binding (padj = 5.046E−04), protein-macromolecule adaptor activity (padj = 6.103E−03), and transcription corepressor activity (padj = 6.103E−03) showed the greatest enrichment for molecular functions. Additional analysis using the Reactome enrichment identified the phospholipid metabolism (padj = 3.953E-04), activation of BH3-only proteins (padj = 1.030E−03), and heme biosynthesis (padj = 1.481E−03) as the top-ranked pathways in children with SMA (Fig. 3b and c). To complement this analysis, functional classification of the DEGs for the KEGG pathways revealed three significant pathways: efferocytosis (padj = 1.031E−04), mitophagy-animal (padj = 1.839E−03), and autophagy-animal (padj = 1.940E−02, Supplementary Fig. S4). Together, the enrichment analyses indicated that SMA was characterized by a multifaceted disturbance involving protein degradation (ubiquitination), heme metabolism, cellular clearance mechanisms (autophagy), and efferocytosis, consistent with disruptions in cellular homeostasis and regulatory pathways.
Canonical Pathways Converge on Alterations in Cellular Stress Response, Immune Modulation, and Metabolic Changes in SMA
Additional characterization of SMA pathogenesis was carried out by exploring the top 10 significant canonical pathway maps with a Log2FoldChange = 0.585 (1.5 linear FoldChange) and false discovery rate (FDR) < 0.050 (MetaCoreTM, Fig. 4a). The top-ranked pathways in the functional categories were: (i) apoptosis and survival—Regulation of apoptosis by mitochondrial proteins (padj = 4.150E−02); (ii) immune response—IFN-alpha/beta signaling via MAPKs (padj = 5.665E−03); (iii) oxidative stress—ROS signaling (padj = 2.089E-02); (iv) regulation of metabolism—Glucocorticoid receptor signaling in glucose and lipid metabolism (padj = 2.089E−02); (v) signal transduction—mTORC2 downstream signaling (padj = 2.105E−02); and (vi) transcription—HIF-1 targets (padj = 1.038E−02). These pathways collectively revealed a complex interplay of cellular stress response, immune modulation, metabolic changes, and transcriptional regulation in SMA.
Validation of whole blood transcriptome data using an external dataset and targeted RNA-Seq
To validate the RNA-Seq results, we utilized a transcriptomic dataset from a Ugandan cohort of children with SMA (n = 17) and community children (household controls, n = 12)22. Enrichment analysis with 323 genes [Log2FoldChange = 0.585 and false discovery rate (FDR) < 0.050] for pathway maps identified HIF-1 targets involved in transcription as the top emergent pathway (padj = 1.405E−03, Supplementary Fig. S5), the second top-ranked pathway map in our dataset. As such, both datasets were mapped on the HIF-1 targets pathway for a direct comparison of DEGs (Fig. 4b). Of the top 10 pathways in the Ugandan dataset (Supplementary Fig. S5), the only other common pathway in the Kenyan dataset was ROS signaling involved in oxidative stress (padj = 3.994E−02, Supplementary Fig. S6). To further validate DEGs identified in the whole blood transcriptomic analysis, we utilized a Qiagen targeted RNA sequencing panel of 491 immune response genes in a separate cohort of Kenyan children [non-SMA (n = 23) and SMA (n = 20), Supplementary Table S3]. To enhance biological relevance and reduce noise by excluding non-relevant variations that might be present in either or both datasets, cluster analysis was performed for the genes that were significant (p < 0.050) in both datasets (n = 147). A heatmap cluster analysis showed concordance in the fold-change and directionality in the two platforms (Fig. 5a) with the two datasets showing a robust correlation (r = 0.612; 95% confidence interval (CI), 0.496-0.706; p = 1.842E−16, Fig. 5b). The external dataset and targeted RNA sequencing panel validation have high concordance with the whole transcriptomics in Kenyan children, indicating consistent DEGs across different SMA cohorts.
Comparative analysis of transcriptomics and protein abundance
To determine if the DEGs from the RNA-Seq align with changes in protein levels, we compared the whole blood transcriptome data with protein abundance (measured in plasma with a 7k SomaScan platform). MetaCoreTM was used to map genes to their respective protein products for 35 children (n = 19 non-SMA and n = 16 SMA) who had both measurements available. Analysis of the 405 gene/protein pairs (p < 0.050) showed concordance in the heatmap (Fig. 6a) and a scatterplot with a modest positive correlation (r = 0.205; 95% CI, 0.107-0.299; p = 3.200E−05, Fig. 6b). Pathway maps were then generated in MetaCoreTM, without thresholds, allowing for a comprehensive view of all transcript/protein relationships. The most aligned pathway was HIF-1 targets, showing high significance in both transcripts (padj = 2.391E−21; 95/95 nodes) and proteins (padj = 4.344E−18; 70/95 nodes, Supplementary Fig. S7), indicating a concordant biological overlap between transcriptomic and protein abundance changes, especially for HIF-1-related processes. Collectively, these data demonstrated a modest relationship between the magnitude and directionality of the transcript and protein measurements with a consistent pattern of biological changes across both molecular layers for HIF-1 targets.
Discussion
Severe life-threatening malaria is represented by distinct and overlap** disease features (one or more) of the following: impaired consciousness, prostration, multiple convulsions, acidosis, hypoglycemia, SMA, renal impairment, jaundice, pulmonary edema, significant bleeding, shock, and hyperparasitemia33. The clinical manifestations of severe malaria and the age at which they present are largely driven by P. falciparum endemicity8,34. The overwhelming majority of life-threatening severe malaria occurs in holoendemic P. falciparum transmission areas of sub-Saharan Africa in children under five years who develop SMA, making this severe manifestation a leading cause of childhood deaths in such regions7,8.
Identifying common gene pathways/networks that encompass the diverse pathophysiological landscape of severe malaria (i.e., mixed phenotype) has presented significant challenges, likely because distinct biological processes may not share common networks. A major advantage of studies in holoendemic malaria regions, such as western Kenya, is that children have a distinct pathophysiological presentation of SMA, making discovery of gene-disease relationships more feasible. Our previous studies have identified innate immune response genes that influence the pathogenesis of SMA, largely through imparting changes in soluble mediators of inflammation7,8,12,35,36. However, this is the first investigation to examine the entire expressed peripheral blood transcriptome in children whose primary phenotype of severe disease is SMA. This study identified previously undescribed biological pathways and process networks that converge on perturbations in cellular and immune stress responses, illustrating that the pathogenesis of SMA is complex and multifaceted.
Differential expression analysis identified both overlap** and unique cellular pathways between non-SMA and SMA groups. Enhanced Th17-type immune responses and activation of signaling cascades that regulate developmental processes and tissue regeneration in the non-SMA group suggest enhanced pathogen defense and an improved capacity for maintaining tissue homeostasis37,38,39. Activation of gene networks involved in cell survival, growth, and differentiation, and responses to external stimuli in children with SMA appear to indicate adaptive responses to stress and damage40,41.
Identification of molecular patterns by hierarchical clustering of the DEGs revealed unique co-regulated gene clusters that differentiated non-severe from severe disease with distinct WBC and lymphocyte profiles but showed association with neither level of parasitemia nor sickle cell status. Relationships that emerged from the analysis were then further deciphered by creating canonical process networks. Children with SMA had a set of down-regulated genes (Cluster 1: IRF1↔SUZ12↔IL-1β↔NRF2↔LHX2) that are central to host signaling, cellular differentiation, and defense42,43,44,45. Decreased expression of the transcription factor, IRF1, and downstream targets (i.e., IL-1β, caspases 1 and 4, and FasR) in the absence of a Nod-like receptor family (i.e., NLRP1, NLRP3, and NLRC4) response suggest an inability to initiate pyroptosis and dysregulation of the inflammasome46,47,48. Up-regulation of the co-regulated sub-network (Cluster 2: TAL1↔LYL1↔BRD4↔FOXO3A↔EKLF1) in the SMA group indicates a physiological attempt to overcome hypoxia through enhanced erythropoiesis and metabolic adjusts for restoration of hemostasis and improved oxygen delivery49,50,51.
Results from the immune profiling with CIBERSORTx indicate that children with SMA appear to have a diminished capacity for immune memory response, antigen presentation, and immediate pathogen clearance (reduced neutrophils), typifying an immature or improperly regulated immune response52. This pattern of expression is consistent with the results obtained from immunological profiles that emerged from the blood transcriptome module repertoire analysis in which SMA was characterized by down-regulation of gene sets for neutrophils, inflammation, interferon, and cytokine/chemokines. Decreased expression of the gene networks in children with SMA identified by the repertoire analysis suggests impairments in innate immune function, stress responses, pyroptosis, autophagy, and antigen processing and presentation, as well as weakened respiratory burst, reduced energy production, and metabolic derangements53,54,55,56,57,58,59,60. The pattern of up-regulated gene sets in children with SMA implies an attempt to enhance erythropoiesis and oxygen transport, altered ubiquitination and protein degradation, and enhanced stress response49,54, This prospective acute febrile cohort study was conducted at Siaya County Referral Hospital (SCRH), located in a holoendemic P. falciparum transmission region in western Kenya where SMA is among the main causes of childhood morbidity and mortality in the community2,5,24,25,82,83. Individuals inhabiting the study area are predominantly from the Luo group ( > 96%), an ethnically homogeneous population. In the rural region of Siaya County, Kenya, the assignment of sex at birth is influenced by the available medical facilities and the cultural practices prevalent within the community. Local healthcare practitioners often follow traditional methods to assign sex at birth based on physical characteristics. It is important to note that the binary categorization used in this context does not fully capture the spectrum of gender identities recognized by some of the local cultures. This study relies on definitions provided by local health authorities and community leaders, acknowledging that these practices vary across different communities. Female and male children (sex at birth, age 1-59 mos.) presenting at SCRH with symptoms of infectious diseases with the following inclusion criteria were enrolled: temperature ≥37.5 °C (axillary), distance to hospital ≤25 km, parent/guardian willing/able to sign an informed consent and agree to present for day 14 follow-up. Exclusion criteria for the children included: previous hospitalization for any reason, positive malaria RDT test results but negative peripheral parasitemia, an episode of malaria within the past one month, and presentation for non-infectious diseases. Based on these criteria, 577 children were enrolled (03/2017 to 09/2020, Supplementary Fig. S2). At enrollment, demographic and clinical data were collected, and a physical examination was performed. Prior to treatment with antimalarials or other medications, Venipuncture blood samples (3-4 mL) were collected for laboratory measures. The study was approved by the University of New Mexico Institutional Review Board, and the Maseno University Scientific and Ethics Review Committee. Written informed consent was provided by the parents/legal guardians of the study participants. It is important to define anemia based on geographic location because varying environmental (e.g., malaria endemicity, altitude), genetic (e.g., hemoglobinopathies), and dietary factors influence the prevalence and severity of anemia. Regional variations necessitate tailored definitions of anemia to ensure accurate diagnosis and effective treatment. To geographically define anemia categories, we followed a cohort of children (n = 1654) over a 36-month period. For the determination of clinical disease definitions based on anemia status, we included 1644 children (3-48 mos.) with robust follow-up data that included over 19,000 Hb measurements for the modeling (Supplementary Fig. S1). Details of the study area have previously been published25. A description of the study participants and longitudinal follow-up schedule for the individuals utilized in the analyses were previously described14. The analysis presented utilized two cohorts of children recruited and followed with identical parameters across a temporal continuum: cohort 1 (2003–2005; n = 777) and cohort 2 (2007–2012; n = 877). Briefly, children presenting with suspected malaria infections or reporting for routine vaccinations were recruited at SCRH. Children with varying severities of malarial anemia (n = 1319) and aparasitemic controls (n = 335) were enrolled following screening for malaria parasites. Exclusion criteria included: children with non-falciparum parasite strains, confirmed cerebral malaria, previously hospitalized for any reason, or had reported use of antimalarial therapy in the two preceding weeks. After enrollment (day 0), children (n = 1654) were scheduled for follow-up visits on day 14 (if they were febrile upon enrollment) and quarterly over 36 months. Physical evaluations and laboratory tests required for comprehensive clinical management of the patients were performed at enrollment, day 14, and each acute and quarterly visit [complete blood counts (CBC), malaria parasitemia measures, and evaluation of bacteremia where indicated]. All samples and biological materials were collected before treatment with antimalarials or other medications. Children were treated according to the Ministry of Health-Kenya guidelines that include antimalarials and blood transfusion of children with Hb<5.0 g/dL and/or Hb < 7.0 g/dL in the context of respiratory distress. Since SMA is among the leading causes of malaria-related mortality in the region, hemoglobin concentrations were averaged for each individual across all visits and correlated with mortality rates. A dynamic programming approach was employed which exhaustively tested all possible splits from 2-way to 10-way27. The criterion for the optimal number of splits was the largest κ for which all pairwise chi-square tests between resulting groups were significant at p < 0.001. The dynamic programming algorithm exhaustively searched through all possible cut points for hemoglobin to minimize the function Fκ, which is defined as: Fκ = -∑ (from j = 1 to k) [−2nj(pj log(pj) + (1—pj) log(1—pj))], where pj is the proportion of one’s (deaths) in segment j, and nj is the number of observations in segment j. This function represents the sum of the negative double products of the number of observations in each segment and the binary log-likelihood for each segment. Data were segmented into k subgroups to minimize the sum. The chi-squared statistic for each potential split was calculated as: X^2 = F0—Fκ, where F0 is the function value for the entire sample. The p-value for assessing the significance of each split was obtained from the chi-squared distribution: p = chisqr(X^2, k − 1). This method revealed three significantly different (p < 0.001) Hb level groups: ≤5.9 g/dL (n = 62, mortality fraction=0.53), 5.91-8.09 g/dL (n = 209, mortality fraction=0.15), and >8.09 g/dL (n = 1,373, mortality fraction=0.03). These independently derived data parallel anemia categories defined from a previous longitudinal birth cohort (0-48 mos.) with 14,317 repeated Hb measurements in the same geographical region: mild anemia (8.0-9.0 g/dL), moderate anemia (6.0-7.9 g/dL, and severe anemia ( < 6.0 g/dL)28. As such, children with P. falciparum infections (any density parasitemia) were stratified into two groups based on Hb concentrations: Hb≥6.0 g/dL (non-SMA, n = 39), and Hb<6.0 g/dL (SMA, n = 18). Although the primary outcome of severe malaria in this study was SMA, other clinical complications were also determined. Respiratory distress was defined as: (tachypnea criteria by age); 0-2 mos. (60 breaths per min., bpm), 2-12 mos. (50 bpm), 1-5 yrs. (40 bpm), retractions (in-drawing of the chest wall); grunting; nasal flaring; use of accessory muscles for breathing; and hypoxia (SpO2 < 9084,85,86. Additional definitions included convulsions (tonic-clonic seizures), hypoglycemia (blood glucose levels<2.2 mM), jaundice (yellowing of skin and/or sclera), and thrombocytopenia (platelet count <150 × 103/mm3)33. Children with non-SMA were defined as those presenting with a malaria-positive smear from P. falciparum parasitemia (of any density) and Hb ≥6.0 g/dL in the absence/presence of other features of severe malaria defined above. Since we have shown that co-infections (i.e., HIV and bacteremia) exacerbate the development of SMA, HIV testing and blood cultures were performed on all children per our published methods79,80,81. Parents/legal guardians of participating children received pre- and post-test HIV&AIDS counseling. Upon presentation to hospital, children were screened for presence of malaria parasites using previously published methods25. Briefly, heel/finger-prick blood ( < 100 μL) was drawn and used to determine parasitemia and Hb status for initial screening. Giemsa-stained thick and thin blood smears were then prepared and examined for asexual malaria parasites under oil immersion microscopy25. The number of P. falciparum parasites was determined per 300 leukocytes, and the parasite density was estimated using the total leukocyte count for each patient. Complete blood counts (CBCs) were determined using a DxH 500 hematology analyzer (Beckman-Coulter). HIV-1 status was determined by two rapid serological antibody tests (i.e., Unigold™ and Determine™) and confirmed by HIV-1 proviral DNA PCR tests as previously described80. Bacterial cultures were performed on ~1.0 mL of Venipuncture blood collected aseptically, inoculated into pediatric blood culture bottles (Peds Plus, Becton-Dickinson), and incubated in an automated BACTEC 9050 system (Becton-Dickinson) for 5 days. Positive cultures were examined by Gram staining and sub-cultured on blood agar, chocolate agar, or MacConkey agar plates. Bacterial isolates were identified according to standard microbiologic procedures as described previously81. To further characterize potential causes of anemia, sickle-cell trait status was determined by alkaline cellulose acetate electrophoresis (Helena BioSciences). To select samples for the RNA-Seq from the 577 enrolled study participants, children with malaria were stratified into two groups based on Hb levels (i.e., Hb≥6.0 g/dL and Hb<6.0 g/dL), and then matched according to age and sex. Further selection criteria for the RNA-Seq included omitting children with any detected co-infections, namely HIV-1, and laboratory-confirmed blood-borne bacterial cultures79,80,81. Children with sickle cell disease (HbSS) were also excluded from analysis. All data presented for analysis, and used to generate figures and tables excluded children presenting with HbSS. This selection strategy yielded 39 children with non-SMA (Hb≥6.0 g/dL) and 18 children with SMA (Hb<6.0 g/dL). Since severe malaria in western Kenya presents as SMA, it was considered the primary outcome variable in RNA-Seq analysis. However, samples of participants with concomitant clinical features of respiratory distress, hypoxia, convulsions, hypoglycemia, jaundice, and thrombocytopenia were included in the analysis. Approximately 500μL of whole blood collected from venipuncture prior to treatment was stabilized with Trizol® (Thermo Fisher Scientific Inc.), immediately frozen in liquid nitrogen, and then subsequently stored at −80 °C. Total RNA was batch-isolated using E.Z.N.A® Total RNA Kit (Omega Bio-Tek Inc.), treated with RNase-free DNase I (New England Biolabs Inc.), and further processed using RNA Clean & Concentrator (ZYMO Research Corp.). Prior to library preparation and sequencing, RNA degradation and contamination were captured on agarose gels, with purity confirmed using a NanoPhotometer® (IMPLEN). RNA integrity and quantification were measured using the RNA Nano 6000 Assay Kit on a Bioanalyzer 2100 system (Agilent Technologies). RNA quality was assessed using RNA Integrity Number (RIN), and only samples with RIN value of >8 were used for library preparations. For quality control, a predefined exclusion criteria based on RNA quality and read counts was applied, with the error rate set at <1% (GEO accession number GSE255403). To capture the entire expressed transcriptome, an amount of 1 μg RNA/sample was used as input material for sequencing library construction and postglobin mRNA depletion using GLOBINclear™-Human Kit (Thermo Fisher Scientific Inc.). Sequencing libraries were generated using NEBNext® UltraTM RNA Library Prep Kit for Illumina (New England Biolabs) following the manufacturer’s protocol. mRNA was purified from total RNA using poly-T oligo-attached magnetic beads. Fragmentation was performed using divalent cations, with an elevated temperature in NEBNext First Strand Synthesis Reaction Buffer (5X). To synthesize a 1st strand cDNA, random hexamer primer and M-MuLV Reverse Transcriptase (RNase H-) were used. The 2nd strand cDNA was synthesized using DNA Polymerase I and RNase H. Any residual overhangs were transformed into blunt ends using the exonuclease/ polymerase activities. To prepare the DNA fragments for hybridization, NEBNext Adaptor with hairpin loop structure was ligated to the adenylated 3′ ends of the fragments. The resulting cDNA fragments were purified using the AMPure XP system (Beckman Coulter) to select preferential fragments of 150 ~ 200 bps in length. The USER Enzyme (New England Biolabs) was used with size-selected, adaptor-ligated cDNA at 37 °C for 15 min followed by 5 min at 95 °C before PCR. Amplification was performed with Phusion High-Fidelity DNA polymerase, Universal PCR primers, and Index (X) Primer. The PCR amplicons were purified (AMPure XP system) and library quality was assessed on the Agilent Bioanalyzer 2100 system (Agilent Technologies). Clustering of index-coded samples was performed on a cBot Cluster Generation System using PE Cluster Kit cBot-HS (Illumina Inc.). Triplicate libraries were generated from SMA and non-SMA samples collected at hospital visit (day 0). Sequencing was performed to a depth of >20 million high-quality mappable reads on an Illumina NovaSeq 6000 sequencer (Novogene Corporation Inc.). All the downstream analyses were based on clean data with high quality. None of the samples were excluded from the analysis due to either RNA quality or number of reads obtained. Raw data were quality-controlled and filtered using fastp87. Clean reads were obtained by removing reads containing adapter, ply-N sequences (N > 10%), and low-quality reads (base quality of >50% bases of the read is ≤5) from raw data. In addition, Q20, Q30, and GC content of the clean data were calculated, and downstream analysis was performed using high-quality clean data (GEO accession number GSE255403). Reference genome and gene model annotation files were downloaded from a genome website browser (NCBI/UCSC/Ensembl) directly. Sequences were aligned to the human reference genome (GRCh38.p13)88 using the STAR software version 2.589. With a depth of sequencing covering >20 million mappable reads, the total mapped reads ratio was >70% in the SMA group relative to non-SMA (GEO accession number GSE255403), representing more than half the subjects in the case group. HTSeq v0.6.1 was used to generate read counts mapped to each gene90. The expected number of Fragments Per Kilobase Million Reads (FPKM) of each gene was calculated according to the gene, gene length, and number of reads mapped to the gene91. An FPKM > 1 was set as a threshold for a gene to the considered expressed. Differential expression analysis of the two clinical conditions (non-SMA vs. SMA) was performed using the EdgeR (3.16.5) package, utilizing a negative binomial (NB) based statistical model on RNA count data. The resulting p-values were adjusted using the Benjamini-Hochberg procedure to control for the false discovery rate92. Genes identified by EdgeR with adjusted p-values (padj) of <0.050 were assigned as differentially expressed. To identify the correlation between different genes, samples were clustered using expression level FPKM utilizing the hierarchical clustering distance method. For high-dimensional data reduction, identification of variance, and visualization of groups and trends, PCA was performed on the DEGs in non-SMA and SMA using the factoextra (1.0.7) package in R. The ‘prcomp()’ function was used to calculate the principal components, with the model data consisting of 57 observations (participants) of 1,682 variables (genes). Demographic (age and sex) and genetic variants (Hb AA and HbAS) data were integrated with the gene annotation file to generate separate 2D plots. The relative percentage of different immune cell types/subtypes in whole blood was imputed using CIBERSORTx29,30. This analytical tool processes gene expression data from a bulk admixture of different cell types to estimate the abundance of member cell types in a mixed cell population29. The curated signature matrix file, LM22, was used as the reference to deconvolute the relative fraction of different cell types in whole blood, resulting in inference of 22 types/subtypes of leukocytes. Imputation of cell-type specific gene expression levels was performed at the sample level with the output presented as the fractional proportion in whole blood for each study participant. The relative proportions of immune cell types were then compared between the non-SMA and SMA groups. To gain insight into coordinated expression of gene groups and their role in biological processes, particularly for immune response genes, modular analysis of blood transcriptome data was analyzed using the BloodGen3Module package in R. BloodGen3 consists of a fixed repertoire of transcriptional modules tailored for the analysis and interpretation of blood transcriptome datasets. The repertoire includes 382 modules with functional annotations, covering >14,000 transcripts, and allows for fingerprint representations. Group comparison analyses were performed using the Welch’s-corrected t-test (‘Groupcomparison’) function. For individual-level comparisons, the FDR threshold was 20%. ClusterProfiler93,94 R package was used to implement the enrichment analysis. GO analysis of DEGs was used to infer functional and biological functions, correcting for gene length bias93. Reactome Enrichment Analysis was used to identify pathways that mapped to biological and cellular networks95. Significantly enriched pathways were identified with KEGG using the R package ‘ClusterProfiler’93. For all analyses, padj < 0.050 were considered significant enrichment. Confirmation and further discovery of the findings were implemented by using MetaCoreTM (https://clarivate.com/products/metacore/) to identify DEGs that mapped to GO processes, networks, and pathways. To perform an independent validation of our findings, we leveraged an independent cohort of Ugandan children (18 mos. to 12 years) in which whole blood transcriptomics was conducted on children with SMA (n = 17) and community children (household controls, n = 12)22. Validation was performed by identifying the top-expressed pathways in MetaCoreTM with identical parameters utilized in the analysis of the Kenyan cohort [log2foldchange = 0.585 (1.5 linear FoldChange) and FDR < 0.050]. Additional validation was performed by utilizing a Qiagen Targeted RNA-Seq panel (491 genes: immune response) conducted on a different set of children with SMA (n = 20) and non-SMA (n = 23) to corroborate the expression of overlap** genes identified in the transcriptomic analysis. Study participants for RNA-Seq analysis were selected from a pool of children with malaria (n = 1,218, aged 3-36 months) and stratified into two discrete groups of SMA (Hb<6.0 g/dL, Avg, 4.05 g/dL n = 20, cases) and non-SMA (Hb≥6.0 g/dL Hb, Avg, 9.96 g/dL, n = 23, controls). Exclusion criteria comprised children with co-morbidities such as bacteremia, HIV, and hemoglobinopathies (sickle cell disease), which can affect the anemia outcome. On the first hospital visit before treatment interventions, leukocytes of the children were collected and used for this study. To maintain the integrity of RNA, WBC pellets were kept in an equivalent volume of Trizol® LS (Thermo Fisher Scientific Inc.) after collection from the study site at SCRH. Total RNA was extracted using the RNeasy® Plus Micro Kit (Qiagen) according to the manufacturer’s protocol (Qiagen) and purified using the RNeasy Micro Kit (Qiagen). The RNA quality was validated using the 260/280 and 260/230 ratios on a NanoPhotometer® (IMPLEN), and the integrity assessed using an automated capillary electrophoresis on an Agilent 2100 Bioanalyzer (Agilent Technologies) according to the manufacturer’s protocol. RNA samples with a RIN < 8 were excluded from QIAseq analysis. Total RNA (125 ng) was used to synthesize cDNA using the First Strand Kit reagents (Qiagen) specific for the Targeted Human Inflammation and Immunity Transcriptome Panel (RHS-005Z, Qiagen). Following cDNA synthesis, gene-specific primers with twelve-base random molecular barcodes were added into each unique target strand via single primer extension, allowing quantification of gene expression in each multiplexed sample. A double QIAseq bead clean-up was done for the samples, followed by an eight-cycle limited gene-specific PCR reaction using a gene-specific primer and a universal primer in a total reaction volume of 25 µL. A second QIAseq bead clean-up was done to eliminate excess primer dimers prior to the final universal enrichment and sample-index PCR reaction of 18 cycles. A last universal enrichment index PCR step included the addition of dual unique index primers to allow for the pooling of all samples prior to sequencing. After the final PCR reaction, the QIAseq bead was cleaned to eliminate unincorporated primers. A QIAseq Library Quant Array Kit (Qiagen) was used to quantify the final purified QIAseq libraries. Individual QIAseq targeted RNA libraries were normalized and pooled using equimolar quantities based on the QIAseq Library Quant Array Kit (Qiagen) concentrations. Specifically, the pooled samples were denatured with 0.2 M NaOH and then diluted with hybridization buffer HT1 to a final dilution concentration of 1.2 pM. Each QIAseq Targeted RNA library was normalized to 4 nM with an average length of about 300 bp. Equal volumes of the equally concentrated libraries were mixed to create the final library pool for sequencing on an Illumina NextSeq500 device. Sequencing run were performed with 1×151 cycles and 8 bp dual index reads. Three independent sequencing reactions were used to complete the Forward Read 1 sequence with QIAseq Read1 Primer I (Qiagen) and then dual multiplexing of indices with Index 1 Read and Index 2 Read Primers (Illumina). The generated FASTQ files were analyzed using QIAseq RNA Quantification pipelines (Qiagen). Trimming of primers prior to read alignment was performed to confirm sequence identity by the internal sequence. Data normalization was performed by obtaining the average unique molecular index (UMI) tag count for the selected reference genes in each sample and in each group to ensure stability across Samples. The p-values were calculated using a generalized linear model with a negative binomial distribution of the replicate normalized gene expression data for each gene and expressed as a fold-change (FC > 1.5 and p < 0.050) in SMA relative to non-SMA, with p < 0.05 indicating significance. Expression analysis using molecular tag counts from RNA-Seq file was done using Targeted RNA Panel Secondary Data Analysis web portal (v1.0, Qiagen). Samples that failed the Data QC step from missing reference genes or suspected DNA contamination quality control were excluded. After normalizing the data with ten reference genes encompassed in the QIAseq Targeted RNA Panels, fold regulation and fold-change were calculated, and p-values generated. For this analysis, the significant (p < 0.050) transcripts and proteins were compared. To mitigate inter-individual variability in the analysis, we selected plasma samples from the same group of children whose whole blood transcriptome was performed. This strategy resulted in 40 children with plasma samples in sufficient quantities and quality available for proteomic analysis, stratified into SMA (Hb<6.0 g/dL, n = 18, cases) and non-SMA (Hb≥6.0 g/dL, n = 22, controls). Venipuncture whole blood samples (1.0 to 3.0 mL) were collected and instantly centrifuged (311 ×g) to separate plasma, aliquoted, and stored at −80 °C until use. The samples (no previous freeze-thaw cycles) were analyzed on a 7k SomaScan Assay v4.1 platform (SomaLogic), following the manufacturer’s protocol. Briefly, plasma samples were diluted, and SOMAmers synthesized with a fluorophore, photocleavable linker, and biotin. Diluted samples were incubated with dilution-specific SOMAmers attached to streptavidin beads. The bound proteins were tagged with biotin, while the unbound proteins were washed away. To dissociate the photocleavable linker, the mixture was exposed to ultraviolet (UV) light, releasing complexes into the solution. Specific complexes remained bound while non-specific ones dissociated. A polyanionic competitor was introduced to prevent the reformation of non-specific complexes. Subsequently, new streptavidin beads captured the biotinylated proteins and bound SOMAmers. The SOMAmers were released by denaturing the proteins, and fluorophores measured on a microarray chip. The fluorescence intensity, measured in relative fluorescence units (RFU), inferred the quantity of epitope in the original plasma sample96. The aptamer-based scan had median limit of detection (LOD) of 125 fM or 5.3 pg/mL97. Data standardization steps that comprised normalization and calibration were applied to mitigate systematic biases in raw assay following microarray feature aggregation (Supplementary Table S4). The normalization step involved a sample-by-sample adjustment to overall signals within the plasma dilutions, while calibration constituted an overall plate, and SOMAmer-by-SOMAmer adjustments, aimed at decreasing between-plate variability. The final analysis incorporated 35 samples that passed the quality control check, while five samples were excluded due to elevated normalization scale across dilution factors (n = 3), high hybridization scale factor (indicating a leak, n = 1), or clogged/low volume (n = 1). The protein measurements (RFU) were compared between non-SMA and SMA groups using a generalized linear model with a negative binomial distribution. Proteins were matched to their respective transcripts using network algorithms in MetaCoreTM, and correlation analyses determined using Spearman’s test. Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.Methods
Study participants
Ethics statement
Clinical disease definitions
Laboratory procedures
Study participant selection for RNA-Seq
RNA isolation, library construction, and sequencing
Bioinformatics analysis
Quality control
Map** to reference genome
Quantification
Differential expression analysis
Principal component analysis (PCA)
Leukocytic immune cell profiling
Module repertoire analysis of DEGs
Enrichment analysis
Validation of transcriptome profiles
Sample selection for QIA-seq analysis
RNA isolation, library construction, and sequencing
Data processing and analysis
Differential expression analysis
Comparative Analysis of Transcriptomics and Proteomics
Sample selection for proteomic analysis
Multiplex plasma proteome profiling by microarray detection
Data processing and analysis
Reporting summary
Data availability
Gene expression data is available in the Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/geo/) under the accession number GSE255403. Source data are provided with this paper.
References
WHO. World Malaria Report 2023. Report No. CC BY-NC-SA 3.0 IGO, (Geneva, 2023).
Akech, S. et al. The clinical profile of severe pediatric malaria in an area targeted for routine RTS, S/AS01 malaria vaccination in Western Kenya. Clin. Infect. Dis. 71, 372–380 (2020).
Amek, N. O. et al. Childhood cause-specific mortality in rural Western Kenya: application of the InterVA-4 model. Glob. Health Action 7, 25581 (2014).
Amek, N. O. et al. Infant and child mortality in relation to malaria transmission in KEMRI/CDC HDSS, Western Kenya: validation of verbal autopsy. Malar. J. 17, 37 (2018).
Obonyo, C. O., Vulule, J., Akhwale, W. S. & Grobbee, D. E. In-hospital morbidity and mortality due to severe malarial anemia in western Kenya. Am. J. Trop. Med. Hyg. 77, 23–28 (2007).
Zucker, J. R. et al. Clinical signs for the recognition of children with moderate or severe anaemia in western Kenya. Bull. World Health Organ. 75, 97–102 (1997).
Perkins, D. J. et al. The Global Burden of Severe Falciparum Malaria: An Immunological and Genetic Perspective on Pathogenesis in Dynamic Models of Infectious Diseases: 1: Vector-Borne Diseases (eds Vadrevu Sree Hari, Rao & Ravi Durvasula) 231-283 (Springer New York, 2013).
Perkins, D. J. et al. Severe malarial anemia: innate immunity and pathogenesis. Int J. Biol. Sci. 7, 1427–1442 (2011).
Barry, A. & Hansen, D. Naturally acquired immunity to malaria. Parasitology 143, 125–128 (2016).
Doolan, D. L., Dobano, C. & Baird, J. K. Acquired immunity to malaria. Clin. Microbiol. Rev. 22, 13–36 (2009).
Gonzales, S. J. et al. Naturally acquired humoral immunity against Plasmodium falciparum Malaria. Front. Immunol. 11, 594653 (2020).
Achieng, A. O. et al. Integrated OMICS platforms identify LAIR1 genetic variants as novel predictors of cross-sectional and longitudinal susceptibility to severe malaria and all-cause mortality in Kenyan children. EBioMedicine 45, 290–302 (2019).
Awandare, G. A. et al. A macrophage migration inhibitory factor promoter polymorphism is associated with high-density parasitemia in children with malaria. Genes Immun. 7, 568–575 (2006).
Kisia, L. E. et al. Genetic variation in CSF2 (5q31.1) is associated with longitudinal susceptibility to pediatric malaria, severe malarial anemia, and all-cause mortality in a high-burden malaria and HIV region of Kenya. Trop. Med. Health 50, 41 (2022).
Kwiatkowski, D. P. How malaria has affected the human genome and what human genetics can teach us about malaria. Am. J. Hum. Genet. 77, 171–192 (2005).
Anyona, S. B. et al. Differential gene expression in host ubiquitination processes in childhood malarial anemia. Front. Genet. 12, 764759 (2021).
Boldt, A. B. W. et al. The blood transcriptome of childhood malaria. EBioMedicine 40, 614–625 (2019).
Lee, H. J. et al. Integrated pathogen load and dual transcriptome analysis of systemic host-pathogen interactions in severe malaria. Sci Transl Med 10, https://doi.org/10.1126/scitranslmed.aar3619 (2018).
Mortazavi, A., Williams, B. A., McCue, K., Schaeffer, L. & Wold, B. Map** and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 5, 621–628 (2008).
Yamagishi, J. et al. Interactive transcriptome analysis of malaria patients and infecting Plasmodium falciparum. Genome Res. 24, 1433–1444 (2014).
Rothen, J. et al. Whole blood transcriptome changes following controlled human malaria infection in malaria pre-exposed volunteers correlate with parasite prepatent period. PLoS One 13, e0199392 (2018).
Nallandhighal, S. et al. Whole-blood transcriptional signatures composed of Erythropoietic and NRF2-regulated genes differ between cerebral malaria and severe malarial anemia. J. Infect. Dis. 219, 154–164 (2019).
Marsh, K., English, M., Crawley, J. & Peshu, N. The pathogenesis of severe malaria in African children. Ann. Trop. Med Parasitol. 90, 395–402 (1996).
Novelli, E. M. et al. Clinical predictors of severe malarial anaemia in a holoendemic Plasmodium falciparum transmission area. Br. J. Haematol. 149, 711–721 (2010).
Ong’echa, J. M. et al. Parasitemia, anemia, and malarial anemia in infants and young children in a rural holoendemic Plasmodium falciparum transmission area. Am. J. Trop. Med. Hyg. 74, 376–385 (2006).
Chaparro, C. M. & Suchdev, P. S. Anemia epidemiology, pathophysiology, and etiology in low- and middle-income countries. Ann. N. Y. Acad. Sci. 1450, 15–31 (2019).
Hawkins, D. M. Fitting multiple change-point models to data. Computational Statistics & Data Analysis 37, 323–341, (2001).
McElroy, P. D. et al. Analysis of repeated hemoglobin measures in full-term, normal birth weight Kenyan children between birth and four years of age. III. The Asemobo Bay Cohort Project. Am. J. Trop. Med. Hyg. 61, 932–940 (1999).
Newman, A. M. et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat. Biotechnol. 37, 773–782 (2019).
Steen, C. B., Liu, C. L., Alizadeh, A. A. & Newman, A. M. Profiling cell type abundance and expression in bulk tissues with CIBERSORTx. Methods Mol. Biol. 2117, 135–157 (2020).
Altman, M. C. et al. Development of a fixed module repertoire for the analysis and interpretation of blood transcriptome data. Nat. Commun. 12, 4385 (2021).
Rinchai, D. et al. BloodGen3Module: blood transcriptional module repertoire analysis and visualization using R. Bioinformatics 37, 2382–2389 (2021).
White, N. J. Severe malaria. Malar. J. 21, 284 (2022).
Weiss, D. J. et al. Map** the global prevalence, incidence, and mortality of Plasmodium falciparum, 2000-17: a spatial and temporal modelling study. Lancet 394, 322–331 (2019).
Anyona, S. B. et al. Cyclooxygenase-2 haplotypes influence the longitudinal risk of malaria and severe malarial anemia in Kenyan children from a holoendemic transmission region. J. Hum. Genet. 65, 99–113 (2020).
Munde, E. O. et al. Association between Fcgamma receptor IIA, IIIA and IIIB genetic polymorphisms and susceptibility to severe malaria anemia in children in western Kenya. BMC Infect. Dis. 17, 289 (2017).
Hoffmann, W. Trefoil Factor Family (TFF) Peptides and Their Links to Inflammation: A Re-evaluation and New Medical Perspectives. Int. J. Mol. Sci. 22, https://doi.org/10.3390/ijms22094909 (2021).
McAleer, J. P. & Kolls, J. K. Mechanisms controlling Th17 cytokine expression and host defense. J. Leukoc. Biol. 90, 263–270 (2011).
Sezin, T., Selvakumar, B. & Scheffold, A. The role of A Disintegrin and Metalloproteinase (ADAM)−10 in T helper cell biology. Biochim. Biophys. Acta Mol. Cell Res. 1869, 119192 (2022).
Kültz, D. Evolution of cellular stress response mechanisms. J. Exp. Zool. Part A Ecol. Integr. Physiol. 333, 359–378 (2020).
Paudel, S., Ghimire, L., **, L., Jeansonne, D. & Jeyaseelan, S. Regulation of emergency granulopoiesis during infection. Front. Immunol. 13, 961601 (2022).
Feng, H., Zhang, Y. B., Gui, J. F., Lemon, S. M. & Yamane, D. Interferon regulatory factor 1 (IRF1) and anti-pathogen innate immune responses. PLoS Pathog. 17, e1009220 (2021).
Gonzalez-Carnicero, Z. et al. Regulation by Nrf2 of IL-1beta-induced inflammatory and oxidative response in VSMC and its relationship with TLR4. Front. Pharm. 14, 1058488 (2023).
Hammad, M. et al. Roles of oxidative stress and Nrf2 signaling in pathogenic and non-pathogenic cells: a possible general mechanism of resistance to therapy. Antioxidants 12, https://doi.org/10.3390/antiox12071371 (2023).
Kobayashi, E. H. et al. Nrf2 suppresses macrophage inflammatory response by blocking proinflammatory cytokine transcription. Nat. Commun. 7, 11624 (2016).
Franchi, L., Eigenbrod, T., Munoz-Planillo, R. & Nunez, G. The inflammasome: a caspase-1-activation platform that regulates immune responses and disease pathogenesis. Nat. Immunol. 10, 241–247 (2009).
Man, S. M. & Kanneganti, T. D. Converging roles of caspases in inflammasome activation, cell death and innate immunity. Nat. Rev. Immunol. 16, 7–21 (2016).
Reimer, T. et al. Experimental cerebral malaria progresses independently of the Nlrp3 inflammasome. Eur. J. Immunol. 40, 764–769 (2010).
Caulier, A. L. & Sankaran, V. G. Molecular and cellular mechanisms that regulate human erythropoiesis. Blood 139, 2450–2459 (2022).
Gnanapragasam, M. N. et al. EKLF/KLF1-regulated cell cycle exit is essential for erythroblast enucleation. Blood 128, 1631–1641 (2016).
Liang, R. et al. A systems approach identifies essential FOXO3 functions at key steps of terminal erythropoiesis. PLoS Genet. 11, e1005526 (2015).
Kumar, R., Loughland, J. R., Ng, S. S., Boyle, M. J. & Engwerda, C. R. The regulation of CD4(+) T cells during malaria. Immunol. Rev. 293, 70–87 (2020).
Hachim, M. Y., Khalil, B. A., Elemam, N. M. & Maghazachi, A. A. Pyroptosis: The missing puzzle among innate and adaptive immunity crosstalk. J. Leukoc. Biol. 108, 323–338 (2020).
Mathangasinghe, Y., Fauvet, B., Jane, S. M., Goloubinoff, P. & Nillegoda, N. B. The Hsp70 chaperone system: distinct roles in erythrocyte formation and maintenance. Haematologica 106, 1519–1534 (2021).
Pant, A. et al. Interactions of autophagy and the immune system in health and diseases. Autophagy Rep. 1, 438–515 (2022).
Pishesha, N., Harmand, T. J. & Ploegh, H. L. A guide to antigen processing and presentation. Nat. Rev. Immunol. 22, 751–764 (2022).
Quan, J., Bode, A. M. & Luo, X. ACSL family: The regulatory mechanisms and therapeutic implications in cancer. Eur. J. Pharm. 909, 174397 (2021).
Sperry, M. M. et al. Enhancers of host immune tolerance to bacterial infection discovered using linked computational and experimental approaches. Adv. Sci. 9, e2200222 (2022).
Taylor, C. T. & Scholz, C. C. The effect of HIF on metabolism and immunity. Nat. Rev. Nephrol. 18, 573–587 (2022).
Yu, G., Wang, L. G., Han, Y. & He, Q. Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
Sheng, X., **a, Z., Yang, H. & Hu, R. The ubiquitin codes in cellular stress responses. Protein Cell 15, 157–190 (2024).
Li, Y., Li, S. & Wu, H. Ubiquitination-Proteasome System (UPS) and autophagy two main protein degradation machineries in response to cell stress. Cells 11, https://doi.org/10.3390/cells11050851 (2022).
Fujii, J. et al. Erythrocytes as a preferential target of oxidative stress in blood. Free Radic. Res 55, 562–580 (2021).
Anyona, S. B. et al. Ingestion of hemozoin by peripheral blood mononuclear cells alters temporal gene expression of ubiquitination processes. Biochem Biophys. Rep. 29, 101207 (2022).
Deretic, V. Autophagy in inflammation, infection, and immunometabolism. Immunity 54, 437–453 (2021).
Chiabrando, D., Vinchi, F., Fiorito, V., Mercurio, S. & Tolosano, E. Heme in pathophysiology: a matter of scavenging, metabolism and trafficking across cell membranes. Front Pharm. 5, 61 (2014).
Ge, Y., Huang, M. & Yao, Y. M. Efferocytosis and its role in inflammatory disorders. Front Cell Dev. Biol. 10, 839248 (2022).
Kourtzelis, I., Hajishengallis, G. & Chavakis, T. Phagocytosis of apoptotic cells in resolution of inflammation. Front Immunol. 11, 553 (2020).
Awandare, G. A. et al. Mechanisms of erythropoiesis inhibition by malarial pigment and malaria-induced proinflammatory mediators in an in vitro model. Am. J. Hematol. 86, 155–162 (2011).
Cumnock, K. et al. Host energy source is important for disease tolerance to malaria. Curr. Biol. 28, 1635–1642 e1633 (2018).
Samec, M. et al. Flavonoids Targeting HIF-1: Implications on cancer metabolism. Cancers 13, https://doi.org/10.3390/cancers13010130 (2021).
Watts, D. et al. Hypoxia pathway proteins are master regulators of Erythropoiesis. Int. J. Mol. Sci. 21, https://doi.org/10.3390/ijms21218131 (2020).
Lennicke, C. & Cocheme, H. M. Redox metabolism: ROS as specific molecular regulators of cell signaling and function. Mol. Cell 81, 3691–3707 (2021).
Bauernfeind, A. L. & Babbitt, C. C. The predictive nature of transcript expression levels on protein expression in adult human brain. BMC Genomics 18, 322 (2017).
de Sousa Abreu, R., Penalva, L. O., Marcotte, E. M. & Vogel, C. Global signatures of protein and mRNA expression levels. Mol. Biosyst. 5, 1512–1526 (2009).
Jiang, W. et al. Analytical considerations of large-scale aptamer-based datasets for translational applications. Cancers 14, https://doi.org/10.3390/cancers14092227 (2022).
Koussounadis, A., Langdon, S. P., Um, I. H., Harrison, D. J. & Smith, V. A. Relationship between differentially expressed mRNA and mRNA-protein correlations in a xenograft model system. Sci. Rep. 5, 10775 (2015).
Vogel, C. & Marcotte, E. M. Insights into the regulation of protein abundance from proteomic and transcriptomic analyses. Nat. Rev. Genet. 13, 227–232 (2012).
Davenport, G. C., Hittner, J. B., Were, T., Ong’echa, J. M. & Perkins, D. J. Relationship between inflammatory mediator patterns and anemia in HIV-1 positive and exposed children with Plasmodium falciparum malaria. Am. J. Hematol., https://doi.org/10.1002/ajh.23200 (2012).
Otieno, R. O. et al. Increased severe anemia in HIV-1-exposed and HIV-1-positive infants and children during acute malaria. AIDS 20, 275–280 (2006).
Were, T. et al. Bacteremia in Kenyan children presenting with malaria. J. Clin. Microbiol. 49, 671–676 (2011).
McElroy, P. D. et al. All-cause mortality among young children in western Kenya. VI: the Asembo Bay Cohort Project. Am. J. Trop. Med. Hyg. 64, 18–27 (2001).
Okiro, E. A., Alegana, V. A., Noor, A. M. & Snow, R. W. Changing malaria intervention coverage, transmission and hospitalization in Kenya. Malar. J. 9, 285 (2010).
Dondorp, A. M. et al. Artesunate versus quinine in the treatment of severe falciparum malaria in African children (AQUAMAT): an open-label, randomised trial. Lancet 376, 1647–1657 (2010).
Marsh, K. et al. Indicators of life-threatening malaria in African children. N. Engl. J. Med. 332, 1399–1404 (1995).
Taylor, W. R. J., Hanson, J., Turner, G. D. H., White, N. J. & Dondorp, A. M. Respiratory manifestations of malaria. Chest 142, 492–505 (2012).
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Schneider, V. A. et al. Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly. Genome Res. 27, 849–864 (2017).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Anders, S., Pyl, P. T. & Huber, W. HTSeq-a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2015).
Trapnell, C. et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol. 28, 511–515 (2010).
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc.: Ser. B 57, 289–300 (1995).
Wu, T. et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation 2, 100141 (2021).
Yu, H. H., Yang, Y. H. & Chiang, B. L. Chronic Granulomatous disease: a comprehensive review. Clin. Rev. Allergy Immunol. 61, 101–113 (2021).
Haw, R., Hermjakob, H., D’Eustachio, P. & Stein, L. Reactome pathway analysis to enrich biological discovery in proteomics data sets. Proteomics 11, 3598–3613 (2011).
Candia, J., Daya, G. N., Tanaka, T., Ferrucci, L. & Walker, K. A. Assessment of variability in the plasma 7k SomaScan proteomics assay. Sci. Rep. 12, 17147 (2022).
Somalogic. SomaScan®Assay v4.1, https://somalogic.com/wp-content/uploads/2023/03/SomaScan-Assay-v4.1-Technical-Note.pdf (2023).
Acknowledgements
The authors gratefully acknowledge the assistance of the University of New Mexico-Kenya team: Nicholas O. Ondiek, Anne A. Ong’ondo, Chrispine W. Ochieng, Everlyne A. Modi, Joan L. A. Ochieng, Duncan O. Njega, Joseph Oduor, Moses Ebungure, Moses Lokorkeju, Rodney B. Mongare, and Vincent Omanje. We are also indebted to all the parents, guardians, and children who participated in the study. The work was supported by National Institutes of Health (NIH) Research Grants R01AI130473 and R01AI51305 (PI: Dr. Douglas J Perkins), NIH Fogarty International Center Grants K43TW011581 (PI: Dr. Samuel B. Anyona), D43TW05884 (DJP, SBA) and D43TW010543 (SBA, DJP), and LANL-LDRD 20150090DR (BHM, DJP).
Author information
Authors and Affiliations
Contributions
SBA: Conducted experiments, data analyses, and co-wrote the manuscript. QC: Project supervision, technical support, data analyses, and manuscript review and editing. SAW: Data analyses and manuscript review and editing. SWO: Data analyses and manuscript review and editing. YG: Data analyses and manuscript review and editing. ER: Technical support and manuscript review and editing. IH: Project supervision, technical support, and manuscript review and editing. COO: Technical support and manuscript review and editing. CO: Project supervision, technical support, and manuscript review and editing. PDS: Project supervision and manuscript review and editing. BHM: Project supervision and manuscript review and editing. CGL: Project supervision and manuscript review and editing. KAS: Data analyses and manuscript review and editing. DJP: Designed study, project supervision, data analyses, and co-wrote manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Yutaka Suzuki and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Anyona, S.B., Cheng, Q., Wasena, S.A. et al. Entire expressed peripheral blood transcriptome in pediatric severe malarial anemia. Nat Commun 15, 5037 (2024). https://doi.org/10.1038/s41467-024-48259-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-48259-4
- Springer Nature Limited