Abstract
Acute COVID-19 infection can be followed by diverse clinical manifestations referred to as Post Acute Sequelae of SARS-CoV2 Infection (PASC). Studies have shown an increased risk of being diagnosed with new-onset psychiatric disease following a diagnosis of acute COVID-19. However, it was unclear whether non-psychiatric PASC-associated manifestations (PASC-AMs) are associated with an increased risk of new-onset psychiatric disease following COVID-19. A retrospective electronic health record (EHR) cohort study of 2,391,006 individuals with acute COVID-19 was performed to evaluate whether non-psychiatric PASC-AMs are associated with new-onset psychiatric disease. Data were obtained from the National COVID Cohort Collaborative (N3C), which has EHR data from 76 clinical organizations. EHR codes were mapped to 151 non-psychiatric PASC-AMs recorded 28–120 days following SARS-CoV-2 diagnosis and before diagnosis of new-onset psychiatric disease. Association of newly diagnosed psychiatric disease with age, sex, race, pre-existing comorbidities, and PASC-AMs in seven categories was assessed by logistic regression. There were significant associations between a diagnosis of any psychiatric disease and five categories of PASC-AMs with odds ratios highest for neurological, cardiovascular, and constitutional PASC-AMs with odds ratios of 1.31, 1.29, and 1.23 respectively. Secondary analysis revealed that the proportions of 50 individual clinical features significantly differed between patients diagnosed with different psychiatric diseases. Our study provides evidence for association between non-psychiatric PASC-AMs and the incidence of newly diagnosed psychiatric disease. Significant associations were found for features related to multiple organ systems. This information could prove useful in understanding risk stratification for new-onset psychiatric disease following COVID-19. Prospective studies are needed to corroborate these findings.
Similar content being viewed by others
Introduction
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus that causes coronavirus disease 2019 (COVID-19), is responsible for over 650 million cases with 6 million deaths worldwide [1]. Many patients experience manifestations that persist following acute COVID-19 or with onset after the acute period, affecting various organ systems [2, 3]. These manifestations, when not explained by another cause, are part of the broader syndrome of Post Acute Sequelae of SARS-CoV2 Infection (PASC; colloquially referred to as Long COVID). The rate of newly diagnosed psychiatric disease has been found to be significantly increased in patients following COVID-19 infection. The most significant risk has been demonstrated for anxiety disorders, with hazard ratios ranging from 1.3-2.1 [4,5,6,7,8]. In a prior study, we found that this risk was only significant in early post-acute phase (28-120 days following COVID-19 diagnosis), reflecting a timeframe when these psychiatric sequelae are most likely to be diagnosed [5]. A smaller study using electronic health record (EHR) data from a Japanese cohort similarly showed that COVID-19 patients were more likely to receive a diagnosis of psychiatric disease one to three months after COVID-19 compared to controls with influenza or respiratory tract infections [9]. These findings have major public health ramifications, creating a need to further characterize the risk of newly diagnosed psychiatric disease following COVID-19.
One relevant question is how psychiatric sequelae relate to other manifestations of PASC. Acute COVID-19 may be characterized by neurological manifestations such as confusion, stroke, and neuromuscular disorders. Pathomechanisms include viral neuroinvasion, immune activation and inflammation within the central nervous system (CNS), and endotheliopathy associated with blood–brain barrier dysfunction, which could additionally lead to psychiatric manifestations through diverse pathophysiological mechanisms [10]. Following acute COVID-19, some patients exhibit manifestations including fatigue, headache, difficulty concentrating, cognitive impairment, anxiety and mood disorders, and dysautonomia [11]. The pathobiology of these post-COVID manifestations are thought to be the result of a combination of delayed recovery from inflammation, viral persistence, and autoimmunity resulting from infection [12]. The relationship between newly diagnosed psychiatric sequelae and other PASC manifestations has not been well characterized. One challenge in understanding the pathogenesis of the psychiatric manifestations of PASC is the fact that it is not a single, well-delineated disease; instead, PASC has multiple distinct pathogenetic mechanisms that may underlie different psychiatric manifestations [13].
In this study, we performed a detailed analysis of 151 PASC-associated manifestations (PASC-AMs) encoded using terms of the Human Phenotype Ontology (HPO), which is widely used to support differential diagnosis and translational research in human genetics [14]. We analyzed 2,391,006 patients for whom at least 120 days of follow-up data were available after acute COVID-19 in multicenter EHR data from the National COVID Cohort Collaborative (N3C) [15]. We show significant associations between new-onset psychiatric disease and PASC-AMs from five organ systems.
Methods
Study population and data sources
In this retrospective cohort study, we examined the association between PASC-AMs and diagnosis of new-onset psychiatric disease and four subcategories: anxiety disorder, depressive disorder, psychosis, and substance abuse disorder following acute COVID-19. We used patient data accessed through the N3C Data Enclave [15]. N3C has integrated EHRs from 76 clinical organizations in the United States. We analyzed N3C data frozen on November 3, 2023, which comprised records for 8 million COVID-19 positive patients. Data included over 30 billion rows of data and 21 million patients. Data were collected from the clinical organizations, normalized to the Observational Medical Outcomes Partnership (OMOP) 5.3.1 vocabulary [16], then de-identified and made available to participating N3C research institutions. The study was exempted by the Institutional Review Board (IRB) at the Jackson Laboratory under 45 CFR 46.101(b) (Common Rule). The N3C data transfer to the National Center for Advancing Translational Sciences (NCATS) is performed under a Johns Hopkins University Reliance Protocol # IRB00249128 or individual site agreements with NIH. The N3C Data Enclave is managed under the authority of the NIH; information can be found at https://ncats.nih.gov/n3c/resources.
Exposures and outcomes
Clinical data including comorbidities, medications, and outcomes were identified using concept identifiers in the OMOP common data model. Demographics, laboratory values, COVID-19 status, and psychiatric diagnoses were collected for each patient.
Patients were included in the primary analysis if SARS-CoV-2 was detected by polymerase chain reaction (PCR) or antigen test after January 1, 2020. Patients with a history of any psychiatric disease prior to 28 days after COVID-19 diagnosis and patients without a medical record covering at least a year prior to and 120 days after COVID-19 diagnosis were excluded from this analysis (Supplemental Fig. S1).
We identified patients with a diagnosis of any psychiatric disease and subcategories of anxiety disorder, depressive disorder, psychosis, and substance abuse using diagnostic codes defined by OMOP. We treated each category of psychiatric disease as an outcome. Psychiatric outcomes were considered if they were first diagnosed in the period of 28 to 120 days following COVID-19 diagnosis (Fig. 1, Supplemental Fig. S1).
We selected all patients with a positive PCR or antigen test for COVID-19 who had records extending greater than 1 year prior to COVID-19 infection and at least 120 days after diagnosis of COVID-19. Patients with any record of psychiatric disease prior to the post-COVID phase (28 days after diagnosis) were removed. For each outcome, patients with more than one outcome (anxiety disorder, depressive disorder, psychosis, and substance abuse) whose first psychiatric diagnosis was not the outcome of interest were excluded from the analysis. We report the number of patients included with the outcome (n) and the total number of patients.
We studied known PASC-AMs that had been previously described using 287 Human Phenotype Ontology (HPO) terms [2, 17]. We mapped 176 of these terms to terms in the OMOP data model using the OMOP2OBO map** algorithm, which leverages well-established automatic map** techniques and manually-derived expert-verified annotations to align concepts from each source (some HPO terms mapped to multiple OMOP terms and some HPO terms had no good OMOP counterparts) [13, 18]. We removed 15 HPO terms from our list of non-psychiatric PASC-AMS because they described psychiatric phenotypes, leaving 161 non-psychiatric HPO terms. 151 of these terms were found in the EHR data and were used for the final analysis. In this way, we were able to identify when patients experienced HPO-defined PASC-AMs using OMOP-encoded patient records. We then categorized HPO terms by affected system. These categories included cardiovascular, constitutional, endocrine, ear-nose-throat (ENT), eye, gastrointestinal, immunology, laboratory, neurological, pulmonary, and skin (Supplemental Table S2). The HPO terms were recorded in the time period from 28 days after onset of acute COVID-19 up to either the occurrence of newly diagnosed psychiatric disease or 120 days (if no diagnosis of psychiatric disease was recorded) (Supplemental Fig. S1).
In addition to PASC-AMs, we considered the effects of patient demographics, including age, race and ethnicity, sex, smoking status, BMI, visit type (inpatient or outpatient), length of stay (if applicable). We also considered pre-COVID comorbidities (Table 1). All comorbidities were defined as a binary variable indicating whether that patient had received the diagnosis of the comorbidity prior to COVID diagnosis. Only comorbidities present in more than 1% of the population were included in the final analysis to avoid biased estimates (Table 1) [19].
Statistical analysis
Data analysis was performed using Palantir Foundry (Palantir Technologies Inc., Denver, Colorado). The analysis was structured as a directed acyclic graph of data transformations. Individual transformations were implemented as nodes consisting of SQL, Python (version 3.6.10), or R (version 3.5.1) code.
To address data missingness, we applied a multiple imputation strategy and computed pooled estimates by applying Rubin’s rule [20]. BMI was the most commonly missing variable (in 58% of cases). We imputed BMI six times with the missRanger algorithm and applied a multiple imputation estimation pipeline to derive the log odds estimates. The missRanger algorithm was chosen based on a previous study comparing different multiple imputation techniques on an N3C cohort of diabetic patients [21]. We identified all HPO-defined PASC-AMs that occurred between 28 and 120 days following COVID-19 diagnosis. PASC-AMs that were first documented after or on the same day as the diagnosis of a psychiatric disease were not included in the analysis. For each patient, we counted the number of unique HPO terms in each of the eleven categories described in the preceding section. PASC-AM categories present in less than 1% of the population were not considered.
To investigate the association of PASC manifestations and other covariates with newly diagnosed psychiatric disease, we performed logistic regression using the glm function in R. The predictors included age, sex, race, pre-existing comorbidities, and the counts of manifestations in each of the eleven HPO categories. Only predictors present in more than 1% of the cohort were used. We conducted separate analyses to predict the occurrence of psychiatric disease and four subcategories: anxiety disorder, depressive disorder, psychosis, substance abuse, ADHD, and bipolar disorder. The control group was patients with no psychiatric diagnosis. For each regression, we recorded the estimated odds ratio, 95% confidence intervals, and corresponding p-value.
To better understand the temporal relationship between PASC-AMs and psychiatric diseases, we looked at the distribution of patients by time between PASC-AM and outcome. Non-exclusive groups were created for each PASC-AM category that contained any patient with the respective PASC-AM and any psychiatric outcome. Histograms were created to show the distribution across time (ranging 1-92 days) for patients in each group.
We performed a chi-squared test of independence to assess if the incidence of HPO terms in each category were significantly associated with an outcome. For each term, a contingency table was constructed containing the counts of study participants with or without the corresponding HPO annotation and with or without the corresponding new-onset psychiatric disease. For the chi-squared analysis, we removed 6,208 patients (8.1% of those with any psychiatric diagnosis) who had more than one psychiatric outcome. P-values were adjusted using Bonferroni correction.
Sensitivity analyses
PASC-AMs were only considered if they occurred prior to a diagnosis of psychiatric outcome or the 120-day cutoff. As a result, PASC-AMs were recorded over a longer period for patients without a diagnosis of a psychiatric disease. This leads to more conservative results with odds ratios that may be negatively shifted. To assess this effect, we computed the average time of receiving a psychiatric diagnosis for patients in the outcome group (72 days) and then repeated the logistic regression after excluding PASC-AMs in the control group that occurred after this time point.
Imputation was used to compensate for substantial missingness of BMI data. Since BMI is associated with risk of severe COVID-19 [22] and could therefore act as a confounder, we also performed the analysis without imputation, kee** only patients with complete data. To ascertain whether our results are consistent between inpatients and outpatients, we separated our cohorts into those who were admitted for their COVID-19 infection and those who were not.
Previous studies that investigated PASC and related manifestations have used different criteria to identify whether a manifestation is attributed to acute COVID-19 or PASC. To determine whether this had a large effect on our results, we repeated our regression analysis using the alternative definition of 42 days after initial diagnosis for outpatients or discharge for inpatients to 365 days after initial diagnosis.
In this study, we investigate the association between PASC-AMs and subsequent psychiatric disease. In our main analysis, we assume that psychiatric diseases diagnosed after PASC-AMs likewise have an onset subsequent to the date on which PASC-AMs were recorded. We performed an additional sensitivity analysis to assess whether our results could have been impacted by a greater delay between onset of psychiatric disease and diagnosis and recording of the diagnosis in the EHR. For this, we excluded PASC-AMs that were diagnosed within ten days prior to the psychiatric diagnosis, and otherwise repeated the analysis unchanged.
Role of the Funding source
The funders had no role in study design, data collection, analysis, interpretation, manuscript writing, or the decision to submit for publication. The corresponding authors had full access to all study data and had final responsibility for the decision to submit for publication.
Results
A total of 8 million patients with prior COVID-19 were assessed. We restricted analysis to patients with no previous recorded psychiatric disease and at least one year of data prior to acute COVID-19 diagnosis and 120 days after. This left 2,391,006 patients in the COVID-19 cohort. We found that 2.1% of patients had a newly diagnosed psychiatric disease in the early post-acute phase (28-120 days after COVID-19 diagnosis) (Table 1).
The outcomes of interest were psychiatric diagnoses and four subcategories: anxiety disorder, depressive disorder, psychosis, and substance abuse. To identify risk factors we exploited the hierarchical structure of the HPO to group into eleven categories the HPO terms describing clinical manifestations that may be observed in PASC [2]. We refer to these manifestations as PASC-AMs to reflect the difficulty of inferring causal relationships on the basis of EHR data. The patient’s symptomatology in each category is summarized by an integer-valued variable equal to the number of distinct HPO terms recorded in that category. Constitutional PASC-AMs were most commonly reported, being found in 4.13% of patients. A logistic regression was performed with these variables as well as binary variables derived from 25 pre-existing comorbidities and age, sex, and race. We then removed categories and variables that were recorded in less than 1% of the cohort (Table 1).
We found that there was a significant positive association with any newly diagnosed psychiatric disease for four of the seven investigated HPO categories, with the estimated odds ratio ranging from 1.15 to 1.31 (Fig. 2). Conversely, endocrine PASC-AMs were negatively associated with psychiatric diagnoses with an odds ratio of 0.93 (0.87–0.999, 95% CI). For the subcategory of anxiety disorders, the same four HPO categories and ENT manifestation showed a significant positive association with anxiety disorders. Of the seven HPO categories, three were significantly positively associated with newly diagnosed depressive disorders. Endocrine PASC-AMs had a significant negative association with new-onset depressive disorders. Fewer patients were available with newly diagnosed psychosis, but there was still a significant increase in risk for patients with neurological and constitutional manifestations for being diagnosed with psychosis (Fig. 2). There was a significant positive association with substance abuse and cardiovascular and constitutional PASC-AMs. Interestingly, substance abuse was the only outcome with a significant negative association with ENT.
Odds ratios and 95% confidence intervals for the association of features in the seven investigated HPO categories with all newly diagnosed psychiatric disease and the subcategories anxiety disorder, depressive disorder, psychosis, and substance abuse. See also Supplemental Table S2 for complete results including comorbidities and demographics.
We performed a sensitivity analysis by limiting the time course in the control group to end at the average time of diagnosis for the group with a psychiatric outcome (72 days). Results showed increased odds ratios in all outcomes resulting in no significant negative associations between outcomes and PASC-AMs (data not shown). Further sensitivity analysis of stratifying inpatients and outpatients, removing patients with any BMI missingness, and varying the definition for the early post-acute phase showed results similar to the main finding.
To understand the temporal relationship between PASC-AMs and new-onset psychiatric diseases, we selected patients with any PASC-AM and a psychiatric outcome. We then found the time (in days) between each patient’s first PASC-AM in any category and diagnosis of psychiatric disease (Fig. 3). To investigate whether main results were primarily from co-occurring PASC-AMs and psychiatric disease that were merely recorded at the same time, we repeated the main analysis excluding PASC-AMs that were recorded within ten days prior to the new-onset psychiatric disease. Results were negatively shifted from main analysis because of the shortened window for PASC-AMs to be recorded, however, some significant positive associations were retained and differences between findings of categories were largely the same, supporting the temporal relationship of PASC-AMs occurring before psychiatric disease.
Histogram showing the time between the first PASC-AM and psychiatric disease for each patient and each PASC-AM category. The X-axis shows the number of days between the first recorded PASC-AM for each patient and the recorded onset of psychiatric disease. A black line is drawn at eleven days to indicate data that was excluded in the sensitivity analysis where PASC-AMs occurring within 10 days before the first psychiatric disease were not considered.
We then investigated the distribution of individual HPO terms for the seven categories described above. We compared counts of the observed HPO terms for each category among patients with diagnoses of new-onset anxiety, depressive disorder, psychosis, and substance abuse. Statistical significance was assessed with a chi-squared test and adjusted for multiple testing. Only patients with at least one HPO term in the category were included in this analysis. To ensure that no patient was in multiple groups, we eliminated the category of all psychiatric diseases and removed patients who simultaneously presented with multiple outcomes, leaving 6,208 patients (Fig. 4, Supplemental Fig. S2 and Table S3).
Here we examine the breakdown of individual HPO phenotypic features from symptom categories. The X-axis shows the five most prevalent HPO features from the displayed category. The Y-axis shows the proportion of patients with that feature from the set patients with the indicated HPO feature category and outcome. Significance of each finding was tested using chi-squared test (Supplemental Table S3).
Discussion
In this retrospective observational study on a cohort of 2,391,006 individuals following acute COVID-19 infection, we find that constitutional, neurological, gastrointestinal, cardiovascular and ENT PASC-AMs are associated with an increased incidence of new-onset psychiatric disease.
Five studies, including our own previous work, have performed large-scale EHR analyses that show significant and consistent but modest associations between SARS-CoV-2 infection and increased rates of psychiatric disease [4, 5, 9, 23, 24]. Differences in cumulative risk between patients who had COVID-19 and those who had other respiratory tract infections persist for some time following acute infection. However, the risk of new-onset psychiatric disease decreases sharply after the first month after acute SARS-CoV-2 infection and two studies suggest the risk returns to baseline after 120 days [5, 9]. In contrast, a recent study showed that neuropsychiatric sequelae of severe COVID-19 infection were similar to those observed in other severe acute respiratory infections [8]. With the exception of altered mental status, one study showed that neuropsychiatric manifestations are similar between patients with influenza and those with SARS-CoV2 [25].
In the main analysis, endocrine PASC-AMs were associated with a lower frequency of new-onset anxiety and depressive disorders. We hypothesize that this is a result of the shorter sampling window of PASC-AMs for patients with a psychiatric diagnosis compared to controls. That is, PASC-AMs are recorded for controls for the full window of 28–120 days after COVID-19 diagnosis, whereas PASC-AMs occurring at the end of the window (after the psychiatric diagnosis) are not recorded for patients with new-onset psychiatric disease. This leads to a negative bias in the odds ratios because patients with a psychiatric outcome will have fewer PASC-AMs considered when compared to controls whose PASC-AMs occur at the same rate. Sensitivity analysis done by shortening the cutoff for including PASC-AMs in the control group shows a unanimous increase in odds ratios which removes the significant negative association between endocrine PASC-AMs and anxiety and depressive disorders. This supports the hypothesis that the varying time course explains this negative association and suggests that the positive associations we report are conservative.
Our results show that the presence of PASC-AMs in five clinical categories is associated with an increased incidence of newly diagnosed psychiatric disease following COVID-19. There are several possible explanations, including that pathophysiological mechanisms increase the risk of both new-onset psychiatric disease and PASC-AMs or that PASC-AMs themselves increase risk of newly diagnosed psychiatric disease following COVID-19. Our results show different PASC-AM category associations for anxiety disorders, depressive disorders, psychosis, and substance abuse and that individual PASC-AMs within the categories contribute to these associations for each outcome. This may suggest there are several pathophysiological mechanisms in PASC that could explain the heterogeneity of phenotypic presentation [12].
Our dataset is derived from over 76 institutions across the country with 8 million cases of COVID-19, and thus is a representative sample of the COVID-19 positive population in the United States, but inconsistent or incomplete data collection could introduce biases. N3C employs a comprehensive suite of data quality checks to mitigate this problem [26], but residual issues cannot be ruled out. Retrospective analysis of EHR data does not allow any conclusions about pathomechanisms and cannot easily detect misdiagnoses.
In summary, we have shown that the presence of PASC-AMs from any of five clinical categories is associated with increased incidence of newly diagnosed psychiatric disease. This is consistent with the association that has been described between clinical severity of several other chronic diseases (chronic heart failure, chronic kidney disease, chronic hepatitis C, and cancer) and psychiatric disease. In fact, that association is bidirectional: chronic disease is associated with increased rates of psychiatric disease, and psychiatric disease is associated with a higher risk of chronic disease occurrence, severity, or progression [27,28,29,30].
Our results have important implications for both individual and public health. Timely and accurate diagnosis of psychiatric conditions has the potential to improve the quality of life for affected individuals; it may be worth systematically testing PASC patients for psychiatric sequelae. The scope of the COVID-19 pandemic is enormous, and it is essential to gain a deeper understanding of the natural history of PASC-related psychiatric diseases and their differential diagnoses to optimize care for affected individuals and institute appropriate public health measures.
Data availability
The data presented in this paper can be accessed upon application to the NCATS N3C Data Enclave at https://covid.cd2h.org/enclave.
References
Cascella M, Rajnik M, Aleem A, Dulebohn SC, Di Napoli R. Features, evaluation, and treatment of coronavirus (COVID-19). StatPearls. Treasure Island (FL): StatPearls Publishing; 2023.
Deer RR, Rock MA, Vasilevsky N, Carmody L, Rando H, Anzalone AJ, et al. Characterizing long COVID: deep phenotype of a complex condition. EBioMedicine. 2021;74:103722.
Brodin P, Casari G, Townsend L, O’Farrelly C, Tancevski I, Löffler-Ragg J, et al. Studying severe long COVID to understand post-infectious disorders beyond COVID-19. Nat Med. 2022;28:879–82.
Taquet M, Luciano S, Geddes JR, Harrison PJ. Bidirectional associations between COVID-19 and psychiatric disorder: retrospective cohort studies of 62 354 COVID-19 cases in the USA. Lancet Psychiatry. 2020;8:130–40.
Coleman B, Casiraghi E, Blau H, Chan L, Haendel MA, Laraway B, et al. Risk of new-onset psychiatric sequelae of COVID-19 in the early and late post-acute phase. World Psychiatry. 2022;21:319–20.
Taquet M, Geddes JR, Husain M, Luciano S, Harrison PJ. 6-month neurological and psychiatric outcomes in 236 379 survivors of COVID-19: a retrospective cohort study using electronic health records. Lancet Psychiatry. 2021;8:416–27.
Castro VM, Rosand J, Giacino JT, McCoy TH, Perlis RH. Case-control study of neuropsychiatric symptoms in electronic health records following COVID-19 hospitalization in 2 academic health systems. Mol Psychiatry. 2022;27:3898–3903.
Clift AK, Ranger TA, Patone M, Coupland CAC, Hatch R, Thomas K, et al. Neuropsychiatric ramifications of severe COVID-19 and other severe acute respiratory infections. JAMA Psychiatry. 2022;79:690–698.
Murata F, Maeda M, Ishiguro C, Fukuda H. Acute and delayed psychiatric sequelae among patients hospitalised with COVID-19: a cohort study using LIFE study data. Gen Psych. 2022;35:e100802.
Spudich S, Nath A. Nervous system consequences of COVID-19. Science. 2022;375:267–9.
Stefanou M-I, Palaiodimou L, Bakola E, Smyrnis N, Papadopoulou M, Paraskevas GP, et al. Neurological manifestations of long-COVID syndrome: a narrative review. Ther Adv Chronic Dis. 2022;13:20406223221076890.
Mehandru S, Merad M. Pathological sequelae of long-haul COVID. Nat Immunol. 2022;23:194–202.
Reese JT, Blau H, Casiraghi E, Bergquist T, Loomba JJ, Callahan TJ, et al. Generalisable long COVID subtypes: findings from the NIH N3C and RECOVER programmes. EBioMedicine. 2023;87:104413.
Robinson PN, Köhler S, Bauer S, Seelow D, Horn D, Mundlos S. The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease. Am J Hum Genet. 2008;83:610–5.
Haendel MA, Chute CG, Bennett TD, Eichmann DA, Guinney J, Kibbe WA, et al. The National COVID Cohort Collaborative (N3C): Rationale, design, infrastructure, and deployment. J Am Med Inform Assoc. 2021;28:427–43.
FitzHenry F, Resnic FS, Robbins SL, Denton J, Nookala L, Meeker D, et al. Creating a common data model for comparative effectiveness with the observational medical outcomes partnership. Appl Clin Inform. 2015;6:536–47.
Köhler S, Gargano M, Matentzoglu N, Carmody LC, Lewis-Smith D, Vasilevsky NA, et al. The Human Phenotype Ontology in 2021. Nucleic Acids Res. 2021;49:D1207–17.
Callahan TJ, Stefanski AL, Wyrwa JM, Zeng C, Ostropolets A, Banda JM, et al. Ontologizing health systems data at scale: making translational discovery a reality. NPJ Digit Med. 2023;6:89.
Doerken S, Avalos M, Lagarde E, Schumacher M. Penalized logistic regression with low prevalence exposures beyond high dimensional settings. PLoS ONE. 2019;14:e0217057.
Rubin DB. The Calculation of Posterior Distributions by Data Augmentation: Comment: A Noniterative Sampling/Importance Resampling Alternative to the Data Augmentation Algorithm for Creating a Few Imputations When Fractions of Missing Information Are Modest: The SIR Algorithm. J Am Stat Assoc. 82, 543–6.
Casiraghi E, Wong R, Hall M, Coleman B, Notaro M, Evans MD, et al. A methodological framework for the comparative evaluation of multiple imputation methods: multiple imputation of race, ethnicity and body mass index in the U.S. National COVID Cohort Collaborative. J Biomed Inform. 2023;139:104295.
Gao M, Piernas C, Astbury NM, Hippisley-Cox J, O’Rahilly S, Aveyard P, et al. Associations between body-mass index and COVID-19 severity in 6·9 million people in England: a prospective, community-based, cohort study. Lancet Diabetes Endocrinol. 2021;9:350–9.
Taquet M, Dercon Q, Luciano S, Geddes JR, Husain M, Harrison PJ. Incidence, co-occurrence, and evolution of long-COVID features: A 6-month retrospective cohort study of 273,618 survivors of COVID-19. PLoS Med. 2021;18:e1003773.
**e Y, Xu E, Al-Aly Z. Risks of mental health outcomes in people with covid-19: cohort study. BMJ. 2022;376:e068993.
Iosifescu AL, Hoogenboom WS, Buczek AJ, Fleysher R, Duong TQ. New-onset and persistent neurological and psychiatric sequelae of COVID?19 compared to influenza: A retrospective cohort study in a large New York City healthcare network. Int J Methods Psychiatr Res. 2022;31:e1914.
Pfaff ER, Girvin AT, Gabriel DL, Kostka K, Morris M, Palchuk M, et al. Synergies between centralized and federated approaches to data quality: a report from the National COVID cohort collaborative. J Am Med Inform Assoc. 2022;29:609–618.
Celano CM, Villegas AC, Albanese AM, Gaggin HK, Huffman JC. Depression and anxiety in heart failure: a review. Harv Rev Psychiatry. 2018;26:175–84.
Shirazian S, Grant CD, Aina O, Mattana J, Khorassani F, Ricardo AC. Depression in chronic kidney disease and end-stage renal disease: similarities and differences in diagnosis, epidemiology, and management. Kidney Int Rep. 2017;2:94–107.
Huang X, Liu X, Yu Y. Depression and chronic liver diseases: are there shared underlying mechanisms? Front Mol Neurosci. 2017;10:134.
Smith HR. Depression in cancer patients: pathogenesis, implications and treatment (Review). Oncol Lett. 2015;9:1509–14.
Acknowledgements
The analyses described in this publication were conducted with data or tools accessed through the NCATS N3C Data Enclave (covid.cd2h.org/enclave) and supported by CD2H—The National COVID Cohort Collaborative (N3C) IDeA CTR Collaboration (3U24TR002306-04S2 NCATS U24 TR002306). This research was possible because of the patients whose information is included within the data from participating organizations (covid.cd2h.org/dtas) and the organizations and scientists (covid.cd2h.org/duas) who have contributed to the ongoing development of this community resource [15]. This work is partially supported by PNRR-PE-AI FAIR project funded by the NextGeneration EU program. We gratefully acknowledge the following core contributors to N3C: Adam B. Wilcox, Adam M. Lee, Alexis Graves, Alfred (Jerrod) Anzalone, Amin Manna, Amit Saha, Amy Olex, Andrea Zhou, Andrew E. Williams, Andrew Southerland, Andrew T. Girvin, Anita Walden, Anjali A. Sharathkumar, Benjamin Amor, Benjamin Bates, Brian Hendricks, Brijesh Patel, Caleb Alexander, Carolyn Bramante, Cavin Ward-Caviness, Charisse Madlock-Brown, Christine Suver, Christopher Chute, Christopher Dillon, Chunlei Wu, Clare Schmitt, Cliff Takemoto, Dan Housman, Davera Gabriel, David A. Eichmann, Diego Mazzotti, Don Brown, Eilis Boudreau, Elaine Hill, Elizabeth Zampino, Emily Carlson Marti, Emily R. Pfaff, Evan French, Farrukh M Koraishy, Federico Mariona, Fred Prior, George Sokos, Greg Martin, Harold Lehmann, Heidi Spratt, Hemalkumar Mehta, Hongfang Liu, Hythem Sidky, J.W. Awori Hayanga, Jami Pincavitch, Jaylyn Clark, Jeremy Richard Harper, Jessica Islam, ** Ge, Joel Gagnier, Joel H. Saltz, Joel Saltz, Johanna Loomba, John Buse, Jomol Mathew, Joni L. Rutter, Julie A. McMurry, Justin Guinney, Justin Starren, Karen Crowley, Katie Rebecca Bradwell, Kellie M. Walters, Ken Wilkins, Kenneth R. Gersing, Kenrick Dwain Cato, Kimberly Murray, Kristin Kostka, Lavance Northington, Lee Allan Pyles, Leonie Misquitta, Lesley Cottrell, Lili Portilla, Mariam Deacy, Mark M. Bissell, Marshall Clark, Mary Emmett, Mary Morrison Saltz, Matvey B. Palchuk, Melissa A. Haendel, Meredith Adams, Meredith Temple-O’Connor, Michael G. Kurilla, Michele Morris, Nabeel Qureshi, Nasia Safdar, Nicole Garbarini, Noha Sharafeldin, Ofer Sadan, Patricia A. Francis, Penny Wung Burgoon, Peter Robinson, Philip R.O. Payne, Rafael Fuentes, Randeep Jawa, Rebecca Erwin-Cohen, Rena Patel, Richard A. Moffitt, Richard L. Zhu, Rishi Kamaleswaran, Robert Hurley, Robert T. Miller, Saiju Pyarajan, Sam G. Michael, Samuel Bozzette, Sandeep Mallipattu, Satyanarayana Vedula, Scott Chapman, Shawn T. O’Neil, Soko Setoguchi, Stephanie S. Hong, Steve Johnson, Tellen D. Bennett, Tiffany Callahan, Umit Topaloglu, Usman Sheikh, Valery Gordon, Vignesh Subbian, Warren A. Kibbe, Wenndy Hernandez, Will Beasley, Will Cooper, William Hillegass, **aohan Tanner Zhang. Details of contributions available at covid.cd2h.org/core-contributors. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health or the N3C program. The following institutions whose data is released or pending: Available: Advocate Health Care Network—UL1TR002389: The Institute for Translational Medicine (ITM) • Boston University Medical Campus—UL1TR001430: Boston University Clinical and Translational Science Institute • Brown University—U54GM115677: Advance Clinical Translational Research (Advance-CTR) • Carilion Clinic—UL1TR003015: iTHRIV Integrated Translational health Research Institute of Virginia • Charleston Area Medical Center—U54GM104942: West Virginia Clinical and Translational Science Institute (WVCTSI) • Children’s Hospital Colorado — UL1TR002535: Colorado Clinical and Translational Sciences Institute • Columbia University Irving Medical Center—UL1TR001873: Irving Institute for Clinical and Translational Research • Duke University—UL1TR002553: Duke Clinical and Translational Science Institute • George Washington Children’s Research Institute — UL1TR001876: Clinical and Translational Science Institute at Children’s National (CTSA-CN) • George Washington University — UL1TR001876: Clinical and Translational Science Institute at Children’s National (CTSA-CN) • Indiana University School of Medicine — UL1TR002529: Indiana Clinical and Translational Science Institute • Johns Hopkins University — UL1TR003098: Johns Hopkins Institute for Clinical and Translational Research • Loyola Medicine — Loyola University Medical Center • Loyola University Medical Center — UL1TR002389: The Institute for Translational Medicine (ITM) • Maine Medical Center — U54GM115516: Northern New England Clinical & Translational Research (NNE-CTR) Network • Massachusetts General Brigham — UL1TR002541: Harvard Catalyst • Mayo Clinic Rochester — UL1TR002377: Mayo Clinic Center for Clinical and Translational Science (CCaTS) • Medical University of South Carolina — UL1TR001450: South Carolina Clinical & Translational Research Institute (SCTR) • Montefiore Medical Center — UL1TR002556: Institute for Clinical and Translational Research at Einstein and Montefiore • Nemours — U54GM104941: Delaware CTR ACCEL Program • NorthShore University HealthSystem — UL1TR002389: The Institute for Translational Medicine (ITM) • Northwestern University at Chicago — UL1TR001422: Northwestern University Clinical and Translational Science Institute (NUCATS) • OCHIN — INV-018455: Bill and Melinda Gates Foundation grant to Sage Bionetworks • Oregon Health & Science University — UL1TR002369: Oregon Clinical and Translational Research Institute • Penn State Health Milton S. Hershey Medical Center — UL1TR002014: Penn State Clinical and Translational Science Institute • Rush University Medical Center — UL1TR002389: The Institute for Translational Medicine (ITM) • Rutgers, The State University of New Jersey — UL1TR003017: New Jersey Alliance for Clinical and Translational Science • Stony Brook University — U24TR002306 • The Ohio State University — UL1TR002733: Center for Clinical and Translational Science • The State University of New York at Buffalo — UL1TR001412: Clinical and Translational Science Institute • The University of Chicago — UL1TR002389: The Institute for Translational Medicine (ITM) • The University of Iowa — UL1TR002537: Institute for Clinical and Translational Science • The University of Miami Leonard M. Miller School of Medicine — UL1TR002736: University of Miami Clinical and Translational Science Institute • The University of Michigan at Ann Arbor — UL1TR002240: Michigan Institute for Clinical and Health Research • The University of Texas Health Science Center at Houston — UL1TR003167: Center for Clinical and Translational Sciences (CCTS) • The University of Texas Medical Branch at Galveston — UL1TR001439: The Institute for Translational Sciences • The University of Utah — UL1TR002538: Uhealth Center for Clinical and Translational Science • Tufts Medical Center — UL1TR002544: Tufts Clinical and Translational Science Institute • Tulane University — UL1TR003096: Center for Clinical and Translational Science • University Medical Center New Orleans — U54GM104940: Louisiana Clinical and Translational Science (LA CaTS) Center • University of Alabama at Birmingham — UL1TR003096: Center for Clinical and Translational Science • University of Arkansas for Medical Sciences — UL1TR003107: UAMS Translational Research Institute • University of Cincinnati — UL1TR001425: Center for Clinical and Translational Science and Training • University of Colorado Denver, Anschutz Medical Campus — UL1TR002535: Colorado Clinical and Translational Sciences Institute • University of Illinois at Chicago — UL1TR002003: UIC Center for Clinical and Translational Science • University of Kansas Medical Center — UL1TR002366: Frontiers: University of Kansas Clinical and Translational Science Institute • University of Kentucky — UL1TR001998: UK Center for Clinical and Translational Science • University of Massachusetts Medical School Worcester — UL1TR001453: The UMass Center for Clinical and Translational Science (UMCCTS) • University of Minnesota — UL1TR002494: Clinical and Translational Science Institute • University of Mississippi Medical Center—U54GM115428: Mississippi Center for Clinical and Translational Research (CCTR) • University of Nebraska Medical Center—U54GM115458: Great Plains IDeA-Clinical & Translational Research • University of North Carolina at Chapel Hill—UL1TR002489: North Carolina Translational and Clinical Science Institute • University of Oklahoma Health Sciences Center—U54GM104938: Oklahoma Clinical and Translational Science Institute (OCTSI) • University of Rochester—UL1TR002001: UR Clinical & Translational Science Institute • University of Southern California—UL1TR001855: The Southern California Clinical and Translational Science Institute (SC CTSI) • University of Vermont—U54GM115516: Northern New England Clinical & Translational Research (NNE-CTR) Network • University of Virginia—UL1TR003015: iTHRIV Integrated Translational health Research Institute of Virginia • University of Washington—UL1TR002319: Institute of Translational Health Sciences • University of Wisconsin-Madison — UL1TR002373: UW Institute for Clinical and Translational Research • Vanderbilt University Medical Center—UL1TR002243: Vanderbilt Institute for Clinical and Translational Research • Virginia Commonwealth University—UL1TR002649: C. Kenneth and Dianne Wright Center for Clinical and Translational Research • Wake Forest University Health Sciences—UL1TR001420: Wake Forest Clinical and Translational Science Institute • Washington University in St. Louis—UL1TR002345: Institute of Clinical and Translational Sciences • Weill Medical College of Cornell University—UL1TR002384: Weill Cornell Medicine Clinical and Translational Science Center • West Virginia University—U54GM104942: West Virginia Clinical and Translational Science Institute (WVCTSI). Submitted: Icahn School of Medicine at Mount Sinai—UL1TR001433: ConduITS Institute for Translational Sciences • The University of Texas Health Science Center at Tyler—UL1TR003167: Center for Clinical and Translational Sciences (CCTS) • University of California, Davis—UL1TR001860: UCDavis Health Clinical and Translational Science Center • University of California, Irvine—UL1TR001414: The UC Irvine Institute for Clinical and Translational Science (ICTS) • University of California, Los Angeles—UL1TR001881: UCLA Clinical Translational Science Institute • University of California, San Diego—UL1TR001442: Altman Clinical and Translational Research Institute • University of California, San Francisco—UL1TR001872: UCSF Clinical and Translational Science Institute. Pending: Arkansas Children’s Hospital — UL1TR003107: UAMS Translational Research Institute • Baylor College of Medicine—None (Voluntary) • Children’s Hospital of Philadelphia — UL1TR001878: Institute for Translational Medicine and Therapeutics • Cincinnati Children’s Hospital Medical Center—UL1TR001425: Center for Clinical and Translational Science and Training • Emory University—UL1TR002378: Georgia Clinical and Translational Science Alliance • HonorHealth—None (Voluntary) • Loyola University Chicago—UL1TR002389: The Institute for Translational Medicine (ITM) • Medical College of Wisconsin—UL1TR001436: Clinical and Translational Science Institute of Southeast Wisconsin • MedStar Health Research Institute—UL1TR001409: The Georgetown-Howard Universities Center for Clinical and Translational Science (GHUCCTS) • MetroHealth—None (Voluntary) • Montana State University—U54GM115371: American Indian/Alaska Native CTR • NYU Langone Medical Center—UL1TR001445: Langone Health’s Clinical and Translational Science Institute • Ochsner Medical Center—U54GM104940: Louisiana Clinical and Translational Science (LA CaTS) Center • Regenstrief Institute—UL1TR002529: Indiana Clinical and Translational Science Institute • Sanford Research—None (Voluntary) • Stanford University—UL1TR003142: Spectrum: The Stanford Center for Clinical and Translational Research and Education • The Rockefeller University—UL1TR001866: Center for Clinical and Translational Science • The Scripps Research Institute—UL1TR002550: Scripps Research Translational Institute • University of Florida—UL1TR001427: UF Clinical and Translational Science Institute • University of New Mexico Health Sciences Center—UL1TR001449: University of New Mexico Clinical and Translational Science Center • University of Texas Health Science Center at San Antonio—UL1TR002645: Institute for Integration of Medicine and Science • Yale New Haven Hospital—UL1TR001863: Yale Center for Clinical Investigation
Funding
NCATS U24 TR002306, DE-AC02-05CH11231., the Office of Science, Office of Basic Energy Sciences of the U.S. Department of Energy Contract No. DE-AC02-05CH11231 and NIH-NHGRI 5U24HG011449.
Author information
Authors and Affiliations
Contributions
Dr Robinson and Dr Reese had full access to all of the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. Concept and design: Coleman, Casiraghi, Haendel, Reese, Robinson. Acquisition, analysis, or interpretation of data: Coleman, Casiraghi, Callahan, Laraway, Haendel, Reese, Robinson. Drafting of the manuscript: Coleman, Casiraghi, Haendel, Reese, Robinson. Critical revision of the manuscript for important intellectual content: Coleman, Casiraghi, Blau, Chan, Davis, Clark, Seltzer, Valentini, Haendel, Reese, Robinson. Statistical analysis: Coleman, Casiraghi, Wilkins, Reese, Robinson. Obtained funding: Haendel, Robinson. Supervision: Reese, Robinson.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Coleman, B., Casiraghi, E., Callahan, T.J. et al. Association of post-COVID phenotypic manifestations with new-onset psychiatric disease. Transl Psychiatry 14, 246 (2024). https://doi.org/10.1038/s41398-024-02967-z
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41398-024-02967-z
- Springer Nature Limited