Background

The global increase in prevalence and incidence of dementia—mainly due to prolonged life expectancy—carries substantial individual, societal, and economic burden [1]. With currently no available cure, development of potential intervention is reliant on accurate identification of asymptomatic individuals at high risk of being in the preclinical phase of the disease [2]. Multiple risk factors for all-cause dementia and Alzheimer’s disease (AD) have been identified, ranging from non-modifiable factors such as genetics (e.g., APOE e4) to modifiable medical (e.g., cardiovascular health) and environmental influences (e.g., level of education) [3,4,5]. These risk factors can be used to estimate someone’s individual probability of develo** all-cause dementia or AD over a specified time through multivariable prognostic modeling. Accurate prognostic models can inform individuals and health professionals about individualized dementia risk, support personalized care, and aid selection of high-risk individuals for clinical trials and prevention [6,7,8].

Development of new models for dementia prediction has flourished across the past decades [9,10,11,12]. However, the majority of models has not been validated in a dataset other than the one it was developed in [10]. Model development and internal validation in one dataset typically lead to opportunistic predictive performance [13]. External validation of a model’s ability to differentiate who does and does not develop dementia (i.e., discrimination) and the model’s agreement between predicted and observed risks (i.e., calibration) in an independent dataset can quantify optimism due to overfitting or statistical modeling limitations [13]. Moreover, evaluating predictive models in another setting (e.g., health care, geographical, or cultural) than the one it was developed in, also called ‘transportability,’ can assess broader applicability of the model [14].

As external validation is essential to evaluate the generalizability and transportability of a prediction model, its lack thereof forms a major limitation for using prognostic scores in clinical practice and research [6]. This study aimed to evaluate the external performance of prediction models for all-cause dementia or AD in older adults that were developed for prediction horizons of 5–10 years. We validated these models in an independent population-based cohort: The Age, Gene/Environment Susceptibility—Reykjavik Study (AGES-RS).

Methods

Literature search, selection criteria, and screening process

Previous work on prediction models of dementia has been summarized in four extensive systematic reviews that cover prediction models generated up until April 1, 2018 [9,10,11,12]. Additionally, we searched PubMed for models published between January 1, 2018 and April 1, 2020 using the following search string: (model OR models) AND (risk) AND (dementia OR Alzheimer) AND (predict* AND (develop* OR create*)) AND (ROC OR (c statistic) OR (c-statistic) OR AUC OR (area under the curve)). This search string incorporated search terms as reported by Hou et al. [12] and Stephan et al. [9] to identify relevant results in a similar way as previous systematic reviews—the reviews by Tang et al. [10, 11] did not report search terms.

We also searched the reference lists from the relevant publications identified in this electronic search. From the systematic reviews, we screened every model that was included. In the electronic search, we filtered based on title and abstract whether a study investigated a prediction model for risk of dementia or AD in non-demented individuals.

A study was eligible when (1) a prediction model was developed in adults ≥ 65 years from the general population without dementia at baseline, (2) the outcome of the prediction model was all-cause dementia or AD, (3) a prediction model included at least age (i.e., the largest risk factor for dementia [15]) as a predictor, and (4) the area under the ROC curve (AUC or concordance (c) statistic) in the development cohort was at least 0.70 (values of 0.70–0.80 are acceptable, values of 0.80–0.90 good [16]).

We were unable to externally validate prediction models that were presented without regression coefficients and/or risk scores (i.e., absolute risk) per predictor. Furthermore, we could only calibrate models that included an intercept or baseline hazard. When risk estimates were presented, models without intercept or baseline hazard were recalibrated by estimating their intercept or baseline hazard. We could not validate models for which the predictors or an appropriate proxy was not available in our validation cohort AGES-RS.

In total, we identified 17 all-cause dementia and AD prediction models for external validation (Table 1). A flowchart for model selection is presented in Fig. 1.

Table 1 Selected prediction models for validation, description of their development cohorts, and development and validation c statistics
Fig. 1
figure 1

Flowchart of selection of prediction models for risk of all-cause dementia and/or AD in older adults for external validation

Independent validation cohort

AGES-RS is a prospective population-based cohort study aimed at investigating risk factors for conditions of old age; the cohort, recruitment, design, and procedures are described in detail elsewhere [17]. In short, AGES-RS is a continuation of the Reykjavik Study, which in 1967 recruited a random sample of 30,795 men and women living in Reykjavik who were born between 1907 and 1935. Between 2002 and 2006, AGES-RS randomly selected 5764 surviving participants from the original Reykjavik Study sample. A follow-up exam (AGES-RS2) was conducted five years later from 2007 to 2011 and included 3316 participants. Event status of participants (diagnosis of dementia or death) was continuously followed-up through nursing home records, medical records, and death certificates until October 2015.

Incident dementia status was assessed at AGES-RS2 following a 3-stage process, which was similar to the ascertainment of prevalent dementia in AGES-RS. The process included (1) a cognitive screening of the total sample; (2) a detailed neuropsychological exam in individuals who screened positive, and a further neurologic and proxy exam in the subset of individuals who also screened positive on test results of the detailed exam; (3) a consensus conference where a neurologist, geriatrician, neuropsychologist and radiographer reviewed relevant data. Additional cases were identified through comprehensive nursing home records, as well as medical records and death certificates following international guidelines for dementia diagnosis. When the individual moved into a nursing home, date of diagnosis was based on the intake exam into the nursing home (all-cause and AD diagnosis); additional incident dementia cases were identified in the nursing home following a standardized protocol followed by all Icelandic nursing homes [18].

The time at risk was measured from entry in the AGES-RS study to diagnosis of dementia, date of death, or end of follow-up (October 2015), whichever came first. In order to account for interval censoring, the time to diagnosis of dementia was calculated as half-way between the last record that reported a diagnosis without dementia and the first record that yielded a diagnosis of all-cause dementia or AD.

Of the 5764 participants at baseline, there were 421 cases of prevalent dementia; as such, our analytic sample of non-demented older adults from AGES-RS was 5343. During 45,021 person-years of follow-up (average follow-up time per person = 8.43 years) 1099 participants developed dementia, and during 40,917 person-years of follow-up 492 participants developed AD.

AGES-RS was approved by the Icelandic National Bioethics Committee (VSN: 00–063), the Icelandic Data Protection Authority (Iceland), and by the Institutional Review Board for the National Institute on Aging, National Institutes of Health (USA). Written informed consent was obtained from all participants.

Statistical analysis

Participant characteristics in AGES-RS were analyzed using descriptive statistics. Variables in the selected prediction models were matched to variables measured in AGES-RS, or if not available, a proxy variable was used (description of proxies is available in the Supplement). Table 2 lists the predictors in every model. To calculate the risk of dementia for each individual in AGES-RS, we applied the original regression equation for 13 of the 17 models; for 4 models, all from the same report [19], we validated the risk score chart.

Table 2 Predictors per model

Discrimination of each model was calculated with a c statistic with 95% confidence interval for 6-year or 10-year risk following the model’s original prediction horizon. Due to the timing of the follow-up assessment, a larger number of cases received an updated diagnosis at si× years rather than at 5 years; therefore, models developed for 5-year risk were validated at 6 years of follow-up for prediction accuracy.

We produced calibration plots to assess the relationship between predicted and e×pected probabilities, with predicted risk divided into five groups. In a calibration plot with grouped observations, more spread between the groups indicates better model performance than less spread [20]. Observed risks in logistic and Co× models were calculated with Kaplan–Meier estimates, and those in Fine & Gray models with cumulative incidence.Footnote 1 The average slope was calculated by using the slope (observed risk divided by predicted risk) across each of the four segments between the cut points. Models that presented a visual mismatch between the predicted and observed risks during calibration were recalibrated by re-estimating the intercept/baseline hazard and calibration slope by multiplying all coefficients with the same estimated factor [21]. No calibration-in-the-large was performed, which only re-estimates the intercept/baseline hazard, as visual inspection indicated the slope also needed to be updated.

For models of which the intercept or baseline hazard was reported, calibration was assessed using the known intercept or baseline hazard to calculate predicted risks. For models of which the intercept or baseline hazard was not reported, the intercept or baseline hazard was first estimated in AGES-RS to calculate predicted probabilities for recalibration. A model’s intercept was estimated by regressing the linear predictor as an offset variable on the event outcome using logistic regression. A model’s baseline hazard was obtained by estimating the baseline survival curve using a Cox model. The models by Anstey et al. [19] were based on risk scores obtained from previous literature, and no predicted risks corresponding to the obtained score were provided; therefore, these models could not be (re)calibrated.

This report is in accordance with the TRIPOD statement (Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis) [22]. Analyses were performed in R Version 3.5.1; missing data in AGES-RS were handled with multiple imputation analyses in R using the package mice [23].

Results

Validation population and occurrence of outcome

Descriptive characteristics of the 5343 individuals without dementia at baseline in AGES-RS are presented in Table 3. The mean observed risk of dementia was 10.1% [9.2–11.0] for 6 years and 22.8% [21.5–24.1] for 10 years; the risk for AD was 5.6% [4.9–6.3] and 12.1% [11.0–13.2], respectively.

Table 3 Validation sample characteristics of all variables included in at least one model

Models ranged from 2 to 12 predictors (Table 2); predictors could be categorized as demographics, medical history, genetics, anatomical characteristics, cognition, functional, lifestyle, social characteristics, and depression. Table 3 includes the percentage of missing data for each predictor variable in AGES-RS.

Discrimination and calibration: all-cause dementia

Table 1 and Fig. 2 present the validated models for dementia and their c statistics with confidence intervals. Figure 3a shows calibration plots including slopes. Among the seven 5/6-year models with all-cause dementia as an outcome, c statistics ranged from 0.68 to 0.80; a value of 0.80 was obtained by the models by Hogan et al. [24] and 2009 Barnes et al. [25] Calibration was deemed good for the models by Hogan et al. [24], Li et al. [26], and Licher et al. [27], while the 2009 model by Barnes et al. [25] overestimated the predicted risk. Recalibration of the 2009 model by Barnes et al. [25] improved agreement between observed and predicted risks. Recalibration due to a missing intercept/baseline hazard yielded reasonable calibration for the 2014 model by Barnes et al. [28], although risk was overestimated for the highest risk group.

Fig. 2
figure 2

Distribution of c statistics across studies with 95% confidence intervals (*model was developed for more than 1 outcome or time horizon)

Fig. 3
figure 3figure 3

Calibration plots of (re)calibrated prognostic models to predict risk of (a) all-cause dementia and (b) Alzheimer’s disease; an intercept of 0 and slope of 1 (i.e., the diagonal line) represents ideal calibration and more spread between the groups indicates better model performance than less spread—error bars in grouped observations represent 95% confidence intervals; Q = quartile

Among the four 10-year models with all-cause dementia as an outcome, c statistics ranged from 0.70 by the model by Li et al. [26] to 0.77 by the model by 2010 Tierney et al. [29]. Calibration for 10-year models was worse compared to the 5/6-year models. Risks were overestimated by the models by Downer et al. [30], Li et al. [26], and Licher et al. [27], particularly in the highest risk group. The 2010 model by Tierney et al. [29] systematically underestimated predicted risks. Recalibration of the model by Li et al. [26] hardly improved the agreement between observed and predicted risks.

Discrimination & calibration: Alzheimer’s disease

C statistics for models with AD outcome including confidence intervals are presented in Table 1, and calibration plots including slopes are shown in Fig. 3b. Among the three 5/6-year models with AD as an outcome, c statistics were 0.67 for the model by Anstey et al. [19] that used only common variables, 0.71 for the full model by Anstey et al. [19], and 0.81 for the model by Mura et al. [31]. Only the model by Mura et al. [31] could be calibrated, and showed systematic overestimation of predicted risk compared to observed risk; recalibration improved this relationship but resulted in some degree of underestimation in the highest risk group.

Among the three 10-year models with AD as an outcome, c statistics were 0.70 for the simple model by Verhaaren et al. [32] (a) that included age + sex/gender, 0.73 for the model by Verhaaren et al. [32] (b) that included the simple model + APOE e4, and 0.76 for the 2005 model by Tierney et al. [33]. Calibration of the 2005 model by Tierney et al. [33] and recalibration due to missing baseline hazards of the models by Verhaaren et al. [32] showed that all three models overestimated predicted risk, particularly in the highest risk group. Recalibration of the 2005 model by Tierney et al. [33] did not improve the relationship between predicted and observed risks, resulting in systematic underestimation of predicted risk.

Patterns among predictors

Hierarchically sorting the c-statistics from best discriminatory ability to worse revealed that all models with an objective cognitive test obtained c statistics ≥ 0.76 (n = 5), all models without ≤ 0.75 (n = 12) (Supplementary Table S1). Additionally, the two models by Licher et al. [27] with subjective memory complaints ranked highest after the models with objective cognition (c statistics 0.75 and 0.74)—models without any cognitive measure ranked ≤ 0.73. No clear patterns emerged among other categories of predictors. Additionally, no pattern was observed regarding the number of predictors, prediction outcome, or prediction horizon.

Complementary validation

For the five best discriminating models (c statistics ≥ 0.76) we performed discrimination and calibration for the complementary outcome (all-cause dementia vs. AD) and prediction horizon (6 years vs. 10 years). These models included the ones by Mura et al. [31], 2009 Barnes et al. [25], Hogan et al. [24], 2010 Tierney et al. [29], and 2005 Tierney et al. [33]—calibration was performed with the reported intercept or baseline hazard, which excluded complementary calibration for the 2010 model by Tierney et al. [29].

Discrimination showed that all five models maintained acceptable discriminatory ability (c statistics ≥ 0.76) for other combinations of prediction outcome and prediction horizon (Supplementary Table S2). Calibration for other combinations of prediction outcome and prediction horizon showed that, while not developed for that particular prediction outcome and/or horizon, the models by Mura et al. [31], 2009 Barnes et al. [25], and 2005 Tierney et al. [33] all three calibrated best for 10-year dementia risk (Supplementary Figure S1). The model by Hogan et al. [24] calibrated best for the original prediction outcome and horizon of 6-year dementia risk.

Conclusions

We externally validated 17 out of 36 eligible previously developed prediction models of all-cause dementia or AD for prediction horizons of five to ten years in the population-based AGES-RS cohort. We found acceptable (c statistic > 0.70) discriminatory ability to predict who will and who will not develop dementia for nearly all models. Calibration of models ranged from good calibration to poor calibration with systematic overestimation of predicted risks or overestimation for particularly the highest risk group. Recalibration of models often resulted in underestimation of predicted risks, particularly in the highest risk group. We observed a clear pattern that the best discriminating models all included cognition as a predictor; of these, only two models also had good (re)calibration. No clear patterns for discriminatory ability emerged among other categories of predictors, the number of predictors in a model, the prediction outcome, or the prediction horizon. Complementary discrimination of the five best performing models showed that all of them discriminated well for combinations of the other prediction outcome and/or prediction horizon.

When externally evaluating the performance of a prediction model, both discrimination and calibration should be assessed: good performance on one measure does not ensure good performance on the other [34]. Among the models that we externally validated, best performance combining both discrimination and calibration was achieved by the original model by Hogan et al. [24] and the recalibrated 2009 model by Barnes et al. [25] In our validation data, proxies had to be used for several predictors in both models; despite these proxies, the models obtained acceptable discrimination and good (re)calibration.

We observed that the dementia prediction models that included cognition as a predictor showed better discrimination compared to models that did not. Cognitive impairment is one of the core clinical criteria for diagnosis of all-cause dementia and AD [35] and subtle cognitive impairment arises several years before a clinical diagnosis can be established [

Data availability

Data are available from the National Institute on Aging (contact via Lenore J. Launer, corresponding author for AGES-RS) for researchers who meet the criteria for access to confidential data.

Notes

  1. Note that the model by Downer et al. provided a limited number of predicted risk-categories in a nomogram only for their risk score model—thus, we manually divided individuals into five groups based on the provided predicted risks to obtain five cut points for calibration (n per group: 2027 (≤ 5%), 1280 (> 5–6%), 717 (> 6–8%), 527 (> 8–11%), 792 (> 11%)).

References

  1. Nichols E, Szoeke CE, Vollset SE, Abbasi N, Abd-Allah F, Abdela J, et al. Global, regional, and national burden of Alzheimer’s disease and other dementias, 1990–2016: a systematic analysis for the global burden of disease study 2016. Lancet Neurol. 2019;18(1):88–106.

    Article  Google Scholar 

  2. Dubois B, Hampel H, Feldman HH, Scheltens P, Aisen P, Andrieu S, et al. Preclinical Alzheimer’s disease: definition, natural history, and diagnostic criteria. Alzheimers Dement. 2016;12(3):292–323.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Vonk JMJ, Rentería MA, Medina VM, Pericak-Vance MA, Byrd GS, Haines J, et al. Education moderates the relation between APOE ɛ4 and memory in nondemented non-hispanic black older adults. J Alzheimer’s Dis. 2019;72:1–12.

    CAS  Google Scholar 

  4. Fitzpatrick AL, Kuller LH, Lopez OL, Diehr P, O’Meara ES, Longstreth W, et al. Midlife and late-life obesity and the risk of dementia: cardiovascular health study. Arch Neurol. 2009;66(3):336–42.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Sharp ES, Gatz M. The relationship between education and dementia an updated systematic review. Alzheimer Dis Assoc Disord. 2011;25(4):289.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Guerra B, Haile SR, Lamprecht B, Ramirez AS, Martinez-Camblor P, Kaiser B, et al. Large-scale external validation and comparison of prognostic models: an application to chronic obstructive pulmonary disease. BMC Med. 2018;16(1):33.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  7. Solomon A, Soininen H. Risk prediction models in dementia prevention. Nat Rev Neurol. 2015;11(7):375–7.

    Article  PubMed  Google Scholar 

  8. Licher S, Yilmaz P, Leening MJ, Wolters FJ, Vernooij MW, Stephan BC, et al. External validation of four dementia prediction models for use in the general community-dwelling population: a comparative analysis from the rotterdam study. Eur J Epidemiol. 2018;33(7):645–55.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Stephan BC, Kurth T, Matthews FE, Brayne C, Dufouil C. Dementia risk prediction in the population: are screening models accurate? Nat Rev Neurol. 2010;6(6):318.

    Article  PubMed  Google Scholar 

  10. Tang EYH, Harrison SL, Errington L, Gordon MF, Visser PJ, Novak G, et al. Current developments in dementia risk prediction modelling: an updated systematic review. PLoS One. 2015;10(9):e0136181.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  11. Tang EYH, Robinson L, Stephan BCM. Dementia risk assessment tools: an update. Future Medicine; 2017.

  12. Hou X-H, Feng L, Zhang C, Cao X-P, Tan L, Yu J-T. Models for predicting risk of dementia: a systematic review. J Neurol Neurosurg Psychiatry. 2019;90(4):373–9.

    Article  PubMed  Google Scholar 

  13. Collins GS, de Groot JA, Dutton S, Omar O, Shanyinde M, Tajar A, et al. External validation of multivariable prediction models: a systematic review of methodological conduct and reporting. BMC Med Res Methodol. 2014;14(1):40.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Hendriksen JM, Geersing G-J, Lucassen WA, Erkens PM, Stoffers HE, Van Weert HC, et al. Diagnostic prediction models for suspected pulmonary embolism: systematic review and independent external validation in primary care. Bmj. 2015;351:h4438.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  15. Song X, Mitnitski A, Rockwood K. Nontraditional risk factors combine to predict Alzheimer disease and dementia. Neurology. 2011;77(3):227–34.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Hosmer DW, Lemeshow S. Applied logistic regression. New York: John Wiley & Sons; 2000.

    Book  Google Scholar 

  17. Harris TB, Launer LJ, Eiriksdottir G, Kjartansson O, Jonsson PV, Sigurdsson G, et al. Age, gene/environment susceptibility–Reykjavik Study: multidisciplinary applied phenomics. Am J Epidemiol. 2007;165(9):1076–87.

    Article  PubMed  Google Scholar 

  18. Jørgensen L, el Kholy K, Damkjaer K, Deis A, Schroll M. “ RAI”–an international system for assessment of nursing home residents. Ugeskr Laeger. 1997;159(43):6371–6.

    PubMed  Google Scholar 

  19. Anstey KJ, Cherbuin N, Herath PM, Qiu C, Kuller LH, Lopez OL, et al. A self-report risk index to predict occurrence of dementia in three independent cohorts of older adults: the ANU-ADRI. PLoS One. 2014;9(1):e86141.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  20. Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, et al. Assessing the performance of prediction models: a framework for some traditional and novel measures. Epidemiology. 2010;21(1):128.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Royston P, Altman DG. External validation of a Cox prognostic model: principles and methods. BMC Med Res Methodol. 2013;13(1):33.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMC Med. 2015;13(1):1.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Buuren SV, Groothuis-Oudshoorn K. mice: Multivariate imputation by chained equations in R. J Stat Softw. 2010;45:1–68.

    Google Scholar 

  24. Hogan DB, Ebly EM. Predicting who will develop dementia in a cohort of Canadian seniors. Can J Neurol Sci. 2000;27(1):18–24.

    Article  CAS  PubMed  Google Scholar 

  25. Barnes D, Covinsky K, Whitmer R, Kuller L, Lopez O, Yaffe K. Predicting risk of dementia in older adults: the late-life dementia risk index. Neurology. 2009;73(3):173–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Li J, Ogrodnik M, Devine S, Auerbach S, Wolf PA, Au R. Practical risk score for 5-, 10-, and 20-year prediction of dementia in elderly persons: Framingham Heart Study. Alzheimers Dement. 2018;14(1):35–42.

    Article  PubMed  Google Scholar 

  27. Licher S, Leening MJ, Yilmaz P, Wolters FJ, Heeringa J, Bindels PJ, et al. Development and validation of a dementia risk prediction model in the general population: an analysis of three longitudinal studies. Am J Psychiatry. 2019;176(7):543–51.

    Article  PubMed  Google Scholar 

  28. Barnes DE, Beiser AS, Lee A, Langa KM, Koyama A, Preis SR, et al. Development and validation of a brief dementia screening indicator for primary care. Alzheimer’s Dementia. 2014;10(6):656-65. e1.

    Article  PubMed  Google Scholar 

  29. Tierney MC, Moineddin R, McDowell I. Prediction of all-cause dementia using neuropsychological tests within 10 and 5 years of diagnosis in a community-based sample. J Alzheimers Dis. 2010;22(4):1231–40.

    Article  PubMed  Google Scholar 

  30. Downer B, Kumar A, Veeranki SP, Mehta HB, Raji M, Markides KS. Mexican-American dementia nomogram: development of a dementia risk index for Mexican-American older adults. J Am Geriatr Soc. 2016;64(12):e265–9.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Mura T, Baramova M, Gabelle A, Artero S, Dartigues J-F, Amieva H, et al. Predicting dementia using socio-demographic characteristics and the free and cued selective reminding test in the general population. Alzheimer’s Res Ther. 2017;9(1):21.

    Article  CAS  Google Scholar 

  32. Verhaaren BF, Vernooij MW, Koudstaal PJ, Uitterlinden AG, van Duijn CM, Hofman A, et al. Alzheimer’s disease genes and cognition in the nondemented general population. Biol Psychiat. 2013;73(5):429–34.

    Article  CAS  PubMed  Google Scholar 

  33. Tierney MC, Yao C, Kiss A, McDowell I. Neuropsychological tests accurately predict incident Alzheimer disease after 5 and 10 years. Neurology. 2005;64(11):1853–9.

    Article  PubMed  Google Scholar 

  34. Heyard R, Timsit JF, Held L, Consortium CM. Validation of discrete time‐to‐event prediction models in the presence of competing risks. Biom J. 2019;62:643–57.

    Article  PubMed  PubMed Central  Google Scholar 

  35. McKhann GM, Knopman DS, Chertkow H, Hyman BT, Jack CR, Kawas CH, et al. The diagnosis of dementia due to Alzheimer’s disease: recommendations from the national institute on aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s Dementia. 2011;7(3):263–9.

    Article  PubMed  Google Scholar 

  36. Bateman RJ, **ong C, Benzinger TL, Fagan AM, Goate A, Fox NC, et al. Clinical and biomarker changes in dominantly inherited Alzheimer’s disease. N Engl J Med. 2012;367(9):795–804.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Linn RT, Wolf PA, Bachman DL, Knoefel JE, Cobb JL, Belanger AJ, et al. The “preclinical phase” of probable Alzheimer’s disease. A 13-year prospective study of the Framingham cohort. Arch Neurol. 1995;52(5):485–90. https://doi.org/10.1001/archneur.52.5.485.

    Article  CAS  PubMed  Google Scholar 

  38. Bondi MW, Edmonds EC, Jak AJ, Clark LR, Delano-Wood L, McDonald CR, et al. Neuropsychological criteria for mild cognitive impairment improves diagnostic precision, biomarker associations, and progression rates. J Alzheimers Dis. 2014;42(1):275–89.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Albert MS, DeKosky ST, Dickson D, Dubois B, Feldman HH, Fox NC, et al. The diagnosis of mild cognitive impairment due to Alzheimer’s disease: recommendations from the national institute on aging-Alzheimer’s association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers Dement. 2011;7(3):270–9.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Petersen RC, Smith GE, Waring SC, Ivnik RJ, Tangalos EG, Kokmen E. Mild cognitive impairment: clinical characterization and outcome. Arch Neurol. 1999;56:303–8 (PubMed PMID: 1791).

    Article  CAS  PubMed  Google Scholar 

  41. Stephan BC, Matthews FE, McKeith IG, Bond J, Brayne C, Function MRCC, et al. Early cognitive change in the general population: how do different definitions work? J Am Geriatr Soc. 2007;55(10):1534–40.

    Article  PubMed  Google Scholar 

  42. Matthews FE, Stephan BC, McKeith IG, Bond J, Brayne C, Function MRCC, et al. Two-year progression from mild cognitive impairment to dementia: to what extent do different definitions agree? J Am Geriatr Soc. 2008;56(8):1424–33.

    Article  PubMed  Google Scholar 

  43. Sachdev PS, Lipnicki DM, Crawford J, Reppermund S, Kochan NA, Trollor JN, et al. Factors predicting reversion from mild cognitive impairment to normal cognitive functioning: a population-based study. PloS one. 2013;8(3):e59649.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Stephan BC, Tzourio C, Auriacombe S, AmievaDufouil HC, Alpérovitch A, et al. Usefulness of data from magnetic resonance imaging to improve prediction of dementia: population based cohort study. Bmj. 2015;350:h2863.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Steyerberg EW. Overfitting and optimism in prediction models. Clinical Prediction Models: Springer; 2019. p. 95–112.

    Google Scholar 

  46. Volkers EJ, Algra A, Kappelle LJ, Jansen O, Howard G, Hendrikse J, et al. Prediction models for clinical outcome after a carotid revascularization procedure: an external validation study. Stroke. 2018;49(8):1880–5.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Vidarsdottir H, Gunnarsdottir HK, Olafsdottir EJ, Olafsdottir GH, Pukkala E, Tryggvadottir L. Cancer risk by education in Iceland; a census-based cohort study. Acta Oncol. 2008;47(3):385–90.

    Article  PubMed  Google Scholar 

  48. Barnes DE, Covinsky KE, Whitmer RA, Kuller LH, Lopez OL, Yaffe K. Dementia risk indices: a framework for identifying individuals with a high dementia risk. Alzheimer’s Dementia. 2010;6(2):138.

    Article  PubMed  Google Scholar 

  49. Chary E, Amieva H, Pérès K, Orgogozo J-M, Dartigues J-F, Jacqmin-Gadda H. Short-versus long-term prediction of dementia among subjects with low and high educational levels. Alzheimers Dement. 2013;9(5):562–71.

    Article  PubMed  Google Scholar 

  50. Jacqmin-Gadda H, Blanche P, Chary E, Loubère L, Amieva H, Dartigues J-F. Prognostic score for predicting risk of dementia over 10 years while accounting for competing risk of death. Am J Epidemiol. 2014;180(8):790–8.

    Article  PubMed  Google Scholar 

  51. Nielsen H, Lolk A, Andersen K, Andersen J, Kragh‐Sørensen P. Characteristics of elderly who develop Alzheimer’s disease during the next two years—a neuropsychological study using CAMCOG. The Odense study. Int J Geriatr Psychiatry. 1999;14(11):957–63.

    Article  CAS  PubMed  Google Scholar 

  52. Reitz C, Tang M-X, Schupf N, Manly JJ, Mayeux R, Luchsinger JA. A summary risk score for the prediction of Alzheimer disease in elderly persons. Arch Neurol. 2010;67(7):835–41.

    Article  PubMed  PubMed Central  Google Scholar 

  53. Mitchell AJ, Shiri-Feshki M. Rate of progression of mild cognitive impairment to dementia–meta-analysis of 41 robust inception cohort studies. Acta Psychiatr Scand. 2009;119(4):252–65.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We gratefully acknowledge the help of Morgan Scarth and Gelan Ying in organizing articles for review, and the advice of Ewoud Schuit and Rene Eijkemans regarding statistical modeling.

Funding

The Age, Gene/Environment Susceptibility-Reykjavik Study has been funded by National Institute on Aging contract N01-AG-12100 with contributions from the National Eye Institute; National Institute on Deafness and Other Communication Disorders; National Heart, Lung and Blood Institute; National Institute on Aging Intramural Research Program; Hjartavernd (the Icelandic Heart Association, HHSN271201200022C); and the Althingi (Icelandic Parliament). The authors thank all the study participants and the IHA staff. This work was supported by Alzheimer Nederland Fellowship WE.15-2018-05 (PI: J.M.J. Vonk), Alzheimer Nederland grant WE.03-2017-06 (PI: M.I. Geerlings), National Institute on Aging K99/R00 award K99AG066934 (PI: J.M.J. Vonk), and NWO/ZonMw Veni Grant project number 09150161810017 (PI: J.M.J. Vonk). None of the funders had any role in the design and conduct of the study, the collection, management, analysis, and interpretation of the data, the preparation and writing of the manuscript, or the decision to submit the manuscript for publication.

Author information

Authors and Affiliations

Authors

Contributions

JMJ Vonk helped shape the project idea, performed the analyses, wrote the manuscript, and is responsible for the overall content as guarantor. JP Greving helped to conceive the idea, helped shape the project execution, helped supervise the project, made comments on the manuscript, and verified the analytical methods. V Gudnason created and directs the AGES-RS cohort, made comments on the manuscript, and provided critical feedback. LJ Launer created and directs the AGES-RS cohort, made comments on the manuscript, and provided critical feedback. MI Geerlings conceived the idea, supervised the current project, verified the analytical methods, provided critical feedback, helped shape the manuscript, and provided critical feedback. The corresponding author JMJ Vonk attests that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted. The corresponding authors had full access to all the data in the study and had final responsibility for the decision to submit for publication.

Corresponding authors

Correspondence to Jet M. J. Vonk or Lenore J. Launer.

Ethics declarations

Conflict of interest

The authors have declared that no competing interests exist.

Ethics approval and consent to participate

AGES-RS was approved by the Icelandic National Bioethics Committee (VSN: 00–063), the Icelandic Data Protection Authority (Iceland), and by the Institutional Review Board for the National Institute on Aging, National Institutes of Health (USA). Written informed consent was obtained from all participants.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 632 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vonk, J.M.J., Greving, J.P., Gudnason, V. et al. Dementia risk in the general population: large-scale external validation of prediction models in the AGES-Reykjavik study. Eur J Epidemiol 36, 1025–1041 (2021). https://doi.org/10.1007/s10654-021-00785-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10654-021-00785-x

Keywords