Machine Learning–Based Prediction of Functional Disability: a Cohort Study of Japanese Older Adults in 2013–2019

Lu, Yongjian; Sato, Koryu; Nagai, Masato; Miyatake, Hirokazu; Kondo, Katsunori; Kondo, Naoki

doi:10.1007/s11606-023-08215-2

Machine Learning–Based Prediction of Functional Disability: a Cohort Study of Japanese Older Adults in 2013–2019

Original Research
Open access
Published: 01 May 2023

Volume 38, pages 2486–2493, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of General Internal Medicine Aims and scope Submit manuscript

Machine Learning–Based Prediction of Functional Disability: a Cohort Study of Japanese Older Adults in 2013–2019

Download PDF

Lu Yongjian PhD ORCID: orcid.org/0000-0002-5975-817X¹^na1,
Koryu Sato MPH ORCID: orcid.org/0000-0002-8418-8535²^na1,
Masato Nagai PhD³,
Hirokazu Miyatake MS⁴,
Katsunori Kondo PhD^5,6 &
…
Naoki Kondo PhD²

1843 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Background

It is important to identify older adults at high risk of functional disability and to take preventive measures for them at an early stage. To our knowledge, there are no studies that predict functional disability among community-dwelling older adults using machine learning algorithms.

Objective

To construct a model that can predict functional disability over 5 years using basic machine learning algorithms.

Design

A cohort study with a mean follow-up of 5.4 years.

Participants

We used data from the Japan Gerontological Evaluation Study, which involved 73,262 people aged ≥ 65 years who were not certified as requiring long-term care. The baseline survey was conducted in 2013 in 19 municipalities.

Main Measures

We defined the onset of functional disability as the new certification of needing long-term care that was ascertained by linking participants to public registries of long-term care insurance. All 183 candidate predictors were measured by self-report questionnaires.

Key Results

During the study period, 16,361 (22.3%) participants experienced the onset of functional disability. Among machine learning–based models, ridge regression (C statistic = 0.818) and gradient boosting (0.817) effectively predicted functional disability. In both models, we identified age, self-rated health, variables related to falls and posture stabilization, and diagnoses of Parkinson’s disease and dementia as important features. Additionally, the ridge regression model identified the household characteristics such as the number of members, income, and receiving public assistance as important predictors, while the gradient boosting model selected moderate physical activity and driving. Based on the ridge regression model, we developed a simplified risk score for functional disability, and it also indicated good performance at the cut-off of 6/7 points.

Conclusions

Machine learning–based models showed effective performance prediction over 5 years. Our findings suggest that measuring and adding the variables identified as important features can improve the prediction of functional disability.

Interpretable classifiers for prediction of disability trajectories using a nationwide longitudinal database

Article Open access 28 July 2022

A new tool for the evaluation of the rehabilitation outcomes in older persons: a machine learning model to predict functional status 1 year ahead

Article 29 August 2018

Predicting restriction of life-space mobility: a machine learning analysis of the IMIAS study

Article 07 September 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

INTRODUCTION

The world’s population of individuals aged > 60 years will almost double between 2015 and 2050.¹ As the population ages, functional disability becomes more prevalent. In the USA, 41.7% of people aged ≥ 65 years reported having one or more disabilities.² Functional disability is associated with adverse outcomes such as decreased quality of life and increased risks of hospitalization and mortality.³ However, functional declines in the aging process are dynamic and reversible. A meta-analysis reported that 13.7% of older adults improved their frailty status during the mean follow-up of 3.9 years.⁴ Therefore, it is important to identify older adults at high risk of functional disability and to take preventive measures for them at an early stage.

Several attempts have been made to predict functional disability and other functional statuses in older populations. A recent review identified 43 studies that predicted the functional status of community-dwelling older adults.⁵ In Japan, the Ministry of Health, Labour and Welfare developed a basic function checklist (the Kihon Checklist [KCL] in Japanese) comprising 25 items to identify older adults at high risk of needing long-term care. Tsuji and colleagues developed a risk score comprising ten items to predict functional disability in 3 years using data from the Japan Gerontological Evaluation Study (JAGES).⁶ Despite existing literature, there are two major research gaps. First, the variables in the developed models were selected based on expert knowledge and previous literature. Researchers can handle a limited number of potential variables and may overlook essential variables. Emerging machine learning algorithms are effective in selecting variables from many candidates without relying on a priori hypotheses or assumptions and may improve the performance of prediction models. However, none of the aforementioned studies has used these methods. Second, most of the previous studies had short follow-up periods, and only four studies from European countries followed participants for over 5 years.⁵ Because preventive measures for functional disability often need time to elicit effects, a model that can predict the distant future is necessary.

To fill these research gaps, the present study constructed prediction models of functional disability from 183 candidate predictors using machine learning algorithms. We studied functionally and cognitively independent Japanese older adults and followed them to evaluate the performance of the prediction models. To the best of our knowledge, no study has predicted functional disability using machine learning algorithms over 5 years among community-dwelling older adults.

METHODS

Baseline Survey

We used data from the JAGES, which studies Japanese people aged ≥ 65 years who are not certified as needing long-term care. Self-report questionnaires were mailed to 112,705 residents in 19 municipalities across nine prefectures from October to December 2013. In ten large municipalities, participants were randomly sampled, whereas in other smaller municipalities, a census of all eligible residents was conducted. Of the invited individuals, 79,291 responded, with a response rate of 70.4%. The analysis did not include 4994 respondents, whose sex and age could not be verified. All participants provided informed consent, and the study protocol was reviewed and approved by the ethics committees of Kyoto University (R3153-2) and Nihon Fukushi University (13–14).

Functional Disability

The onset of functional disability was defined as the new certification of needing long-term care and ascertained by linking participants to the public registries of long-term care insurance administrated by each municipality. This definition of functional disability has been widely used in previous studies.^6,7,8,9,10 All Japanese citizens aged ≥ 40 years sign up for public long-term care insurance, and they are eligible for benefits if they are determined to need care.¹¹ Through a nationally standardized protocol, applicants are classified into the following eight levels of needing long-term care: not certified, support-needs levels 1–2, and care-needs levels 1–5 (larger numbers indicate severer disability; see Supplementary Table 1 for more details).^12,13 The levels are determined according to a time estimation needed for care based on home-visit and computer-based assessments, a primary physician’s documented opinion, and a committee deliberation. In this study, those certified as one of the seven levels of needing care (except for those not certified) were considered to have functional disabilities. The follow-up period started between October and November 2013 and ended between March 2019 and March 2021 (mean follow-up, 5.4 years). Of the 74,297 eligible respondents, 73,262 participants were successfully linked to the administrative records (follow-up rate = 98.6%). Figure 1 shows a flowchart of the analytic sample.

Candidate Predictors

We considered all variables constructed by questions that the JAGES asked all participants to be included in the prediction models. A total of 183 variables included demographic characteristics, socioeconomic status, self-reported physical and mental health, health behaviors, social capital, and community environment (see Supplementary Table 2 for the list of candidate variables). To make variables measured using different scales comparable in machine learning algorithms, they were normalized to values ranging from zero to one.

Statistical Analysis

In general, parametric methods overweigh non-parametric methods when the relationship between an outcome and a predictor is linear, and vice versa.¹⁴ Thus, we examined the predictive performance of one parametric and three non-parametric machine learning algorithms: namely, ridge regression, gradient boosting, random forest, and eXtreme Gradient Boosting (XGBoost). They can be easily implemented using statistical packages and have been widely used.^14,15 We performed logistic regression with ridge regularization to prevent overfitting by penalizing large coefficients.^16,17 Gradient boosting¹⁸ and random forest¹⁹ are non-parametric ensemble methods that combine multiple decision trees to prevent overfitting. Whereas gradient boosting combines decision trees using boosting (iteratively correcting errors made by the previously trained tree), random forest uses bagging (bootstrap aggregating of independently developed trees). XGBoost is a new and fast algorithm of gradient boosting combining regularization.²⁰

For all models, we performed a threefold cross-validation procedure; the dataset was randomly split into three groups; in the three training and validation processes, each group was always used once as test data, while the remaining groups were used as training data. Then, we repeated the same process ten times. The feature importance of selected predictors was calculated; it represents coefficients in ridge regression, while it represents relative values of reductions in the Gini index due to splits over a given predictor in other tree-based algorithms. Based on the feature importance of ridge regression, we developed a simplified risk score for functional disability to facilitate the implementation of the model (it is hard for other non-parametric models to simplify the calculation of risk scores due to non-linearity). We selected the ten most important features in the ridge regression and assigned a score of 1 to the tenth feature. Then, scores proportional to the importance were assigned to each feature (decimals were rounded off). In the dataset, 4.9% of the values were missing. To reduce the potential bias due to missing variables, we imputed them using a random forest algorithm.²¹ All analyses were performed using Python 3 (CreateSpace, Scotts Valley, CA, USA).

RESULTS

Table 1 presents the baseline characteristics of the participants. For categorical variables, high scores indicate poor outcomes. Among the participants, 16,361 (22.3%) were newly disabled (i.e., needing long-term care) during the study period. Compared to those who remained independent, disabled people were more than 6 years older, lived with fewer household members, had lower household income, were more likely to receive public assistance, to provide self-reporting of needs for assistance in basic activities of daily living (e.g., walking, bathing, and using a toilet), to experience falls within 1 year, worry about falling, and feel bothersome, and to be diagnosed with dementia, Parkinson’s disease, and blood and immune diseases, and rated their health as poorer at baseline. In addition, disabled people were less likely to be able to climb stairs and stand up without support, engage in moderate physical activity (e.g., walking at a brisk pace, dancing, gymnastics, golf, farming, gardening, and car washing), and drive than those who remained independent. Supplementary Fig. 1 describes the distribution of certified levels of needing long-term care in the follow-up.

Table 1 Participants’ Characteristics

Full size table

Table 2 compares the performance of the proposed prediction models. Among the models, ridge regression showed the best performance in predicting functional disability (C statistics = 0.818), whereas gradient boosting showed a similar performance (0.817). Figure 2 shows the ten most important features of the two models. In both models, we identified age, self-rated health, variables related to falls and posture stabilization, and diagnoses of Parkinson’s disease and dementia as important features. In the ridge regression, household characteristics such as the number of members, income, and receiving public assistance were also important features (Fig. 2A). In the gradient boosting model, moderate physical activity and driving also predicted functional disability (Fig. 2B).

Table 2 Prediction Performance for Functional Disability

Full size table

Table 3 presents the simplified risk score for functional disability based on our ridge regression model. Figure 3 indicates the distribution of the risk score and the percentage of those who experienced the event. The continuous risk score indicated good performance (C statistics = 0.792). Youden index suggests that the cut-off of 6/7 points is optimal (sensitivity = 0.746, specificity = 0.699); those with a score of 7 or higher are at high risk of functional disability.

Table 3 Simplified Risk Score for Functional Disability Based on Ridge Regression

Full size table

We performed several sensitivity analyses. First, we excluded participants who were certified as needing long-term care within 1 year from the baseline survey. The C statistics declined in all models but still showed good performance (0.809 for ridge regression and 0.807 for gradient boosting; Supplementary Table 3). Second, we tested prediction performance for the onset of severe disabilities (i.e., certified as the care-needs level 2 or severer, which requires care for basic activities of daily living), as a previous study defined.²² Compared to the performance for any certified needs levels, that of predicting severer conditions was lower but still good (0.805 for ridge regression and 0.804 for gradient boosting; Supplementary Table 4). Similar to our main models, both prediction models for the alternative cut-off identified age, self-rated health, and diagnoses of Parkinson’s disease and dementia as important features (Supplementary Fig. 2). In the alternative ridge regression, the use of an electric wheelchair and body mass index appeared to be important features (Supplementary Fig. 2A). In the alternative gradient boosting model, several variables related to instrumental activities of daily living (e.g., going shop** and filling out documents) were selected (Supplementary Fig. 2B). Third, we also performed a Cox proportional hazard regression. During the study period, some experienced the onset of functional disability early, others experienced late, and others were censored without the onset of functional disability. However, our main models predicted whether the participant experienced the onset of functional disability, regardless of the duration of free from it. Thus, a prediction model accounting for the time to event may better perform. Our Cox model included a penalty term using ridge regularization to prevent overfitting.¹⁴ The Cox model performed similarly to the logistic regression with ridge regularization and gradient boosting (0.817; Supplementary Table 5). Fourth, we confirmed whether a voting ensemble method combining the four algorithms improved performance. However, the performance improvement was slight (0.819; Supplementary Table 5). Finally, we tested the performance of the 25-item KCL, which is often used as a screening tool for those at high risk of functional disability in Japan. Although its performance was acceptable (0.716 for ridge regression and 0.717 for gradient boosting; Supplementary Table 6), our machine learning–based models performed better.

DISCUSSION

This study constructed prediction models of functional disability using machine learning algorithms over 5 years among community-dwelling older adults. Among the models, ridge regression and gradient boosting effectively predicted functional disability. Machine learning improved prediction performance compared to models previously developed. The existing models not based on machine learning indicated median C statistics ranging between 0.65 and 0.76 for development models, and between 0.60 and 0.68 for validation models.⁵ While the 3-year prediction model developed by Tsuji and colleagues indicated a C statistic of 0.804,⁶ our model performed better with longer-term forecasts. Although the KCL (excluding five items related to depression) showed good performance in predicting functional disability in 1 year (C statistic = 0.83),²³ our additional analysis suggested that its performance degrades when forecasted for more than 5 years. The simplified risk score based on our ridge regression also indicated good performance. Our findings suggest that machine learning enables us to identify those at high risk for functional disability more precisely and to take preventive measures effectively.

Several important features were identified in both models. Both models identified variables related to falls and posture stabilization as important features, namely, the frequency of falls within 1 year, worry about falls, and ability to climb stairs and stand up without support. Moreover, the models have captured the process of functional declines due to aging. People with frailty have difficulty climbing stairs and standing up on their own, and are more likely to fall.³ Falls and traumatic injuries increase the risk of functional disability.²⁴ These four variables are also included in the KCL used in Japan’s long-term care insurance and the risk score of functional disability developed by Tsuji and colleagues.^6,23 In line with the previous models, this study suggested that adding these measures could improve the prediction performance of functional disability. In addition, our models suggest that diagnoses of Parkinson’s disease and dementia are important predictors of functional disability as previous studies have found.^25,26 These neurodegenerative diseases are common in the older population and result in functional impairments.^27,28

We also found that self-rated health predicted functional disability, which was consistent with previous studies.^29,30,31 Idler and Benyamini argued that there are four reasons why self-rated health can predict functional disability effectively; (1) it is more inclusive than other measures; (2) it can evaluate not only the current health status but also trajectory; (3) it affects behaviors that have an impact on future health status; and (4) it reflects about resources which one can access when health declines.³² Our findings suggest that self-rated health is a simple and useful measure to predict functional disability among older adults.

Furthermore, ridge regression and gradient boosting models have identified unique and important features. In the ridge regression, the household characteristics such as the number of members, income, and receiving public assistance were selected as important predictors. A previous study reported that the size of social networks, including family members, was not associated with functional disability.³³ In contrast, the present study suggested that household size mattered, and those who experienced functional disability had a smaller household size than those who did not. Household income and the status of public assistance may reflect the socioeconomic gradient of functional disability, as previous literature showed.^34,35,36

In the gradient boosting model, moderate physical activity and driving were identified as important features. Interestingly, moderate physical activity was the best predictor, although vigorous (e.g., running, swimming, cycling, tennis, exercise at the gym, and mountain climbing) and light (e.g., stretching, bowling, walking to shops or the station, and laundry) physical activities were candidate predictors in the model. Additionally, we found that in older adults, driving out of the house is a good predictor of disability. In order to prevent motor-vehicle collisions by older drivers, the Japanese National Police Agency requires drivers aged ≥ 75 years to take a special lecture, a cognitive function test, and a driving skills test when renewing their driver’s license as well as incentivize the voluntary return of license. Given such stringent measures in Japan, driving may be a proxy variable for the retention of physical function.

There are several limitations in this study. First, objectively measured variables could not be included as candidates. Prospective studies have shown that objective measures of physical function such as gait speed, one-leg-standing time, and handgrip strength can improve the prediction of functional disability.^37,38 Although there may be room to improve prediction performance by adding objectively measured variables, this study showed that prediction models constructed only with self-reported variables could predict functional disability with good performance. Second, this study did not provide causal models, but prediction models; thus, causality should not be inferred from our findings. There can be reverse causation and other potential biases between the identified predictors and functional disability. Readers should not interpret the results for etiology but use them to calculate the risk score of functional disability.³⁹ Further studies are required to confirm causality, and to propose preventive measures for functional disability. Third, some residents did not respond to the survey, which could have caused a selection bias. We could not assess the impact of non-respondents, because we did not have this data. However, a response rate of > 70% is comparable to or even higher than that of similar surveys involving community-dwelling older adults.⁴⁰ Fourth, given that we aimed at predicting functional disability for individuals, clinical and biological factors were chosen as important features. However, contextual factors should also be considered for community health. Previous studies demonstrated that living in a community with active social participation and rich social cohesion was associated with the reduced onset of functional disability.^7,8,9,22 Fifth, we combined all levels of needing long-term care as the outcome to predict the onset of functional disability. However, the clinical conditions of a person certified as the support-needs level 1 and a person certified as the care-needs level 5 are very different. We performed sensitivity analysis setting the care-needs level 2 as an alternative cut-off and found that the alternative models selected many of the same variables, but some were different from our primary models. We acknowledge that other prediction models may perform better to predict functional disability defined by different cut-offs and the severity of functional disability. Finally, we studied Japanese older adults, and the generalizability of our findings to other countries may be limited.

In summary, we present prediction models for functional disability that included important features selected from 183 candidate predictors using machine learning algorithms. The models showed effective performance prediction over 5 years. Our findings suggest that measuring and adding the variables identified as important features of ridge regression and gradient boosting can improve the prediction of functional disability. This study provides researchers and policymakers with valuable insights for improving the prediction of functional disabilities in community-dwelling older adults.

Data Availability

All JAGES datasets have ethical or legal restrictions for public deposition due to the inclusion of sensitive information from human participants. All enquiries are to be addressed to the data management committee via email: dataadmin.ml@jages.net.

References

World Health Organization. Ageing and health. World Health Organization. Published October 4, 2021. https://www.who.int/news-room/fact-sheets/detail/ageing-and-health.Accessed 12 Aug 2022.
Okoro CA. Prevalence of Disabilities and Health Care Access by Disability Status and Type Among Adults — United States, 2016. MMWR Morb Mortal Wkly Rep. 2018;67. https://doi.org/10.15585/mmwr.mm6732a3.
Fried LP, Ferrucci L, Darer J, Williamson JD, Anderson G. Untangling the concepts of disability, frailty, and comorbidity: implications for improved targeting and care. J Gerontol A Biol Sci Med Sci. 2004;59(3):255-263.
Kojima G, Taniguchi Y, Iliffe S, Jivraj S, Walters K. Transitions Between Frailty States Among Community-Dwelling Older People: a Systematic Review and Meta-analysis. Ageing Res Rev. 2019;50:81-88. https://doi.org/10.1016/j.arr.2019.01.010.
Van Grootven B, van Achterberg T. Prediction Models for Functional Status in Community Dwelling Older Adults: a Systematic Review. BMC Geriatr. 2022;22(1):465. https://doi.org/10.1186/s12877-022-03156-7.
Tsuji T, Kondo K, Kondo N, Aida J, Takagi D. Development of a Risk Assessment Scale Predicting Incident Functional Disability Among Older People: Japan Gerontological Evaluation Study. Geriatr Gerontol Int. 2018;18(10):1433-1438. https://doi.org/10.1111/ggi.13503.
Aida J, Kondo K, Kawachi I, et al. Does Social Capital Affect the Incidence of Functional Disability in Older Japanese? A Prospective Population-Based Cohort Study. J Epidemiol Community Health. 2013;67(1):42-47. https://doi.org/10.1136/jech-2011-200307.
Ashida T, Kondo N, Kondo K. Social Participation and the Onset of Functional Disability by Socioeconomic Status and Activity Type: the JAGES Cohort Study. Prev Med. 2016;89:121-128. https://doi.org/10.1016/j.ypmed.2016.05.006.
Fujihara S, Miyaguni Y, Tsuji T, Kondo K. Community-Level Social Participation and Functional Disability Among Older Adults: a JAGES Multilevel Longitudinal Study. Arch Gerontol Geriatr. 2022;100:104632. https://doi.org/10.1016/j.archger.2022.104632.
Watanabe R, Tsuji T, Ide K, et al. Predictive Validity of the Modified Kihon Checklist for the Incidence of Functional Disability Among Older People: a 3-Year Cohort Study from the JAGES. Geriatr Gerontol Int. 2022;22(8):667-674. https://doi.org/10.1111/ggi.14439.
Houde SC, Gautam R, Kai I. Long-term care insurance in Japan: implications for U.S. long-term care policy. J Gerontol Nurs. 2007;33(1):7–13.
Ministry of Health, Labour and Welfare. Long-term care insurance system of Japan. Ministry of Health, Labour and Welfare. Published November 2016. https://www.mhlw.go.jp/english/policy/care-welfare/care-welfare-elderly/dl/ltcisj_e.pdf.Accessed 15 Sept .2022
Ministry of Health, Labour and Welfare. Older Adult Care in 2015: Toward the Establishment of Care That Supports the Dignity of Older Adults. Ministry of Health, Labour and Welfare; 2003. https://www.mhlw.go.jp/topics/kaigo/kentou/15kourei/sankou3.html.Accessed 9 Mar 2023.
James G, Witten D, Hastie T, Tibshirani R. An Introduction to Statistical Learning: With Applications in R. 2nd Edition. Springer; 2021.
Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2nd edition. Springer; 2016.
Hoerl AE, Kennard RW. Ridge Regression: Biased Estimation for Nonorthogonal Problems. Technometrics. 1970;12(1):55-67. https://doi.org/10.1080/00401706.1970.10488634.
Hoerl AE, Kennard RW. Ridge Regression: Applications to Nonorthogonal Problems. Technometrics. 1970;12(1):69-82. https://doi.org/10.2307/1267352.
Natekin A, Knoll A. Gradient Boosting Machines, a Tutorial. Front Neurorobotics. 2013;7:21. https://doi.org/10.3389/fnbot.2013.00021.
Breiman L. Random Forests. Mach Learn. 2001;45(1):5-32. https://doi.org/10.1023/A:1010933404324.
Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD ’16. Association for Computing Machinery; 2016:785–794. https://doi.org/10.1145/2939672.2939785.
Stekhoven DJ, Bühlmann P. MissForest—Non-parametric Missing Value Imputation for Mixed-Type Data. Bioinformatics. 2012;28(1):112-118. https://doi.org/10.1093/bioinformatics/btr597.
Noguchi T, Kondo K, Saito M, Nakagawa-Senda H, Suzuki S. Community Social Capital and the Onset of Functional Disability Among Older Adults in Japan: a Multilevel Longitudinal Study Using Japan Gerontological Evaluation Study (JAGES) Data. BMJ Open. 2019;9(10):e029279. https://doi.org/10.1136/bmjopen-2019-029279.
Tomata Y, Hozawa A, Ohmori-Matsuda K, et al. Validation of the Kihon Checklist for predicting the risk of 1-year incident long-term care insurance certification: the Ohsaki Cohort 2006 Study. Nihon Koshu Eisei Zasshi Jpn J Public Health. 2011;58(1):3-13.
Tinetti ME, Williams CS. The Effect of Falls and Fall Injuries on Functioning in Community-Dwelling Older Persons. J Gerontol Ser A. 1998;53A(2):M112-M119. https://doi.org/10.1093/gerona/53A.2.M112.
Murray AM, Bennett DA, Mendes de Leon CF, Beckett LA, Evans DA. A Longitudinal Study of Parkinsonism and Disability in a Community Population of Older People. J Gerontol Ser A. 2004;59(8):M864-M870. https://doi.org/10.1093/gerona/59.8.M864.
Greiner PA, Snowdon DA, Schmitt FA. The Loss of Independence in Activities of Daily Living: the Role of Low Normal Cognitive Function in Elderly Nuns. Am J Public Health. 1996;86(1):62-66. https://doi.org/10.2105/ajph.86.1.62.
Armstrong MJ, Okun MS. Diagnosis and Treatment of Parkinson Disease: a Review. JAMA. 2020;323(6):548-560. https://doi.org/10.1001/jama.2019.22360.
Sauvaget C, Yamada M, Fujiwara S, Sasaki H, Mimori Y. Dementia as a Predictor of Functional Disability: a Four-Year Follow-up Study. Gerontology. 2002;48(4):226-233. https://doi.org/10.1159/000058355.
Idler EL, Kasl SV. Self-ratings of Health: Do They Also Predict Change in Functional Ability? J Gerontol B Psychol Sci Soc Sci. 1995;50(6):S344-353. https://doi.org/10.1093/geronb/50b.6.s344.
Lee Y. The Predictive Value of Self Assessed General, Physical, and Mental Health on Functional Decline and Mortality in Older Adults. J Epidemiol Community Health. 2000;54(2):123-129. https://doi.org/10.1136/jech.54.2.123.
Takahashi S, Tanno K, Yonekura Y, et al. Poor Self-rated Health Predicts the Incidence of Functional Disability in Elderly Community Dwellers in Japan: a Prospective Cohort Study. BMC Geriatr. 2020;20(1):328. https://doi.org/10.1186/s12877-020-01743-0.
Idler EL, Benyamini Y. Self-rated health and mortality: a review of twenty-seven community studies. J Health Soc Behav. 1997;38(1):21-37.
McLaughlin D, Leung J, Pachana N, Flicker L, Hankey G, Dobson A. Social Support and Subsequent Disability: It Is Not the Size of Your Network That Counts. Age Ageing. 2012;41(5):674-677. https://doi.org/10.1093/ageing/afs036.
Minkler M, Fuller-Thomson E, Guralnik JM. Gradient of Disability across the Socioeconomic Spectrum in the United States. N Engl J Med. 2006;355(7):695-703. https://doi.org/10.1056/NEJMsa044316.
Zhong Y, Wang J, Nicholas S. Gender, Childhood and Adult Socioeconomic Inequalities in Functional Disability Among Chinese Older Adults. Int J Equity Health. 2017;16(1):165. https://doi.org/10.1186/s12939-017-0662-3.
Gjonça E, Tabassum F, Breeze E. Socioeconomic Differences in Physical Disability at Older Age. J Epidemiol Community Health. 2009;63(11):928-935. https://doi.org/10.1136/jech.2008.082776.
Chen T, Honda T, Chen S, Kishimoto H, Kumagai S, Narazaki K. Potential Utility of Physical Function Measures to Improve the Risk Prediction of Functional Disability in Community-Dwelling Older Japanese Adults: a Prospective Study. BMC Geriatr. 2021;21(1):476. https://doi.org/10.1186/s12877-021-02415-3.
Guralnik JM, Ferrucci L, Simonsick EM, Salive ME, Wallace RB. Lower-Extremity Function in Persons over the Age of 70 Years as a Predictor of Subsequent Disability. N Engl J Med. 1995;332(9):556-561. https://doi.org/10.1056/NEJM199503023320902.
Ramspek CL, Steyerberg EW, Riley RD, et al. Prediction or Causality? A Sco** Review of Their Conflation Within Current Observational Research. Eur J Epidemiol. 2021;36(9):889-898. https://doi.org/10.1007/s10654-021-00794-w.
Santos-Eggimann B, Cuénoud P, Spagnoli J, Junod J. Prevalence of Frailty in Middle-Aged and Older Community-Dwelling Europeans Living in 10 Countries. J Gerontol Ser A. 2009;64A(6):675-681. https://doi.org/10.1093/gerona/glp012.

Download references

Acknowledgements

Contributors: YL performed the statistical analyses. KS conceived the study design and drafted the manuscript. KK and NK collected the data. LY, MN, HM, KK, and NK interpreted the results and revised the manuscript. All authors approved the manuscript for publication and agreed to be accountable for all aspects of this work. Additionally, we thank Toshihiro Hayashi for his support.

Funding

This study used data from the JAGES, which was supported by Japan Society for the Promotion of Sciences (15H01972, 20K18931, 23H03164), Japanese Ministry of Health, Labor and Welfare (H28-Choju-Ippan-002), Japan Agency for Medical Research and Development (JP17dk0110017, JP18dk0110027, JP18ls0110002, JP18le0110009), and National Center for Geriatrics and Gerontology (29–42). The funders played no role in the design and conduct of the study; management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; or the decision to submit the manuscript for publication.

Author information

Yongjian Lu and Koryu Sato are equal contributors.

Authors and Affiliations

Tokyo, Japan
Lu Yongjian PhD
Department of Social Epidemiology, Graduate School of Medicine and School of Public Health, Kyoto University, Kyoto, Japan
Koryu Sato MPH & Naoki Kondo PhD
Department of Hygiene and Public Health, Osaka Medical and Pharmaceutical University, Osaka, Japan
Masato Nagai PhD
Mitsubishi Research Institute, Inc., Tokyo, Japan
Hirokazu Miyatake MS
Department of Social Preventive Medical Sciences, Center for Preventive Medical Sciences, Chiba University, Chiba, Japan
Katsunori Kondo PhD
Department of Gerontological Evaluation, Center for Gerontology and Social Science, Research Institute, National Center for Geriatrics and Gerontology, Aichi, Japan
Katsunori Kondo PhD

Authors

Lu Yongjian PhD
View author publications
You can also search for this author in PubMed Google Scholar
Koryu Sato MPH
View author publications
You can also search for this author in PubMed Google Scholar
Masato Nagai PhD
View author publications
You can also search for this author in PubMed Google Scholar
Hirokazu Miyatake MS
View author publications
You can also search for this author in PubMed Google Scholar
Katsunori Kondo PhD
View author publications
You can also search for this author in PubMed Google Scholar
Naoki Kondo PhD
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Koryu Sato MPH.

Ethics declarations

Conflict of Interest:

The authors declare that they do not have a conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Prior presentations: None.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 2559 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lu, Y., Sato, K., Nagai, M. et al. Machine Learning–Based Prediction of Functional Disability: a Cohort Study of Japanese Older Adults in 2013–2019. J GEN INTERN MED 38, 2486–2493 (2023). https://doi.org/10.1007/s11606-023-08215-2

Download citation

Received: 29 September 2022
Accepted: 18 April 2023
Published: 01 May 2023
Issue Date: August 2023
DOI: https://doi.org/10.1007/s11606-023-08215-2

KEY WORDS

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Machine Learning–Based Prediction of Functional Disability: a Cohort Study of Japanese Older Adults in 2013–2019