1 Introduction

In recent years, the prevalence of chronic diseases has escalated worldwide, exerting a profound impact on healthcare systems. To address the challenges posed by these conditions, innovative healthcare delivery models, such as telerehabilitation, have emerged, offering customized programs that extend beyond traditional hospital settings.

Traditional home-based rehabilitation heavily relies on the exercise programs prescribed by physiotherapists, yet patient adherence to these regimens often falls short [1]. On the other hand, telerehabilitation involves consistent monitoring of patients’ health status and progress, aligning closely with treatment objectives and individual expectations [2, 3].

To ensure the effectiveness and safety of rehabilitation programs provided at distance, it is imperative to maintain comparable levels of supervision to those provided by in-person therapists. This necessitates the monitoring of both postural alignment and kinematics. An emerging field in the realm of rehabilitation technologies is dedicated to the advancement of digital biofeedback systems [4, 5]. These systems utilize external sensors to collect motion data, which is then meticulously analyzed and conveyed to the user, with the primary aim of delivering support and motivation throughout the home-based rehabilitation process. Widely adopted approaches often rely on wearable and/or vision-based systems to monitor patients during exercises, offering a cost-effective solution for remote analysis of human motion.

The integration of motion capture system technology for recording subjects’ performance can yield reliable outcomes when coupled with specific and precise data processing algorithms. Machine learning algorithms have become increasingly prevalent for motion assessment, with the aim of providing comprehensive movement evaluation and feedback to ensure the accurate execution of gestures. Assistive systems play a vital role in supporting patients with in-home exercises [6, 7]. To be effective, assistive systems need to be adequate, affordable, and easily accessible, while also ensuring user engagement [8]. Furthermore, it is essential for these systems to assess patients’ performance and provide therapists with the necessary data to track progress and make informed clinical decisions [9, 10].

Prior research has delved into computerized approaches for at-home rehabilitation. These advancing systems frequently incorporate intricate interaction designs, providing both visual and auditory cues [8, 12].

The development of digital assistive systems for precise human motion evaluation has posed a formidable challenge in the field of artificial intelligence (AI) research [13]. This challenge is amplified by the diverse range of movements exhibited by individuals, variations in physical capabilities, and the inherent subjectivity involved in the algorithmic approaches used to assess human motion. Constructing an automated system capable of addressing these intricacies is a substantial undertaking, demanding a profound understanding of the diverse spectrum of human movements, physical abilities, and the subjective nature of human motion assessment [14,15,16].

While most existing computational approaches to automatic human motion assessment involve comparing exercise movements to predefined templates or ground truth, the goal for digital assistive rehabilitation systems is to accurately identify and count each exercise repetition to provide assistance and feedback [11, 17]. These systems must also categorize motion patterns as correct or incorrect, evaluating factors such as speed, direction, quality, and compensatory patterns to replicate the nuanced work of an in-person physical therapist [18].

Current repositories of rehabilitation datasets often suffer from limited scope and inadequate data modeling. Recognizing the crucial need for ongoing monitoring and support in rehabilitation, our study aims to overcome these limitations. Our research addresses the pressing demand for assistive evaluation systems for rehabilitation exercises. By harnessing artificial intelligence to analyze human motion, we propose annotating exercise motion phases within an existing dataset, serving a dual purpose in analysis for correction evaluation for patients and providing feedback, as well as quantitative analysis. This solution can be a valuable tool for patients, ensuring correct exercise performance and offering real-time feedback during sessions. We suggest adapting a previously published dataset to fit the requirements of a digital assistive system aimed at remotely aiding patients, essentially acting as a virtual physical therapist [19, 20]. This innovative approach involves meticulously labeling human movement data to ensure a more personalized and effective rehabilitation experience. By closely monitoring the accuracy of exercise repetitions and offering corrective feedback for any deviations in movement patterns, our adapted dataset enables systems to deliver tailored support to patients [18]. Through these additional annotations, we not only enable systems to track exercise repetitions effectively but also provide actionable insights to rectify any erroneous movement patterns, thus enhancing the effectiveness and customization of rehabilitation endeavors.

This paper underscores the pivotal importance of assessing the quality of human motion within the context of rehabilitation. The innovative rehabilitation data modeling presented here sets the stage for the development of more effective and personalized rehabilitation programs, thereby offering vital support to individuals on their path to recovery. In Sect. 2, we conduct a comparative analysis with other existing studies and datasets. Section 3 outlines the methodology employed for dataset modeling, Sect. 4 presents the dataset obtained, and finally, Sect. 5 and Sect. 6, provide concluding remarks that underscore the valuable contributions of this work.

2 Related study

Recent advancements in research have ventured into the realm of mathematical modeling and evaluation techniques for the analysis of human motion in the context of physical therapy. These innovative methods can be broadly categorized into three distinct areas: comparison, categorization, and scoring [21].

Comparison techniques involve assessing a patient's exercise performance by contrasting their motion sequences with predefined template motion sequences. These templates are typically derived from accurate executions of exercises by healthy individuals, clinicians, or patients under clinical supervision. Categorization methods, on the other hand, are designed to classify patient activities into discrete groups, such as correct and incorrect movement patterns. Scoring approaches aim to provide automated evaluations of a patient’s motion using personalized clinical or non-clinical scoring systems, utilizing both continuous and categorical scales.

Unlike comparative and scoring analyses, categorization methods play a crucial role in delivering real-time patient feedback and guidance during rehabilitation sessions. The selection of the most suitable computational technique for monitoring and assessing human movements significantly impacts the collection and organization of datasets.

Several datasets have been published in the field of rehabilitation research. Datasets like SPHERE [22], MSR-3D [23], and UTD-MHAD [24] have been compiled and used for monitoring applications, primarily offering graphical or statistical comparisons of patient actions without providing definitive patient evaluations. For example, the SPHERE dataset includes examples of both normal and physically impaired movements for activities like walking, ascending stairs, and sit-to-stand transitions. The KIMORE [25] and UI-PRMD [26] datasets have been designed to develop an evaluation system that assigns a score to exercise performance.

Conversely, other datasets like the Toronto Rehab Stroke Pose (TRSP) [27], Kinect 3D Active (K3Da) [28], and IntelliRehabDS (IRDS) [29] were created to classify movements into categorical classes. For instance, the TRSP dataset contains pose estimations of stroke patients and healthy subjects performing upper limb rehabilitation movements, annotated for regular and compensatory patterns of motion during an assisted-robot rehabilitation session. The IRDS dataset contains gestures of simple rehabilitation movements performed by both healthy and pathological subjects in various conditions (sitting or standing), labeled for correct and incorrect executions.

This work aims to describe a methodology for modeling a rehabilitation dataset that can be adopted to support AI-driven digital assistive systems, capable of assisting and monitoring home programs and providing feedback to improve patient performance. Given the limitations of currently available published datasets in terms of motion data modeling, a subset of the freely available IRDS was chosen for manipulation in alignment with our objectives. The goal is to implement a set of categorical classes of movements associated with the corresponding exercise phase, distinguishing each exercise repetition in finer terms of the quality of the motion pattern [30].

3 Materials and methods

3.1 Original dataset

In pursuit of the primary goal of this study to model a dataset supporting AI-driven digital assistive system, a subset of the freely available IRDS dataset underwent manipulation.

This selected subset encapsulates repetitions of shoulder movements executed by 29 subjects, comprising 15 patients and 14 healthy controls. Recorded using a Microsoft Kinect One sensor at 30 frames per second, the dataset provides 3D coordinates of 25 body joints and corresponding depth maps for each frame. Each movement is meticulously annotated with exercise type (shoulder flexion, shoulder abduction, shoulder forward elevation), the limb side involved (right, left, both sides), the subject’s position (sitting or standing), and the correctness of pattern motion (correct or incorrect). Shoulder flexion and abduction exercises comprises gesture performed separately with the left body side and the right body side, the shoulder forward elevation is simultaneously performed with both sides.

Table 1 summarizes for each exercise the gesture index.

Table 1 Description of gesture categories included in the new dataset

What sets the IRDS apart from other datasets is its inclusion of natural motions performed by both patients and healthy subjects, encompassing correct and incorrect patterns. Moreover, the dataset features patients with diverse pathological conditions, such as stroke and spinal cord injury, offering a valuable resource for enhancing the monitoring and assistance of upper limb rehabilitation across various populations, including those with neurological and musculoskeletal disorders.

While the current published form of the IRDS dataset serves as a valuable resource for categorizing exercises and assessing their correctness, it falls short of addressing the needs of digital assistive rehabilitation systems. These systems require a finer evaluation, recognizing repetitions of each exercise and assessing them for the completeness of range of motion (ROM) while analyzing incorrect patterns of motion, as suggested by recent works [31, 32]. The ultimate aim is to provide a dataset that facilitates the evaluation of whether an exercise is performed correctly according to its prescription or not.

This work outlines the methods employed to model the IRDS dataset, supporting a movement classification approach for the evaluation of human motions. This approach enables real-time feedback to patients regarding the performance of their prescribed shoulder rehabilitation exercises, allowing for a more detailed analysis of each exercise repetition [30].

3.2 Data modeling

The dataset aims to provide a comprehensive evaluation of ROM and compensatory movement patterns exhibited by patients during shoulder exercise rehabilitation sessions.

To enhance the granularity of the analysis, we associated categorical classes of ROM with specific exercise phases in an individual’s performance. This enables a detailed examination of each movement, considering both the completeness and correctness of its execution.

In the IRDS dataset, each frame, capturing both correct and incorrect movements, underwent labeling for ROM execution classes. An expert in human physiology assessed depth images to assign classes such as complete concentric phase (up), complete eccentric phase (down), incomplete eccentric phase (incomplete down), and incomplete concentric phase (incomplete up). These labels serve three main objectives in rehabilitation evaluation: segmenting exercise routines into repetitions, assessing each repetition’s completion status and providing feedback for performance enhancement, indicating which phase of the repetition is considered invalid-concentric, eccentric, or totally incomplete.

Furthermore, incorrect movements were labeled to distinguish various classes of ROM, allowing for a nuanced evaluation of repetitions not only in terms of motion completeness but also in terms of the percentage of correct and incorrect patterns. This approach provides insights into the quality of motion and facilitates tracking advancements during the rehabilitation process.

The methodology is based on the hypothesis that separating the evaluation of ROM and the recognition of compensatory movements into distinct stages can enhance the accuracy of AI-assisted rehabilitation systems. This separation reduces the complexity of each task by minimizing the number of classes, thereby potentially improving overall system performance.

Formally, we consider a set of input–output pairs \(D=\{(x_i,y_i)\}_{i=1}^N\) where \(x_i \in R^n\) are the N samples of the input feature space obtained from the 3D video motion analysis and \(y_i \in C = \{C_j \vert 1 \le j \le j\}\) are the class of labels to which these features correspond. The evaluation look for a decision function \(f:R^n \rightarrow C\) which given a sample, that in this work contains features of body joints’ 3D pose data, determines the output class that includes the kind of execution performed: \(x \rightarrow y = f(x,w)\). Throughout the following, let \(X(z) = \{X_t, m_t, c_t \}\) denote a labeled sequence of N frames of the considered set of the three shoulder exercises \(z \in \{ S1, S2, S3 \}\), where \(X_t\) denotes the acquired RGB-D image at frame t, and \(m_t\) represents the ROM phase of the exercise execution. Moments of a given exercise comprise both the key ROM phases of the exercise, resulting as \(m_t \in C = \{C_j \vert 1 \le j \le 4\}\). Moreover, \(c_t \in C = \{C_j \vert 0 \le j \le 2\}\) denoting whether the t frame belongs either to a physiological or compensatory motion pattern class.

4 Results

The manipulated dataset contains 1488 files for shoulder rehabilitation movements, following the structure of the original dataset.

Each file is associated with i) subject ID, ii) gesture type, iii) number of repetitions, and iv) label of correct movement and incorrect movements.

The nomenclature of the files is as follows:

$$\begin{aligned} SubjectID.GestureType.RepetitionNo.CorrectMovement.Position.csv \end{aligned}$$
(1)

The data contents of each file, presented in csv format, can be described as follows: (i) each file contains n frames, (ii) each frame contains spatial information of 25 joints along the three axes (x, y, z), iii) each frame contain temporal information of ROM phase execution.

The ROM phase executions are available along with depth image files. Figure 1 displays a chronological sequence of motion for ROM classes, exemplifying their temporal order in execution.

Fig. 1
figure 1

A chronological sequence of left-side shoulder flexion, labeled within the context of ROM classes, captured in depth images of Patient ID 215

The comprehensive dataset composition is outlined in Table 2, providing a detailed breakdown of ROM classes for both the healthy and patient categories.

Table 2 Description of the new dataset in terms of samples for each ROM class category along the various gesture types

The distribution of data for each ROM class was systematically examined within each patient and healthy subject category for various gesture types. Figure 2 depicts a bar plot that distinguishes between patient and healthy category IDs for each gesture type.

Fig. 2
figure 2

Distribution of ROM classes for gesture type and healthy and patients category of subjects

5 Discussion

Ensuring the effectiveness of at-home rehabilitation programs relies on providing both qualitative and quantitative evaluations of exercises to patients. Recent advancements in machine learning and computer vision techniques have sparked increased interest in automating the assessment of rehabilitation exercises [

Data availability

Not applicable.

Code availability

Not applicable.

References

  1. Komatireddy R, Chokshi A, Basnett J, Casale M, Goble D, Shubert T. Quality and quantity of rehabilitation exercises delivered by a 3-d motion controlled camera: a pilot study. Int J Phys Med Rehabil. 2014. https://doi.org/10.4172/2329-9096.1000214.

    Article  Google Scholar 

  2. Rogante M, Grigioni M, Cordella D, Giacomozzi C. Ten years of telerehabilitation: a literature overview of technologies and clinical applications. NeuroRehabilitation. 2010;27(4):287–304.

    Article  Google Scholar 

  3. Hailey D, Roine R, Ohinmaa A, Dennett L. Evidence of benefit from telerehabilitation in routine care: a systematic review. J Telemed Telecare. 2011;17(6):281–7.

    Article  Google Scholar 

  4. Giggins OM, Persson UM, Caulfield B. Biofeedback in rehabilitation. J Neuroeng Rehabil. 2013;10(1):1–11.

    Article  Google Scholar 

  5. Huang H, Wolf SL, He J. Recent developments in biofeedback for neuromotor rehabilitation. J Neuroeng Rehabil. 2006;3(1):1–12.

    Article  Google Scholar 

  6. Simard PY, Steinkraus D, Platt JC. et al. Best practices for convolutional neural networks applied to visual document analysis. In: Icdar, vol. 3, 2003. Edinburgh

  7. Krizhevsky A, Sutskever I, Hinton GE. Imagenet classification with deep convolutional neural networks. Commun ACM. 2017;60(6):84–90.

    Article  Google Scholar 

  8. Ma L, Jia X, Sun Q, Schiele B, Tuytelaars T, Van Gool L. Pose guided person image generation. Advances in neural information processing systems. 2017;30.

  9. Balakrishnan G, Zhao A, Dalca AV, Durand F, Guttag J. Synthesizing images of humans in unseen poses. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018; 8340–8348.

  10. Shorten C, Khoshgoftaar TM. A survey on image data augmentation for deep learning. J Big Data. 2019;6(1):1–48.

    Article  Google Scholar 

  11. Perez L, Wang J. The effectiveness of data augmentation in image classification using deep learning. ar**v preprint ar**v:1712.04621, 2017.

  12. Lee M, Siewiorek D, Smailagic A, Bernardino A, Badia S. Enabling AI and robotic coaches for physical rehabilitation therapy: iterative design and evaluation with therapists and post-stroke survivors, 2021.

  13. Cust EE, Sweeting AJ, Ball K, Robertson S. Machine and deep learning for sport-specific movement recognition: a systematic review of model development and performance. J Sports Sci. 2019;37(5):568–600.

    Article  Google Scholar 

  14. Pereira A, Folgado D, Cotrim R, Sousa I. Physiotherapy exercises evaluation using a combined approach based on semg and wearable inertial sensors. In: Biosignals, pp 73–82, 2019.

  15. Gauthier LV, Kane C, Borstad A, Strahl N, Uswatte G, Taub E, Morris D, Hall A, Arakelian M, Mark V. Video game rehabilitation for outpatient stroke (vigorous): protocol for a multi-center comparative effectiveness trial of in-home gamified constraint-induced movement therapy for rehabilitation of chronic upper extremity hemiparesis. BMC Neurol. 2017;17(1):1–18.

    Article  Google Scholar 

  16. Maciejasz P, Eschweiler J, Gerlach-Hahn K, Jansen-Troy A, Leonhardt S. A survey on robotic devices for upper limb rehabilitation. J Neuroeng Rehabil. 2014;11(1):1–29.

    Article  Google Scholar 

  17. Difini GM, Martins MG, Barbosa JLV. Human pose estimation for training assistance: a systematic literature review. In: Proceedings of the Brazilian Symposium on Multimedia and the Web, pp 189–196, 2021

  18. Mennella C, Maniscalco U, De Pietro G, Esposito M. A deep learning system to monitor and assess rehabilitation exercises in home-based remote and unsupervised conditions. Computers Biol Med. 2023;166: 107485.

    Article  Google Scholar 

  19. Sardari S, Sharifzadeh S, Daneshkhah A, Nakisa B, Loke SW, Palade V, Duncan MJ. Artificial intelligence for skeleton-based physical rehabilitation action evaluation: a systematic review. Computers Biol Med. 2023;158: 106835. https://doi.org/10.1016/j.compbiomed.2023.106835.

    Article  Google Scholar 

  20. Mennella, C., Maniscalco, U., De Pietro, G., Esposito, M.: The role of artificial intelligence in future rehabilitation services: a systematic literature review. IEEE Access 2023.

  21. Debnath B, O’Brien M, Yamaguchi M, Behera A. A review of computer vision-based approaches for physical rehabilitation and assessment. Multimedia Syst. 2022. https://doi.org/10.1007/s00530-021-00815-4.

    Article  Google Scholar 

  22. Paiement A, Tao L, Hannuna S, Camplani M, Damen D, Mirmehdi M. Online quality assessment of human movement from skeleton data British Machine Vision. Durham: BMVA press; 2014.

    Google Scholar 

  23. Li W, Zhang Z, Liu Z. Action recognition based on a bag of 3d points. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition-workshops, pp 9–14, 2010. IEEE

  24. Chen C, Jafari R, Kehtarnavaz N. Utd-mhad: A multimodal dataset for human action recognition utilizing a depth camera and a wearable inertial sensor. In: 2015 IEEE International Conference on Image Processing (ICIP), pp 168–172, 2015. IEEE

  25. Capecci M, Ceravolo MG, Ferracuti F, Iarlori S, Monteriu A, Romeo L, Verdini F. The kimore dataset: Kinematic assessment of movement and clinical scores for remote monitoring of physical rehabilitation. IEEE Trans Neural Syst Rehabil Eng. 2019;27(7):1436–48.

    Article  Google Scholar 

  26. Vakanski A, Jun H-P, Paul D, Baker R. A data set of human body movements for physical rehabilitation exercises. Data. 2018;3(1):2.

    Article  Google Scholar 

  27. Dolatabadi E, Zhi YX, Ye B, Coahran M, Lupinacci G, Mihailidis A, Wang R, Taati B. The toronto rehab stroke pose dataset to detect compensation during stroke rehabilitation therapy. In: Proceedings of the 11th EAI International Conference on Pervasive Computing Technologies for Healthcare, pp 375–381, 2017

  28. Leightley D, Yap MH, Coulson J, Barnouin Y, McPhee JS. Benchmarking human motion analysis using kinect one: An open source dataset. In: 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), pp 1–7, 2015. IEEE

  29. Miron A, Sadawi N, Ismail W, Hussain H, Grosan C. Intellirehabds (irds)-a dataset of physical rehabilitation movements. Data. 2021;6(5):46.

    Article  Google Scholar 

  30. Keogh J, Lake J, Swinton P. Practical applications of biomechanical principles in resistance training: moments and moment arms. J Fitness Res. 2013;2:39–48.

    Google Scholar 

  31. Brennan L, Bevilacqua A, Kechadi T, Caulfield B. Segmentation of shoulder rehabilitation exercises for single and multiple inertial sensor systems. J Rehabil Assistive Technol Eng. 2020;7:2055668320915377. https://doi.org/10.1177/2055668320915377. (PMID: 32913661).

    Article  Google Scholar 

  32. Liao Yalin, Vakanski Aleksandar, **an Min. A deep learning framework for assessing physical rehabilitation exercises. IEEE Trans Neural Syst Rehabil Eng. 2020;28(2):468–77. https://doi.org/10.1109/TNSRE.2020.2966249.

    Article  Google Scholar 

  33. Liao Y, Vakanski A, **an M, Paul D, Baker R. A review of computational approaches for evaluation of rehabilitation exercises. Computers Biol Med. 2020;119: 103687. https://doi.org/10.1016/j.compbiomed.2020.103687.

    Article  Google Scholar 

  34. Chong KM. Privacy-preserving healthcare informatics: a review. ITM Web of Conferences 2021;36, 04005 https://doi.org/10.1051/itmconf/20213604005

  35. Cao T, Armin MA, Denman S, Petersson L, Ahmedt-Aristizabal D. In-bed human pose estimation from unseen and privacy-preserving image domains. In: 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI), pp 1–5, 2022. IEEE

  36. Ferreira B, Menezes P, Batista J. Transformers for workout video segmentation. In: 2022 IEEE International Conference on Image Processing (ICIP), pp 3470–3474, 2022. IEEE

  37. Mennella C, Maniscalco U, De Pietro G, Esposito M. Generating a novel synthetic dataset for rehabilitation exercises using pose-guided conditioned diffusion models: a quantitative and qualitative evaluation. Computers in Biology and Medicine. 2023; 107665.

  38. Gharaei N, Ismail W, Grosan C, Hendradi R. Optimizing the setting of medical interactive rehabilitation assistant platform to improve the performance of the patients: A case study. Artif Intell Med. 2021;120: 102151. https://doi.org/10.1016/j.artmed.2021.102151.

    Article  Google Scholar 

Download references

Acknowledgements

Ciro Mennella is enrolled in the National PhD in Artificial Intelligence, XXXVII cycle, course on Health and life sciences, organized by Università Campus Bio-Medico di Roma (Via Alvaro del Portillo 21, 00,128 Roma, Italy). We acknowledge financial support from the project PNRR MUR project PE0000013-FAIR.

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

Ciro Mennella: supervision, Writing —original draft, writing —review and editing, rormal analysis, conceptualization, methodology. Umberto Maniscalco: supervision, writing —review and editing, conceptualization, methodology. Giuseppe De Pietro: conceptualization, review and editing. Massimo Esposito: supervision, writing – review and editing, conceptualization.

Corresponding author

Correspondence to Ciro Mennella.

Ethics declarations

Ethics approval and consent to participate

Not applicable as this study was done using an open access online database. The original IRDS dataset by Miron et. al is available at: https://zenodo.org/record/4,610,859 [38]. The Miron et. al study [38] was conducted according to the guidelines of the Declaration of Helsinki and approved by the Institutional Review Board of Pusat Rehabilitasi Perkeso Melaka (16.06.2019).

Competing interests

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mennella, C., Maniscalco, U., Pietro, G.D. et al. Modeling rehabilitation dataset to implement effective AI assistive systems. Discov Artif Intell 4, 43 (2024). https://doi.org/10.1007/s44163-024-00130-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s44163-024-00130-7

Keywords