Introduction

Dry eye (DE) is a multifactorial ocular surface disorder1,2 causing impaired visual function, ocular discomfort, decreased quality of life, and reduced work productivity3,4,5,6. Its current worldwide prevalence is 5%–50%, which is expected to increase due to an aging society and increased digital work7,8,9. DE is characterized by a disruption in the tear-film homeostasis, including tear-film instability, hyperosmolarity, ocular inflammation, and subsequent damage of the ocular surface1. However, current treatment focuses on symptomatic relief ex post facto, with no convincing preventative or curative therapies6. Additionally, due to its wide heterogeneity and diverse symptom profiles, including photophobia, eyestrain, and reduced visual acuity1, symptoms are often overlooked as non-specific complaints, hindering optimal treatment selection for varying presentations. Therefore, predictive, preventive, personalized, and participatory (P4) medicine10,11,12 may facilitate the prevention of the onset and progression of DE via management optimization12. Subjective symptom measurement is the diagnostic standard in DE2,6, via validated questionnaires with a cutoff score13. However, this approach for DE diagnosis13,14,15 ignores various presentations and etiologies that could optimize treatment, which is further complicated by its multifactorial nature that includes environmental, host contributory, and lifestyle factors7,9,16,17. Therefore, monitoring individual subjective symptoms and contributory factors to elucidate and stratify the heterogeneous disease phenotype is needed for optimizing personalized treatments and managing personal risk factors10,12,18.

Mobile health (mHealth) refers to both medical and medical-care-supporting activities performed using smartphones19. The digital phenotype was first defined in 2016 as the “moment-by-moment quantification of the individual-level human phenotype in situ using data from personal digital devices,” which encompasses all data generated through an individual’s interaction with digital technologies. This includes, but is not limited to the data collected through embedded global positioning system (GPS), accelerometers, and photoplethysmography (pulse oximeters) in modern devices which allow instantaneous in situ measurements of human phenotypes20. Measurements collected through modern smart device-embedded sensors, along with patient-reported outcomes (PRO), are potential clinically relevant digital phenotypes21. mHealth’s novel features of collecting individual subjective symptoms, lifestyle data, and electronic PROs may prove useful as routinely collected digital phenotypes22.

We used “DryEyeRhythm (Fig. 1a),”—a free in-house application for DE—to longitudinally observe DE symptoms, related behaviors, and lifestyles, highlighting mHealth as an essential platform for digital phenotype data by identifying individual risk factors and DE characteristics (Fig. 1b)9,17,23,24. mHealth elucidates DE’s diversity and heterogeneity by stratifying multidimensional integrated data on heterogeneous symptoms, individual lifestyle factors, and biosensor data. mHealth, with the advancements made in mobile smart devices, allows for a non-intrusive, real-time, continuous collection of lifestyle data, and individual digital information which remains a challenge in traditional research methodologies17. Additionally, its capability for the comprehensive data collection on each individual, as well as subsequent personalized medical advice for the patient and public, have great implications in actuating all four pillars of P4 medicine10,12. We conducted a large-scale crowdsourced cross-sectional study using our DryEyeRhythm app to develop a robust method to stratify and visualize individual, heterogenetic subjective DE symptoms. Through advancing digital phenoty** for highly heterogeneous and diverse diseases, we attempt to lay a foundation for future P4 medicine10,25,26 in DE management by establishing an individual digital phenoty** protocol.

Fig. 1: Screenshot and the description of user experience of DryEyeRhythm.
figure 1

(a) Screenshots of DryEyeRhythm (b) Description of the user experience of DryEyeRhythm. The copyright permission of the logo was obtained from Juntendo University, Tokyo, Japan.

Results

Application description and study enrollment

The DryEyeRhythm app was used to conduct the study. DryEyeRhythm’s key performance indicators between November 2, 2016, and September 30, 2019, are shown in Fig. 2(a–c). The total impression number was 16,161,564 times during this period (Fig. 2a). DryEyeRhythm was downloaded 22,810 times (Fig. 2b), with 28,408 total sessions (Fig. 2c). Study enrolment is presented in Fig. 2d; 35,218 records were identified in our crowd database, and 3,593 individuals completed the survey and were included. Supplementary Table 1 shows the sensitivity analysis between the included and excluded participants. Figure 2e shows the geographic distribution of the participants in Japan.

Fig. 2: Study cohort description.
figure 2

Specific key performance indicators for DryEyeRhythm are shown in (a) impression times, (b) download times, and (c) session times. (d) Enrollment process of this study. (e) Geographic distribution of included participants (n = 3,593).

Participant characteristics

Participants were DryEyeRhythm app users in Japan. Participant demographics are shown in Supplementary Table 2 (median age: 27 years; women: 59.8%). Symptomatic DE was defined as a total score ≥13 in the Japanese version of the Ocular Surface Disease Index (J-OSDI)15, which identified 2,619 (72.9%) individuals. Individuals with symptomatic DE tended to be younger (symptomatic DE vs. non-symptomatic DE; 26 years vs. 30 years; P < 0.001), female (65% vs. 45.8%; P < 0.001), have hay fever (52.3% vs. 45.9%; P < 0.001), and have a past DE diagnosis (28.8% vs. 14.5%; P < 0.001), compared with patients with non-symptomatic DE.

Symptom-based stratification

DE’s subjective symptoms were assessed using the 12-item J-OSDI questionnaire, and the participants were classified into two groups: non-symptomatic DE (J-OSDI total score <13) and symptomatic DE (J-OSDI total score ≥13). Our symptom-based stratification of the 12 J-OSDI items from 2,619 symptomatic individuals individually stratified DE’s heterogeneous symptoms (Fig. 3a). Seven clusters were determined using eigengaps of the normalized affinity matrix (Fig. 3b)27. Fig. 3c depicts a Uniform Manifold Approximation and Projection (UMAP) plot7,9. However, a one-size-fits-all approach has been predominant, and minimal efforts have been made to personalize regimens according to individual factors12. Increasing reports and discussion regarding P4’s validity suggest preemptive and personalized healthcare both increase treatment and cost-effectiveness26, which necessitates a comprehensive dataset on patients’ subjective symptoms, lifestyle factors, and their environment. mHealth enables remote, longitudinal collection of various personalized data, including biosensor data, and enables participatory medical research through facilitating user comprehension, consent, and feedback9,10,12,17,23,24,31. The widespread prevalence of smartphones32 underscores mHealth’s potential to revolutionize the current facility-based healthcare by re-introducing it within free-living settings. Digital phenoty** and interventions gather vast comprehensive data on the user population and retain personalized aspects, allowing providers selective data application33. Additionally, mHealth facilitates disease monitoring through its PRO-integrated digital phenoty** as a remote diagnostic and interventional tool.

We investigated the effectiveness of smartphone applications as a digital phenoty** platform and were the first to explore MBI and blink frequency as digital phenotypes. Previous reports demonstrated a positive correlation between the MBI and the tear-film break-up time (TFBUT), and concomitant usage of the J-OSDI and MBI demonstrated potential as a non-invasive substitute29,34. In this study, symptomatic DE displayed a significant decrease in MBI, particularly in severe subgroups, suggesting MBI measurement could be highly specific in screening severe DE.

Effective DE-stratification strategies based on multi-dimensional symptom analyses and lifestyle factors were investigated. A dimension reduction of the aggregate DE subjective symptoms identified seven clusters of DE with distinct biological and behavioral profiles. Notably, clusters 2, 5, and 6 correlated with younger age, supporting the association between DE and aging7. However, our results may explain observed discrepancies in prior reports7,17. Traditional facility-based studies largely collected data in an equipped research facility—hospitals or otherwise—which led to a heavy reliance on participants to travel to a dedicated location. This inevitably excluded the younger, working-group population from participation, creating an inevitable selection bias35. However, a smartphone-based study enables a closer look at younger age groups9,17,23,24,31, which enables more robust hierarchical analyses to verify new disease subtypes. Unlike the traditional “cutoff-score” diagnoses2,36, this methodology stratifies groups of individual criteria, creating clusters of an originally singular disease to elucidate patterns of disease heterogeneity. For example, while cluster 1 displayed aggravation of all 12 symptoms, cluster 5 displayed vulnerability to environmental triggers (Fig. 3e and f). Recent studies showed an increasing prevalence of individuals with a short TFBUT-type DE, likely in part due to the aging and digital society37. The observed shortened MBI with higher J-OSDI total score in cluster 1, 2, and 5 show consistency with the previously reported short TFBUT-type DE, and the early identification of these subtypes may aid in guiding the appropriate tear-film oriented therapy38.

Modifiable risk factors could also be assessed and expeditiously addressed with interventions through smartphone applications. We demonstrated that a multidimensional data-driven analysis may aid in providing effective treatment by stratifying disease characteristics and tailoring the regimen according to the specific disease subgroup. For patients with similarities to cluster 1 with severe symptoms, individualized interventions to minimize pollen exposure, contact lens wear, on-screen time, smoking, and depression-related symptoms could be initiated (Fig. 6). With more studies reporting the connection between DE symptom aggravation and major depressive disorder, concomitantly monitoring mood-related symptoms may be helpful24. Previous studies support the value of digital phenoty** for identifying at-risk individuals, and accessible behavioral treatments through mHealth, either as a stand-alone or coached intervention39,40, may be valuable for the DE population41. mHealth dismantles the monolithic approach through early identification of a patient’s disease characteristics as the first step in patient-specific intervention.

mHealth yields high-volume, multidimensional integrative data, inherently difficult to analyze and interpret due to non-uniformity and noise. In part, the noise stems from the subjectivity and response bias that results from the reliance on self-reporting. Dimensional reduction greatly improves data interpretability by data-clustering on simple topological manifolds, with subsequent per-cluster analyses and visualization unveiling unique characteristics33,45. The role of mHealth technology has been gaining global attention, evidenced by the “Be He@lthy, Be Mobile (BHBM)” initiative of the World Health Organization and position papers of renowned journals, including the American College of Allergy and Asthma and Immunology (ACAAI) and the European Academy of Allergy and Clinical Immunology (EAACI)46,47,48. As a notable example, a recent study demonstrated the effectiveness of an mHealth-driven stratification technique on the heterogeneous symptoms of hay fever in creating a foundation for individualized therapeutic interventions18. mHealth is positioned to seamlessly fit within the lifestyle of the global population, making it ideal in actualizing healthcare within one’s life and providing early identification and intervention of diseases.

Our study has several limitations. First, this cohort contains age-related selection bias, socioeconomic factors, and educational level, and its initial release was solely on the iOS platform9. Aging is associated with lower rates of smartphone use, partially due to cognitive changes and reduced fine motor control49. However, the major cause of this barrier was a lack of training, underscoring the social efforts needed to introduce modern devices to older populations. In addition, design guidelines should address older adults’ diminishing cognition skills, physical ability, and motivational barriers50. Regarding our study, an Android version was released on September 3, 2020, and socioeconomic and educational surveys were added to track their influences on the results. Nonetheless, to minimize any potential bias, both iOS and Android-supported apps should be used from inception for future studies. Additionally, health-seeking behavior may have limited this study’s generalizability, as participants who actively participated in the study were more likely to have DE-related symptoms. Additionally, the demographics of our participants suggest that the study results are more applicable to the younger population, likely due to the nature of an mHealth study. Therefore, careful external validation may be needed to correlate this study’s findings to previous studies that are based on the older population7. Second, self-administered questionnaires may introduce self-reporting bias. However, the J-OSDI questionnaire was validated through paper-based and DryEyeRhythm-based questionnaires; therefore, the results of self-administered questionnaires were deemed valid9. Third, this study identified symptomatic DE solely based on the J-OSDI questionnaire and without clinical examinations. A previous study reports that a considerable discrepancy may exist between subjectively reported symptoms and objectively measured clinical findings of DE51. Therefore, this study may be susceptible to false-positive findings, and the authors ask the readers to be mindful of the magnitude of self-reported data in this study when interpreting the results. However, recent reports suggest that MBI measurement is an effective non-invasive alternative to tear-film break-up time, and smartphone camera-based blink biosensors may be a valid DE diagnostic tool29,34. Finally, regarding sample size, this study was based on the best available sample size collected to date. Some stratified groups were small in size and, therefore, some variables might not have been identified as statistically significant.

While addressing the existing biases, future studies should assess the scalability, applicability, and practicality of mHealth by further extracting key digital phenotypes and demonstrating their value for other diseases. While considerable efforts have been made in numerous fields, including psychiatry33, timely integration of mHealth may be crucial for equity in healthcare regardless of specialty, especially during a pandemic. Support for diverse operating systems, simplification of the user interface, population training, and medical data security are all topics that require thorough discussion. With increasing evidence, including our own, of the value offered by mHealth instituting non-traditional methodologies, emphasis on broad usage and effective implementation may be paramount for a prompt uptake of mHealth and actualization of P4 medicine.

In conclusion, we developed a smartphone-based digital phenoty** strategy for heterogeneous DE presentations. This sequence of data collection and analysis that allows for stratification and visualization of diseases might be valuable in unveiling their heterogeneity and fundamental pathology and increasing the quality of healthcare through establishing a robust foundation for P4 medicine.

Methods

DryEyeRhythm smartphone application

The DryEyeRhythm app9 was initially developed using Apple Inc.’s (Cupertino, CA, USA) open-source framework, ResearchKit52. The DryEyeRhythm app was released in February 2016 for iOS and September 2020 for Android under a consignment contract with Juntendo University Graduate School of Medicine, Tokyo, Japan, and Inno**, Inc., Tokyo, Japan. It is freely available on Apple’s App Store and Google Play.

Study design and participants

This cross-sectional crowdsourced study was conducted using the iOS version of DryEyeRhythm (Fig. 1a). This study was conducted between November 2, 2016, and September 30, 2019. Previous studies have reported the enrollment process9,17,23,24. Key performance indicators, including impressions, downloads, and sessions, were measured using App Analytics within the App Store Connect. Electronic informed consent for participation was obtained from all users. We included DryEyeRhythm users in Japan, excluding duplicate and incomplete user data. This study was approved by the Independent Ethics Committee of Juntendo University Faculty of Medicine (approval numbers: 16-078 and 16-152) per the Declaration of Helsinki. The methodology and results of this survey are reported as per the CHERRIES reporting guidelines53.

User-data collection

Figure 1b depicts the DryEyeRhythm’s user experience. This method initially collects electronic informed consent for participation after explaining the study’s nature and possible consequences. The data collection parameters included questions about demographics, medical history, lifestyle questionnaire, blink sensing, and daily subjective symptoms (Supplementary Table 6)9,17,23,24. Participants also reported daily subjective symptoms (Supplementary Table 5) and answered, among others, the J-OSDI and the Zung SDS questionnaires14,15 for DE and depression54, respectively (Supplementary Tables 7 and 8).

Blink sensing using a blink-detection application programming interface

Blinking was measured with the users’ own smartphone cameras and CIFaceFeature in the iOS interface for facial detection (available at https://developer.apple.com/documentation/coreimage/cifacefeature). Blinking was defined as opening both eyes followed by their closure. MBI was defined as the maximum inter-blink duration per 30-s trial29,34.

Symptomatic DE ascertainment

DE’s subjective symptoms were assessed using the 12-item J-OSDI questionnaire (Supplementary Table 7) with three subscales: ocular symptoms, visual functioning, and environmental triggers14. Each response was recorded on a five-point scale [“None of the time” (0) / “All of the time” (4)], with “N/A” for questions not applicable to the user. The total score was reported on a 100-point scale, determining DE symptom severity (0–12: normal; 13–22: mild; 23–33: moderate; and 33–100: severe). The J-OSDI is validated in Japan as the Japanese version of the OSDI15; the DryEyeRhythm-based J-OSDI is validated based on the paper-based J-OSDI9.

We classified the study participants into two groups17: non-symptomatic DE (J-OSDI total score <13) and symptomatic DE (J-OSDI total score ≥13).

Zung SDS

The internationally used 20-item SDS evaluates depression symptoms (Supplementary Table 8)54 and is validated in Japan55,56. Each item is rated on the following 4-point Likert scale: “A little of the time,” “Some of the time,” “Good part of the time,” and “Most of the time.” Total scores range from 20 to 80; a score ≥40 suggests depression57.

Symptom-based stratification

Among symptomatic individuals with DE, normalized maximum eigengap values estimated the number of clusters during spectral clustering27. The UMAP with spectral clustering for dimension reduction was used to depict the stratification of subjective symptoms of symptomatic DE based on the 12 J-OSDI questions. UMAP was performed by the umap-learn python package (version 0.4.6)28.

Hierarchical clustering heatmap

A hierarchical clustering heatmap was constructed using the matplotlib module (version 0.9.0, Python 3). Stratified clusters identified by the UMAP are shown on the left side of the heatmap as a color bar. The dendrograms’ clustering for each of the J-OSDI’s 12 items is shown on the top of the heatmap. Heatmaps are colored by the score of the J-OSDI items (0–4) with a color bar legend at the left top.

Chord diagram

A chord diagram illustrates inter-relationships between each J-OSDI item in each cluster. Each of the 12 items was assigned to one of eight groups for illustration. Colors are based on the participants’ clustered group. Lines represent the connections between the participants’ clustered group (at the base of the lines in the lower half of the circle) and the J-OSDI items (at the base of the lines in the upper half of the circle). Line thickness represents the participants’ number. The Chord diagrams are colored by the threshold score of the J-OSDI items (0–4) with a color bar legend at the right. Plots were created in R version 4.0.3 (2020-10-10) using the circlize package (version 0.4.11)58.

Co-occurrence network analysis

Co-occurrence network analysis found cluster-based significant correlations between each J-OSDI item. Co-occurrence network analyses were conducted with the wTO metric, normalizing all shared correlations between a pair of parameters. The wTO metrics were calculated using R version’s 4.0.3 wTO package (2020-10-10). All item networks were constructed using Pearson correlation using 250 bootstraps59. Networks were filtered for a Benjamini and Hochberg adjusted P < 0.001. Significant links were visualized using the NetVis function in the wTO package.

In the co-occurrence network analysis representations, nodes (12 items of the J-OSDI) are represented as circles; node links (edges) are represented as lines. Each node size is proportional to the frequency of each J-OSDI-defined DE symptom. Edges show pairwise comparison strength for each J-OSDI item domain interrelation. A thicker edge across all networks implies a stronger association, facilitating strength comparison. Purple and green edges represent positive and negative interconnections, respectively.

Multivariate logistic regression analysis

Multivariable-adjusted logistic regression analyses identified risk factors for each cluster among symptomatic versus non-symptomatic DE (J-OSDI total score <13), and among symptomatic DE versus data of all participants. The covariates were selected based on previously established methodology7,9,23,24. The Circos plot of risk factors for each cluster among the symptomatic DE versus the data of all participants was visualized using the EpiCircos (available at https://github.com/mattlee821/EpiViz)30.

Statistical analysis

Participant characteristics were compared between symptomatic and asymptomatic DE and each of their cluster. Continuous variables are presented as median (interquartile range [IQR]) for non-normally distributed factors, based on Shapiro-Wilk tests; categorical variables are presented as a percentage. We conducted χ2 and Mann-Whitney U tests for categorical and non-normally distributed continuous variables.

All data were analyzed with STATA version 15 (Stata Corp, College Station, TX, USA). Moreover, a heatmap for the geographic distribution of the participants in Japan (Fig. 2e) was constructed using the matplotlib module (version 0.9.0, Python 3).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.