Area under the expiratory flow-volume curve: predicted values by artificial neural networks

Ioachimescu, Octavian C.; Stoller, James K.; Garcia-Rio, Francisco

doi:10.1038/s41598-020-73925-0

Area under the expiratory flow-volume curve: predicted values by artificial neural networks

Article
Open access
Published: 06 October 2020

Volume 10, article number 16624, (2020)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Area under the expiratory flow-volume curve: predicted values by artificial neural networks

Download PDF

Octavian C. Ioachimescu¹,
James K. Stoller² &
Francisco Garcia-Rio³

1995 Accesses
3 Citations
Explore all metrics

Abstract

Area under expiratory flow-volume curve (AEX) has been proposed recently to be a useful spirometric tool for assessing ventilatory patterns and impairment severity. We derive here normative reference values for AEX, based on age, gender, race, height and weight, and by using artificial neural network (ANN) algorithms. We analyzed 3567 normal spirometry tests with available AEX values, performed on subjects from two countries (United States and Spain). Regular linear or optimized regression and ANN models were built using traditional predictors of lung function. The ANN-based models outperformed the de novo regression-based equations for AEX_predicted and AEX z scores using race, gender, age, height and weight as predictor factors. We compared these reference values with previously developed equations for AEX (by gender and race), and found that the ANN models led to the most accurate predictions. When we compared the performance of ANN-based models in derivation/training, internal validation/testing, and external validation random groups, we found that the models based on pooling samples from various geographic areas outperformed the other models (in both central tendency and dispersion of the residuals, ameliorating any cohort effects). In a geographically diverse cohort of subjects with normal spirometry, we computed by both regression and ANN models several predicted equations and z scores for AEX, an alternative measurement of respiratory function. We found that the dynamic nature of the ANN allows for continuous improvement of the predictive models’ performance, thus promising that the AEX could become an essential tool in assessing respiratory impairment.

Estimation of Lung Properties Using ANN-Based Inverse Modeling of Spirometric Data

Normal spirometry prediction equations for the Iranian population

Article Open access 12 December 2022

Prediction of spirometry parameters of adult Indian population using machine learning technology

Article 24 February 2024

Introduction

Interpretation of pulmonary function testing by spirometry relies mainly on comparing measured volumes and flows with their predicted and lower limit of normal (LLN) values. These functional parameters are largely dependent on anthropometric characteristics such as race or ethnicity, gender, age, height and weight. Over the past 50 years, multiple equation sets have been developed and used, generally in separate nominal categories defined by gender and race^1,2,3,4,5. In current practice, for every lung function measurement (e.g., Forced Vital Capacity, FVC) or calculated variable, values smaller than the 5th percentile (or z scores < − 1.645) of gender and race-referenced healthy individuals define the LLN.

More than four decades ago, an analogue lung function index called the area under the maximum expiratory flow-volume curve (abbreviated AFVx) was computed and proposed for use by Vermaak et al.⁶. In order to describe functional abnormalities, a predicted AFVx based on age, gender and height was computed, and a measured to predicted AFVx ratio was assessed against other established lung function parameters. This ratio appeared to be a sensitive indicator of the degree of lung function impairment⁶. More recently, we published on the utility of a digital functional measurement called Area under Expiratory flow-volume loop (AEX)^7,8,9,10 and its approximations (AEX₁ through AEX₄, based on the instantaneous isovolumic flows at 25%, 50% and/or 75% of FVC, or FEF₂₅, FEF₅₀ and FEF₇₅, respectively)⁷ as global tools for diagnosis and severity stratification of respiratory functional impairment. The AEX_1–4 are good approximations of AEX, and they are especially relevant when the pulmonary function testing software does not provide the actual, measured AEX (as the integral function of flow by variable volume). It is currently unknown if constructs such as predicted AEX_1–4, which are derived from individual predicted volumes and flows, are useful as surrogates of AEX_predicted, since FEF₂₅, FEF₅₀ and FEF₇₅ tend to have high inter-test variability (or coefficients of variation), and thus wide confidence intervals for their predicted values. Several authors have also derived and published in the past linear regression-based predictive equations for normal AEX, based on subjects’ age, gender and/or height^6,11.

In this study, in order to define functional impairments by using AEX, we aimed to find a set of equations for AEX_predicted and its z scores (standard deviations) by using artificial neural networks (ANN). The ANN represent a modern computational methodology able to model more complex response surfaces and to circumvent limitations related to fixed equations, variable collinearities, non-gaussian distributions, wide variances and non-linear relationships between predictors. We performed analyses on two groups of normal spirometry tests, one originating from Cleveland, OH (USA), and one from the region of Madrid (Spain), and we compared this approach with optimized regression models using the same variables. The advantage conferred by this approach is that ANN-based models are adaptive and their learning capability could lead to improved predictive performance, thus allowing us to better differentiate between normal and abnormal, and to further define impairments in respiratory physiology.

Results

We analyzed 3111 spirometry tests constituting the Cleveland group, which were randomly divided into a derivation/training (66%) and an internal validation/testing set (33%). In this group of tests originating from the USA, approximately 66% of the subjects were women; 87% of the tested individuals were White and 13% self-identified as Black. In addition, we analyzed 457 normal spirometry tests from Spain, which constituted the Madrid group. In this group, 61% were women, and all subjects were characterized as White. The main anthropometric characteristics and pulmonary function measurements of the two groups are shown in Table 1. Figure 1 shows the AEX distributions by gender and race, while Fig. 2 shows the relationship between AEX and the subject’s age at the time of testing.

Table 1 Demographic and functional characteristics of the study participants.

Full size table

Next, we computed the AEX approximations called AEX₁ through AEX₄ from FVC, Peak Expiratory Flow (PEF), FEF₂₅, FEF₅₀ and FEF₇₅, based on the areas of the triangles and trapezoids delineated by these flows and volumes, as described elsewhere⁷. Then, we compared them with their predicted values, as derived from the main predictive equation sets for FVC, PEF and for the respective isovolumic flows (for the latter, we computed the same triangles and trapezoids’ areas from the predicted values of the instantaneous flows and volumes). For comparison, we used European Community of Steel and Coal (ECSC), National Health and Nutrition Evaluation Survey (NHANES) III and the more recent Global Lung Initiative (GLI) formulas (Fig. 3). The AEX₁, AEX₂, AEX₃ and AEX₄ approximations of AEX based on one, two, three or four flows, respectively were very close to the actual AEX values (i.e., small deviance and dispersion, Fig. 3—dark grey box plots). First and as iterated before, these approximations are valuable when the pulmonary function software does not provide the actual AEX. All in-between group comparisons showed correlation coefficients > 0.97 and p < 0.0001 (Table 2), findings consistent with our prior investigations^7,8,9,10. Second, we found that predicted AEX_k (k = 1–4) based on the major equation sets overestimated on average the actual AEX or its approximations AEX_k (k = 1–4)—Fig. 3, light grey box plots. Among the three predicted sets compared, the ECSC equations overestimated the AEX₁ through AEX₄ and, indirectly AEX, the most (correlation coefficients were the lowest, i.e., ~ 0.80, p < 0.0001).

Table 2 Mean differences (with 95% Confidence Intervals, CI) between actual AEX, AEX approximations (AEX₁ through AEX₄) and predicted AEX values by four different formulas (Vermaak et al.⁶; Garcia-Rio et al.¹¹, regression and artificial neural networks or ANN, 2020) in the training, testing and validation sets.

Full size table

In a side-by-side bar graph format, Fig. 4 illustrates the median and interquartile ranges (IQR) of AEX_1–4, actual AEX and the four predictive models for AEX, i.e., derived from the formulas published by Vermaak et al.⁶, Garcia-Rio et al.¹¹, the current linear regression and the ANN-based models. Standard least square-based regression predictive equations for AEX developed de novo in the two groups combined found R² between 0.62 and 0.71, depending on the gender and race-based subset. In these models, weight was a predictive variable only in White men, while race, gender, age, and height remained significant predictors in all the other groups. Regression optimization by transforming the AEX variable for normalization and variance reduction (either by logarithmic or by gamma function transformation), and by using regression regularization techniques (‘generalized regression’) such as ridge penalty regression, single or double lasso (with or without adaptive features), and elastic net led to only minor improvements in Akaike Information Criterion (AICc, maximal delta 2324), generalized R² (maximal delta 0.01, up to 0.75), in Average Absolute Error (AAE, delta 0.24, ~ 2.11) or in the square root of the mean squared prediction error (RASE, likely one of the most important performance measurements here, with maximal delta 0.02, > 3.39) in the random validation subsets of the entire population of tests, by either tenfold crossvalidation or fixed rate holdback validation methods.

For the ANN, we used as inputs the same parameters, i.e., age, weight, height, gender and race, and the output was AEX or its gender plus race-determined z scores [derived from the formula (X − Mean)/Standard Deviation]. As mentioned earlier (see full details here: Supplemental_Material_S1), the chosen neural network architecture included two ‘hidden’ layers, each containing three sigmodal, three linear and three gaussian activation function nodes. In our analyses, this represented the best architecture in the trade-off between performance and speed, bias and variance, underfitting and overfitting (see also Table 3, which shows the results of ANN ablation experiments). Expectedly, mean predicted AEX was larger in Whites vs. African Americans, and in men vs. women. The ANN–based model predicted the AEX with the highest accuracy, with a median difference of − 0.01 (IQR − 1.66 to 1.30) L²/s, and a correlation coefficient of 0.89. The residuals remained low in the external validation lot (Madrid group, Fig. 5a): median difference of − 0.36 (IQR − 1.66 to 1.30) L²/s, and a correlation coefficient of 0.76. The model performed well due to its small dispersion, without significant heteroscedasticity, i.e., residuals were not progressively larger at higher values. The model’s R² ranged from 0.80 and 0.83 in the derivation/training and the internal validation/testing sets, and 0.55 in the external validation set (Fig. 5a). These were much higher than prior models’ R² (regression-based), which ranged from 0.39 to 0.42¹¹. More importantly, other measurements of model error (Fig. 5a) remained lower vs other regression techniques used. By contrast, in our analyses, the regression-based predicted AEX had a median difference of 0.12 (IQR − 1.90 to 2.03) L²/s, and a correlation coefficient of 0.86; in the external validation lot (Madrid group), the median difference was − 1.04 (IQR − 2.73 to 1.21) L²/s, and the correlation coefficient was 0.78. Similarly, in our ANN models, the AEX z score prediction, which is important for determining LLN, was also very robust (Fig. 5b). While all inputs were significant independent predictors, the most important factors (total effects, %) for predicted AEX were gender (28.6%), race (28.6%), height (21.6%) and age (20.5%), while for AEX z scores (which are computed by gender and race) were height (50.3%), age (30.7%) and weight (18.8%), respectively.

Table 3 Comparison of the Linear Regression (LR) using Standard Least Squares method, Generalized Regression (GR) model using a logarithmic transformation and the double-lasso method, and the main ablation experiments of the Artificial Neural Network (ANN) methods tried.

Full size table

Figures 5 and 6 show two possible modelling approaches by ANN methods. The approach shown in Fig. 5a,b is represented by models developed for AEX_predicted and AEX z scores, respectively, on two thirds of the Cleveland group (derivation/training set) and verified on the rest of the subjects (internal validation/testing set), followed by validation (external validation) in the Madrid group. In this case, one can observe the classic ‘cohort effect’, i.e., the model is ‘overfitting’ in the Cleveland group and it loses its precision when applied to another cohort, of different subjects. The alternative approach, which is shown in Fig. 6a,b, takes advantage of the adaptability or optimization functions of the ANN models, by mixing the two cohorts and deriving a model on ~ 50% of the subjects, followed by testing in 25% of the cohort (internal validation) and validation on the rest of the tests from the two groups combined. This allowed for better fitting models, in this case with larger R² (0.79–0.82) and improved precision of AEX_predicted (consistently lower measurements of error/bias and dispersion). Figures 5 and 6 also show that the condition of homoscedasticity for the models is generally met, i.e., residuals remain roughly in the same range at higher values, with the exception of very few outliers.

In a more comprehensive one-on-one analysis of various variables, Table 2 illustrates the main differences (with 95% Confidence Intervals, CI) between observed AEX, computed AEX₁ through AEX₄, predicted AEX values by previously published formulas^6,11, and by the new regression and ANN-based models.

Discussion

The main finding of this article is that artificial neural networks (ANN) can provide a great alternative to traditional methodologies in computing normal predicted equations, as well as LLNs based on z scores, in this case applied to Area Under Expiratory flow-volume curve (AEX). The adaptive, machine learning model performed better than a de novo linear regression model (smaller dispersion) and was superior to two previously published equations for AEX^6,11.

Traditional regression-based models used for deriving predictive equations for pulmonary function have been flawed by internal and external validity biases (‘cohort effects’), or by various degrees of untrue assumptions of normality, additivity or linearity¹². For these reasons, we used here a more modern method of modelling, able to circumvent collinearities and non-linear relationships, and which can be used in spirometry reference equation derivation, i.e., the ANN. In addition, we found that this methodology outperformed more advanced regression regularization techniques in reducing the bias and the dispersion of the residuals. Nowadays, in an era of exploding computational capabilities, neural networks represent the backbone of many emerging artificial intelligence techniques, which could successfully be applied in our field^13,14,15,16.

We explored first a comparison between measured AEX and its approximations called AEX₁, AEX₂, AEX₃ and AEX_4. As described before⁷, these parameters are computed based on FVC and PEF (AEX₁); FVC, PEF and FEF₅₀ (AEX₂); FVC, PEF, FEF₂₅ and FEF₇₅ (AEX₃), FVC, PEF, FEF₂₅, FEF₅₀ and FEF₇₅ (AEX₄). Then we used the most common, validated predictive equations such as ECSC^17,18, NHANES III¹⁹ and GLI² sets, stratified by gender and race to derive predicted values for AEX₁ through AEX₄.

We illustrate in Fig. 3 several salient findings of our investigation. First, we confirmed our previously published findings⁷, i.e., that AEX_1–4 are acceptable approximations of AEX (with great metrics of central tendency and dispersion for the estimations). The analyses were performed on a subset of subjects with normal lung function from the Cleveland group (in which inclusion was adjudicated by normal lung volume determinations), and on an external validation set of non-smoking elderly subjects with normal spirometry (the Madrid group). Second, we show that the ECSC equations tend to overestimate these spirometric parameters the most, while GLI-based predicted values for AEX₁ through AEX₄ are the closest to the actual normal AEX values.

In Fig. 4 we show both central tendency (medians) and dispersion (IQR) metrics for actual AEX, AEX₁ through AEX₄, and for two AEX predicted values, as published before by Vermaak et al.⁶ and Garcia-Rio et al.¹¹. Of note, the distribution of these parameters was non-gaussian (sinusoidal or logarithmic-like). In addition to these functional parameters, we included in Fig. 4 the values derived from the linear regression and ANN-based models developed de novo in this article. The ANN-based median AEX_predicted (dark blue bar in Fig. 4) was the closest to the actual median AEX (red, double-hashed bar), while the model’s dispersion (as assessed by the IQR) was also the smallest in the ANN-based model. Supplemental Figures S2 and S3 show the distributions of residuals (AEX_predicted—AEX) by both methods and by gender and race, combining all tests from the two groups. In the figures, highlighted (dark green in Supplemental Figure S2 and dark blue in Supplemental Figure S3) represent the men, while lighter colors illustrate the distributions in women. The linear regression model tended to overestimate AEX in males, while the ANN model provided a more precise estimate of the central tendency in all subgroups. In Table 2, we show the in-between variables’ average differences and their 95% CIs (yet we caution the reader that the residuals are non-normally distributed), together with RMSE (root mean square error), RASE (square root of the mean squared prediction error, calculated as the square root of the sum of squares error divided by n, measurement considered by some as equivalent to an off-sample RSME) and R². As such, we confirmed the high correlations and small dispersions for ANN-based model, both in aggregate and by cohort (for the latter, data not shown).

The ANN-based models described here had as input parameters traditional predictors of lung function, i.e., subjects’ gender, race or ethnicity, height, weight, and age, two layers of nine ‘hidden’ nodes (with three sigmoidal, three linear and three gaussian activation functions), and AEX as the output. The model developed in the Cleveland group was also validated internally—dark green (males) and light green (females) dots, followed by external validation in the Madrid group—black (males) and grey (females) dots, Fig. 5a,b. Expectedly, there was a significant ‘step-down’ in the model’s performance, even when ANN methodology was used and by employing a traditional approach of derivation and internal validation in a population, followed by external validation in another cohort. Instead, taking advantage of the learning property of the ANN models (Fig. 6a,b), pooling all tests from the two groups leads to better predictive ability (better central tendency, smaller dispersion and higher percentage of variance explained by the model). See additional online information (link: Supplemental_Material_S1), which also shows the formulas and the code used, for future validation or refinements of the models in other pulmonary function sets.

Several limitations of this investigation deserve to be mentioned. First, the current predictive models for AEX do not consider the intra-individual, test-to-test variability of the AEX measurement, which needs to be explored in future investigations. It is conceivable that, similarly to the large variability of FEF₂₅, FEF₅₀ and FEF₇₅, AEX could also present large variations. This intrinsic variability can be explored and, if found to be high, could potentially be minimized by using AEX variables in concert with other spirometric measurements, approach which can further refine the characterization of the functional impairments. Second, the Madrid cohort included very different subjects, i.e., older, White, and from a small geographic footprint. This limitation could be overcome in the future by extending the geographic coverage and the diversity of the pooled tests. This will allow the ANN models to continue to evolve (trying to the minimize the gradient descent) and to further refine the node equations based on additional variation of the inputs. Third, additional predictors of lung function can be assessed, as modern computational techniques allow us to employ fast and powerful mathematical models, leveraging the unprecedented access to big data, unavailable decades ago, or when using traditional modeling methods. Fourth, one of the disadvantages of the ANN is the complexity of the equations in the hidden nodes, leading to a perceived lack of transparency or ‘black box’ effect, yet it can be visualized easily at each node and in all layers. Fifth, the accuracy of the presented models or equations may not be optimal in a new experimental study that considers different ranges of age, weight and height or other racial profiles. In future training, testing and validation sets, ANN-based models may differ mathematically and deal differently with possible new sources of variance from other factors and with the potential of higher systematic bias. However, this is exactly the point we are making here when we illustrate modeling outcomes in one population with external validation in a different cohort vs ‘pooling’ of all tests together and devising the ANN models that use input variability from all demographic categories. Lastly, the utility of AEX needs to be explored in relationship to specific conditions and outcomes, as most measurements in modern medicine need to be ‘anchored’ against prevention, early diagnosis and development of personalized therapies.

Conclusion

In this investigation, we used neural network models in a pooled, geographically diverse cohort, in order to compute predicted Area Under Expiratory flow-volume curve, a spirometric measurement that may have great impact on how we define respiratory functional impairment in the future. In a large pool of normal spirometry tests, we found that the learning property of the artificial neural networks allows continuous improvement of the predictive models that compute the reference values for AEX and that these models may outperform traditional methods and validation approaches.

Methods

Analyses were performed on a development cohort (the Cleveland group) of 3111 consecutive adult subjects who had normal spirometry and normal same-day lung volume testing in the Cleveland Clinic Pulmonary Function Laboratory over a 10-year time span. A second cohort (the Madrid group) was constituted by 457 never-smoker healthy volunteers who met the American Thoracic Society criteria for reference subjects and participated in a Spanish study that was aimed at deriving spirometry reference values for elderly European individuals¹¹.

Spirometry was performed and interpreted per the current, joint American Thoracic Society (ATS) and European Respiratory Society (ERS) standards and recommendations^{1,20,21,22,23}. Lung volume assessments⁴ were performed only in the Cleveland group, by either body plethysmography^24,25,26 or helium dilution^27,28 methods. Normal lung volume testing was defined as values between lower and upper limits of normal for the following parameters: total lung capacity, functional residual capacity and residual volume. All tests were done using a Jaeger-Viasys Master Lab Pro system (Wurzberg, Germany). The most recent, validated and widely applicable reference values, as developed in ‘semi-parametric’ regression-type models and published by the Global Lung Initiative (GLI) were used for spirometry interpretation and definition of normality^2,19. For lung volumes, the reference values used were those published by Crapo et al.²⁹. We did not use the previously published lung volume reference values developed for 65–85 year-old Europeans³⁰, as the Cleveland group (the only group with lung volume determinations, which constitute gold standard in pulmonary function testing) was overall younger and likely with different anthropometric characteristics. We calculated the parameters AEX₁ through AEX₄ from FVC, FEF₂₅, FEF₅₀ and FEF₇₅, as done elsewhere⁷, and compared them with their predicted values using three of the most popular and widely used equation sets, i.e., European Community for Steel and Coal (ECSC)¹⁸, National Health and Nutrition Survey (NHANES) III¹⁹ and Global Lung Initiative (GLI)². The largest AEX was selected from all the pre-bronchodilator spirometry trials performed. In addition, predicted AEX was computed by using two predictive equations for AEX, as published before by Vermaak et al.⁶ and Garcia-Rio et al.¹¹.

Statistical analyses were performed using JMP Pro15 (SAS Institute, Cary, NC, USA) and open-access R software (R version 3.6.2, R: A Language and Environment for Statistical Computing, R Core Team, R Foundation for Statistical Computing, Vienna, Austria, 2019, https://www.R-project.org, R Studio 1.2.5033, RStudio, Inc).

Descriptive statistical analysis of available variables was performed. Categorical variables were summarized as frequencies or percentages. Continuous variables were characterized by mean, standard deviation, median and 25^th–75th interquartile range (IQR), as appropriate (as most distributions were non-gaussian).

The GLI equations² were developed and made available as Generalized Additive Models for Location, Scale and Shape (GAMLSS) in the R software package. The methods are ‘parametric’ in the sense that they require a parametric distribution assumption for the response variables, and ‘semi’ because modelling of the parameters of distribution as functions of exploratory variables may involve non-parametric smoothing functions (link: GAMLSS).

Some of the prior models for pulmonary function normal values used regular linear regression (standard least squares method) by gender and race, relying on predictive variables such as age, height and, occasionally, weight. In this work, regular regression models were improved by several types of optimization approaches, e.g., generalized additive models defining splines for means, variance and skewness (as in the GLI equations²), regression regularization techniques such as ridge regression, lasso, elastic net and double lasso techniques, with and without adaptive features, using both native values and logarithmic or gamma transformations (as they represented the closest distribution fits) and comparing them with deep learning algorithms or artificial intelligence (AI) methods. The latter models were based on ANN, which could adjust for more complex relationships and interactions between variables, thus modeling more efficiently complex response surfaces. The machine learning models used here are described in more detail online (link: Supplemental_Material_S1). We tried different ANN architectures, with variable number of nodes (3–5) in the first and second layer, and different activation functions in the hidden nodes. During ablation study experiments, we selected the simplest models that provided the lowest dispersion of the predicted variables (variance) vs smallest bias, and the best trade-off between speed and performance, fitting and overfitting. We used the approach of a derivation (training) and an internal validation (testing) set from the Cleveland group with a random holdback method at 33% rate for the internal validation; following this step, we applied the model on an external validation (validation) set constituted by data points from the Madrid group (Fig. 5a,b). In another approach (Fig. 6a,b), we pooled the data from the two cohorts and developed new ANN-based models; we used a 50–25–25% random partition for training–testing–validation (‘ongoing validation’), respectively. In the AI models used, we performed an analysis of the residuals (i.e., the differences between predicted and actual AEX), checking for normality, internal consistency by various parameters and for homoscedasticity of the residuals. The variables’ weight in various models, independent of the model type and fitting used, was assessed by the dependent resampled inputs methods in JMP Pro15, in which factor values are constructed from observed combinations using a k-nearest neighbors’ approach (k = 5 was used), in order to account for correlation. This method, used mainly when there is an assumption that the inputs (such as height, weight, gender, race and age) are possibly correlated, and treats observed variance and covariance as representative of the covariance structure for the used factors³¹. The performance of the standard least squares fit method (regression) and ANN models were assessed by using the JMP Pro15 platform and comparing the means, the residuals, as well as R², square root of the mean squared prediction error (RASE) and average absolute errors (AAE).

Institutional research oversight approvals were obtained to conduct the study and to waive subjects’ informed consent (Cleveland Clinic IRB EX#0504/EX#19-1129; Emory IRB# 00049576/Atlanta VA R&D Ioachimescu-002; and Ethics Committee of the University Hospital of La Paz HULP #PI-70).

Ethics

These analyses were performed were performed in accordance with the relevant rules, guidelines and regulations (and regulatory approvals obtained from Institutional Review Boards).

Informed consent

No informed consent was necessary, as these were data analyses of existing databases.

Abbreviations

AAE:: Average absolute error
AEX:: Area under expiratory flow-volume curve
AEX_k :: Area under expiratory flow-volume curve approximation based on number = k isovolumic flows
AFVx:: Area under maximum expiratory flow-volume curve
AI:: Artificial intelligence
ANN:: Artificial neural network
ATS:: American Thoracic Society
BMI:: Body mass index
CI:: Confidence intervals
ECSC:: European Community for Steel and Coal
ERS:: European Respiratory Society
FDR:: False discovery rate
FEF₂₅, FEF₅₀ and FEF₇₅ :: Forced expiratory flow at 25, 50 and 75% of forced vital capacity
FEV:: Forced expiratory volume
FEV₁ :: Forced expiratory volume in 1 s
FVC:: Forced vital capacity
GLI:: Global lung initiative
HSD:: (Tukey–Kramer) honest significant differences test
IQR:: Inter-quartile range
IRB:: Institution Review Board
NHANES III:: National Health and Nutrition Survey 3
PEF:: Peak expiratory flow
RASE:: Square root of the mean squared prediction error (square root of [sum of square error divided by number of observations])

References

American Thoracic Society. Standardization of spirometry, 1994 update. Am. J. Respir. Crit. Care Med. 152, 1107–1136. https://doi.org/10.1164/ajrccm.152.3.7663792 (1995).
Article Google Scholar
Quanjer, P. H. et al. Multi-ethnic reference values for spirometry for the 3–95-year age range: The global lung function 2012 equations. Eur. Respir. J. 40, 1324–1343. https://doi.org/10.1183/09031936.00080312 (2012).
Article PubMed PubMed Central Google Scholar
Staitieh, B. S. & Ioachimescu, O. C. Interpretation of pulmonary function tests: Beyond the basics. J. Investig. Med. 65, 301–310. https://doi.org/10.1136/jim-2016-000242 (2017).
Article PubMed Google Scholar
Wanger, J. et al. Standardisation of the measurement of lung volumes. Eur. Respir. J. 26, 511–522. https://doi.org/10.1183/09031936.05.00035005 (2005).
Article CAS PubMed Google Scholar
Pellegrino, R. et al. Interpretative strategies for lung function tests. Eur. Respir. J. 26, 948–968. https://doi.org/10.1183/09031936.05.00035205 (2005).
Article CAS PubMed Google Scholar
Vermaak, J. C., Bunn, A. E. & de Kock, M. A. A new lung function index: The area under the maximum expiratory flow-volume curve. Respiration 37, 61–65. https://doi.org/10.1159/000194008 (1979).
Article CAS PubMed Google Scholar
Ioachimescu, O. C. & Stoller, J. K. Area under the expiratory flow-volume curve (AEX): Actual versus approximated values. J. Investig. Med. 68, 403–411. https://doi.org/10.1136/jim-2019-001137 (2020) (Epub ahead of print Sep 11).
Article PubMed Google Scholar
Ioachimescu, O. C. & Stoller, J. K. An alternative spirometric measurement: Area under the expiratory flow-volume curve (AEX). Ann. Am. Thorac. Soc. 17, 582–588. https://doi.org/10.1513/AnnalsATS.201908-613OC (2020).
Article PubMed PubMed Central Google Scholar
Ioachimescu, O. C. & Stoller, J. K. Assessing small airway disease in GLI versus NHANES III based spirometry using area under the expiratory flow-volume curve. BMJ Open Respir. Res. 6, e000511. https://doi.org/10.1136/bmjresp-2019-000511 (2019).
Article PubMed PubMed Central Google Scholar
Ioachimescu, O. C., McCarthy, K. & Stoller, J. K. Alternative measurements to aid interpretation of spirometry: The role of Area under the Expiratory flow-volume curve (AEX). Chest 130, 119S (2006).
Article Google Scholar
Garcia-Rio, F., Pino, J. M., Dorgham, A., Alonso, A. & Villamor, J. Spirometric reference equations for European females and males aged 65–85 years. Eur. Respir. J. 24, 397–405. https://doi.org/10.1183/09031936.04.00088403 (2004).
Article CAS PubMed Google Scholar
Steyerbeg, E. W. Clinical Prediction Models 213–230 (Springer Science + Business Media, LLC, New York, 2009).
Book Google Scholar
Das, N. et al. Deep learning algorithm helps to standardise ATS/ERS spirometric acceptability and usability criteria. Eur. Respir. J. https://doi.org/10.1183/13993003.00603-2020 (2020).
Article PubMed PubMed Central Google Scholar
Velickovski, F. et al. Automated spirometry quality assurance: Supervised learning from multiple experts. IEEE J. Biomed. Health Inform. 22, 276–284. https://doi.org/10.1109/JBHI.2017.2713988 (2018).
Article PubMed Google Scholar
Kavitha, A., Sujatha, M. & Ramakrishnan, S. Evaluation of flow-volume spirometric test using neural network based prediction and principal component analysis. J. Med. Syst. 35, 127–133. https://doi.org/10.1007/s10916-009-9349-7 (2011).
Article PubMed Google Scholar
Castaldi, P. J. et al. Machine learning characterization of COPD subtypes: Insights from the COPDGene Study. Chest 157, 1147–1157. https://doi.org/10.1016/j.chest.2019.11.039 (2020).
Article CAS PubMed Google Scholar
Quanjer, P. H. et al. Lung volumes and forced ventilatory flows. Report Working Party Standardization of Lung Function Tests, European Community for Steel and Coal. Official Statement of the European Respiratory Society. Eur. Respir. J. Suppl. 16, 5–40 (1993).
Article CAS Google Scholar
Quanjer, P. H. et al. Lung volumes and forced ventilatory flows. Work Group on Standardization of Respiratory Function Tests. European Community for Coal and Steel. Official position of the European Respiratory Society. Rev. Maladies Respir. 11, 5–40 (1994).
Google Scholar
Hankinson, J. L., Odencrantz, J. R. & Fedan, K. B. Spirometric reference values from a sample of the general US population. Am. J. Respir. Crit. Care Med. 159, 179–187. https://doi.org/10.1164/ajrccm.159.1.9712108 (1999).
Article CAS PubMed Google Scholar
American Thoracic Society. Lung function testing: Selection of reference values and interpretative strategies. Am. Rev. Respir. Disease 144, 1202–1218. https://doi.org/10.1164/ajrccm/144.5.1202 (1991).
Article Google Scholar
Miller, M. R. et al. Standardisation of spirometry. Eur. Respir. J. 26, 319–338. https://doi.org/10.1183/09031936.05.00034805 (2005).
Article CAS PubMed Google Scholar
Culver, B. H. et al. Recommendations for a Standardized Pulmonary Function Report. An Official American Thoracic Society Technical Statement. Am. J. Respir. Crit. Care Med. 196, 1463–1472. https://doi.org/10.1164/rccm.201710-1981ST (2017).
Article PubMed Google Scholar
Graham, B. L. et al. Standardization of Spirometry 2019 Update. An Official American Thoracic Society and European Respiratory Society Technical Statement. Am. J. Respir. Crit. Care Med. 200, e70–e88. https://doi.org/10.1164/rccm.201908-1590ST (2019).
Article PubMed PubMed Central Google Scholar
Pfluger, E. Das pneumonometer. Pfluger’s Arch. f. d. ges. Physiol. 29, 244 (1882).
Article Google Scholar
Dubois, A. B., Botelho, S. Y. & Comroe, J. H. Jr. A new method for measuring airway resistance in man using a body plethysmograph: Values in normal subjects and in patients with respiratory disease. J. Clin. Investig. 35, 327–335. https://doi.org/10.1172/JCI103282 (1956).
Article CAS PubMed PubMed Central Google Scholar
Coates, A. L., Peslin, R., Rodenstein, D. & Stocks, J. Measurement of lung volumes by plethysmography. Eur. Respir. J. 10, 1415–1427 (1997).
Article CAS Google Scholar
Darling, R. C., Cournand, A. & Richards, D. W. Studies on the intrapulmonary mixture of gases. III. An open circuit method for measuring residual air. J. Clin. Investig. 19, 609–618. https://doi.org/10.1172/JCI101163 (1940).
Article CAS PubMed PubMed Central Google Scholar
Meneeley, G. R. & Kaltreider, N. L. The volume of the lung determined by helium dilution. Description of the method and comparison with other procedures. J. Clin. Investig. 28, 129–139 (1949).
Article Google Scholar
Crapo, R. O., Morris, A. H., Clayton, P. D. & Nixon, C. R. Lung volumes in healthy nonsmoking adults. Bull. Eur. Physiopathol. Respir. 18, 419–425 (1982).
CAS PubMed Google Scholar
Garcia-Rio, F. et al. Lung volume reference values for women and men 65 to 85 years of age. Am. J. Respir. Crit. Care Med. 180, 1083–1091. https://doi.org/10.1164/rccm.200901-0127OC (2009).
Article PubMed Google Scholar
Saltelli, A. Making best use of model evaluations to compute sensitivity indices. Comput. Phys. Commun. 145, 280–297 (2002).
Article ADS CAS Google Scholar

Download references

Acknowledgements

Kevin McCarthy RCPT (data extraction).

Funding

None.

Author information

Authors and Affiliations

Division of Pulmonary, Allergy, Critical Care and Sleep Medicine, School of Medicine, Emory University, Atlanta VA Sleep Medicine Center, 250 N Arcadia Ave, Decatur, GA, 30030, USA
Octavian C. Ioachimescu
Jean Wall Bennett Professor of Medicine, Chair-Education Institute, Cleveland Clinic, 9500 Euclid Ave, Cleveland, OH, USA
James K. Stoller
Servicio de Neumología, Hospital Universitario La Paz, IdiPAZ-Departamento de Medicina, Universidad Autónoma de Madrid-Centro de Investigación Biomédica en Red en Enfermedades Respiratorias (CIBERES), Madrid, Spain
Francisco Garcia-Rio

Authors

Octavian C. Ioachimescu
View author publications
You can also search for this author in PubMed Google Scholar
James K. Stoller
View author publications
You can also search for this author in PubMed Google Scholar
Francisco Garcia-Rio
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

O.C.I.—concept, data collection and analysis, manuscript writing; J.K.S.—concept, manuscript writing; F.G.R.—concept, data collection, manuscript writing.

Corresponding author

Correspondence to Octavian C. Ioachimescu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Legends.

Supplementary Information.

Supplementary Figure S2.

Supplementary Figure S3.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ioachimescu, O.C., Stoller, J.K. & Garcia-Rio, F. Area under the expiratory flow-volume curve: predicted values by artificial neural networks. Sci Rep 10, 16624 (2020). https://doi.org/10.1038/s41598-020-73925-0

Download citation

Received: 06 April 2020
Accepted: 23 September 2020
Published: 06 October 2020
DOI: https://doi.org/10.1038/s41598-020-73925-0
Springer Nature Limited

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Area under the expiratory flow-volume curve: predicted values by artificial neural networks

Abstract

Similar content being viewed by others

Estimation of Lung Properties Using ANN-Based Inverse Modeling of Spirometric Data

Normal spirometry prediction equations for the Iranian population

Prediction of spirometry parameters of adult Indian population using machine learning technology

Introduction

Results

Discussion

Conclusion

Methods

Ethics

Informed consent

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary Legends.

Supplementary Information.

Supplementary Figure S2.

Supplementary Figure S3.

Rights and permissions

About this article

Cite this article

Navigation

Area under the expiratory flow-volume curve: predicted values by artificial neural networks

Abstract

Similar content being viewed by others

Estimation of Lung Properties Using ANN-Based Inverse Modeling of Spirometric Data

Normal spirometry prediction equations for the Iranian population

Prediction of spirometry parameters of adult Indian population using machine learning technology

Introduction

Results

Discussion

Conclusion

Methods

Ethics

Informed consent

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary Legends.

Supplementary Information.

Supplementary Figure S2.

Supplementary Figure S3.

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation