Abstract
Traditional methods for the analysis of binary response data are generalized linear models that employ logistic or probit link functions. Unfortunately, effect measures for these type of models do not have a straightforward interpretation. Hence, in this paper we survey probability-based effect measures that can be simpler to understand than logistic and probit regression model parameters and their corresponding effect measures, such as odds ratios. For describing the effect of an explanatory variable while adjusting for others, it is sometimes possible to employ the identity and log link functions to generate simple effect measures. When such link functions are inappropriate, one can still construct analogous effect measures. For comparing groups that are levels of categorical explanatory variables or relevant values for quantitative explanatory variables, such measures can be based on average differences or log-ratios of the probability modeled. For quantitative explanatory variables, they can also be based on average instantaneous rates of change for the probability. We also propose analogous measures for interpreting effects in models with nonlinear predictors, such as generalized additive models. We illustrate the measures for two examples and show how to implement them with R software.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agresti, A.: Categorical Data Analysis, 3rd edn. Wiley, Hoboken (2013)
Agresti, A., Kateri, M.: Ordinal probability effect measures for group comparisons in multinomial cumulative link models. Biometrics 73, 214–219 (2017)
Agresti, A., Tarantola, C.: Simple ways to interpret effects in modeling ordinal categorical data. Statistica Neerlandica 72, 210–223 (2018)
Brumback, B., Berg, A.: On effect-measure modification: relationships among changes in the relative risk, odds ratio, and risk difference. Stat. Med. 27, 3453–3465 (2008)
Fasiolo, M., Nedellec, R., Goude, Y., Wood, S.N.: Scalable visualisation methods for modern generalized additive models. J. Comput. Graph. Stat. 29, 78–86 (2020)
Franses P.H., Paap, R.: Quantitative Models in Marketing Research. Cambridge University Press, Cambridge (2004)
Gail, M.H.: Adjusting for covariates that have the same distribution in exposed and unexposed cohorts. In: Moolgavkar, S.H., Prentice, R.L. (eds.) Modern Statistical Methods in Chronic Disease Epidemiology, pp. 3–18. Wiley, New York (1986)
Gail, M.H., Wieand, S., Piantadosi, S.: Biased estimates of treatment effect in randomized experiments with nonlinear regressions and omitted covariates. Biometrika 71, 431–444 (1984)
Greene, W.H.: Econometric Analysis, 6th edn. Pearson Prentice Hall, Upper Saddle River (2008)
Greenland, S., Robins, J.M., Pearl, J. Confounding and collapsibility in causal inference. Stat. Sci. 14, 29–46 (1999)
Hastie, T., Tibshirani, R.: Generalized Additive Models. Chapman and Hall, London (1990)
Iannario, M., Tarantola, C.: How to interpret the effect of covariates on the extreme categories in ordinal data models. Sociol. Methods Res. (2021). https://doi.org/10.1177/0049124120986179
Leeper, T.J.: Margins: An R Port of Stata’s Margins Command. R package version 0.3.23 (2018)
Long, J.S.: Regression Models for Categorical and Limited Dependent Variables. Sage, Thousand Oaks (1997)
Long, J.S., Freese, J.: Regression Models for Categorical Dependent Variables Using Stata, 3rd edn. Stata Press, College Station (2014)
Long, J.S., Mustillo, S.A.: Using predictions and marginal effects to compare groups in regression models for binary outcomes. Sociol. Methods Res. 50, 1284–1320 (2021)
McKelvey, R.D., Zavoina, W.: A statistical model for the analysis of ordinal level dependent variables. J. Math. Sociol. 4, 103–120 (1975)
Mood, C.: Logistic regression: Why we cannot do what we think we can do, and what we can do about it. Eur. Sociol. Rev. 26, 67–82 (2010)
Robinson, L., Jewell, N.P.: Some surprising results about covariate adjustment in logistic regression models. Int. Stat. Rev. 59, 227–240 (1991)
Stanghellini, E., Doretti, M.: On marginal and conditional parameters in logistic regression models. Biometrika 106, 732–739 (2019)
Sun, C.: Empirical Research in Economics: Growing up with R. Pine Square LLC, Baton Rouge (2015)
VanderWeele, T.J.: Optimal approximate conversions of odds ratios and hazard ratios to risk ratios. Biometrics 76, 746–752 (2020)
Wood, S.N.: Generalized Additive Models: An Introduction with R, 2nd edn. CRC Press, Boca Raton (2017)
Acknowledgements
The authors appreciate helpful comments from two referees and from Pablo Inchausti and Maria Kateri.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
This appendix provides the source code for the R analyses described in the text.
Rights and permissions
Copyright information
© 2023 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Agresti, A., Tarantola, C., Varriale, R. (2023). Simple Ways to Interpret Effects in Modeling Binary Data. In: Kateri, M., Moustaki, I. (eds) Trends and Challenges in Categorical Data Analysis. Statistics for Social and Behavioral Sciences. Springer, Cham. https://doi.org/10.1007/978-3-031-31186-4_5
Download citation
DOI: https://doi.org/10.1007/978-3-031-31186-4_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-31185-7
Online ISBN: 978-3-031-31186-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)