De-confounding representation learning for counterfactual inference on continuous treatment via generative adversarial network

Zhao, Yonghe; Huang, Qiang; Zeng, Haolong; Peng, Yun; Sun, Huiyan

doi:10.1007/s10618-024-01058-3

De-confounding representation learning for counterfactual inference on continuous treatment via generative adversarial network

Published: 11 July 2024

(2024)
Cite this article

Data Mining and Knowledge Discovery Aims and scope Submit manuscript

Yonghe Zhao¹,
Qiang Huang¹,
Haolong Zeng¹,
Yun Peng² &
…
Huiyan Sun ORCID: orcid.org/0000-0002-4664-7147¹

Abstract

Counterfactual inference for continuous rather than binary treatment variables is more common in real-world causal inference tasks. While there are already some sample reweighting methods based on Marginal Structural Model for eliminating the confounding bias, they generally focus on removing the treatment’s linear dependence on confounders and rely on the accuracy of the assumed parametric models, which are usually unverifiable. In this paper, we propose a de-confounding representation learning (DRL) framework for counterfactual outcome estimation of continuous treatment by generating the representations of covariates decorrelated with the treatment variables. The DRL is a non-parametric model that eliminates both linear and nonlinear dependence between treatment and covariates. Specifically, we train the correlations between the de-confounding representations and the treatment variables against the correlations between the covariate representations and the treatment variables to eliminate confounding bias. Further, a counterfactual inference network is embedded into the framework to make the learned representations serve both de-confounding and trusted inference. Extensive experiments on synthetic and semi-synthetic datasets show that the DRL model performs superiorly in learning de-confounding representations and outperforms state-of-the-art counterfactual inference models for continuous treatment variables. In addition, we apply the DRL model to a real-world medical dataset MIMIC III and demonstrate a detailed causal relationship between red cell width distribution and mortality.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Institutional subscriptions

References

Austin Peter C (2011) An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivariate Behav Res 46(3):399–424
Article Google Scholar
Bellot A, Dhir A, Prando G (2023) Generalization bounds and algorithms for estimating the effect of multiple treatments and dosage
Bica I, Jordon J, Schaar M (2020) Estimating the effects of continuous-valued interventions using generative adversarial networks. Adv Neural Inf Process Syst 33:16434–16445
Google Scholar
Brooks-Gunn J, Liaw F-R, Klebanov PK (1992) Effects of early intervention on cognitive function of low birth weight preterm infants. J Pediatr 120(3):350–359
Article Google Scholar
Castro-Martín L, Mar Rueda M, Ferri-García R (2022) Combining statistical matching and propensity score adjustment for inference from non-probability surveys. J Comput Appl Math 404(2):113414
Article MathSciNet Google Scholar
Chang Y, Dy J (2017) Informative subspace learning for counterfactual inference. In Proceedings of the AAAI Conference on Artificial Intelligence, pp 31
Chipman HA, George EI, McCulloch RE (2010) Bart: bayesian additive regression trees. Annal Appl Stat 4(1):69
MathSciNet Google Scholar
DeStefano F (2007) Vaccines and autism: evidence does not support a causal association. Clin Pharmacol Therap 82(6):756–759
Article Google Scholar
Du X, Sun L, Duivesteijn W, Nikolaev A, Pechenizkiy M (2021) Adversarial balancing-based representation learning for causal effect inference with observational data. Data Min Knowl Disc 35(4):1713–1738
Article MathSciNet Google Scholar
D’Aunno T (2010) Reputation and power: organizational image and pharmaceutical regulation at the fda. Adm Sci Q 55(4):671–672
Article Google Scholar
Fong C, Hazlett C, Imai K (2018) Covariate balancing propensity score for a continuous treatment: application to the efficacy of political advertisements. Annal Appl Stat 12(1):156–177
MathSciNet Google Scholar
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. Adv Neural Inf Process Syst 27:69
Google Scholar
Horne BD, May HT, Kfoury AG, Renlund DG, Muhlestein JB, Lappé D, Rasmusson KD, Bunch TJ, Carlquist JF, Bair TL (2014) The intermountain risk score (including the red cell distribution width) predicts heart failure and other morbidity endpoints. Eur J Heart Fail 12(11):1203–1213
Article Google Scholar
Hunziker S, Celi LA, Lee J, Howell MD (2012) Red cell distribution width improves the simplified acute physiology score for risk prediction in unselected critically ill patients. Crit Care 16:1–8
Article Google Scholar
Imai K, Ratkovic M (2014) Covariate balancing propensity score. J R Stat Soc Ser B Stat Methodol 76(1):243–263
Article MathSciNet Google Scholar
Imbens GW (2000) The role of the propensity score in estimating dose-response functions. Biometrika 87(3):706–710
Article MathSciNet Google Scholar
Imbens GW (2004) Nonparametric estimation of average treatment effects under exogeneity: a review. Rev Econ Stat 86(1):4–29
Article MathSciNet Google Scholar
Imbens GW, Rubin DB (2015) Causal inference for statistics, social, and biomedical sciences. Cambridge University Press, Cambridgeshire
Book Google Scholar
Johansson FD, Kallus N, Shalit U, Sontag D (2018) Learning weighted representations for generalization across designs. ar**v e-prints ar**v:1802.08598
Johansson FD, Shalit U, Sontag D (2016) Learning representations for counterfactual inference. In Proceedings of The 33rd International Conference on Machine Learning 48:3020–3029
Johnson AE, Pollard TJ, Shen L, Lehman LH, Feng M, Ghassemi M, Moody B, Szolovits P, Anthony Celi L, Mark RG (2016) Mimic-iii, a freely accessible critical care database. Sci Data 3(1):1–9
Article Google Scholar
Johnson A, Pollard T, Mark R (2016) Mimic-iii clinical database (version 1.4). PhysioNet
Kallus N (2020) Generalized optimal matching methods for causal inference. J Mach Learn Res 21(62):1–54
MathSciNet Google Scholar
Kallus N, Santacatterina M (2019) Kernel optimal orthogonality weighting: A balancing approach to estimating effects of continuous treatments. ar** precision treatment rules with observational data. Behav Res Ther 120:103412
Article Google Scholar
Kluve J, Schneider H, Uhlendorff A, Zhao Z (2012) Evaluating continuous training programmes by using the generalized propensity score. J R Stat Soc Ser A Stat Soc 175(2):587–617
Article MathSciNet Google Scholar
Kohavi R, Longbotham R (2011) Unexpected results in online controlled experiments. ACM SIGKDD Explorations Newsl 12(2):31–35
Article Google Scholar
Kreif N, Grieve R, Díaz I, Harrison D (2015) Evaluation of the effect of a continuous treatment: a machine learning approach with an application to treatment for traumatic brain injury. Health Econ 24(9):1213–1228
Article Google Scholar
Lee JH, Chung HJ, Kim K, Jo YH, Rhee JE, Kim YJ, Kang KW (2013) Red cell distribution width as a prognostic marker in patients with community-acquired pneumonia. Am J Emerg Med 31(1):72–79
Article Google Scholar
Lee BK, Lessler J, Stuart EA (2011) Weight trimming and propensity score weighting. PLoS ONE 6(3):18174
Article Google Scholar
Li KKYLB, Cui P, Yang H, Tao J, Wu F (2021) continuous treatment effect estimation through generative adversaria1 de confounding. Data Min Knowl Disc 35(6):2467–2497
Article Google Scholar
Ma X, Wang J (2019) Robust inference using inverse probability weighting*. J Am Stat Assoc 115(532):1–26
MathSciNet Google Scholar
Malina D, Bothwell LE, Greene JA, Podolsky SH, Jones DS (2016) Assessing the gold standard â€” lessons from the history of rcts. N Engl J Med 374(22):2175–2181
Article Google Scholar
Myers JA, Rassen JA, Gagne JJ, Huybrechts KF, Schneeweiss S, Rothman KJ, Joffe MM, Glynn RJ (2011) Effects of adjusting for instrumental variables on bias and precision of effect estimates. Am J Epidemiol 174(11):1213–1222
Article Google Scholar
Nie L, Ye M, Liu Q, Nicolae D (2021) Vcnet and functional targeted regularization for learning causal effects of continuous treatments. ar**v e-prints ar**v:2103.07861
Nijsse M (1991) Multiple correlation-coefficient. Biometrics 47(1):341–341
Google Scholar
Pearl J (2009) Causality. Cambridge University Press, Cambridgeshire
Book Google Scholar
Robins JM, Hernán M, Brumback B (2000) Marginal structural models and causal inference in epidemiology. Epidemiology 11(5):550–560
Article Google Scholar
Robins JM, Rotnitzky A, Zhao L (1994) Estimation of regression coefficients when some regressors are not always observed. J Am Stat Assoc 89(427):846–866
Article MathSciNet Google Scholar
Rosenbaum PR, Rubin DB (1983) The central role of the propensity score in observational studies for causal effects. Biometrika 70(1):41–55
Article MathSciNet Google Scholar
Rubin DB (1974) Estimating causal effects of treatments in randomized and nonrandomized studies. J Educ Psychol 66(5):688
Article Google Scholar
Schwab P, Linhardt L, Karlen W (2018) Perfect match: A simple method for learning representations for counterfactual inference with neural networks. ar**v e-prints ar**v:1810.00656
Schwab P, Linhardt L, Bauer S, Buhmann JM, Karlen W (2020) Learning counterfactual representations for estimating individual dose-response curves. In Proceedings of the AAAI Conference on Artificial Intelligence 34(4):5612–5619
Yao L, Chu Z, Li S, Li Y, Gao J, Zhang A (2021) A survey on causal inference. ACM Trans Knowl Discov Data 15(5):1–46
Article Google Scholar
Zhang Y-F, Zhang H, Lipton ZC, Li LE, **ng EP (2022) Exploring transformer backbones for heterogeneous treatment effect estimation. ar**v e-prints ar**v:2202.01336
Zhao Y, Huang Q, Fu S, Sun H (2023) Does misclassifying non-confounding covariates as confounders affect the causal inference within the potential outcomes framework? ar**v e-prints ar**v:2308.11676
Zhu Y, Coffman DL, Ghosh D (2015) A boosting algorithm for estimating generalized propensity scores with continuous treatments. J Causal Inference 3(1):25–40
Article MathSciNet Google Scholar
Zou WY, Shyam S, Mui M, Wang M, Pedersen J, Ghahramani Z (2020) Learning continuous treatment policy and bipartite embeddings for matching with heterogeneous causal effects. ar**v e-prints ar**v:2004.09703

Download references

Funding

The authors thank funding support from the National Natural Science Foundation of China (No. 62372210), Natural Science Foundation of Jilin Province (No. 20240101025JJ).

Author information

Authors and Affiliations

School of Artificial Intelligence, Jilin University, Qian** Street, Changchun, 130012, Jilin, China
Yonghe Zhao, Qiang Huang, Haolong Zeng & Huiyan Sun
Department of Data Analysis, Baidu, Shangdi Street, Bei**g, 100085, China
Yun Peng

Authors

Yonghe Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Huang
View author publications
You can also search for this author in PubMed Google Scholar
Haolong Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Yun Peng
View author publications
You can also search for this author in PubMed Google Scholar
Huiyan Sun
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by YZ, QH, HZ, YP and HS. The first draft of the manuscript was written by YZ and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Huiyan Sun.

Ethics declarations

Conflict of interest

The authors have no Conflict of interest to declare that are relevant to the content of this article.

Additional information

Responsible editor: Matteo Riondato.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhao, Y., Huang, Q., Zeng, H. et al. De-confounding representation learning for counterfactual inference on continuous treatment via generative adversarial network. Data Min Knowl Disc (2024). https://doi.org/10.1007/s10618-024-01058-3

Download citation

Received: 18 July 2023
Accepted: 30 June 2024
Published: 11 July 2024
DOI: https://doi.org/10.1007/s10618-024-01058-3

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Institutional subscriptions

De-confounding representation learning for counterfactual inference on continuous treatment via generative adversarial network

Abstract

Access this article

Subscribe and save

Buy Now

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation