Brief Report

On the role of marginal confounder prevalence – implications for the high-dimensional propensity score algorithm

Corresponding Author

Tibor Schuster

Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Quebec, Canada

Centre for Clinical Epidemiology, Lady Davis Institute, Jewish General Hospital, Montreal, Quebec, Canada

Correspondence to: T. Schuster, Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Purvis Hall 1020 Pine Ave. West, Montreal, Quebec, Canada, H3A 1A2. E-mail: [email protected]Search for more papers by this author

Menglan Pang,

Menglan Pang

Centre for Clinical Epidemiology, Lady Davis Institute, Jewish General Hospital, Montreal, Quebec, Canada

Search for more papers by this author

Robert W. Platt,

Robert W. Platt

Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Quebec, Canada

Department of Pediatrics, McGill University, Montreal, Quebec, Canada

Search for more papers by this author

Tibor Schuster,

Corresponding Author

Tibor Schuster

Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Quebec, Canada

Centre for Clinical Epidemiology, Lady Davis Institute, Jewish General Hospital, Montreal, Quebec, Canada

Menglan Pang,

Menglan Pang

Centre for Clinical Epidemiology, Lady Davis Institute, Jewish General Hospital, Montreal, Quebec, Canada

Search for more papers by this author

Robert W. Platt,

Robert W. Platt

Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Quebec, Canada

Department of Pediatrics, McGill University, Montreal, Quebec, Canada

Search for more papers by this author

First published: 10 April 2015

https://doi.org/10.1002/pds.3773

Citations: 17

Share a link

Email
Wechat
Bluesky

Abstract

Purpose

The high-dimensional propensity score algorithm attempts to improve control of confounding in typical treatment effect studies in pharmacoepidemiology and is increasingly being used for the analysis of large administrative databases. Within this multi-step variable selection algorithm, the marginal prevalence of non-zero covariate values is considered to be an indicator for a count variable's potential confounding impact. We investigate the role of the marginal prevalence of confounder variables on potentially caused bias magnitudes when estimating risk ratios in point exposure studies with binary outcomes.

Methods

We apply the law of total probability in conjunction with an established bias formula to derive and illustrate relative bias boundaries with respect to marginal confounder prevalence.

Results

We show that maximum possible bias magnitudes can occur at any marginal prevalence level of a binary confounder variable. In particular, we demonstrate that, in case of rare or very common exposures, low and high prevalent confounder variables can still have large confounding impact on estimated risk ratios.

Conclusions

Covariate pre-selection by prevalence may lead to sub-optimal confounder sampling within the high-dimensional propensity score algorithm. While we believe that the high-dimensional propensity score has important benefits in large-scale pharmacoepidemiologic studies, we recommend omitting the prevalence-based empirical identification of candidate covariates. Copyright © 2015 John Wiley & Sons, Ltd.

Supporting Information

References

1Schneeweiss S, Rassen JA, Glynn RJ, et al. High-dimensional propensity score adjustment in studies of treatment effects using health care claims data. Epidemiology 2009; 20(4): 512–522. doi:10.1097/EDE.0b013e3181a663cc.
10.1097/EDE.0b013e3181a663cc
PubMed Web of Science® Google Scholar
2Rassen JA, Avorn J, Schneeweiss S. Multivariate-adjusted pharmacoepidemiologic analyses of confidential information pooled from multiple health care utilization databases. Pharmacoepidemiol Drug Saf 2010; 19(8): 848–857. doi:10.1002/pds.1867.
10.1002/pds.1867
PubMed Web of Science® Google Scholar
3Toh S, García Rodríguez LA, Hernán MA. Confounding adjustment via a semi-automated high-dimensional propensity score algorithm: an application to electronic medical records. Pharmacoepidemiol Drug Saf 2011; 20(8): 849–857. doi:10.1002/pds.2152.
10.1002/pds.2152
CAS PubMed Web of Science® Google Scholar
4Polinski JM, Schneeweiss S, Glynn RJ, et al. Confronting “confounding by health system use” in Medicare Part D: comparative effectiveness of propensity score approaches to confounding adjustment. Pharmacoepidemiol Drug Saf 2012; 21(Suppl 2): 90–98. doi:10.1002/pds.3250.
10.1002/pds.3250
PubMed Web of Science® Google Scholar
5Garbe E, Kloss S, Suling M, et al. High-dimensional versus conventional propensity scores in a comparative effectiveness study of coxibs and reduced upper gastrointestinal complications. Eur J Clin Pharmacol 2013; 69(3): 549–557. doi:10.1007/s00228-012-1334-2.
10.1007/s00228-012-1334-2
CAS PubMed Web of Science® Google Scholar
6Neugebauer R, Schmittdiel JA, Zhu Z, Rassen JA, Seeger JD, Schneeweiss S. High-dimensional propensity score algorithm in comparative effectiveness research with time-varying interventions. Stat Med 2015; 34(5): 753–781. doi:10.1002/sim.6377.
10.1002/sim.6377
PubMed Web of Science® Google Scholar
7 Mini-Sentinel. Statistical Methods Development, Available at: URL http://www.mini-sentinel.org/work_products/Statistical_Methods/Mini-Sentinel_High-Dimensional-Propensity-Score-Adjustment.pdf [10 March 2015].
Google Scholar
8Suissa S, Henry D, Caetano P, et al. CNODES: the Canadian Network for Observational Drug Effect Studies. Open Med 2012; 6(4): 134–140.
PubMed Google Scholar
9Bross IDJ. Spurious effects from an extraneous variable. J Chronic Dis 1966; 19: 637–647. doi:10.1016/0021-9681(66)90062-2.
10.1016/0021-9681(66)90062-2
CAS PubMed Web of Science® Google Scholar
10Schlesselman JJ. Assessing effects of confounding variables. Am J Epidemiol 1978; 108(1): 3–8.
CAS PubMed Web of Science® Google Scholar
11Flanders WD, Khoury MJ. Indirect assessment of confounding: graphic description and limits on effect of adjusting for covariates. Epidemiology 1990; 1(3): 239–246. doi:10.1097/00001648-199005000-00010.
10.1097/00001648-199005000-00010
CAS PubMed Google Scholar
12Arah OA, Chiba Y, Greenland S. Bias formulas for external adjustment and sensitivity analysis of unmeasured confounders. Ann Epidemiol 2008; 18(8): 637–646. doi:10.1016/j.annepidem.2008.04.003.
10.1016/j.annepidem.2008.04.003
PubMed Web of Science® Google Scholar
13Chiba Y. Simple formulae for evaluating the potential impact of confounding bias. Communications Stat-Theory Methods 2011; 40(23): 4278–4288. doi:10.1080/03610926.2010.508864.
10.1080/03610926.2010.508864
Web of Science® Google Scholar
14VanderWeele TJ, Arah OA. Unmeasured confounding for general outcomes, treatments, and confounders: bias formulas for sensitivity analysis. Epidemiol (Cambridge Mass) 2011; 22(1): 42–52.
10.1097/EDE.0b013e3181f74493
PubMed Web of Science® Google Scholar
15 R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria 2014. URL http://www.R-project.org/.
Google Scholar

Citing Literature

Volume24, Issue9

September 2015

Pages 1004-1007

On the role of marginal confounder prevalence – implications for the high-dimensional propensity score algorithm

Abstract

Purpose

Methods

Results

Conclusions

Supporting Information

References

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

On the role of marginal confounder prevalence – implications for the high-dimensional propensity score algorithm

Abstract

Purpose

Methods

Results

Conclusions

Supporting Information

References

Citing Literature

References

Related

Information