Volume 67, Issue 4

A New Criterion for Confounder Selection

Tyler J. VanderWeele

Corresponding Author

Departments of Epidemiology and Biostatistics, Harvard School of Public Health, 677 Huntington Avenue, Boston, Massachusetts 02115, U.S.A.

email: tvanderw@hsph.harvard.eduSearch for more papers by this author
Ilya Shpitser

Department of Epidemiology, Harvard School of Public Health, 677 Huntington Avenue, Boston, Massachusetts 02115, U.S.A.

Search for more papers by this author
First published: 31 May 2011
Citations: 129

Abstract

Summary We propose a new criterion for confounder selection when the underlying causal structure is unknown and only limited knowledge is available. We assume all covariates being considered are pretreatment variables and that for each covariate it is known (i) whether the covariate is a cause of treatment, and (ii) whether the covariate is a cause of the outcome. The causal relationships the covariates have with one another is assumed unknown. We propose that control be made for any covariate that is either a cause of treatment or of the outcome or both. We show that irrespective of the actual underlying causal structure, if any subset of the observed covariates suffices to control for confounding then the set of covariates chosen by our criterion will also suffice. We show that other, commonly used, criteria for confounding control do not have this property. We use formal theory concerning causal diagrams to prove our result but the application of the result does not rely on familiarity with causal diagrams. An investigator simply need ask, “Is the covariate a cause of the treatment?” and “Is the covariate a cause of the outcome?” If the answer to either question is “yes” then the covariate is included for confounder control. We discuss some additional covariate selection results that preserve unconfoundedness and that may be of interest when used with our criterion.

Number of times cited according to CrossRef: 129

  • G-computation, propensity score-based methods, and targeted maximum likelihood estimator for causal inference with different covariates sets: a comparative simulation study, Scientific Reports, 10.1038/s41598-020-65917-x, 10, 1, (2020).
  • Factors Associated with 5-Year Costs of Care among a Cohort of Alcohol Use Disorder Patients: A Bayesian Network Model, Healthcare Informatics Research, 10.4258/hir.2020.26.2.129, 26, 2, (129-145), (2020).
  • Recent incarceration and risk of first-time injection initiation assistance: A prospective cohort study of persons who inject drugs, Drug and Alcohol Dependence, 10.1016/j.drugalcdep.2020.107983, (107983), (2020).
  • Regarding the Relationship Between 48-Hour Fluid Balance and Acute Kidney Injury, Journal of the American College of Surgeons, 10.1016/j.jamcollsurg.2020.02.023, (2020).
  • Venue of catheter insertion does not significantly impact the event of central line-associated bloodstream infection in patients with haematological diseases, Infection Prevention in Practice, 10.1016/j.infpip.2020.100050, (100050), (2020).
  • Age of puberty and Sleep duration: Observational and Mendelian randomization study, Scientific Reports, 10.1038/s41598-020-59811-9, 10, 1, (2020).
  • Experiences with health care and health-related quality of life of patients with hematologic malignancies in Mexico, BMC Health Services Research, 10.1186/s12913-020-05498-7, 20, 1, (2020).
  • Results from the European Union MAPEC_LIFE cohort study on air pollution and chromosomal damage in children: are public health policies sufficiently protective?, Environmental Sciences Europe, 10.1186/s12302-020-00352-3, 32, 1, (2020).
  • Treatment Effect Estimation via Differentiated Confounder Balancing and Regression, ACM Transactions on Knowledge Discovery from Data (TKDD), 10.1145/3365677, 14, 1, (1-25), (2020).
  • Patient and health service factors associated with delays in cancer treatment for children without social security in Mexico, Pediatric Blood & Cancer, 10.1002/pbc.28331, 67, 9, (2020).
  • Regression analysis of unmeasured confounding, Epidemiologic Methods, 10.1515/em-2019-0028, 9, 1, (2020).
  • Exploring a causal model in observational cohort data: The role of parents and peers in shaping substance use trajectories, Addictive Behaviors, 10.1016/j.addbeh.2020.106597, (106597), (2020).
  • The role of mercury, selenium and the Se-Hg antagonism on cognitive neurodevelopment: A 40-month follow-up of the Italian mother-child PHIME cohort, International Journal of Hygiene and Environmental Health, 10.1016/j.ijheh.2020.113604, 230, (113604), (2020).
  • Discharge and post-discharge outcomes of psychiatric inpatients with a lifetime history of exposure to interpersonal trauma: A population-based study, General Hospital Psychiatry, 10.1016/j.genhosppsych.2020.05.015, (2020).
  • Prenatal exposure to fine particles, premature rupture of membranes and gestational age: A prospective cohort study, Environment International, 10.1016/j.envint.2020.106146, 145, (106146), (2020).
  • Outcomes of intravascular brachytherapy for recurrent drug‐eluting in‐stent restenosis, Catheterization and Cardiovascular Interventions, 10.1002/ccd.28716, 0, 0, (2020).
  • Iron deficiency is a common disorder in general population and independently predicts all-cause mortality: results from the Gutenberg Health Study, Clinical Research in Cardiology, 10.1007/s00392-020-01631-y, (2020).
  • The Structure of Academic Achievement: Searching for Proximal Mechanisms Using Causal Discovery Algorithms, Sociological Methods & Research, 10.1177/0049124120926208, (004912412092620), (2020).
  • Determinants of HIV testing among Filipino women: Results from the 2013 Philippine National Demographic and Health Survey, PLOS ONE, 10.1371/journal.pone.0232620, 15, 5, (e0232620), (2020).
  • Seropositivity of selected chronic infections and different measures of obesity, PLOS ONE, 10.1371/journal.pone.0231974, 15, 4, (e0231974), (2020).
  • Trabecular bone score, a new bone quality index, is associated with severe periodontitis, Journal of Periodontology, 10.1002/JPER.19-0580, 0, 0, (2020).
  • The role of dwelling type when estimating the effect of magnetic fields on childhood leukemia in the California Power Line Study (CAPS), Cancer Causes & Control, 10.1007/s10552-020-01299-9, (2020).
  • The impact of residual mitral regurgitation after MitraClip therapy in functional mitral regurgitation, European Journal of Heart Failure, 10.1002/ejhf.1774, 0, 0, (2020).
  • Comparing outcomes after peripheral nerve block versus general anesthesia for lower extremity amputation: a nationwide exploratory retrospective cohort study in Japan, Regional Anesthesia & Pain Medicine, 10.1136/rapm-2019-101208, (rapm-2019-101208), (2020).
  • The Associations of Income, Education and Income Inequality and Subjective Well-Being among Elderly in Hong Kong—A Multilevel Analysis, International Journal of Environmental Research and Public Health, 10.3390/ijerph17041271, 17, 4, (1271), (2020).
  • Associations of objectively measured sleep characteristics and incident hypertension among police officers: The role of obesity, Journal of Sleep Research, 10.1111/jsr.12988, 0, 0, (2020).
  • Social inequalities in supportive care needs and quality of patient-centered care of cancer patients in Mexico, Supportive Care in Cancer, 10.1007/s00520-020-05615-6, (2020).
  • Testing hypotheses about the microbiome using the linear decomposition model (LDM), Bioinformatics, 10.1093/bioinformatics/btaa260, (2020).
  • Utilization of mental health services among university students in Vietnam, International Journal of Mental Health, 10.1080/00207411.2020.1816114, (1-23), (2020).
  • Bias and High-Dimensional Adjustment in Observational Studies of Peer Effects, Journal of the American Statistical Association, 10.1080/01621459.2020.1796393, (1-11), (2020).
  • Seropositivity for pathogens associated with chronic infections is a risk factor for all-cause mortality in the elderly: findings from the Memory and Morbidity in Augsburg Elderly (MEMO) Study, GeroScience, 10.1007/s11357-020-00216-x, (2020).
  • Association between cardiorespiratory fitness and handgrip strength with age-related macular degeneration: a population-based study, British Journal of Ophthalmology, 10.1136/bjophthalmol-2020-316255, (bjophthalmol-2020-316255), (2020).
  • Data‐adaptive longitudinal model selection in causal inference with collaborative targeted minimum loss‐based estimation, Biometrics, 10.1111/biom.13135, 76, 1, (145-157), (2019).
  • Short-term exposure to ambient ozone and inflammatory biomarkers in cross-sectional studies of children and adolescents: Results of the GINIplus and LISA birth cohorts, Environmental Pollution, 10.1016/j.envpol.2019.113264, (113264), (2019).
  • Methods for the Selection of Covariates in Nutritional Epidemiology Studies: A Meta-Epidemiological Review, Current Developments in Nutrition, 10.1093/cdn/nzz104, 3, 10, (2019).
  • Early childhood vaccination and subsequent mortality or morbidity: are observational studies hampered by residual confounding? A Danish register-based cohort study, BMJ Open, 10.1136/bmjopen-2019-029794, 9, 9, (e029794), (2019).
  • Authors’ reply, Biometrical Journal, 10.1002/bimj.201900196, 61, 6, (1598-1599), (2019).
  • A preliminary Bayesian network model to identify factors associated with treatment outcome in T2 and T3 laryngeal carcinoma, Oral Oncology, 10.1016/j.oraloncology.2019.09.007, (2019).
  • Fundamental concepts for causal inference in medicine医学のための因果推論の基礎概念, Japanese Journal of Biometrics, 10.5691/jjb.40.35, 40, 1, (35-62), (2019).
  • Cheating when in the hole: The case of New York city taxis, Accounting, Organizations and Society, 10.1016/j.aos.2019.101070, (101070), (2019).
  • Childhood chromium exposure and neuropsychological development in children living in two polluted areas in southern Spain, Environmental Pollution, 10.1016/j.envpol.2019.06.084, (2019).
  • Evaluation of Causal Effects and Local Structure Learning of Causal Networks, Annual Review of Statistics and Its Application, 10.1146/annurev-statistics-030718-105312, 6, 1, (103-124), (2019).
  • Nonprobability Sampling and Causal Analysis, Annual Review of Statistics and Its Application, 10.1146/annurev-statistics-030718-104951, 6, 1, (149-172), (2019).
  • Social isolation proxy variables and prescription opioid and benzodiazepine misuse among older adults in the U.S.: A cross-sectional analysis of data from the National Survey on Drug Use and Health, 2015-2017, Drug and Alcohol Dependence, 10.1016/j.drugalcdep.2019.06.020, (2019).
  • Anti-androgenic therapy with finasteride in patients with chronic heart failure - a retrospective propensity score based analysis, Scientific Reports, 10.1038/s41598-019-46640-8, 9, 1, (2019).
  • The Magnitude and Direction of Collider Bias for Binary Variables, Epidemiologic Methods, 10.1515/em-2017-0013, 0, 0, (2019).
  • Maternal age at menarche and offspring body mass index in childhood, BMC Pediatrics, 10.1186/s12887-019-1659-4, 19, 1, (2019).
  • The Causal Structure of Suppressor Variables, Journal of Educational and Behavioral Statistics, 10.3102/1076998619825679, (107699861982567), (2019).
  • Risk factors of post-discharge under-five mortality among Danish children 1997-2016: A register-based study, PLOS ONE, 10.1371/journal.pone.0226045, 14, 12, (e0226045), (2019).
  • The association of intensive care with utilization and costs of outpatient healthcare services and quality of life, PLOS ONE, 10.1371/journal.pone.0222671, 14, 9, (e0222671), (2019).
  • Guidance for a causal comparative effectiveness analysis using ‘big real world’ evidence: when to start statin treatment, Journal of Comparative Effectiveness Research, 10.2217/cer-2018-0103, (2019).
  • Routine preoperative aortic computed tomography angiography is associated with reduced risk of stroke in coronary artery bypass grafting: a propensity-matched analysis, European Journal of Cardio-Thoracic Surgery, 10.1093/ejcts/ezz237, (2019).
  • A descriptive review of variable selection methods in four epidemiologic journals: there is still room for improvement, European Journal of Epidemiology, 10.1007/s10654-019-00529-y, (2019).
  • Why Propensity Scores Should Not Be Used for Matching, Political Analysis, 10.1017/pan.2019.11, (1-20), (2019).
  • Principles of confounder selection, European Journal of Epidemiology, 10.1007/s10654-019-00494-6, (2019).
  • The incidence of post-intubation hypertension and association with repeated intubation attempts in the emergency department, PLOS ONE, 10.1371/journal.pone.0212170, 14, 2, (e0212170), (2019).
  • A More Efficient Causal Mediator Model Without the No-Unmeasured-Confounder Assumption, Multivariate Behavioral Research, 10.1080/00273171.2019.1656051, (1-22), (2019).
  • Resilience is strongly associated with health-related quality of life but does not buffer work-related stress in employed persons 1 year after acute myocardial infarction, Quality of Life Research, 10.1007/s11136-019-02306-6, (2019).
  • Association between cardiorespiratory fitness and colorectal cancer in the UK Biobank, European Journal of Epidemiology, 10.1007/s10654-019-00575-6, (2019).
  • Covariate selection strategies for causal inference: Classification and comparison, Biometrical Journal, 10.1002/bimj.201700294, 61, 5, (1270-1289), (2018).
  • 12-year changes in cardiovascular risk factors in people with major depressive or bipolar disorder: a prospective cohort analysis in Germany, European Archives of Psychiatry and Clinical Neuroscience, 10.1007/s00406-018-0923-1, 269, 5, (565-576), (2018).
  • The association of air pollution with body mass index: evidence from Hong Kong’s “Children of 1997” birth cohort, International Journal of Obesity, 10.1038/s41366-018-0070-9, 43, 1, (62-72), (2018).
  • Doubly robust matching estimators for high dimensional confounding adjustment, Biometrics, 10.1111/biom.12887, 74, 4, (1171-1179), (2018).
  • Variable selection – A review and recommendations for the practicing statistician, Biometrical Journal, 10.1002/bimj.201700067, 60, 3, (431-449), (2018).
  • DNN: A Two-Scale Distributional Tale of Heterogeneous Treatment Effect Inference, SSRN Electronic Journal, 10.2139/ssrn.3238897, (2018).
  • Adjustment for unmeasured confounding through informative priors for the confounder-outcome relation, BMC Medical Research Methodology, 10.1186/s12874-018-0634-3, 18, 1, (2018).
  • Impact of vaccine delays at the 2, 4, 6 and 12 month visits on incomplete vaccination status by 24 months of age in Quebec, Canada, BMC Public Health, 10.1186/s12889-018-6235-6, 18, 1, (2018).
  • Covariate association eliminating weights: a unified weighting framework for causal effect estimation, Biometrika, 10.1093/biomet/asy015, 105, 3, (709-722), (2018).
  • Comprehensive Support for Family Caregivers: Impact on Veteran Health Care Utilization and Costs, Medical Care Research and Review, 10.1177/1077558717697015, 76, 1, (89-114), (2017).
  • Adjustment with three continuous variables, Communications in Statistics - Simulation and Computation, 10.1080/03610918.2017.1390128, 48, 2, (627-633), (2017).
  • Data‐driven confounder selection via Markov and Bayesian networks, Biometrics, 10.1111/biom.12788, 74, 2, (389-398), (2017).
  • Discussion of “Data‐driven confounder selection via Markov and Bayesian networks” by Jenny Häggström, Biometrics, 10.1111/biom.12787, 74, 2, (399-402), (2017).
  • Discussion of “Data‐driven confounder selection via Markov and Bayesian networks” by Häggström, Biometrics, 10.1111/biom.12784, 74, 2, (403-406), (2017).
  • Covariate selection with group lasso and doubly robust estimation of causal effects, Biometrics, 10.1111/biom.12736, 74, 1, (8-17), (2017).
  • Sustained Posttransplantation Diabetes Is Associated With Long‐Term Major Cardiovascular Events Following Liver Transplantation, American Journal of Transplantation, 10.1111/ajt.14401, 18, 1, (207-215), (2017).
  • Collaborative targeted learning using regression shrinkage, Statistics in Medicine, 10.1002/sim.7527, 37, 4, (530-543), (2017).
  • The association of air pollution with height: Evidence from Hong Kong's “Children of 1997” birth cohort, American Journal of Human Biology, 10.1002/ajhb.23067, 30, 1, (2017).
  • Case–control matching: effects, misconceptions, and recommendations, European Journal of Epidemiology, 10.1007/s10654-017-0325-0, 33, 1, (5-14), (2017).
  • Generalisability of an online randomised controlled trial: an empirical analysis, Journal of Epidemiology and Community Health, 10.1136/jech-2017-209976, 72, 2, (173-178), (2017).
  • Estimating marginal causal effects in a secondary analysis of case‐control data, Statistics in Medicine, 10.1002/sim.7277, 36, 15, (2404-2419), (2017).
  • Crowdsourced Earnings Forecasts: Implications for Analyst Forecast Timing and Market Efficiency, SSRN Electronic Journal, 10.2139/ssrn.3057388, (2017).
  • Bias Analysis for Uncontrolled Confounding in the Health Sciences, Annual Review of Public Health, 10.1146/annurev-publhealth-032315-021644, 38, 1, (23-38), (2017).
  • Prediabetes is associated with lower brain gray matter volume in the general population. The Study of Health in Pomerania (SHIP), Nutrition, Metabolism and Cardiovascular Diseases, 10.1016/j.numecd.2017.10.007, 27, 12, (1114-1122), (2017).
  • Data-driven algorithms for dimension reduction in causal inference, Computational Statistics & Data Analysis, 10.1016/j.csda.2016.08.012, 105, (280-292), (2017).
  • Propensity Scores in Pharmacoepidemiology: Beyond the Horizon, Current Epidemiology Reports, 10.1007/s40471-017-0131-y, 4, 4, (271-280), (2017).
  • Generalizing Study Results, Epidemiology, 10.1097/EDE.0000000000000664, 28, 4, (553-561), (2017).
  • Outcome-wide Epidemiology, Epidemiology, 10.1097/EDE.0000000000000641, 28, 3, (399-402), (2017).
  • Acute Illness Among Surfers After Exposure to Seawater in Dry- and Wet-Weather Conditions, American Journal of Epidemiology, 10.1093/aje/kwx019, 186, 7, (866-875), (2017).
  • The Association of Air Pollution With Pubertal Development: Evidence From Hong Kong's “Children of 1997” Birth Cohort, American Journal of Epidemiology, 10.1093/aje/kww200, 185, 10, (914-923), (2017).
  • Bias Due to Confounders for the Exposure–Competing Risk Relationship, Epidemiology, 10.1097/EDE.0000000000000565, 28, 1, (20-27), (2017).
  • Age at menarche and cardiovascular risk factors using Mendelian randomization in the Guangzhou Biobank Cohort Study, Preventive Medicine, 10.1016/j.ypmed.2017.06.006, 101, (142-148), (2017).
  • Instrumental variables as bias amplifiers with general outcome and confounding, Biometrika, 10.1093/biomet/asx009, 104, 2, (291-302), (2017).
  • Sensitivity Analysis for Multiple Comparisons in Matched Observational Studies Through Quadratically Constrained Linear Programming, Journal of the American Statistical Association, 10.1080/01621459.2015.1120675, 111, 516, (1820-1830), (2017).
  • The Balance‐Sample Size Frontier in Matching Methods for Causal Inference, American Journal of Political Science, 10.1111/ajps.12272, 61, 2, (473-489), (2016).
  • Model averaged double robust estimation, Biometrics, 10.1111/biom.12622, 73, 2, (410-421), (2016).
  • Five myths about variable selection, Transplant International, 10.1111/tri.12895, 30, 1, (6-10), (2016).
  • Variable Selection for Confounder Control, Flexible Modeling and Collaborative Targeted Minimum Loss-Based Estimation in Causal Inference, The International Journal of Biostatistics, 10.1515/ijb-2015-0017, 12, 1, (97-115), (2016).
  • Brief Report, Epidemiology, 10.1097/EDE.0000000000000448, 27, 3, (433-437), (2016).
  • Sensitivity analysis for the effects of multiple unmeasured confounders, Annals of Epidemiology, 10.1016/j.annepidem.2016.07.009, 26, 9, (605-611), (2016).
  • Covariate selection criteria for controlling confounding bias in a causal study, Korean Journal of Applied Statistics, 10.5351/KJAS.2016.29.5.849, 29, 5, (849-858), (2016).
  • See more

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.