Volume 61, Issue 4

Doubly Robust Estimation in Missing Data and Causal Inference Models

Heejung Bang

Division of Biostatistics and Epidemiology, Department of Public Health, Weill Medical College of Cornell University, New York, New York 10021, U.S.A.
email:heb2013@med.cornell.edu

Search for more papers by this author
James M. Robins

Departments of Biostatistics and Epidemiology, Harvard School of Public Health, Boston, Massachusetts 02115, U.S.A.

Search for more papers by this author
First published: 02 December 2005
Citations: 651

Abstract

Summary The goal of this article is to construct doubly robust (DR) estimators in ignorable missing data and causal inference models. In a missing data model, an estimator is DR if it remains consistent when either (but not necessarily both) a model for the missingness mechanism or a model for the distribution of the complete data is correctly specified. Because with observational data one can never be sure that either a missingness model or a complete data model is correct, perhaps the best that can be hoped for is to find a DR estimator. DR estimators, in contrast to standard likelihood‐based or (nonaugmented) inverse probability‐weighted estimators, give the analyst two chances, instead of only one, to make a valid inference. In a causal inference model, an estimator is DR if it remains consistent when either a model for the treatment assignment mechanism or a model for the distribution of the counterfactual data is correctly specified. Because with observational data one can never be sure that a model for the treatment assignment mechanism or a model for the counterfactual data is correct, inference based on DR estimators should improve upon previous approaches. Indeed, we present the results of simulation studies which demonstrate that the finite sample performance of DR estimators is as impressive as theory would predict. The proposed method is applied to a cardiovascular clinical trial.

Number of times cited according to CrossRef: 651

  • A causal framework for classical statistical estimands in failure‐time settings with competing events, Statistics in Medicine, 10.1002/sim.8471, 39, 8, (1199-1236), (2020).
  • The overall effect of parental supply of alcohol across adolescence on alcohol‐related harms in early adulthood—a prospective cohort study, Addiction, 10.1111/add.15005, 115, 10, (1833-1843), (2020).
  • Causal isotonic regression, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/rssb.12372, 82, 3, (719-747), (2020).
  • G-computation, propensity score-based methods, and targeted maximum likelihood estimator for causal inference with different covariates sets: a comparative simulation study, Scientific Reports, 10.1038/s41598-020-65917-x, 10, 1, (2020).
  • Understanding marginal structural models for time-varying exposures: pitfalls and tips, Journal of Epidemiology, 10.2188/jea.JE20200226, (2020).
  • The Effect of Prenatal Treatments on Offspring Events in the Presence of Competing Events, Epidemiology, 10.1097/EDE.0000000000001222, 31, 5, (636-643), (2020).
  • Defining and Identifying Per-protocol Effects in Randomized Trials, Epidemiology, 10.1097/EDE.0000000000001234, 31, 5, (692-694), (2020).
  • Improved Doubly Robust Estimation in Learning Optimal Individualized Treatment Rules, Journal of the American Statistical Association, 10.1080/01621459.2020.1725522, (1), (2020).
  • Doubly Robust Estimation of Causal Effect, Circulation: Cardiovascular Quality and Outcomes, 10.1161/CIRCOUTCOMES.119.006065, 13, 1, (2020).
  • Doubly robust difference-in-differences estimators, Journal of Econometrics, 10.1016/j.jeconom.2020.06.003, (2020).
  • Doubly robust inference when combining probability and non‐probability samples with high dimensional data, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/rssb.12354, 82, 2, (445-465), (2020).
  • Causal Inference, Engineering, 10.1016/j.eng.2019.08.016, (2020).
  • Endometrial stromal cells treated by tumor necrosis factor‐α stimulate macrophages polarized toward M2 via interleukin‐6 and monocyte chemoattractant protein‐1, Journal of Obstetrics and Gynaecology Research, 10.1111/jog.14135, 46, 2, (293-301), (2020).
  • Defining ecological restoration of peatlands in Central Kalimantan, Indonesia, Restoration Ecology, 10.1111/rec.13097, 28, 2, (435-446), (2020).
  • Multiply robust causal inference with double‐negative control adjustment for categorical unmeasured confounding, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/rssb.12361, 82, 2, (521-540), (2020).
  • Treatment Effect Estimation via Differentiated Confounder Balancing and Regression, ACM Transactions on Knowledge Discovery from Data (TKDD), 10.1145/3365677, 14, 1, (1-25), (2020).
  • Effects of intergenerational exposure interventions on adolescent outcomes: An application of inverse probability weighting to longitudinal pre‐birth cohort data, Paediatric and Perinatal Epidemiology, 10.1111/ppe.12646, 34, 3, (366-375), (2020).
  • Sumca: simple, unified, Monte‐Carlo‐assisted approach to second‐order unbiased mean‐squared prediction error estimation, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/rssb.12358, 82, 2, (467-485), (2020).
  • Rubusoside alleviates the ovalbumin‐induced mice allergic asthma by modulating the NF‐κB activation, Journal of Food Biochemistry, 10.1111/jfbc.13187, 44, 5, (2020).
  • Metastasis, Mortality, and Quality of Life for Men with NCCN High and Very High Risk Localized Prostate Cancer following Surgical and/or Combined Modality Radiation Therapies, Clinical Genitourinary Cancer, 10.1016/j.clgc.2019.11.023, (2020).
  • Extending inferences from a randomized trial to a new target population, Statistics in Medicine, 10.1002/sim.8426, 39, 14, (1999-2014), (2020).
  • Toward Causally Interpretable Meta-analysis, Epidemiology, 10.1097/EDE.0000000000001177, 31, 3, (334-344), (2020).
  • A Causal Dirichlet Mixture Model for Causal Inference from Observational Data, ACM Transactions on Intelligent Systems and Technology, 10.1145/3379500, 11, 3, (1-29), (2020).
  • The Early Impact of the Centers for Medicare & Medicaid Services State Innovation Models Initiative on 30-Day Hospital Readmissions Among Adults With Diabetes, Medical Care, 10.1097/MLR.0000000000001276, 58, (S22-S30), (2020).
  • gfoRmula: An R Package for Estimating the Effects of Sustained Treatment Strategies via the Parametric g-formula, Patterns, 10.1016/j.patter.2020.100008, (100008), (2020).
  • Towards Non-I.I.D. Image Classification: A Dataset and Baselines, Pattern Recognition, 10.1016/j.patcog.2020.107383, (107383), (2020).
  • Modelado estadístico y matemático en la epidemia del coronavirus: algunas consideraciones para minimizar los sesgos en los resultados, Archivos de Bronconeumología, 10.1016/j.arbres.2020.04.022, (2020).
  • Far from MCAR, Epidemiology, 10.1097/EDE.0000000000001215, 31, 5, (620-627), (2020).
  • Doubly robust estimator of risk in the presence of censoring dependent on time-varying covariates: application to a primary prevention trial for coronary events with pravastatin, BMC Medical Research Methodology, 10.1186/s12874-020-01087-8, 20, 1, (2020).
  • Statistical and mathematical modeling in the coronavirus epidemic: some considerations to minimize biases in the results, Archivos de Bronconeumología (English Edition), 10.1016/j.arbr.2020.04.006, 56, 9, (601-602), (2020).
  • Capital Expenditures Increased at Rural Hospitals That Merged Between 2012 and 2015, Journal of Healthcare Management, 10.1097/JHM-D-19-00219, 65, 5, (346-364), (2020).
  • Causal Inference in Pharmacoepidemiology, Quantitative Methods in Pharmaceutical Research and Development, 10.1007/978-3-030-48555-9, (181-224), (2020).
  • Estimation of causal effect using propensity score and weighted-average method, Procedia Computer Science, 10.1016/j.procs.2020.09.076, 176, (810-817), (2020).
  • Access to savings and household welfare evidence from a household survey in The Gambia, African Development Review, 10.1111/1467-8268.12423, 32, 2, (138-149), (2020).
  • A beyond multiple robust approach for missing response problem, Computational Statistics & Data Analysis, 10.1016/j.csda.2020.107111, (107111), (2020).
  • Research Note: Causal inference, Journal of Physiotherapy, 10.1016/j.jphys.2020.07.010, (2020).
  • Doubly robust‐type estimation of population moments and parameters in biased sampling, Stat, 10.1002/sta4.241, 8, 1, (2020).
  • “The Name Is Sherlock Holmes, and the Address Is 221B Baker Street”: Virtual Reality, Fan Communities, and Tourism, The Journal of Popular Culture, 10.1111/jpcu.12870, 52, 6, (1494-1511), (2020).
  • “Do the Ghosts Roam Along the Corridors Here at Ordsall Hall?”: Paranormal Media, Haunted Heritage, and Investing Historical Capital, The Journal of Popular Culture, 10.1111/jpcu.12852, 52, 6, (1312-1333), (2020).
  • The Art of Wall Building: The Caminata Nocturna and the Rhetoric of Nationalism, The Journal of Popular Culture, 10.1111/jpcu.12863, 52, 6, (1274-1289), (2020).
  • Statistical considerations in the design and analysis of non-inferiority trials with binary endpoints in the presence of non-adherence: a simulation study, Wellcome Open Research, 10.12688/wellcomeopenres.15636.2, 4, (207), (2020).
  • Assessing the Causal Impact of Delayed Oral Health Care on Emergency Department Utilization, North American Actuarial Journal, 10.1080/10920277.2020.1735448, (1-13), (2020).
  • Demystifying a class of multiply robust estimators, Biometrika, 10.1093/biomet/asaa026, (2020).
  • Estimation and Validation of Ratio-based Conditional Average Treatment Effects Using Observational Data, Journal of the American Statistical Association, 10.1080/01621459.2020.1772080, (1-18), (2020).
  • Regression with incomplete multivariate surrogate responses for a latent covariate, Biostatistics & Epidemiology, 10.1080/24709360.2020.1794705, (1-18), (2020).
  • Nonparametric targeted Bayesian estimation of class proportions in unlabeled data, Biostatistics, 10.1093/biostatistics/kxaa022, (2020).
  • Parametric g‐formula implementations for causal survival analyses, Biometrics, 10.1111/biom.13321, 0, 0, (2020).
  • Technology-based Stepped Care to Stem Transgender Adolescent Risk Transmission: Study Protocol for a Randomized Controlled Trial (TechStep) (Preprint), JMIR Research Protocols, 10.2196/18326, (2020).
  • The Collateral Damage of In-School Suspensions: A Counterfactual Analysis of High-Suspension Schools, Math Achievement and College Attendance, Urban Education, 10.1177/0042085920902256, (004208592090225), (2020).
  • A Stochastic Second-Order Generalized Estimating Equations Approach for Estimating Association Parameters, Journal of Computational and Graphical Statistics, 10.1080/10618600.2019.1710156, (1-15), (2020).
  • Visually Communicating and Teaching Intuition for Influence Functions, The American Statistician, 10.1080/00031305.2020.1717620, (1-11), (2020).
  • Effects of Maryland’s Affordable Care Act Medicaid Health Home Waiver on Quality of Cardiovascular Care Among People with Serious Mental Illness, Journal of General Internal Medicine, 10.1007/s11606-020-05690-9, (2020).
  • Quantile regression for incomplete longitudinal data with selection by death, Statistical Methods in Medical Research, 10.1177/0962280220909986, (096228022090998), (2020).
  • A multiple robust propensity score method for longitudinal analysis with intermittent missing data, Biometrics, 10.1111/biom.13330, 0, 0, (2020).
  • A framework for covariate balance using Bregman distances, Scandinavian Journal of Statistics, 10.1111/sjos.12457, 0, 0, (2020).
  • ‘In search of lost time’: Identifying the causative role of cumulative competition load and competition time-loss in professional tennis using a structural nested mean model, PLOS ONE, 10.1371/journal.pone.0231568, 15, 4, (e0231568), (2020).
  • Multiply robust bootstrap variance estimation in the presence of singly imputed survey data, Journal of Survey Statistics and Methodology, 10.1093/jssam/smaa004, (2020).
  • Causal inference for recurrent event data using pseudo-observations, Biostatistics, 10.1093/biostatistics/kxaa020, (2020).
  • For Rural Hospitals That Merged, Inpatient Charges Decreased and Outpatient Charges Increased: A Pre‐/Post‐Comparison of Rural Hospitals That Merged and Rural Hospitals That Did Not Merge Between 2005 and 2015, The Journal of Rural Health, 10.1111/jrh.12461, 0, 0, (2020).
  • The Causal Effects of Parental Divorce and Parental Temporary Separation on Children’s Cognitive Abilities and Psychological Well-being According to Parental Relationship Quality, Social Indicators Research, 10.1007/s11205-020-02428-2, (2020).
  • Third-Party Policing Approaches Against Organized Crime: An Evaluation of the Yakuza Exclusion Ordinances, Journal of Quantitative Criminology, 10.1007/s10940-020-09466-6, (2020).
  • Comparison Between Ticagrelor and Clopidogrel in Elderly Patients with an Acute Coronary Syndrome: Insights from the SWEDEHEART Registry, Circulation, 10.1161/CIRCULATIONAHA.120.050645, (2020).
  • Robust and efficient semi‐supervised estimation of average treatment effects with application to electronic health records data, Biometrics, 10.1111/biom.13298, 0, 0, (2020).
  • Design and Analysis Considerations for a Sequentially Randomized HIV Prevention Trial, Statistics in Biosciences, 10.1007/s12561-020-09274-3, (2020).
  • Evaluating multiple surrogate markers with censored data, Biometrics, 10.1111/biom.13370, 0, 0, (2020).
  • Assessing the impact of case-mix heterogeneity in individual participant data meta-analysis: Novel use of I 2 statistic and prediction interval , Research Methods in Medicine & Health Sciences, 10.1177/2632084320957207, (263208432095720), (2020).
  • A Semiparametric Approach to Model Effect Modification, Journal of the American Statistical Association, 10.1080/01621459.2020.1811099, (1-33), (2020).
  • Causal survival analysis: A guide to estimating intention-to-treat and per-protocol effects from randomized clinical trials with non-adherence, Research Methods in Medicine & Health Sciences, 10.1177/2632084320961043, (263208432096104), (2020).
  • A robust joint modeling approach for longitudinal data with informative dropouts, Computational Statistics, 10.1007/s00180-020-00972-6, (2020).
  • Causal inference with noisy data: Bias analysis and estimation approaches to simultaneously addressing missingness and misclassification in binary outcomes, Statistics in Medicine, 10.1002/sim.8419, 39, 4, (456-468), (2019).
  • Data‐adaptive longitudinal model selection in causal inference with collaborative targeted minimum loss‐based estimation, Biometrics, 10.1111/biom.13135, 76, 1, (145-157), (2019).
  • Examining the Use of Real‐World Evidence in the Regulatory Process, Clinical Pharmacology & Therapeutics, 10.1002/cpt.1658, 107, 4, (843-852), (2019).
  • Further Evidence on the Effect of Clean Indoor Air Laws on Smoking: The Italian Case, Southern Economic Journal, 10.1002/soej.12409, 86, 3, (1110-1132), (2019).
  • The effect of unemployment on the smoking behavior of couples, Health Economics, 10.1002/hec.3961, 29, 2, (154-170), (2019).
  • Robust estimation of the causal effect of time‐varying neighborhood factors on health outcomes, Statistics in Medicine, 10.1002/sim.8423, 39, 5, (544-561), (2019).
  • Propensity weighting: how to minimise comparative bias in non-randomised studies?, European Journal of Cardiovascular Nursing, 10.1177/1474515119888972, 19, 1, (83-88), (2019).
  • Implementing lower‐risk brief resolved unexplained events guideline reduces admissions in a modelled population, Journal of Evaluation in Clinical Practice, 10.1111/jep.13211, 26, 1, (343-356), (2019).
  • Experimental characterization of fracture of glass fiber reinforced composites laminates subjected to freeze‐thaw cycles, Fatigue & Fracture of Engineering Materials & Structures, 10.1111/ffe.13109, 43, 2, (242-249), (2019).
  • A refined diffuse cohesive approach for the failure analysis in quasibrittle materials—part I: Theoretical formulation and numerical calibration, Fatigue & Fracture of Engineering Materials & Structures, 10.1111/ffe.13107, 43, 2, (221-241), (2019).
  • Models of evaluation under ceteris imparibus: System dynamics and the example of emergency care, Systems Research and Behavioral Science, 10.1002/sres.2615, 37, 2, (211-222), (2019).
  • System dynamics approaches to public–private partnerships: A literature review, Systems Research and Behavioral Science, 10.1002/sres.2626, 37, 2, (277-291), (2019).
  • Invited Commentary: Two-Phase, Generalized Case-Control Designs for Quantitative Longitudinal Outcomes and Evolution of the Case-Control Study, American Journal of Epidemiology, 10.1093/aje/kwz200, 189, 2, (91-94), (2019).
  • The role of elective neck dissection in high‐grade parotid malignancy: A hospital‐based cohort study, The Laryngoscope, 10.1002/lary.28238, 130, 6, (1487-1495), (2019).
  • Safety surveillance and the estimation of risk in select populations: Flexible methods to control for confounding while targeting marginal comparisons via standardization, Statistics in Medicine, 10.1002/sim.8410, 39, 4, (369-386), (2019).
  • Discussion of PENCOMP, Journal of the American Statistical Association, 10.1080/01621459.2018.1537914, 114, 525, (24-27), (2019).
  • Police contact and health among urban adolescents: The role of perceived injustice, Social Science & Medicine, 10.1016/j.socscimed.2019.112487, (112487), (2019).
  • Estimating the Population Average Treatment Effect in Observational Studies with Choice-Based Sampling, The International Journal of Biostatistics, 10.1515/ijb-2018-0093, 0, 0, (2019).
  • A Nonparametric Dynamic Causal Model for Macroeconometrics, SSRN Electronic Journal, 10.2139/ssrn.3345325, (2019).
  • Metalworking Fluids and Colon Cancer Risk, Environmental Epidemiology, 10.1097/EE9.0000000000000035, 3, 1, (e035), (2019).
  • Personalization in biomedical-informatics: methodological considerations and recommendations, Journal of Biomedical Informatics, 10.1016/j.jbi.2018.12.002, (2019).
  • Calibration estimation of semiparametric copula models with data missing at random, Journal of Multivariate Analysis, 10.1016/j.jmva.2019.02.003, (2019).
  • Assessing the effectiveness of Japan's community-based direct payment scheme for hilly and mountainous areas, Ecological Economics, 10.1016/j.ecolecon.2019.01.036, 160, (62-75), (2019).
  • Mental Health Changes and the Willingness to Take Risks, SSRN Electronic Journal, 10.2139/ssrn.3352663, (2019).
  • Best practices for analyzing large-scale health data from wearables and smartphone apps, npj Digital Medicine, 10.1038/s41746-019-0121-1, 2, 1, (2019).
  • Misuse of Regression Adjustment for Additional Confounders Following Insufficient Propensity Score Balancing, Epidemiology, 10.1097/EDE.0000000000001023, 30, 4, (541-548), (2019).
  • IFAD Impact Assesment - Agricultural Value Chains Support Project (PAFA): Senegal, SSRN Electronic Journal, 10.2139/ssrn.3389322, (2019).
  • undefined, Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining - WSDM '19, 10.1145/3289600.3290958, (447-455), (2019).
  • Gender Income Disparity Among White-Collar Regular Employees: Explaining the Causes Responsible for 80% of the Disparity and Its Mechanisms, Gender Inequalities in the Japanese Workplace and Employment, 10.1007/978-981-13-7681-8_4, (111-142), (2019).
  • Impacts of Companies’ Promotion of Work–Life Balance and the Restrictive Regular Employment System on Gender Wage Gap, Gender Inequalities in the Japanese Workplace and Employment, 10.1007/978-981-13-7681-8_5, (143-166), (2019).
  • Association of sleep quality with job burnout among Chinese coal mine staff: a propensity score weighting analysis, Scientific Reports, 10.1038/s41598-019-45329-2, 9, 1, (2019).
  • See more

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.