Advanced Statistics: Missing Data in Clinical Research—Part 1: An Introduction and Conceptual Framework

Authors

  • Jason S. Haukoos MD, MS,

    Corresponding author
    1. Department of Emergency Medicine, Denver Health Medical Center, and the Department of Preventive Medicine and Biometrics, University of Colorado Health Sciences Center (JSH), Denver, CO
    Search for more papers by this author
  • Craig D. Newgard MD, MPH

    1. Center for Policy and Research in Emergency Medicine, Department of Emergency Medicine, Oregon Health & Science University, Portland, OR
    Search for more papers by this author

(Email: jason.haukoos@dhha.org).

Abstract

Missing data are commonly encountered in clinical research. Unfortunately, they are often neglected or not properly handled during analytic procedures, and this may substantially bias the results of the study, reduce study power, and lead to invalid conclusions. In this two-part series, the authors will introduce key concepts regarding missing data in clinical research, provide a conceptual framework for how to approach missing data in this setting, describe typical mechanisms and patterns of censoring of data and their relationships to specific methods of handling incomplete data, and describe in detail several simple and more complex methods of handling such data. In part 1, the authors will describe relatively simple approaches to handling missing data, including complete-case analysis, available-case analysis, and several forms of single imputation, including mean imputation, regression imputation, hot and cold deck imputation, last observation carried forward, and worst case analysis. In part 2, the authors will describe in detail multiple imputation, a more sophisticated and valid method for handling missing data.

Ancillary