Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond
Article first published online: 13 JUN 2007
Copyright © 2007 John Wiley & Sons, Ltd.
Statistics in Medicine
Volume 27, Issue 2, pages 157–172, 30 January 2008
How to Cite
Pencina, M. J., D' Agostino, R. B., D' Agostino, R. B. and Vasan, R. S. (2008), Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond. Statist. Med., 27: 157–172. doi: 10.1002/sim.2929
- Issue published online: 20 DEC 2007
- Article first published online: 13 JUN 2007
- Manuscript Accepted: 5 APR 2007
- Manuscript Received: 4 APR 2007
- National Heart, Lung, and Blood Institute's Framingham Heart Study. Grant Numbers: N01-HC-25195, 2K24 HL 04334
- model performance;
- risk prediction;
Identification of key factors associated with the risk of developing cardiovascular disease and quantification of this risk using multivariable prediction algorithms are among the major advances made in preventive cardiology and cardiovascular epidemiology in the 20th century. The ongoing discovery of new risk markers by scientists presents opportunities and challenges for statisticians and clinicians to evaluate these biomarkers and to develop new risk formulations that incorporate them. One of the key questions is how best to assess and quantify the improvement in risk prediction offered by these new models. Demonstration of a statistically significant association of a new biomarker with cardiovascular risk is not enough. Some researchers have advanced that the improvement in the area under the receiver-operating-characteristic curve (AUC) should be the main criterion, whereas others argue that better measures of performance of prediction models are needed. In this paper, we address this question by introducing two new measures, one based on integrated sensitivity and specificity and the other on reclassification tables. These new measures offer incremental information over the AUC. We discuss the properties of these new measures and contrast them with the AUC. We also develop simple asymptotic tests of significance. We illustrate the use of these measures with an example from the Framingham Heart Study. We propose that scientists consider these types of measures in addition to the AUC when assessing the performance of newer biomarkers. Copyright © 2007 John Wiley & Sons, Ltd.