Volume 27, Issue 2
Research Article

Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond

Michael J. Pencina

Corresponding Author

E-mail address: mpencina@bu.edu

Department of Mathematics and Statistics, Framingham Heart Study, Boston University, 111 Cummington St., Boston, MA 02215, U.S.A.

Department of Mathematics and Statistics, Boston University, 111 Cummington Street, Boston, MA 02215, U.S.A.Search for more papers by this author
Ralph B. D' Agostino Sr

Department of Mathematics and Statistics, Framingham Heart Study, Boston University, 111 Cummington St., Boston, MA 02215, U.S.A.

Search for more papers by this author
Ralph B. D' Agostino Jr

Department of Biostatistical Sciences, Wake Forest University School of Medicine, Medical Center Boulevard, Winston‐Salem, NC 27157, U.S.A.

Search for more papers by this author
Ramachandran S. Vasan

Framingham Heart Study, Boston University School of Medicine, 73 Mount Wayte Avenue, Suite 2, Framingham, MA 01702‐5803, U.S.A.

Search for more papers by this author
First published: 13 June 2007
Citations: 3,645

Abstract

Identification of key factors associated with the risk of developing cardiovascular disease and quantification of this risk using multivariable prediction algorithms are among the major advances made in preventive cardiology and cardiovascular epidemiology in the 20th century. The ongoing discovery of new risk markers by scientists presents opportunities and challenges for statisticians and clinicians to evaluate these biomarkers and to develop new risk formulations that incorporate them. One of the key questions is how best to assess and quantify the improvement in risk prediction offered by these new models. Demonstration of a statistically significant association of a new biomarker with cardiovascular risk is not enough. Some researchers have advanced that the improvement in the area under the receiver‐operating‐characteristic curve (AUC) should be the main criterion, whereas others argue that better measures of performance of prediction models are needed. In this paper, we address this question by introducing two new measures, one based on integrated sensitivity and specificity and the other on reclassification tables. These new measures offer incremental information over the AUC. We discuss the properties of these new measures and contrast them with the AUC. We also develop simple asymptotic tests of significance. We illustrate the use of these measures with an example from the Framingham Heart Study. We propose that scientists consider these types of measures in addition to the AUC when assessing the performance of newer biomarkers. Copyright © 2007 John Wiley & Sons, Ltd.

Number of times cited according to CrossRef: 3645

  • Generation and Applicability of Genetic Risk Scores (GRS) in Stroke, Stroke Biomarkers, 10.1007/978-1-4939-9682-7_3, (23-34), (2020).
  • Urinary Biomarkers may Complement the Cleveland Score for Prediction of Adverse Kidney Events After Cardiac Surgery: A Pilot Study, Annals of Laboratory Medicine, 10.3343/alm.2020.40.2.131, 40, 2, (131), (2020).
  • The specific ex vivo released cytokine profile is associated with ischemic stroke outcome and improves its prediction, Journal of Neuroinflammation, 10.1186/s12974-019-1691-1, 17, 1, (2020).
  • Development of Predictive Equations for Nocturnal Hypertension and Nondipping Systolic Blood Pressure, Journal of the American Heart Association, 10.1161/JAHA.119.013696, 9, 2, (2020).
  • Association of Machine Learning–Derived Phenogroupings of Echocardiographic Variables with Heart Failure in Stable Coronary Artery Disease: The Heart and Soul Study, Journal of the American Society of Echocardiography, 10.1016/j.echo.2019.09.010, (2020).
  • Association between socioeconomic factors at diagnosis and survival in breast cancer: A population‐based study, Cancer Medicine, 10.1002/cam4.2842, 9, 5, (1922-1936), (2020).
  • Echocardiographic evaluation of left ventricular filling pressure in patients with heart failure with preserved ejection fraction: usefulness of inferior vena cava measurements and 2016 EACVI/ASE recommendations, Journal of Cardiac Failure, 10.1016/j.cardfail.2020.01.018, (2020).
  • Predictive value of the combination of age, creatinine, and ejection fraction score and diabetes in patients with ST-segment elevation myocardial infarction undergoing percutaneous coronary intervention, Coronary Artery Disease, 10.1097/MCA.0000000000000791, 31, 2, (109-117), (2020).
  • Nomograms predicting long-term survival in patients with invasive intraductal papillary mucinous neoplasms of the pancreas: A population-based study, World Journal of Gastroenterology, 10.3748/wjg.v26.i5.535, 26, 5, (535-549), (2020).
  • Genomic prediction of alcohol-related morbidity and mortality, Translational Psychiatry, 10.1038/s41398-019-0676-2, 10, 1, (2020).
  • Mitochondrial 8-hydroxy-2′-deoxyguanosine and coronary artery disease in patients with type 2 diabetes mellitus, Cardiovascular Diabetology, 10.1186/s12933-020-00998-6, 19, 1, (2020).
  • Hypochloremia Is a Noninvasive Predictor of Mortality in Pulmonary Arterial Hypertension, Journal of the American Heart Association, 10.1161/JAHA.119.015221, 9, 5, (2020).
  • Machine Learning Framework to Identify Individuals at Risk of Rapid Progression of Coronary Atherosclerosis: From the PARADIGM Registry, Journal of the American Heart Association, 10.1161/JAHA.119.013958, 9, 5, (2020).
  • The prognostic value of the serum albumin level for long‐term prognosis in patients with acute pulmonary embolism, The Clinical Respiratory Journal, 10.1111/crj.13176, 14, 6, (578-585), (2020).
  • Prediction modelling - Part 1 - Regression modelling, Kidney International, 10.1016/j.kint.2020.02.007, (2020).
  • Frailty assessment and risk prediction by GRACE score in older patients with acute myocardial infarction, BMC Geriatrics, 10.1186/s12877-020-1500-9, 20, 1, (2020).
  • Impact of monocyte to high-density lipoprotein ratio on prevalent hyperuricemia: findings from a rural Chinese population, Lipids in Health and Disease, 10.1186/s12944-020-01226-6, 19, 1, (2020).
  • Degraded microarchitecture by low trabecular bone score is associated with prevalent vertebral fractures in patients with systemic lupus erythematosus, Archives of Osteoporosis, 10.1007/s11657-020-00726-3, 15, 1, (2020).
  • The Synergic Association of hs-CRP and Serum Amyloid P Component in Predicting All-Cause Mortality in Patients With Type 2 Diabetes, Diabetes Care, 10.2337/dc19-2489, 43, 5, (1025-1032), (2020).
  • Prognostic Relevance of Cardiorespiratory Fitness as Assessed by Submaximal Exercise Testing for All-Cause Mortality: A UK Biobank Prospective Study, Mayo Clinic Proceedings, 10.1016/j.mayocp.2019.12.030, 95, 5, (867-878), (2020).
  • Additional prognostic value of electrocardiographic left ventricular hypertrophy in traditional cardiovascular risk assessments in chronic kidney disease, Journal of Hypertension, 10.1097/HJH.0000000000002394, 38, 6, (1149-1157), (2020).
  • Validation of the Kidney Failure Risk Equation in Kidney Transplant Recipients, Canadian Journal of Kidney Health and Disease, 10.1177/2054358120922627, 7, (205435812092262), (2020).
  • Eosinopenia and elevated C-reactive protein facilitate triage of COVID-19 patients in fever clinic: A retrospective case-control study, EClinicalMedicine, 10.1016/j.eclinm.2020.100375, (100375), (2020).
  • Mental health, pain, and risk of drug misuse: a nationwide cohort study, Addictive Behaviors, 10.1016/j.addbeh.2020.106467, (106467), (2020).
  • Importancia pronóstica de la enfermedad arterial periférica diagnosticada mediante el índice tobillo-brazo en población general española, Atención Primaria, 10.1016/j.aprim.2020.03.005, (2020).
  • Impact of spine-hip discordance on fracture risk assessment and treatment qualification in Canada: the Manitoba BMD registry, Archives of Osteoporosis, 10.1007/s11657-020-00763-y, 15, 1, (2020).
  • Validating the doubly weighted genetic risk score for the prediction of type 2 diabetes in the Lifelines and Estonian Biobank cohorts, Genetic Epidemiology, 10.1002/gepi.22327, 44, 6, (589-600), (2020).
  • Admission Bedside Lung Ultrasound Reclassifies Mortality Prediction in Patients With ST-Segment–Elevation Myocardial Infarction, Circulation: Cardiovascular Imaging, 10.1161/CIRCIMAGING.119.010269, 13, 6, (2020).
  • Simultaneous Assessment of MicroRNAs 126 and 192 in Diabetic Nephropathy Patients and the Relation of these MicroRNAs with Urinary Albumin, Current Molecular Medicine, 10.2174/1566524019666191019103918, 20, 5, (361-371), (2020).
  • Bioactive Adrenomedullin, Organ Support Therapies, and Survival in the Critically Ill, Critical Care Medicine, 10.1097/CCM.0000000000004044, 48, 1, (49-55), (2020).
  • Comparative Performance of Body Composition Parameters in Prediction of Death in Hospitalized Patients on Maintenance Hemodialysis: A Cohort Study, Scientific Reports, 10.1038/s41598-020-67019-0, 10, 1, (2020).
  • Serum N-terminal pro-B-type natriuretic peptide as a predictor for future development of atrial fibrillation in a general population: the Hisayama Study, International Journal of Cardiology, 10.1016/j.ijcard.2020.06.018, (2020).
  • Biomarkers versus traditional risk factors to predict cardiovascular events in very old adults: cross-validated prospective cohort study, BMJ Open, 10.1136/bmjopen-2019-035809, 10, 6, (e035809), (2020).
  • Statistical inference for decision curve analysis, with applications to cataract diagnosis, Statistics in Medicine, 10.1002/sim.8588, 39, 22, (2980-3002), (2020).
  • Development of a nomogram to predict 30-day mortality of patients with sepsis-associated encephalopathy: a retrospective cohort study, Journal of Intensive Care, 10.1186/s40560-020-00459-y, 8, 1, (2020).
  • Biomarkers for Prediction of Cardiovascular Events in Community-Dwelling Adults Aged 40 or Older, International Heart Journal, 10.1536/ihj.19-240, (2020).
  • undefined, 2020 Seventh International Conference on eDemocracy & eGovernment (ICEDEG), 10.1109/ICEDEG48599.2020.9096851, (165-174), (2020).
  • 30-minute postload plasma glucose levels during an oral glucose tolerance test predict the risk of future type 2 diabetes: the Hisayama Study, BMJ Open Diabetes Research & Care, 10.1136/bmjdrc-2019-001156, 8, 1, (e001156), (2020).
  • Utility of the “omics” in kidney disease: Methods of analysis, sampling considerations, and technical approaches in renal biomarkers, Kidney Biomarkers, 10.1016/B978-0-12-815923-1.00002-X, (19-153), (2020).
  • Comparison of the CAMI-NSTEMI and GRACE Risk Model for Predicting In-Hospital Mortality in Chinese Non-ST-Segment Elevation Myocardial Infarction Patients, Cardiology Research and Practice, 10.1155/2020/2469281, 2020, (1-6), (2020).
  • An independent validation of the kidney failure risk equation in an Asian population, Scientific Reports, 10.1038/s41598-020-69715-3, 10, 1, (2020).
  • First‐trimester blood urea nitrogen and risk of gestational diabetes mellitus, Journal of Cellular and Molecular Medicine, 10.1111/jcmm.14924, 24, 4, (2416-2422), (2020).
  • Serum dickkopf-3 is associated with death and vascular events after ischemic stroke: an observational study from CATIS, Journal of Neuroinflammation, 10.1186/s12974-019-1680-4, 17, 1, (2020).
  • Amino acids levels in early pregnancy predict subsequent gestational diabetes, Journal of Diabetes, 10.1111/1753-0407.13018, 12, 7, (503-511), (2020).
  • A Data Censoring Approach for Predictive Error Modeling of Flow in Ephemeral Rivers, Water Resources Research, 10.1029/2019WR026128, 56, 1, (2020).
  • Biomarkers in patients with heart failure and central sleep apnoea: findings from the SERVE‐HF trial, ESC Heart Failure, 10.1002/ehf2.12521, 7, 2, (503-511), (2020).
  • Osteoprotegerin promotes intimal hyperplasia and contributes to in-stent restenosis: Role of an αVβ3/FAK dependent YAP pathway, Journal of Molecular and Cellular Cardiology, 10.1016/j.yjmcc.2020.01.006, (2020).
  • Association of soluble ST2 with all-cause and cardiovascular mortality in renal transplant recipients: a single-centre cohort study, BMC Nephrology, 10.1186/s12882-020-1690-6, 21, 1, (2020).
  • Plasma circular RNA hsa_circ_0001445 and coronary artery disease: Performance as a biomarker, The FASEB Journal, 10.1096/fj.201902507R, 34, 3, (4403-4414), (2020).
  • A threshold‐free summary index for quantifying the capacity of covariates to yield efficient treatment rules, Statistics in Medicine, 10.1002/sim.8481, 39, 9, (1362-1373), (2020).
  • Bleeding and recurrent VTE with apixaban vs warfarin as outpatient treatment: time-course and subgroup analyses, Blood Advances, 10.1182/bloodadvances.2019001081, 4, 2, (432-439), (2020).
  • Eosinopenia as a diagnostic marker of bloodstream infection in a general internal medicine setting: a cohort study, BMC Infectious Diseases, 10.1186/s12879-020-4814-5, 20, 1, (2020).
  • Association between non-high-density lipoprotein cholesterol and haemorrhagic transformation in patients with acute ischaemic stroke, BMC Neurology, 10.1186/s12883-020-1615-9, 20, 1, (2020).
  • Development of a Cardiovascular Disease Risk Prediction Model Using the Suita Study, a Population-Based Prospective Cohort Study in Japan, Journal of Atherosclerosis and Thrombosis, 10.5551/jat.48843, (2020).
  • Prehospital triage of acute aortic syndrome using a machine learning algorithm, BJS (British Journal of Surgery), 10.1002/bjs.11442, 107, 8, (995-1003), (2020).
  • Recalibration and validation of the Charlson Comorbidity Index in an Asian population: the National Health Insurance Service-National Sample Cohort study, Scientific Reports, 10.1038/s41598-020-70624-8, 10, 1, (2020).
  • Anaesthesia geriatric evaluation to guide patient selection for preoperative multidisciplinary team care in cardiac surgery, British Journal of Anaesthesia, 10.1016/j.bja.2019.12.042, (2020).
  • Exercise cardiac power and the risk of heart failure in men: A population-based follow-up study, Journal of Sport and Health Science, 10.1016/j.jshs.2020.02.008, (2020).
  • The Association and Predictive Ability of ECG Abnormalities with Cardiovascular Diseases: A Prospective Analysis, Global Heart, 10.5334/gh.790, 15, 1, (2020).
  • Evaluation of the usefulness of red blood cell distribution width in critically ill pediatric patients, Medicine, 10.1097/MD.0000000000022075, 99, 36, (e22075), (2020).
  • Pre-Pregnancy Obesity vs. Other Risk Factors in Probability Models of Preeclampsia and Gestational Hypertension, Nutrients, 10.3390/nu12092681, 12, 9, (2681), (2020).
  • Use of static cutoffs of hypertension to determine high cIMT in children and adolescents: An international collaboration study, Canadian Journal of Cardiology, 10.1016/j.cjca.2020.02.093, (2020).
  • A common variant in PNPLA3 is associated with age at diagnosis of NAFLD in patients from a multi-ethnic biobank, Journal of Hepatology, 10.1016/j.jhep.2020.01.029, (2020).
  • The 2018 ICM minor criteria for chronic hip and knee PJI: Validation from a single center, The Journal of Arthroplasty, 10.1016/j.arth.2020.03.014, (2020).
  • Associations of Serum Dickkopf‐1 and Sclerostin With Cardiovascular Events: Results From the Prospective Bruneck Study, Journal of the American Heart Association, 10.1161/JAHA.119.014816, 9, 6, (2020).
  • A Pharmaceutical Dispensing–based Index of Mortality Risk From Long-term Conditions Performed as well as Hospital Record–based Indices, Medical Care, 10.1097/MLR.0000000000001217, 58, 2, (e9-e16), (2020).
  • Including mRECIST in the Metroticket 2.0 criteria improves prediction of hepatocellular carcinoma-related death after liver transplant, Journal of Hepatology, 10.1016/j.jhep.2020.03.018, (2020).
  • Preoperative chronic kidney disease predicts poor prognosis in patients with primary non–muscle-invasive bladder cancer who underwent transurethral resection of bladder tumor, Urologic Oncology: Seminars and Original Investigations, 10.1016/j.urolonc.2020.02.001, (2020).
  • Long-Term Prognostic Value of Simultaneous Assessment of Atherosclerosis and Ischemia in Patients with Suspected Angina: Implications for Routine Use of Carotid Ultrasound during Stress Echocardiography, Journal of the American Society of Echocardiography, 10.1016/j.echo.2019.11.019, (2020).
  • Alzheimer's Disease Diagnosis Using Misfolding Proteins in Blood, Dementia and Neurocognitive Disorders, 10.12779/dnd.2020.19.1.1, 19, 1, (1), (2020).
  • Risk prediction model for lung cancer incorporating metabolic markers: Development and internal validation in a Chinese population, Cancer Medicine, 10.1002/cam4.3025, 9, 11, (3983-3994), (2020).
  • Early Pregnancy Prediction of Gestational Diabetes Mellitus Risk Using Prenatal Screening Biomarkers in Nulliparous Women, Diabetes Research and Clinical Practice, 10.1016/j.diabres.2020.108139, (108139), (2020).
  • A 17-Gene Panel Genomic Prostate Score has Similar Predictive Accuracy for Adverse Pathology at Radical Prostatectomy in African American and European American Men, Urology, 10.1016/j.urology.2020.01.052, (2020).
  • Preoperative Serum Fibrinogen is Associated With Acute Kidney Injury after Cardiac Valve Replacement Surgery, Scientific Reports, 10.1038/s41598-020-63522-6, 10, 1, (2020).
  • Validating the Framingham Hypertension Risk Score: A 4‐year follow‐up from the Brazilian Longitudinal Study of the Adult Health (ELSA‐Brasil), The Journal of Clinical Hypertension, 10.1111/jch.13855, 22, 5, (850-856), (2020).
  • Corrigendum to ‘Relationship between optical coherence tomography-derived morphological criteria and functional relevance as determined by fractional flow reserve’ [J. Cardiol. 71 (2018) 359–366/4], Journal of Cardiology, 10.1016/j.jjcc.2020.04.001, (2020).
  • Urine NGAL as a biomarker for septic AKI: a critical appraisal of clinical utility—data from the observational FINNAKI study, Annals of Intensive Care, 10.1186/s13613-020-00667-7, 10, 1, (2020).
  • Early prediction model of organ/space surgical site infection after elective gastrointestinal or hepatopancreatobiliary cancer surgery, Journal of Infection and Chemotherapy, 10.1016/j.jiac.2020.04.009, (2020).
  • Serum Eosinophil-derived Neurotoxin Better Reflect Asthma Control Status Than Blood Eosinophil Counts, The Journal of Allergy and Clinical Immunology: In Practice, 10.1016/j.jaip.2020.03.035, (2020).
  • Circulating MicroRNA-423-3p Improves the Prediction of Coronary Artery Disease in a General Population ― Six-Year Follow-up Results From the China-Cardiovascular Disease Study ―, Circulation Journal, 10.1253/circj.CJ-19-1181, (2020).
  • Usefulness of the Simplified Frailty Scale in Predicting Risk of Readmission or Mortality in Elderly Patients Hospitalized with Cardiovascular Disease, International Heart Journal, 10.1536/ihj.19-557, (2020).
  • Reducing Bias Due to Outcome Misclassification for Epidemiologic Studies Using EHR-derived Probabilistic Phenotypes, Epidemiology, 10.1097/EDE.0000000000001193, Publish Ahead of Print, (2020).
  • International prognostic indices in diffuse large B-cell lymphoma: a comparison of IPI, R-IPI, and NCCN-IPI, Blood, 10.1182/blood.2019002729, 135, 23, (2041-2048), (2020).
  • IgG Glycosylation Profile and the Glycan Score Are Associated with Type 2 Diabetes in Independent Chinese Populations: A Case-Control Study, Journal of Diabetes Research, 10.1155/2020/5041346, 2020, (1-8), (2020).
  • The first composite score predicting Digital Ulcers in systemic sclerosis patients using Clinical data, Imaging and Patient history—CIP-DUS, Arthritis Research & Therapy, 10.1186/s13075-020-02235-7, 22, 1, (2020).
  • Association between faecal haemoglobin concentration and the risk of cardiovascular diseases among Taiwanese adults in a community-based screening cohort, BMJ Open, 10.1136/bmjopen-2019-032633, 10, 6, (e032633), (2020).
  • Horibe GI bleeding prediction score: a simple score for triage decision-making in patients with suspected upper GI bleeding, Gastrointestinal Endoscopy, 10.1016/j.gie.2020.03.3846, (2020).
  • sTREM-1 predicts mortality in hospitalized patients with infection in a tropical, middle-income country, BMC Medicine, 10.1186/s12916-020-01627-5, 18, 1, (2020).
  • External validation of the European risk assessment tool for chronic cardio-metabolic disorders in a Middle Eastern population, Journal of Translational Medicine, 10.1186/s12967-020-02434-5, 18, 1, (2020).
  • Probing the relationship between late endogenous ERP components with fluid intelligence in healthy older adults, Scientific Reports, 10.1038/s41598-020-67924-4, 10, 1, (2020).
  • Penetrance of Breast and Ovarian Cancer in Women Who Carry a BRCA1/2 Mutation and Do Not Use Risk-Reducing Salpingo-Oophorectomy: An Updated Meta-Analysis, JNCI Cancer Spectrum, 10.1093/jncics/pkaa029, 4, 4, (2020).
  • Multi-biomarker strategy for prediction of myocardial dysfunction and mortality in sepsis预测脓毒症患者心脏功能障碍和死亡率的多生物标记物策略, Journal of Zhejiang University-SCIENCE B, 10.1631/jzus.B2000049, 21, 7, (537-548), (2020).
  • Role of Post-Stent Physiological Assessment in a Risk Prediction Model After Coronary Stent Implantation, JACC: Cardiovascular Interventions, 10.1016/j.jcin.2020.04.041, 13, 14, (1639-1650), (2020).
  • Left atrial cross-sectional area is a novel measure of atrial shape associated with cardioembolic strokes, Heart, 10.1136/heartjnl-2019-315964, 106, 15, (1176-1182), (2020).
  • Association of protein-energy wasting and inflammation status with mortality after coronary revascularisation in patients on haemodialysis, Open Heart, 10.1136/openhrt-2020-001276, 7, 2, (e001276), (2020).
  • Simple Methods for Evaluating 4 Types of Biomarkers: Surrogate Endpoint, Prognostic, Predictive, and Cancer Screening, Biomarker Insights, 10.1177/1177271920946715, 15, (117727192094671), (2020).
  • Kinesiophobia is not required to predict chronic low back pain in workers: a decision curve analysis, BMC Musculoskeletal Disorders, 10.1186/s12891-020-3186-8, 21, 1, (2020).
  • Radial Pulse Wave Signals Combined with Ba-PWV for the Risk Prediction of Hypertension and the Monitoring of Its Accompanying Metabolic Risk Factors, Evidence-Based Complementary and Alternative Medicine, 10.1155/2020/3926851, 2020, (1-9), (2020).
  • Prognostic importance of visit-to-visit blood pressure variability for micro- and macrovascular outcomes in patients with type 2 diabetes: The Rio de Janeiro Type 2 Diabetes Cohort Study, Cardiovascular Diabetology, 10.1186/s12933-020-01030-7, 19, 1, (2020).
  • Validation and Comparison of Tools for Selecting Individuals to Screen for Barrett’s Esophagus and Early Neoplasia, Gastroenterology, 10.1053/j.gastro.2020.02.037, (2020).
  • See more

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.