Volume 16, Issue 9
Research Article

A COMPARISON OF GOODNESS‐OF‐FIT TESTS FOR THE LOGISTIC REGRESSION MODEL

D. W. HOSMER

Department of Biostatistics and Epidemiology, University of Massachusetts, Arnold House, Box 30430, Amherst, MA 01004‐0430, U.S.A.

Search for more papers by this author
T. HOSMER

University Computer Centre, University of Massachusetts, Amherst, MA 01004, U.S.A.

Search for more papers by this author
S. LE CESSIE

Department of Medical Statistics, University of Leiden, The Netherlands

Search for more papers by this author
S. LEMESHOW

Department of Biostatistics and Epidemiology, University of Massachusetts, Arnold House, Box 30430, Amherst, MA 01004‐0430, U.S.A.

Search for more papers by this author

Abstract

Recent work has shown that there may be disadvantages in the use of the chi‐square‐like goodness‐of‐fit tests for the logistic regression model proposed by Hosmer and Lemeshow that use fixed groups of the estimated probabilities. A particular concern with these grouping strategies based on estimated probabilities, fitted values, is that groups may contain subjects with widely different values of the covariates. It is possible to demonstrate situations where one set of fixed groups shows the model fits while the test rejects fit using a different set of fixed groups. We compare the performance by simulation of these tests to tests based on smoothed residuals proposed by le Cessie and Van Houwelingen and Royston, a score test for an extended logistic regression model proposed by Stukel, the Pearson chi‐square and the unweighted residual sum‐of‐ squares. These simulations demonstrate that all but one of Royston's tests have the correct size. An examination of the performance of the tests when the correct model has a quadratic term but a model containing only the linear term has been fit shows that the Pearson chi‐square, the unweighted sum‐of‐squares, the Hosmer–Lemeshow decile of risk, the smoothed residual sum‐of‐squares and Stukel's score test, have power exceeding 50 per cent to detect moderate departures from linearity when the sample size is 100 and have power over 90 per cent for these same alternatives for samples of size 500. All tests had no power when the correct model had an interaction between a dichotomous and continuous covariate but only the continuous covariate model was fit. Power to detect an incorrectly specified link was poor for samples of size 100. For samples of size 500 Stukel's score test had the best power but it only exceeded 50 per cent to detect an asymmetric link function. The power of the unweighted sum‐of‐squares test to detect an incorrectly specified link function was slightly less than Stukel's score test. We illustrate the tests within the context of a model for factors associated with low birth weight. © 1997 by John Wiley & Sons, Ltd. Stat. Med., Vol. 16, 965–980 (1997).

Number of times cited according to CrossRef: 957

  • Ecological indicators of near-surface permafrost habitat at the southern margin of the boreal forest in China, Ecological Indicators, 10.1016/j.ecolind.2019.105714, 108, (105714), (2020).
  • Leveraging Electronic Health Records and Machine Learning to Tailor Nursing Care for Patients at High Risk for Readmissions, Journal of Nursing Care Quality, 10.1097/NCQ.0000000000000412, 35, 1, (27-33), (2020).
  • Prediction modelling - Part 1 - Regression modelling, Kidney International, 10.1016/j.kint.2020.02.007, (2020).
  • Mortality prediction by SOFA score in ICU-patients after cardiac surgery; comparison with traditional prognostic–models, BMC Anesthesiology, 10.1186/s12871-020-00975-2, 20, 1, (2020).
  • The current wave and determinants of brain-drain migration from China, Human Systems Management, 10.3233/HSM-190622, (1-14), (2020).
  • Malignant peritoneal cytology and decreased survival of women with stage I endometrioid endometrial cancer, European Journal of Cancer, 10.1016/j.ejca.2020.03.031, 133, (33-46), (2020).
  • Implementation of colon surgical site infection prevention bundle – the successes and challenges, American Journal of Infection Control, 10.1016/j.ajic.2020.05.010, (2020).
  • Hemoplasmas Are Endemic and Cause Asymptomatic Infection in the Endangered Darwin’s Fox ( Lycalopex fulvipes ) , Applied and Environmental Microbiology, 10.1128/AEM.00779-20, 86, 12, (2020).
  • Does exposure to workplace hazards cluster by occupational or sociodemographic characteristics? An analysis of foreign‐born workers in Australia, American Journal of Industrial Medicine, 10.1002/ajim.23146, 63, 9, (803-816), (2020).
  • Awareness about and willingness to use long-acting injectable pre-exposure prophylaxis (LAI-PrEP) among people who use drugs, Journal of Substance Abuse Treatment, 10.1016/j.jsat.2020.108058, (108058), (2020).
  • sTREM-1 predicts mortality in hospitalized patients with infection in a tropical, middle-income country, BMC Medicine, 10.1186/s12916-020-01627-5, 18, 1, (2020).
  • Predicting Recurrent Instability of the Shoulder (PRIS): A Valid Tool to Predict Which Patients Will Not Have Repeat Shoulder Instability After First-Time Traumatic Anterior Dislocation, Journal of Orthopaedic & Sports Physical Therapy, 10.2519/jospt.2020.9284, 50, 8, (431-437), (2020).
  • Machine Learning Predicts Prolonged Acute Hypoxemic Respiratory Failure in Pediatric Severe Influenza, Critical Care Explorations, 10.1097/CCE.0000000000000175, 2, 8, (e0175), (2020).
  • Logistic regression modeling of implementation of corporate safety policy in international infrastructures, Engineering, Construction and Architectural Management, 10.1108/ECAM-03-2019-0155, ahead-of-print, ahead-of-print, (2020).
  • Plasma Glycated CD59 Predicts Early Gestational Diabetes and Large for Gestational Age Newborns, The Journal of Clinical Endocrinology & Metabolism, 10.1210/clinem/dgaa087, 105, 4, (2020).
  • Validating a methodology to measure frailty syndromes at hospital level utilising administrative data, Clinical Medicine, 10.7861/clinmed.2019-0249, 20, 2, (183-188), (2020).
  • Delayed Hyponatremia Following Surgery for Pituitary Adenomas: An Under-recognized Complication, Neurology India, 10.4103/0028-3886.280637, 0, 0, (0), (2020).
  • Severe functional limitation due to pain & emotional distress and subsequent receipt of prescription medications among older adults with cancer, Journal of Geriatric Oncology, 10.1016/j.jgo.2020.02.006, (2020).
  • Logistic Regression and Related Methods, Principles and Practice of Clinical Trials, 10.1007/978-3-319-52677-5, (1-23), (2020).
  • Quiescent stem cell marker genes in glioma gene networks are sufficient to distinguish between normal and glioblastoma (GBM) samples, Scientific Reports, 10.1038/s41598-020-67753-5, 10, 1, (2020).
  • Exosomes co‐expressing AQP5‐targeting miRNAs and IL‐4 receptor‐binding peptide inhibit the migration of human breast cancer cells, The FASEB Journal, 10.1096/fj.201902434R, 34, 2, (3379-3398), (2020).
  • S100A6 is a positive regulator of PPP5C‐FKBP51‐dependent regulation of endothelial calcium signaling, The FASEB Journal, 10.1096/fj.201901777R, 34, 2, (3179-3196), (2020).
  • Interleukin 35 ameliorates myocardial ischemia‐reperfusion injury by activating the gp130‐STAT3 axis, The FASEB Journal, 10.1096/fj.201901718RR, 34, 2, (3224-3238), (2020).
  • Prospective development of a prostate cancer risk calculator in a racially diverse population: The Kaiser Permanente Prostate Cancer Risk Calculator, Urologic Oncology: Seminars and Original Investigations, 10.1016/j.urolonc.2020.05.011, (2020).
  • Urine Neutrophil Gelatinase-associated Lipocalin (NGAL) for Prediction of Persistent AKI and Major Adverse Kidney Events, Scientific Reports, 10.1038/s41598-020-65764-w, 10, 1, (2020).
  • Protocol for a prospective evaluation of postpartum engagement in HIV care among women living with HIV in South Africa, BMJ Open, 10.1136/bmjopen-2019-035465, 10, 1, (e035465), (2020).
  • Populational trends and outcomes of postoperative radiotherapy for high-risk early-stage cervical cancer with lymph node metastasis: concurrent chemo-radiotherapy versus radiotherapy alone, American Journal of Obstetrics and Gynecology, 10.1016/j.ajog.2019.10.010, 222, 5, (484.e1-484.e15), (2020).
  • The association between parents and children meeting physical activity guidelines, Journal of Pediatric Nursing, 10.1016/j.pedn.2020.03.007, 52, (70-75), (2020).
  • Logistic Regression and Related Methods, Principles and Practice of Clinical Trials, 10.1007/978-3-319-52677-5, (1-23), (2020).
  • Bioengineering for multiple PAHs degradation for contaminated sediments: Response surface methodology (RSM) and artificial neural network (ANN), Chemometrics and Intelligent Laboratory Systems, 10.1016/j.chemolab.2020.104033, (104033), (2020).
  • Goodness‐of‐fit testing in high dimensional generalized linear models, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/rssb.12371, 82, 3, (773-795), (2020).
  • Exposure to baby-friendly hospital practices and mothers’ achievement of their planned duration of breastfeeding, BMC Pregnancy and Childbirth, 10.1186/s12884-020-02904-0, 20, 1, (2020).
  • Disparities in the prevalence and risk factors of anaemia among children aged 6–24 months and 25–59 months in Ethiopia, Journal of Nutritional Science, 10.1017/jns.2020.29, 9, (2020).
  • Mental distress in orthodontic patients during the coronavirus disease 2019 pandemic, American Journal of Orthodontics and Dentofacial Orthopedics, 10.1016/j.ajodo.2020.07.005, (2020).
  • Evaluation of POSSUM scoring systems in predicting postoperative morbidity and mortality in indian patients operated for esophageal cancer, Bali Journal of Anesthesiology, 10.4103/BJOA.BJOA_13_20, 4, 2, (53), (2020).
  • A Scoring System to Predict No-reflow Phenomenon in Elective Percutaneous Coronary Intervention: The RECOVER Score, Current Problems in Cardiology, 10.1016/j.cpcardiol.2020.100676, (100676), (2020).
  • Abhängige Variablen mit begrenztem Wertebereich, Regressionsanalyse in der empirischen Wirtschafts- und Sozialforschung Band 2, 10.1007/978-3-662-61438-9, (29-107), (2020).
  • Early detection of black Sigatoka in banana leaves using hyperspectral images, Applications in Plant Sciences, 10.1002/aps3.11383, 8, 8, (2020).
  • Can Ratios Between Prognostic Factors Predict the Clinical Pregnancy Rate in an IVF/ICSI Program with a GnRH Agonist-FSH/hMG Protocol? An Assessment of 2421 Embryo Transfers, and a Review of the Literature, Reproductive Sciences, 10.1007/s43032-020-00307-2, (2020).
  • Association of Cardiovascular Mortality and Deep Learning-Funduscopic Atherosclerosis Score derived from Retinal Fundus Images, American Journal of Ophthalmology, 10.1016/j.ajo.2020.03.027, (2020).
  • Cancer cachexia: comparing diagnostic criteria in patients with incurable cancer, Nutrition, 10.1016/j.nut.2020.110945, (110945), (2020).
  • The impact of employee participation in online innovation communities on idea quality, Kybernetes, 10.1108/K-04-2020-0228, ahead-of-print, ahead-of-print, (2020).
  • Statistical characterization of frost zones: Case of tea freeze damage in the Kenyan highlands, International Journal of Applied Earth Observation and Geoinformation, 10.1016/j.jag.2019.101971, 84, (101971), (2020).
  • Equipping the American Joint Committee on Cancer Staging for Resectable Pancreatic Ductal Adenocarcinoma with Tumor Grade: A Novel Staging System, Journal of Oncology, 10.1155/2020/9093729, 2020, (1-9), (2020).
  • Predictors of Long-Term Pain After Hip Arthroplasty in Patients With Femoral Neck Fractures: A Cohort Study, Journal of Orthopaedic Trauma, 10.1097/BOT.0000000000001929, 34, 3, (S55-S63), (2020).
  • Expressed breast milk feeding practices in Hong Kong Chinese women: a descriptive study, Midwifery, 10.1016/j.midw.2020.102835, (102835), (2020).
  • Recruiting migrant workers in Australia for Public Health surveys: how sampling strategy make a difference in estimates of workplace hazards, BMC Research Notes, 10.1186/s13104-020-05320-x, 13, 1, (2020).
  • Development of a Risk Model for Pediatric Hospital-Acquired Thrombosis: A Report from the CHAT Consortium, The Journal of Pediatrics, 10.1016/j.jpeds.2020.09.016, (2020).
  • Risk factors for hospitalization in youth with type 1 diabetes: Development and validation of a multivariable prediction model, Pediatric Diabetes, 10.1111/pedi.13090, 21, 7, (1268-1276), (2020).
  • Extended SAFPH℞ (Systems Analysis for Formal Pharmaceutical Human Reliability): Two approaches based on extended CREAM and a comparative analysis, Safety Science, 10.1016/j.ssci.2020.104944, 132, (104944), (2020).
  • Transfusion, Not Just Injury Severity, Leads to Posttrauma Infection: A Matched Cohort Study, The American Surgeon, 10.1177/000313480907500408, 75, 4, (307-312), (2020).
  • Social Networks: Their Role in Access to Financial Services in Britain, National Institute Economic Review, 10.1177/002795010418900110, 189, (99-109), (2020).
  • Mediation Effect of Health Beliefs in the Relationship Between Health Knowledge and Uptake of Mammography in a National Breast Cancer Screening Program in Taiwan, Journal of Cancer Education, 10.1007/s13187-020-01711-7, (2020).
  • Effort avoidance is not simply error avoidance, Psychological Research, 10.1007/s00426-020-01331-2, (2020).
  • Using experimental manipulation of questionnaire design and a Kenyan panel to test for the reliability of reported perceptions of climate change and adaptation, Climatic Change, 10.1007/s10584-020-02709-2, (2020).
  • Predictors of Health Insurance Enrollment among HIV Positive Pregnant Women in Kenya: Potential for Adverse Selection and Implications for HIV Treatment and Prevention, International Journal of Environmental Research and Public Health, 10.3390/ijerph17082892, 17, 8, (2892), (2020).
  • Do the Elderly Need Wider Parking Spaces? Evidence from Experimental and Questionnaire Surveys, Sustainability, 10.3390/su12093800, 12, 9, (3800), (2020).
  • A Sign of the Crimes: Examining Officers’ Identification of, and Arrest for, Stalking in Domestic Violence Complaints, Police Quarterly, 10.1177/1098611120923155, (109861112092315), (2020).
  • Inferences About the Probability of Success, Given the Value of a Covariate, Using a Nonparametric Smoother, Journal of Modern Applied Statistical Methods, 10.22237/jmasm/1556670240, 18, 1, (2020).
  • Determinants of motor, language, cognitive, and global developmental delay in children with complicated severe acute malnutrition at the time of discharge: An observational study from Central India, PLOS ONE, 10.1371/journal.pone.0233949, 15, 6, (e0233949), (2020).
  • Improved wrong-model inference for generalized linear models for binary responses in the presence of link misspecification, Statistical Methods & Applications, 10.1007/s10260-020-00529-3, (2020).
  • Albumin–Bilirubin (ALBI) Grade-Based Nomogram for Patients with Hepatocellular Carcinoma Undergoing Transarterial Chemoembolization, Digestive Diseases and Sciences, 10.1007/s10620-020-06384-2, (2020).
  • Development and validation of risk prediction models for multiple cardiovascular diseases and Type 2 diabetes, PLOS ONE, 10.1371/journal.pone.0235758, 15, 7, (e0235758), (2020).
  • Multivariable prediction model to estimate the probability of restenosis at proximal edge after 2nd-generation drug-eluting-stent implantation: development and internal validation using a quantitative coronary angiography from the post-marketing surveillance studies of everolimus-eluting stent in Japan, Cardiovascular Intervention and Therapeutics, 10.1007/s12928-020-00666-2, (2020).
  • A 2-Biomarker Model Augments Clinical Prediction of Mortality in Melioidosis, Clinical Infectious Diseases, 10.1093/cid/ciaa126, (2020).
  • iCARE: An R package to build, validate and apply absolute risk models, PLOS ONE, 10.1371/journal.pone.0228198, 15, 2, (e0228198), (2020).
  • Comparison of the Peritoneal Cancer Index and Dutch region count as tools to stage patients with peritoneal metastases of colorectal cancer, BJS Open, 10.1002/bjs5.50313, 0, 0, (2020).
  • Malnutrition and its associated factors among elderly Chinese with physical functional dependency, Public Health Nutrition, 10.1017/S1368980019005299, (1-11), (2020).
  • Factors Associated With the Choice of Oral Anticoagulant Class in the Older Patients: An Observational Study, Journal of Cardiovascular Pharmacology and Therapeutics, 10.1177/1074248420917811, (107424842091781), (2020).
  • Validation and update of the thoracic surgery scoring system (Thoracoscore) risk model, European Journal of Cardio-Thoracic Surgery, 10.1093/ejcts/ezaa056, (2020).
  • Artificial intelligence–enabled rapid diagnosis of patients with COVID-19, Nature Medicine, 10.1038/s41591-020-0931-3, (2020).
  • Comparison of Four Risk Prediction Models for Diabetes Remission after Roux-en-Y Gastric Bypass Surgery in Obese Chinese Patients with Type 2 Diabetes Mellitus, Obesity Surgery, 10.1007/s11695-019-04371-9, (2020).
  • Willingness to Use HIV Self-Testing and Associated Factors Among Transgender Women in Malaysia, Transgender Health, 10.1089/trgh.2019.0085, (2020).
  • Specialist palliative cancer care in acute hospitals and place of death: a population study, BMJ Supportive & Palliative Care, 10.1136/bmjspcare-2020-002232, (bmjspcare-2020-002232), (2020).
  • Why Let the Dogs Out? Exploring Variables Associated with Dog Confinement and General Characteristics of the Free-ranging Owned-Dog Population in a Peri-urban Area, Journal of Applied Animal Welfare Science, 10.1080/10888705.2020.1820334, (1-15), (2020).
  • Engagement in Harm Reduction Strategies After Suspected Fentanyl Contamination Among Opioid-Dependent Individuals, Journal of Community Health, 10.1007/s10900-020-00928-3, (2020).
  • Comparison of revised Functional Capacity Index scores with Abbreviated Injury Scale 2008 scores in predicting 12-month severe trauma outcomes, Injury Prevention, 10.1136/injuryprev-2018-043085, 26, 2, (138-146), (2019).
  • Big Data in Total Shoulder Arthroplasty: An In-depth Comparison of National Outcomes Databases, Journal of the American Academy of Orthopaedic Surgeons, 10.5435/JAAOS-D-19-00173, 28, 14, (e626-e632), (2019).
  • Validation of the Norwegian survival prediction model in trauma (NORMIT) in Swedish trauma populations, BJS (British Journal of Surgery), 10.1002/bjs.11306, 107, 4, (381-390), (2019).
  • RTK signaling requires C3ar1/C5ar1 and IL‐6R joint signaling to repress dominant PTEN, SOCS1/3 and PHLPP restraint, The FASEB Journal, 10.1096/fj.201900677R, 34, 2, (2105-2125), (2019).
  • Time‐course of sodium transport along the nephron in nephrotic syndrome: The role of potassium, The FASEB Journal, 10.1096/fj.201901345R, 34, 2, (2408-2424), (2019).
  • Comparison of two simple models for prediction of short term mortality in patients after severe traumatic brain injury, Injury, 10.1016/j.injury.2018.08.022, 50, 1, (65-72), (2019).
  • A novel risk prediction model for 30-day severe adverse events and readmissions following bariatric surgery based on the MBSAQIP database, Surgery for Obesity and Related Diseases, 10.1016/j.soard.2019.03.005, (2019).
  • Predictive performance of the SOFA and mSOFA scoring systems for predicting in-hospital mortality in the emergency department, The American Journal of Emergency Medicine, 10.1016/j.ajem.2018.09.011, 37, 7, (1237-1241), (2019).
  • Examining factors associated with adherence to hormonal therapy in breast cancer patients, Research in Social and Administrative Pharmacy, 10.1016/j.sapharm.2019.08.005, (2019).
  • Retrospective Analysis of the Effect of Postdischarge Telephone Calls by Hospitalists on Improvement of Patient Satisfaction and Readmission Rates, Southern Medical Journal, 10.14423/SMJ.0000000000000994, 112, 7, (357-362), (2019).
  • Determinants of environmental conservation in Lake Tana Biosphere Reserve, Ethiopia, Heliyon, 10.1016/j.heliyon.2019.e01997, 5, 7, (e01997), (2019).
  • Health parameters and their association with price in young calves sold at auction for veal operations in Québec, Canada, Journal of Dairy Science, 10.3168/jds.2018-16051, 102, 7, (6454-6465), (2019).
  • Combining procalcitonin with the qSOFA and sepsis mortality prediction, Medicine, 10.1097/MD.0000000000015981, 98, 23, (e15981), (2019).
  • Predicting protein-ligand interactions based on bow-pharmacological space and Bayesian additive regression trees, Scientific Reports, 10.1038/s41598-019-43125-6, 9, 1, (2019).
  • Accurate WiFi Localization by Unsupervised Fusion of Extended Candidate Location Set, IEEE Internet of Things Journal, 10.1109/JIOT.2018.2870659, 6, 2, (2476-2485), (2019).
  • References, Leading and Managing Change in the Age of Disruption and Artificial Intelligence, 10.1108/9781787563674, (169-192), (2019).
  • Examining health literacy on cholera in an endemic community in Accra, Ghana: a cross-sectional study, Tropical Medicine and Health, 10.1186/s41182-019-0157-6, 47, 1, (2019).
  • Influential design factors on occupant satisfaction with indoor environment in workplaces, Building and Environment, 10.1016/j.buildenv.2019.05.002, (2019).
  • Determinants of functional decline in older adults experiencing cancer (the INCAPAC study), Journal of Geriatric Oncology, 10.1016/j.jgo.2019.03.006, (2019).
  • Development of predictive nomograms for clinical use to quantify the risk of isolating resistance prone organisms in patients with infected foot ulcers, Epidemiology and Infection, 10.1017/S0950268818003667, 147, (2019).
  • Risk Factors for Complications in Children with Staphylococcus aureus Bacteremia, The Journal of Pediatrics, 10.1016/j.jpeds.2018.12.002, (2019).
  • Developing Prediction Models for 30-Day Unplanned Readmission Among Children With Medical Complexity, Hospital Pediatrics, 10.1542/hpeds.2018-0174, 9, 3, (201-208), (2019).
  • New risk prediction model of coronary heart disease in participants with and without diabetes: Assessments of the Framingham risk and Suita scores in 3-year longitudinal database in a Japanese population, Scientific Reports, 10.1038/s41598-019-39049-w, 9, 1, (2019).
  • Predictors for pathological parametrial invasion in clinical stage IIB cervical cancer, European Journal of Surgical Oncology, 10.1016/j.ejso.2019.02.019, (2019).
  • See more

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.