Volume 25, Issue 4
Research Article

Probabilistic index: an intuitive non‐parametric approach to measuring the size of treatment effects

Laura Acion

Corresponding Author

E-mail address: laura‐acion@uiowa.edu

Department of Biostatistics, College of Public Health, University of Iowa, IA, U.S.A.

Department of Psychiatry, College of Medicine, University of Iowa, IA, U.S.A.

1‐192 MEB/Psychiatry Research, The University of Iowa College of Medicine, Iowa City, IA 52242‐1000, U.S.A.Search for more papers by this author
John J. Peterson

Statistical Sciences Department, GlaxoSmithKline Pharmaceuticals, King of Prussia, PA, U.S.A.

Search for more papers by this author
Scott Temple

Department of Psychiatry, College of Medicine, University of Iowa, IA, U.S.A.

Search for more papers by this author
Stephan Arndt

Department of Biostatistics, College of Public Health, University of Iowa, IA, U.S.A.

Department of Psychiatry, College of Medicine, University of Iowa, IA, U.S.A.

Search for more papers by this author
First published: 05 September 2005
Citations: 149

An earlier version of portions of this paper was presented as a poster session at the Joint Statistical Meetings 2004, Toronto, Canada.

Abstract

Effect sizes (ES) tell the magnitude of the difference between treatments and, ideally, should tell clinicians how likely their patients will benefit from the treatment. Currently used ES are expressed in statistical rather than in clinically useful terms and may not give clinicians the appropriate information. We restrict our discussion to studies with two groups: one with n patients receiving a new treatment and the other with m patients receiving the usual or no treatment. The standardized mean difference (e.g. Cohen's d) is a well‐known index for continuous outcomes. There is some intuitive value to d, but measuring improvement in standard deviations (SD) is a statistical concept that may not help a clinician. How much improvement is a half SD? A more intuitive and simple‐to‐calculate ES is the probability that the response of a patient given the new treatment (X) is better than the one for a randomly chosen patient given the old or no treatment (Y) (i.e. P(X > Y), larger values meaning better outcomes). This probability has an immediate identity with the area under the curve (AUC) measure in procedures for receiver operator characteristic (ROC) curve comparing responses to two treatments. It also can be easily calculated from the Mann–Whitney U, Wilcoxon, or Kendall τ statistics. We describe the characteristics of an ideal ES. We propose P(X > Y) as an alternative index, summarize its correspondence with well‐known non‐parametric statistics, compare it to the standardized mean difference index, and illustrate with clinical data. Copyright © 2005 John Wiley & Sons, Ltd.

Number of times cited according to CrossRef: 149

  • Regional geochemical zonation of cultivated floodplains–Application of multi-element associations for soil quality evaluation along the Ohře (Eger) River, Czech Republic, Journal of Geochemical Exploration, 10.1016/j.gexplo.2020.106491, (106491), (2020).
  • Effects of Periosteal Elevation Before Lateral Osteotomy in Rhinoplasty: A Meta-Analysis of Randomized Controlled Trials, Clinical and Experimental Otorhinolaryngology, 10.21053/ceo.2019.01599, 13, 3, (268-273), (2020).
  • Typology of Published Randomized Controlled Trials Investigating Initial Ventilation Strategy in Critically Ill Patients With Acute Respiratory Failure, Chest, 10.1016/j.chest.2020.03.082, (2020).
  • Contrasting migratory journeys and changes in hippocampal astrocyte morphology in shorebirds, European Journal of Neuroscience, 10.1111/ejn.14781, 0, 0, (2020).
  • Method of lateral osteotomy to reduce eyelid edema and ecchymosis after rhinoplasty: A meta‐analysis, The Laryngoscope, 10.1002/lary.28519, 0, 0, (2020).
  • Aspirin versus low-molecular-weight heparin for venous thromboembolism prophylaxis in orthopaedic trauma patients: A patient-centered randomized controlled trial, PLOS ONE, 10.1371/journal.pone.0235628, 15, 8, (e0235628), (2020).
  • Measuring stalking: the development and evaluation of the Stalking Assessment Indices (SAI), Psychiatry, Psychology and Law, 10.1080/13218719.2020.1787904, (1-27), (2020).
  • Effects on walking performance and lower body strength by short message service guided training after stroke or transient ischemic attack (The STROKEWALK Study): a randomized controlled trial, Clinical Rehabilitation, 10.1177/0269215520954346, (026921552095434), (2020).
  • On the interpretation of the hazard ratio in Cox regression, Biometrical Journal, 10.1002/bimj.201800255, 62, 3, (742-750), (2019).
  • An empirical comparison of two novel transformation models, Statistics in Medicine, 10.1002/sim.8425, 39, 5, (562-576), (2019).
  • Semiparametric linear transformation models: Effect measures, estimators, and applications, Statistics in Medicine, 10.1002/sim.8078, 38, 8, (1484-1501), (2019).
  • Estimating Mann–Whitney‐type Causal Effects, International Statistical Review, 10.1111/insr.12326, 87, 3, (514-530), (2019).
  • Neuropsychological subtypes of incident mild cognitive impairment in the Mayo Clinic Study of Aging, Alzheimer's & Dementia, 10.1016/j.jalz.2019.03.014, 15, 7, (878-887), (2019).
  • Can Sodium Citrate Effectively Improve Olfactory Function in Non-Conductive Olfactory Dysfunction?, Korean Journal of Otorhinolaryngology-Head and Neck Surgery, 10.3342/kjorl-hns.2018.00766, 62, 2, (75-81), (2019).
  • Measurement of Patient-Reported Outcomes of Health Services, Health Services Evaluation, 10.1007/978-1-4939-8715-3_34, (537-557), (2019).
  • Effect size measures and their benchmark values for quantifying benefit or risk of medicinal products, Biometrical Journal, 10.1002/bimj.201800107, 61, 4, (973-982), (2019).
  • Peripheral proinflammatory markers are upregulated in abstinent alcohol-dependent patients but are not affected by cognitive bias modification: preliminary findings, Drug and Alcohol Dependence, 10.1016/j.drugalcdep.2019.107553, (107553), (2019).
  • Controlling for confounding via propensity score methods can result in biased estimation of the conditional AUC: A simulation study, Pharmaceutical Statistics, 10.1002/pst.1948, 18, 5, (568-582), (2019).
  • Employment recovery capital in the treatment of substance use disorders: Six-month follow-up observations, Drug and Alcohol Dependence, 10.1016/j.drugalcdep.2019.107624, (107624), (2019).
  • A patient-centered composite endpoint weighting technique for orthopaedic trauma research, BMC Medical Research Methodology, 10.1186/s12874-019-0885-7, 19, 1, (2019).
  • Direct and Collateral Effects of Peer Tutoring on Social and Behavioral Outcomes: A Meta-Analysis of Single-Case Research, School Psychology Review, 10.1080/02796015.2014.12087427, 43, 3, (260-285), (2019).
  • Academic Benefits of Peer Tutoring: A Meta-Analytic Review of Single-Case Research, School Psychology Review, 10.1080/02796015.2013.12087490, 42, 1, (39-55), (2019).
  • The win ratio: On interpretation and handling of ties, Statistics in Biopharmaceutical Research, 10.1080/19466315.2019.1575279, (1-14), (2019).
  • Biomarkers of Hand Osteoarthritis Are Detectable after Mechanical Exercise, Journal of Clinical Medicine, 10.3390/jcm8101545, 8, 10, (1545), (2019).
  • Nonparametric MANOVA in meaningful effects, Annals of the Institute of Statistical Mathematics, 10.1007/s10463-019-00717-3, (2019).
  • Efficient Estimation of Mann–Whitney-Type Effect Measures for Right-Censored Survival Outcomes in Randomized Clinical Trials, Statistics in Biosciences, 10.1007/s12561-019-09246-2, (2019).
  • Beck Depression Inventory-II: Self-report or interview-based administrations show different results in older persons, International Psychogeriatrics, 10.1017/S1041610218001187, 31, 5, (735-742), (2018).
  • MRI Outperforms [18F]AV‐1451 PET as a Longitudinal Biomarker in Progressive Supranuclear Palsy, Movement Disorders, 10.1002/mds.27546, 34, 1, (105-113), (2018).
  • Efficacy of tranexamic acid on operative bleeding in endoscopic sinus surgery: A meta‐analysis and systematic review, The Laryngoscope, 10.1002/lary.27766, 129, 4, (800-807), (2018).
  • Modified Wilcoxon–Mann–Whitney Test and Power Against Strong Null, The American Statistician, 10.1080/00031305.2017.1328375, 73, 1, (43-49), (2018).
  • Regional Distribution, Asymmetry, and Clinical Correlates of Tau Uptake on [18F]AV-1451 PET in Atypical Alzheimer’s Disease, Journal of Alzheimer's Disease, 10.3233/JAD-170740, 62, 4, (1713-1724), (2018).
  • Firesetting and general criminal recidivism among a consecutive sample of Finnish pretrial male firesetters: A register-based follow-up study, Psychiatry Research, 10.1016/j.psychres.2017.11.008, 259, (377-384), (2018).
  • Advancing Interpretation of Patient-Reported Outcomes, Biopharmaceutical Applied Statistics Symposium, 10.1007/978-981-10-7826-2_5, (69-89), (2018).
  • Distributions and Effects, Rank and Pseudo-Rank Procedures for Independent Observations in Factorial Designs, 10.1007/978-3-030-02914-2_2, (15-74), (2018).
  • Estimating Mann–Whitney-Type Causal Effects for Right-Censored Survival Outcomes, Journal of Causal Inference, 10.1515/jci-2018-0010, 0, 0, (2018).
  • What Hypotheses do “Nonparametric” Two-Group Tests Actually Test?, The Stata Journal: Promoting communications on statistics and Stata, 10.1177/1536867X1201200202, 12, 2, (182-190), (2018).
  • Confidence intervals for the Mann–Whitney test, Statistical Methods in Medical Research, 10.1177/0962280218814556, (096228021881455), (2018).
  • The Effect of Sphenopalatine Block on the Postoperative Pain of Endoscopic Sinus Surgery: A Meta-analysis, Otolaryngology–Head and Neck Surgery, 10.1177/0194599818805673, (019459981880567), (2018).
  • Bootstrap- and permutation-based inference for the Mann–Whitney effect for right-censored and tied data, TEST, 10.1007/s11749-017-0565-z, 27, 3, (639-658), (2017).
  • Efficacy of Adjuvant Magnesium for Posttonsillectomy Morbidity in Children: A Meta-analysis, Otolaryngology–Head and Neck Surgery, 10.1177/0194599817730354, 158, 1, (27-35), (2017).
  • Longitudinal structural and molecular neuroimaging in agrammatic primary progressive aphasia, Brain, 10.1093/brain/awx293, 141, 1, (302-317), (2017).
  • Efficacy of dexmedetomidine for perioperative morbidities in pediatric tonsillectomy: A metaanalysis, The Laryngoscope, 10.1002/lary.26888, 128, 5, (E184-E193), (2017).
  • Efficacy of dexmedetomidine on perioperative morbidity during nasal surgery: A meta‐analysis, The Laryngoscope, 10.1002/lary.26787, 128, 3, (573-580), (2017).
  • Comparing Two Groups, Introduction to Robust Estimation and Hypothesis Testing, 10.1016/B978-0-12-804733-0.00005-6, (145-234), (2017).
  • Reassessing the Link Between Stalking and Intimate Partner Abuse, Partner Abuse, 10.1891/1946-6560.8.3.223, 8, 3, (223-250), (2017).
  • References, Introduction to Robust Estimation and Hypothesis Testing, 10.1016/B978-0-12-804733-0.00018-4, (741-777), (2017).
  • Comparison of [ 18 F]Flutemetamol and [ 11 C]Pittsburgh Compound-B in cognitively normal young, cognitively normal elderly, and Alzheimer's disease dementia individuals, NeuroImage: Clinical, 10.1016/j.nicl.2017.08.011, 16, (295-302), (2017).
  • Research Synthesis and Meta-Analysis of Single-Case Designs, Handbook of Special Education, 10.4324/9781315517698, (168-186), (2017).
  • Measurement of Patient-Reported Outcomes of Health Services, Methods in Health Services Research, 10.1007/978-1-4939-6704-9_10-1, (1-21), (2017).
  • Composite End Points in Clinical Trials of Heart Failure Therapy, Circulation: Heart Failure, 10.1161/CIRCHEARTFAILURE.116.003222, 10, 1, (2017).
  • The Internationalization of Psychology Journals in Brazil: A Bibliometric Examination Based on Four Indices, Paidéia (Ribeirão Preto), 10.1590/1982-43272766201702, 27, 66, (7-15), (2017).
  • Multiple-Treatments Meta-Analysis: Are the Conclusions Supported by the Data?, Journal of Clinical Oncology, 10.1200/JCO.2016.70.4775, 35, 5, (565-566), (2017).
  • Use of a machine learning framework to predict substance use disorder treatment success, PLOS ONE, 10.1371/journal.pone.0175383, 12, 4, (e0175383), (2017).
  • Assessing the Link Between Intimate Partner Violence and Postrelationship Stalking: A Gender-Inclusive Study, Journal of Interpersonal Violence, 10.1177/0886260517734859, (088626051773485), (2017).
  • The Reliability and Predictive Validity of the Stalking Risk Profile, Assessment, 10.1177/1073191116653470, 25, 2, (259-276), (2016).
  • Self-assessed limited prosocial emotions do not distinguish community youth with psychosocial problems from those without them, Nordic Journal of Psychiatry, 10.1080/08039488.2016.1241825, 71, 2, (126-130), (2016).
  • Short-term and long-term effects of a progressive resistance and balance exercise program in individuals with chronic stroke: a randomized controlled trial, Disability and Rehabilitation, 10.1080/09638288.2016.1206631, 39, 16, (1615-1622), (2016).
  • Risk factors for stalking violence, persistence, and recurrence, The Journal of Forensic Psychiatry & Psychology, 10.1080/14789949.2016.1247188, 28, 1, (38-56), (2016).
  • A Mann–Whitney type effect measure of interaction for factorial designs, Communications in Statistics - Theory and Methods, 10.1080/03610926.2016.1263739, 46, 22, (11243-11260), (2016).
  • Depression and healthcare service utilization in patients with cancer, Psycho-Oncology, 10.1002/pon.4133, 26, 8, (1133-1139), (2016).
  • Best (but oft-forgotten) practices: expressing and interpreting associations and effect sizes in clinical outcome assessments, The American Journal of Clinical Nutrition, 10.3945/ajcn.115.120378, 103, 3, (685-693), (2016).
  • Upper Airway Vibration Perception in School-Aged Children with Obstructive Sleep Apnea, Sleep, 10.5665/sleep.6084, 39, 9, (1647-1652), (2016).
  • Does the Preoperative Administration of Steroids Reduce Intraoperative Bleeding during Endoscopic Surgery of Nasal Polyps?, Otolaryngology-Head and Neck Surgery, 10.1177/0194599816663455, 155, 6, (949-955), (2016).
  • Validation of the UCSD Performance-based Skills Assessment (UPSA) in Hispanics with and without schizophrenia, Psychiatry Research, 10.1016/j.psychres.2016.08.027, 244, (388-393), (2016).
  • The Efficacy of Corticosteroids in the Treatment of Peritonsillar Abscess: A Meta-Analysis, Clinical and Experimental Otorhinolaryngology, 10.21053/ceo.2014.01851, 9, 2, (89-97), (2016).
  • Effects of Acute Endurance Exercise on Plasma Protein Profiles of Endurance-Trained and Untrained Individuals over Time, Mediators of Inflammation, 10.1155/2016/4851935, 2016, (1-11), (2016).
  • Does preoperative administration of gabapentin/pregabalin improve postoperative nasal surgery pain?, The Laryngoscope, 10.1002/lary.25951, 126, 10, (2232-2241), (2016).
  • A practical divergence measure for survival distributions that can be estimated from Kaplan–Meier curves, Statistics in Medicine, 10.1002/sim.6868, 35, 14, (2406-2421), (2016).
  • A win ratio approach to comparing continuous non‐normal outcomes in clinical trials, Pharmaceutical Statistics, 10.1002/pst.1743, 15, 3, (238-245), (2016).
  • Permutation‐based inference for the AUC: A unified approach for continuous and discontinuous data, Biometrical Journal, 10.1002/bimj.201500105, 58, 6, (1319-1337), (2016).
  • Correlation coefficients in medical research: from product moment correlation to the odds ratio, Statistical Methods in Medical Research, 10.1177/0962280206070650, 15, 6, (525-545), (2016).
  • Impact of crime victimization on initial presentation to an early intervention for psychosis service and 18‐month outcomes, Early Intervention in Psychiatry, 10.1111/eip.12219, 11, 2, (123-132), (2015).
  • Can perioperative acupuncture reduce the pain and vomiting experienced after tonsillectomy? A meta‐analysis, The Laryngoscope, 10.1002/lary.25721, 126, 3, (608-615), (2015).
  • The efficacy of gabapentin/pregabalin in improving pain after tonsillectomy: A meta‐analysis, The Laryngoscope, 10.1002/lary.25636, 126, 2, (357-366), (2015).
  • UPSA-M: Feasibility and initial validity of a mobile application of the UCSD Performance-Based Skills Assessment, Schizophrenia Research, 10.1016/j.schres.2015.02.014, 164, 1-3, (187-192), (2015).
  • Referrals and Treatment Completion for Prescription Opioid Admissions: Five Years of National Data, Journal of Substance Abuse Treatment, 10.1016/j.jsat.2015.07.010, 59, (109-114), (2015).
  • Sample size calculations for clinical trials targeting tauopathies: a new potential disease target, Journal of Neurology, 10.1007/s00415-015-7821-5, 262, 9, (2064-2072), (2015).
  • Psychopathic traits among a consecutive sample of Finnish pretrial fire-setting offenders, BMC Psychiatry, 10.1186/s12888-015-0425-x, 15, 1, (2015).
  • Neutrophil extracellular trap (NET) formation characterises stable and exacerbated COPD and correlates with airflow limitation, Respiratory Research, 10.1186/s12931-015-0221-7, 16, 1, (2015).
  • The association of both self-reported and behavioral impulsivity with the annual prevalence of substance use among early adolescents, Substance Abuse Treatment, Prevention, and Policy, 10.1186/s13011-015-0019-0, 10, 1, (2015).
  • Concept for a study design in patients with severe community-acquired pneumonia: A randomised controlled trial with a novel IGM-enriched immunoglobulin preparation – The CIGMA study, Respiratory Medicine, 10.1016/j.rmed.2015.03.008, 109, 6, (758-767), (2015).
  • Sensory Feedback Reduces Individuality by Increasing Variability within Subjects, Current Biology, 10.1016/j.cub.2015.08.044, 25, 20, (2672-2676), (2015).
  • A Regression Framework for Rank Tests Based on the Probabilistic Index Model, Journal of the American Statistical Association, 10.1080/01621459.2015.1016226, 110, 511, (1276-1283), (2015).
  • Accelerated vs. unaccelerated serial MRI based TBM-SyN measurements for clinical trials in Alzheimer's disease, NeuroImage, 10.1016/j.neuroimage.2015.03.026, 113, (61-69), (2015).
  • Effects of honey on oral mucositis in patients with head and neck Cancer: A meta‐analysis, The Laryngoscope, 10.1002/lary.25233, 125, 9, (2085-2092), (2015).
  • Efficacy of Endonasal Phototherapy for Relieving the Symptoms of Allergic Rhinitis: Meta-Analysis, American Journal of Rhinology & Allergy, 10.2500/ajra.2015.29.4190, 29, 4, (283-291), (2015).
  • Calcular e apresentar tamanhos do efeito em trabalhos científicos (1): As limitações do p < 0,05 na análise de diferenças de médias de dois grupos, Revista Portuguesa de Investigação Comportamental e Social, 10.7342/ismt.rpics.2015.1.1.14, 1, 1, (3-16), (2015).
  • The efficacy of honey for ameliorating pain after tonsillectomy: a meta-analysis, European Archives of Oto-Rhino-Laryngology, 10.1007/s00405-014-3433-4, 273, 4, (811-818), (2014).
  • “It's Not Really Stalking If You Know the Person”: Measuring Community Attitudes That Normalize, Justify and Minimise Stalking, Psychiatry, Psychology and Law, 10.1080/13218719.2014.945637, 22, 2, (291-306), (2014).
  • Increasing the power of the Mann‐Whitney test in randomized experiments through flexible covariate adjustment, Statistics in Medicine, 10.1002/sim.6386, 34, 6, (1012-1030), (2014).
  • Sensitivity Analysis for Withdrawals in Grouped Time-to-Event Data, Statistics in Biopharmaceutical Research, 10.1080/19466315.2013.836984, 6, 1, (41-54), (2014).
  • Response to Invited Commentary: Methods to address control for confounding and nonperfect randomization when using outcome distribution curves to estimate the population-level impact of a public health intervention, Journal of Clinical Epidemiology, 10.1016/j.jclinepi.2014.06.006, 67, 11, (1286-1288), (2014).
  • Formulating appropriate statistical hypotheses for treatment comparison in clinical trial design and analysis, Contemporary Clinical Trials, 10.1016/j.cct.2014.09.005, 39, 2, (294-302), (2014).
  • Prognostic ROC Curves, Epidemiology, 10.1097/EDE.0000000000000004, 25, 1, (103-109), (2014).
  • G-computation might be used to control for confounding when estimating the population-level impact of interventions through outcome distribution curves, Journal of Clinical Epidemiology, 10.1016/j.jclinepi.2014.06.005, 67, 11, (1286), (2014).
  • A mediator effect size in randomized clinical trials, International Journal of Methods in Psychiatric Research, 10.1002/mpr.1445, 23, 4, (401-410), (2014).
  • Effect Size, The Encyclopedia of Clinical Psychology, 10.1002/9781118625392, (1-3), (2014).
  • Global Assessment Variables, Methods and Applications of Statistics in Clinical Trials, 10.1002/9781118596005, (423-437), (2014).
  • Global Assessment Variables, Wiley StatsRef: Statistics Reference Online, 10.1002/9781118445112, (2014).
  • Using the probability-probability plot and index to augment interpretation of treatment effect for patient-reported outcome measures, Expert Review of Pharmacoeconomics & Outcomes Research, 10.1586/14737167.2013.849575, 13, 6, (707-713), (2014).
  • See more

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.