Volume 25, Issue 9
Research Article

Assessment on homogeneity tests for kappa statistics under equal prevalence across studies in reliability

Jun‐mo Nam

Corresponding Author

E-mail address: namj@mail.nih.gov

Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Department of Health & Human Services, Executive Plaza South, Room 8028, 6120 Executive Boulevard, MSC 7240, Rockville, MD 20892‐7240, U.S.A.

Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Department of Health & Human Services, Executive Plaza South, Room 8028, 6120 Executive Boulevard, MSC 7240, Rockville, MD 20892‐7240, U.S.ASearch for more papers by this author
First published: 11 October 2005
Citations: 5

This article is a U.S. Government work and is in the public domain in the U.S.A.

Abstract

In this paper, we assess the performance of homogeneity tests for two or more kappa statistics when prevalence rates across reliability studies are assumed to be equal. The likelihood score method and the chi‐square goodness‐of‐fit (GOF) test provide type 1 error rates that are satisfactorily close to the nominal level, but a Fleiss‐like test is not satisfactory for small or moderate sample sizes. Simulations show that the score test is more powerful than the chi‐square GOF test and the approximate sample size required for a specific power of the former is substantially smaller than the latter. In addition, the score test is robust to deviations from the equal prevalence assumption, while the GOF test is highly sensitive and it may give a grossly misleading type 1 error rate when the assumption of equal prevalence is violated. We conclude that the homogeneity score test is the preferred method. Published in 2005 by John Wiley & Sons, Ltd.

Number of times cited according to CrossRef: 5

  • kaphom: An R package for testing the homogeneity of intra-class kappa statistics, Communications in Statistics - Simulation and Computation, 10.1080/03610918.2018.1538457, (1-16), (2019).
  • Exact approaches for testing hypotheses based on the intra‐class kappa coefficient, Statistics in Medicine, 10.1002/sim.6135, 33, 17, (2998-3012), (2014).
  • Paramedic Determinations of Medical Necessity: A Meta-Analysis, Prehospital Emergency Care, 10.1080/10903120903144809, 13, 4, (516-527), (2009).
  • Sample Size for Comparing Correlated Concordance Rates, Journal of Biopharmaceutical Statistics, 10.1080/10543400701697216, 18, 2, (359-369), (2008).
  • Diagnosing Barrettʼs oesophagus: factors related to agreement between endoscopy and histology, European Journal of Gastroenterology & Hepatology, 10.1097/MEG.0b013e3282cf5018, 19, 10, (870-877), (2007).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.