Homogeneity Score Test for the Intraclass Version of the Kappa Statistics and Sample‐Size Determination in Multiple or Stratified Studies
Abstract
Summary. When the intraclass correlation coefficient or the equivalent version of the kappa agreement coefficient have been estimated from several independent studies or from a stratified study, we have the problem of comparing the kappa statistics and combining the information regarding the kappa statistics in a common kappa when the assumption of homogeneity of kappa coefficients holds. In this article, using the likelihood score theory extended to nuisance parameters (Tarone, 1988, Communications in Statistics—Theory and Methods17(5), 1549–1556) we present an efficient homogeneity test for comparing several independent kappa statistics and, also, give a modified homogeneity score method using a noniterative and consistent estimator as an alternative. We provide the sample size using the modified homogeneity score method and compare it with that using the goodness‐of‐fit method (GOF) (Donner, Eliasziw, and Klar, 1996, Biometrics52, 176–183). A simulation study for small and moderate sample sizes showed that the actual level of the homogeneity score test using the maximum likelihood estimators (MLEs) of parameters is satisfactorily close to the nominal and it is smaller than those of the modified homogeneity score and the goodness‐of‐fit tests. We investigated statistical properties of several noniterative estimators of a common kappa. The estimator (Donner et al., 1996) is essentially efficient and can be used as an alternative to the iterative MLE. An efficient interval estimation of a common kappa using the likelihood score method is presented.
Citing Literature
Number of times cited according to CrossRef: 17
- Chikara Honda, Tetsuji Ohyama, Homogeneity score test of AC1 statistics and estimation of common AC1 in multiple or stratified inter-rater agreement studies, BMC Medical Research Methodology, 10.1186/s12874-019-0887-5, 20, 1, (2020).
- Muammer Albayrak, Kemal Turhan, Yasemin Yavuz, Zeliha Aydin Kasap, kaphom: An R package for testing the homogeneity of intra-class kappa statistics, Communications in Statistics - Simulation and Computation, 10.1080/03610918.2018.1538457, (1-16), (2019).
- Tetsuji Ohyama, Statistical inference of agreement coefficient between two raters with binary outcomes, Communications in Statistics - Theory and Methods, 10.1080/03610926.2019.1576894, (1-11), (2019).
- M. Ganjali, N. Moradzadeh, T. Baghfalaki, Bayesian testing of agreement criteria under order constraints, Journal of the Korean Statistical Society, 10.1016/j.jkss.2016.06.004, 46, 1, (78-87), (2017).
- Yan Guo, Kasey Vickers, Yanhua Xiong, Shilin Zhao, Quanhu Sheng, Pan Zhang, Wanding Zhou, Charles R. Flynn, Comprehensive evaluation of extracellular small RNA isolation methods from serum in high throughput sequencing, BMC Genomics, 10.1186/s12864-016-3470-z, 18, 1, (2017).
- Zhao Yang, Ming Zhou, Kappa statistic for clustered matched‐pair data, Statistics in Medicine, 10.1002/sim.6113, 33, 15, (2612-2633), (2014).
- Gregory E. Wilding, Joseph D. Consiglio, Guogen Shan, Exact approaches for testing hypotheses based on the intra‐class kappa coefficient, Statistics in Medicine, 10.1002/sim.6135, 33, 17, (2998-3012), (2014).
- Nian-Sheng Tang, Bo Zhang, Hu-Qiong Li, Homogeneity Test of Difference Between Two Correlated Proportions in Stratified Matched-Pair Studies, Journal of Biopharmaceutical Statistics, 10.1080/10543406.2013.834915, 23, 6, (1261-1280), (2013).
- Hui-Qiong Li, Liu-Cang Wu, Sample Size Determination via Non-unity Relative Risk for Stratified Matched-Pair Studies, Modeling Risk Management for Resources and Environment in China, 10.1007/978-3-642-18387-4_54, (493-500), (2011).
- Hui‐Qiong Li, Nian‐Sheng Tang, Homogeneity test of rate ratios in stratified matched‐pair studies, Biometrical Journal, 10.1002/bimj.201000074, 53, 4, (614-627), (2011).
- Adelmo de Souza Machado Neto, Tarcisio Matos Andrade, Gilênio Borges Fernandes, Helder Paulo Zacharias, Fernando Martins Carvalho, Ana Paula Souza Machado, Ana Carmen Costa Dias, Ana Carolina Rocha Garcia, Lauro Reis Santana, Carlos Eduardo Rolin, Cyntia Sampaio, Gisele Ghiraldi, Francisco Inácio Bastos, Reliability of a questionnaire on substance use among adolescent students, Brazil, Revista de Saúde Pública, 10.1590/S0034-89102010000500008, 44, 5, (830-839), (2010).
- Shun‐Fang Wang, Nian‐Sheng Tang, Bo Zhang, Xue‐Ren Wang, Statistical inference of risk difference in K correlated 2×2 tables with structural zero, Pharmaceutical Statistics, 10.1002/pst.360, 8, 4, (317-332), (2009).
- Hui-Qiong Li, Nian-Sheng Tang, Liu-Cang Wu, Statistical analysis of non-inferiority via non-zero risk difference in stratified matched-pair studies, Journal of Statistical Planning and Inference, 10.1016/j.jspi.2008.03.001, 138, 12, (4055-4067), (2008).
- Chris Roberts, Modelling patterns of agreement for nominal scales, Statistics in Medicine, 10.1002/sim.2945, 27, 6, (810-830), (2007).
- Ty A. Ridenour, Bethany C. Bray, Linda B. Cottler, Reliability of use, abuse, and dependence of four types of inhalants in adolescents and young adults, Drug and Alcohol Dependence, 10.1016/j.drugalcdep.2007.05.004, 91, 1, (40-49), (2007).
- Jun‐mo Nam, Assessment on homogeneity tests for kappa statistics under equal prevalence across studies in reliability, Statistics in Medicine, 10.1002/sim.2321, 25, 9, (1521-1531), (2005).
- G. Y. Zou, Statistical Methods for the Analysis of Genetic Association Studies, Annals of Human Genetics, 10.1111/j.1529-8817.2005.00213.x, 70, 2, (262-276), (2005).




