Volume 51, Issue 4
Original Article

A Comparison of Different Psychometric Approaches to Modeling Testlet Structures: An Example with C‐Tests

Alexander Robitzsch

Federal Institute for Education Research, Innovation & Development of the Austrian Schooling System (BIFIE Salzburg)

Search for more papers by this author
Stefan Schipolowski

Institute for Educational Quality Improvement, Humboldt‐Universität zu Berlin

Search for more papers by this author
First published: 10 December 2014
Citations: 5

During the preparation of this manuscript, Stefan Schipolowski was a fellow of the International Max Planck Research School “The Life Course: Evolutionary and Ontogenetic Dynamics (LIFE).”

Abstract

C‐tests are a specific variant of cloze tests that are considered time‐efficient, valid indicators of general language proficiency. They are commonly analyzed with models of item response theory assuming local item independence. In this article we estimated local interdependencies for 12 C‐tests and compared the changes in item difficulties, reliability estimates, and person parameter estimates for different modeling approaches: (a) Rasch, (b) testlet, (c) partial credit, and (d) copula models. The results are complemented with findings of a simulation study in which sample size, number of testlets, and strength of residual correlations between items were systematically manipulated. Results are discussed with regard to the pivotal question whether residual dependencies between items are an artifact or part of the construct.

Number of times cited according to CrossRef: 5

  • Vergleichende Messung fachsprachlicher Fähigkeiten in den Domänen Physik und SportComparative analysis of special language skills in the domains of physics and sports, Zeitschrift für Didaktik der Naturwissenschaften, 10.1007/s40573-017-0055-6, 23, 1, (53-69), (2017).
  • Setting cut scores on an EFL placement test using the prototype group method: A receiver operating characteristic (ROC) analysis, Language Testing, 10.1177/0265532216672703, 34, 3, (383-411), (2016).
  • Analyses of Testlet Data, Pacific Rim Objective Measurement Symposium (PROMS) 2015 Conference Proceedings, 10.1007/978-981-10-1687-5, (199-214), (2016).
  • Evidence for a positive relation between interest and achievement: Examining between-person and within-person variation in five domains, Contemporary Educational Psychology, 10.1016/j.cedpsych.2016.05.004, 46, (116-127), (2016).
  • Analyse der Aufgaben zur Evaluation der Bildungsstandards in Physik - Differenzierung von schriftsprachlichen Fähigkeiten und FachlichkeitAnalyses of the tasks for evaluating the educational standards in physics – Differentiation between written language proficiency and content knowledge, Zeitschrift für Erziehungswissenschaft, 10.1007/s11618-015-0646-2, 18, 4, (763-779), (2015).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.