The effect of a ‘don't know’ option on test scores: number-right and formula scoring compared
Article first published online: 4 JAN 2002
Volume 33, Issue 4, pages 267–275, April 1999
How to Cite
Muijtjens, Mameren, H. v., Hoogenboom, Evers and Vleuten, C. P. M. v. d. (1999), The effect of a ‘don't know’ option on test scores: number-right and formula scoring compared. Medical Education, 33: 267–275. doi: 10.1046/j.1365-2923.1999.00292.x
- Issue published online: 4 JAN 2002
- Article first published online: 4 JAN 2002
- editorial comments to authors
- Cited By
- Educational measurement methods;
- medical undergraduate education methods;
In multiple-choice tests using a `don't-know' option the number of correct minus incorrect answers was used as the test score (formula scoring) in order to reduce the measurement error resulting from random guessing. In the literature diverging results are reported when comparing formula scoring and number-right scoring, the scoring method without the don't-know option.
To investigate which method was most appropriate, both scoring methods were used in true–false tests (block tests) taken at the end of a second- and third-year educational module (block). The students were asked to answer each item initially by choosing from the response options true, false or don't know, and secondly to replace all don't-know answers by a true–false answer.
Maastricht University, The Netherlands.
The correct scores for the don't-know answered items were found to be 4·5% and 5·9%, respectively, higher than expected with pure random guesswork. This represents a source of bias with formula scoring, because students who were less willing to guess obtained lower scores. The average difference in the correct minus incorrect score for the two scoring methods (2·5%, P < 0·001, and 3·4%, P < 0·001, respectively) indicates the size of the bias (compare: the standard deviation of the score equals 11%). Test reliability was higher with formula scoring (0·72 vs. 0·66 and 0·74 vs. 0·66), but the difference decreased when the test was restricted to items which were close to the core content of the block (0·81 vs. 0·77, resp. 0·75 vs. 0·70).
In deciding what scoring method to use, less bias (number-right scoring) has to be weighed against higher reliability (formula scoring). Apart from these psychometric reasons educational factors must be considered.