Special Issue Paper
Effect of dependent errors in the assessment of diagnostic or screening test accuracy when the reference standard is imperfect

Article first published online: 21 FEB 2012
DOI: 10.1002/sim.4444
Copyright © 2012 John Wiley & Sons, Ltd.
Issue
1097-0258/asset/cover.gif?v=1&s=64ebf4a6597e744f418c952845cddf175ccc795f)
Statistics in Medicine
Special Issue: Papers from the 31st Annual Conference of the International Society for Clinical Biostatistics
Volume 31, Issue 11-12, pages 1129–1138, 20-30 May 2012
Additional Information
How to Cite
Walter, S.D., Macaskill, P., Lord, S. J. and Irwig, L. (2012), Effect of dependent errors in the assessment of diagnostic or screening test accuracy when the reference standard is imperfect. Statist. Med., 31: 1129–1138. doi: 10.1002/sim.4444
Publication History
- Issue published online: 15 MAY 2012
- Article first published online: 21 FEB 2012
- Manuscript Accepted: 28 SEP 2011
- Manuscript Received: 22 OCT 2010
Funded by
- Australian National Health and Medical Research Council Program. Grant Numbers: 402764, 633003
- Abstract
- Article
- References
- Cited By
Keywords:
- diagnosis;
- screening;
- test accuracy;
- imperfect reference standard;
- correlated test errors;
- bias
When no gold standard is available to evaluate a diagnostic or screening test, as is often the case, an imperfect reference standard test must be used instead. Furthermore, the errors of the test and its reference standard may not be independent. Some authors have opined that positively dependent errors will lead to overestimation of test performance. Although positive dependence does increase agreement between the test and the reference standard, it is not clear if test accuracy will necessarily be overestimated in this situation, and the case of negatively associated test errors is even less clear. To examine this issue in more detail, we derive the apparent sensitivity, specificity, and overall accuracy of a test relative to an imperfect reference standard and the bias in these parameters. We demonstrate that either positive or negative bias can occur if the reference standard is imperfect. The type and magnitude of bias depend on several components: the disease prevalence, the true test sensitivity and specificity, the covariance between the false-negative test errors among the true disease cases, and the covariance between the false-positive test errors among the true noncases. If, for example, sensitivity and specificity are 0.8 for both the test and reference standard and the errors have a moderate positive dependence, test sensitivity is then underestimated at low prevalence but overestimated at high prevalence, while the opposite occurs for specificity. We illustrate these ideas through general numerical calculations and an empirical example of screening for breast cancer with magnetic resonance imaging and mammography. Copyright © 2012 John Wiley & Sons, Ltd.

1097-0258/asset/SIM_left.gif?v=1&s=1b631772c3897aa95941da3609d901cd1d389e83)
1097-0258/asset/olbannerright.gif?v=1&s=6d257623b3308a7485294c87b3b5e1e665484099)