These authors contributed equally to this work.
On the prognostic value of survival models with application to gene expression signatures
Article first published online: 8 MAR 2010
Copyright © 2010 John Wiley & Sons, Ltd.
Statistics in Medicine
Special Issue: Papers from the 29th Annual Conference of the International Society for Clinical Biostatistics
Volume 29, Issue 7-8, pages 818–829, 30 March - 15 April 2010
How to Cite
Hielscher, T., Zucknick, M., Werft, W. and Benner, A. (2010), On the prognostic value of survival models with application to gene expression signatures. Statist. Med., 29: 818–829. doi: 10.1002/sim.3768
- Issue published online: 8 MAR 2010
- Article first published online: 8 MAR 2010
- Manuscript Accepted: 22 SEP 2009
- Manuscript Received: 5 NOV 2008
- censored data;
- R2 coefficient
As part of the validation of any statistical model, it is a good statistical practice to quantify the prediction accuracy and the amount of prognostic information represented by the model; this includes gene expression signatures derived from high-dimensional microarray data. Several approaches exist for right-censored survival data measuring the gain in prognostic information compared with established clinical parameters or biomarkers in terms of explained variation or explained randomness. They are either model-based or use estimates of prediction accuracy.
As these measures differ in their underlying mechanisms, they vary in their interpretation, assumptions and properties, in particular in how they deal with the presence of censoring. It remains unclear, under what conditions and to what extent they are comparable. We present a comparison of several common measures and illustrate their behaviour in high-dimensional situations in simulation examples as well as in applications to real gene expression microarray data sets. An overview of available software implementations in R is given. Copyright © 2010 John Wiley & Sons, Ltd.