Tutorial in Biostatistics
Categorizing a prognostic variable: review of methods, code for easy implementation and applications to decision-making about cancer treatments
Article first published online: 5 JAN 2000
DOI: 10.1002/(SICI)1097-0258(20000115)19:1<113::AID-SIM245>3.0.CO;2-O
Copyright © 2000 John Wiley & Sons, Ltd.
Additional Information
How to Cite
Mazumdar, M. and Glassman, J. R. (2000), Categorizing a prognostic variable: review of methods, code for easy implementation and applications to decision-making about cancer treatments. Statistics in Medicine, 19: 113–132. doi: 10.1002/(SICI)1097-0258(20000115)19:1<113::AID-SIM245>3.0.CO;2-O
Publication History
- Issue published online: 5 JAN 2000
- Article first published online: 5 JAN 2000
- Manuscript Accepted:
- Manuscript Received:
Funded by
- Cancer Chemotherapy Program Project. Grant Number: CA 05826–35
- Abstract
- References
- Cited By
Abstract
Categorizing prognostic variables is essential for their use in clinical decision-making. Often a single cutpoint that stratifies patients into high-risk and low-risk categories is sought. These categories may be used for making treatment recommendations, determining study eligibility, or to control for varying patient prognoses in the design of a clinical trial.
Methods used to categorize variables include: biological determination (most desirable but often unavailable); arbitrary selection of a cutpoint at the median value; graphical examination of the data for a threshold effect; and exploration of all observed values for the one which best separates the risk groups according to a chi-squared test. The last method, called the minimum p-value approach, involves multiple testing which inflates the type I error rates. Several methods for adjusting the inflated p-values have been proposed but remain infrequently used.
Exploratory methods for categorization and the minimum p-value approach with its various p-value corrections are reviewed, and code for their easy implementation is provided. The combined use of these methods is recommended, and demonstrated in the context of two cancer-related examples which highlight a variety of the issues involved in the categorization of prognostic variables. Copyright © 2000 John Wiley & Sons, Ltd.

1097-0258/asset/SIM_left.gif?v=1&s=1b631772c3897aa95941da3609d901cd1d389e83)
1097-0258/asset/olbannerright.gif?v=1&s=6d257623b3308a7485294c87b3b5e1e665484099)
1097-0258/asset/cover.gif?v=1&s=64ebf4a6597e744f418c952845cddf175ccc795f)