Some theoretical and practical considerations for multivariate statistical cell classification useful in autologous stem cell transplantation and tumor cell purging

Authors

  • James A. Hokanson,

    1. Department of Preventive Medicine and Community Health, Division of Infectious Diseases, University of Texas Medical Branch, Galveston, Texas
    Search for more papers by this author
  • Judah I. Rosenblatt,

    1. Department of Preventive Medicine and Community Health, Division of Infectious Diseases, University of Texas Medical Branch, Galveston, Texas
    Search for more papers by this author
  • James F. Leary

    Corresponding author
    1. Department of Internal Medicine, Division of Infectious Diseases, University of Texas Medical Branch, Galveston, Texas
    • Molecular Cytometry Unit, Route 0835, University of Texas Medical Branch, Galveston, TX 77555–0835.
    Search for more papers by this author

Abstract

Background:

As flow cytometric data becomes more complex, it becomes increasingly difficult to classify cells using conventional flow cytometry data techniques based on visual classification of the data by user-drawn regions. This paper shows some simple applications of multivariate statistical classification to classify flow cytometric data.

Methods:

Discriminant Function Analysis (DFA) and Logistic Regression (LR) analysis techniques were evaluated with respect to their potential utility in the problem of detecting human breast cancer cells within normal bone marrow cells. Data sets having defined properties were employed to evaluate the potential utility of these statistical classification techniques whose performance was measured by ROC analysis.

Results:

Two extreme but reasonable situations are presented: (1) data where the separation of cells was obvious by visual inspection and (2) data where major overlaps in the values of the individual FCM parameters made intuitive classification improbable. Both DFA and LR analysis were able to classify the cells of each type with acceptable accuracy and yield.

Conclusions:

The excellent empirical performance of both DFA and LR techniques, suggests that they offer promising approaches for classifying multiparameter FCM data using objective rules that may represent an improvement over commonly employed ad hoc approaches. Cytometry 36:60–70, 1999. © 1999 Wiley-Liss, Inc.

Ancillary