Quality assessment of CD34+ stem cell enumeration: experience of the United Kingdom National External Quality Assessment Scheme (UK NEQAS) using a unique stable whole blood preparation

Authors


Dr Barnett UK NEQAS for Leucocyte Immunophenotyping, Department of Haematology, Royal Hallamshire Hospital, Glossop Road, Sheffield S10 2JF.

Abstract

CD34+ peripheral blood stem cell (PBSC) mobilization and harvesting has rapidly replaced autologous bone marrow as a source of stem cells for transplantation. Timing and adequacy of harvests rely upon the accurate enumeration of circulating CD34+ cells. However, previous EQA programmes have reported interlaboratory CVs as high as 284%, suggesting the need for greater standardization. In addition the routine use of fresh and/or frozen cells as analytes also introduces antigen instability as a variable factor. To circumvent this problem and achieve a true reflection of interlaboratory variation, we have used a novel whole blood preparation in which the antigenic profiles of PBSCs, as determined by flow cytometry, are retained for > 200 d. This international scheme, currently the largest in the world, distributes aliquots of stabilized whole blood bi-monthly to 91 laboratories in 20 countries (44 U.K., 47 overseas). Participants are required to determine the percentage and absolute values for CD34+ PBSCs using in-house techniques. Adopting such a preparation, a more accurate determination of interlaboratory variation has been possible when compared to previous EQA studies, with CVs as low as 22% and 24% for percentage and absolute counts. In addition the programme has established that a wide range of methods are in routine use, emphasizing the urgent requirement for national/international consensus guidelines.

It is well established that the 1–3% of cells in the bone marrow that express the CD34 antigen, a heavily glycosylated mucin-like structure, are capable of reconstituting long-term multilineage haemopoiesis following ablative therapy ( Berenson et al, 1988 ; Andrews et al, 1992 ). CD34+ cells are extremely rare in the peripheral blood of normal individuals (approximately 0.01–0.05%). However, current treatment regimes, including chemotherapy and/or haemopoietic growth factors, can significantly increase circulating CD34+ stem cell counts in patients and donors. Peripheral blood stem cells (PBSC) have now virtually replaced bone marrow as the primary source of stem cells for autologous transplantation after myeloablative therapy ( Gratwohl et al, 1996 ), and the procedure is also being used for allogeneic transplantation between HLA-identical siblings ( Russell et al, 1996 ). Advantages of PBSCs include the generally shorter engraftment time, reduced hospitalization costs ( Ager et al, 1995 ), the presence of large numbers of T lymphocytes and NK cells which may reduce post-transplant relapse ( Dreger et al, 1994 ), and the elimination of a general anaesthetic. In addition, the PBSC product is more suitable for ex vivo manipulation, including CD34+ cell selection ( Brugger et al, 1994 ), tumour purging ( Ross et al, 1995 ) and gene manipulation ( Bregni et al, 1992 ).

Transplant centres routinely rely upon the enumeration of CD34+ cells as an indicator for the optimal timing and adequacy of PBSC harvests ( Haas et al, 1994 ). Assessment of haemopoietic progenitors by colony-forming assays is laborious and time-consuming ( Appelbaum, 1979) and has the disadvantage of not enabling real-time planning of PBSC collections. A minimum threshold level of between 2 and 5 × 106 CD34+ cells/kg has been observed in multiple clinical settings to result in adequate engraftment ( Krause et al, 1996 ). However, the lack of assay standardization prevents a more exact definition of the threshold level ( Bender et al, 1992 ). For example, a variety of flow cytometric gating strategies for CD34+ cell enumeration have been developed based primarily upon the detection of total CD34+, or CD34+CD45dim cells ( Gratama et al, 1997 ; Siena et al, 1991 ; Sutherland et al, 1996 ; Verwer & Ward, 1997). In addition, there is a marked variation in the choice of monoclonal antibody, fluorochrome and lysing reagent used. Furthermore, the absolute enumeration of CD34+ cells can be determined using either a dual- or single-platform approach. The former derives the CD34+ value from the combination of a flow cytometrically determined percentage CD34+ count and an absolute nucleated cell count generated by a haematological analyser. In contrast, single-platform technology derives an absolute CD34+ cell count directly from the flow cytometer using either precision fluidics or micro-beads ( Mercolino et al, 1995 ; Verwer & Ward, 1997; Chin-Yee et al, 1997a ; Keeney et al, 1998 ). Such variation in methodologies and the requirement for precise CD34+ cell enumeration has made standardization difficult, and the best approach has been the subject of much recent discussion ( Johnsen, 1997; Sutherland et al, 1997 ).

In an attempt to address the problems of standardization, several national external quality assurance (EQA) programmes ( Gratama et al, 1997 ; Lowdell & Bainbridge, 1996; Lumley et al, 1996 ; Chang & Ma, 1996; Brecher et al, 1996 ; Chin-Yee et al, 1997b ) and international workshops ( Gee & Lamb, 1994; Johnsen, 1995; Wunder et al, 1992 ; Johnsen & Knudsen, 1996) have been set up during the last 5 years. The EQA programmes have reported widely varying interlaboratory coefficients of variations (CVs; summarized in Table I), the main cause of which has been attributed to the use of fresh, or cryopreserved, specimens ( Gratama et al, 1997 ). This view is supported by the findings of a recent Australian study which documented a marked reduction of interlaboratory CVs when only list mode data was analysed ( Chang & Ma, 1996). Chang & Ma (1996) additionally demonstrated that gating strategies were a major contributing factor to result variability and that only one gating strategy, the ISHAGE protocol, gave reproducible results from all centres of within ± 10% of the median CD34+ cell value on both peripheral blood (PB) and apheresis samples. In response to such findings, UK NEQAS initiated an EQA scheme, originally involving 64 participants in 12 countries and using whole blood stabilized in a manner previously described ( Barnett et al, 1995 ). Use of such material has previously been shown to circumvent analyte instability, facilitating a more accurate and detailed analysis of interlaboratory variation for CD4+ T-lymphocyte enumeration and, in addition, has enabled transportation of EQA specimens overseas by post, thus reducing transportation costs ( Barnett et al, 1996 ).

We report the findings from the first 18 months of the UK NEQAS CD34+ stem cell enumeration programme, currently the largest such EQA scheme described to date, using stabilized whole blood specifically prepared for PBSC enumeration. The use of such a preparation within the programme has resulted in reduced interlaboratory variation for CD34+ stem cell quantitation when compared to previously published studies ( Table I) and removed the sample variability seen when fresh or cryopreserved specimens are used. Thus, for the first time, in any EQA for CD34+ stem cell enumeration, a more accurate assessment of performance, both on a national and international scale, has been achieved. However, our data still highlights the need for further improvement in methodological approach and underlines the urgent requirement for national/international consensus guidelines.

METHODS

Specimen collection and distribution

50 ml of peripheral blood was obtained from nine patients undergoing G-CSF stem cell mobilization and prior to peripheral blood stem cell harvesting. In addition, a single 50 ml sample of cord blood was obtained for one issue. All samples were stabilized using a procedure described previously ( Barnett et al, 1995 , 1996) and which is now used under licence to produce Ortho AbsoluteControl (Ortho Diagnostic Systems Inc., Raritan, U.S.A.). Longitudinal studies have previously shown that flow cytometric profiles are retained ( Barnett et al, 1996 ; Janossy et al, 1998 ) and confirmatory studies for the stability of the CD34 antigen were performed prior to the use of the stabilized material in the EQA programme (see below). All material used throughout this study was obtained following informed consent.

Each centre was issued with a 1 ml aliquot of stabilized peripheral blood, transported by either post or, if specifically requested, commercial courier. The UK NEQAS Scheme for Leucocyte Immunophenotyping has provided stabilized quality assessment specimens for many years to laboratories worldwide that perform leukaemia immunophenotyping and HIV lymphocyte subset analysis ( Barnett et al, 1994, 1996). The CD34+ Stem Cell Enumeration Scheme initially issued samples to 64 participants (38 U.K., 26 overseas), a number which increased to 91 (44 U.K., 47 overseas) after 18 months. Non-U.K. countries included Australia, Brazil, Canada, Denmark, Eire, Germany, Mexico, Netherlands, New Zealand, Portugal, Spain, Sweden, Switzerland and U.S.A.

Longitudinal specimen testing

In order to determine the degree to which the percentage value and mean channel fluorescence drifted with time, a 50 ml aliquot of peripheral blood was stabilized as previously described ( Barnett et al, 1995 ) and stored in 1 ml lots at 4°C until use. CD34+PBSC enumeration was undertaken according to the non-sequential gating strategy of Sienna et al (1991 ). This protocol was employed in order to identify any increase in non-specific staining due to debris and/or platelets. In addition, it was the most widely used approach by the participants (information obtained from a pre-survey questionnaire). Samples were analysed over a total of 228 d using a class III antibody (HPCA-2, Becton Dickinson, San Jose, U.S.A.), conjugated with phycoerythrin. An IgG1 fluorochrome-matched control antibody was used to correct for background staining and all antibodies were employed in accordance with the manufacturers' recommendations at saturating concentrations. Briefly, 20 μl of anti-CD34 PE, or control antibody as appropriate, was added to 100 μl of stabilized whole blood and incubated for 15 min at room temperature (RT). The erythrocytes were then lysed by incubating the sample with 1 ml of Ortho-mune lysing Solution (Ortho Clinical Diagnostics, Raritan, U.S.A.) for 15 min. After this period the sample was analysed without washing.

Flow cytometric calibration (FACScan, Becton Dickinson, San Jose, U.S.A.) was performed on a daily basis in accordance with manufacturers' instructions and the light scatter and immunostaining characteristics of stabilized samples compared with those of fresh samples as previously described ( Barnett et al, 1996 ; Janossy et al, 1998 ). Instrument settings remained constant for fresh and test samples.

Data reporting

Each centre was required to analyse the QC material within 3 weeks of issue, using local clinical laboratory procedures. A questionnaire, issued with each sample, requested details of total white cell count (WCC) and CD34+ cell count (both percentage and absolute values), as well as information on gating strategy, flow cytometer, sample preparation procedure (i.e. lyse-no-wash, lysis reagent, density centrifugation, etc.), antibody type (i.e. class I, II or III), antibody source/clone and fluorochrome, number of events analysed and whether an isotype control was employed (i.e. CD34 events corrected for IgG subclass binding).

Percentage and absolute CD34+ cell counts for a given laboratory were compared with the results obtained from other participants. Following the issue of the sixth specimen, a performance scoring system was introduced for absolute counts. This employed the use of stratified centiles as target ranges. Median and centile ranges were used for statistical evaluation due to the non-parametric nature of the data. Selected target ranges were the 5th, 10th, 25th, 75th, 90th and 95th centiles, criteria set by the Department of Medical Statistics and Evaluation (Royal Postgraduate Medical School, London) following 20 000 simulations using data from the first six issues. Development of such a system has enabled the identification of persistent unsatisfactory performers (PUPs), based on scoring criteria determined by the scheme Steering Committee. Laboratories scoring 100 points or greater, over a rolling three-sample window, were defined as a PUP. In brief, a score of 50 was awarded for laboratories whose absolute CD34+ value exceeded either the 5th or 95th centile, with 35 points for values between the 5th and 10th, or the 90th and 95th centile, 20 points for values between the 10th and 25th, or the 75th and 90th and zero points if the absolute value was between the 25th and 75th centile. Laboratories were therefore classified as a PUP if the results fell outside the central 80% on three consecutive samples (the chosen window of analysis). Failure to return a result (nil return) was also penalized under a separate scoring system (50 points for a nil return), such that a score of 100 points or greater, over a rolling three-specimen window, identified the participant as a PUP. The two scoring systems for performance and nil returns were mutually exclusive.

RESULTS

Longitudinal stability studies

Previous whole blood studies, using the current stabilization procedure, have shown that all leucocyte antigens investigated remain stable for > 200 d ( Barnett et al, 1996 ) without any significant loss of antigen density for CD45 and CD34 ( Janossy et al, 1998 ). In addition, stability studies performed on one of the samples used in the CD34 scheme (sample 2) on days 7, 70, 146 and 228, revealed minimal alteration with time for CD34 antigen expression. The percentage CD34+ values were 0.2%, 0.22%, 0.21% and 0.25% respectively. The cells exhibited very low levels of autofluorescence and, in addition, non-specific binding of the isotype control gave similar values to those recorded with fresh whole blood throughout the longitudinal stability study. In addition, the flow cytometric profiles remained constant, with all populations, including granulocytes, monocytes and lymphocytes as well as debris, being discrete and identifiable. Importantly, we have previously shown that there was no significant difference in CD34 and CD45 antigen density post stabilization when compared to the original fresh sample ( Janossy et al, 1998 ). For example, over the study period (following stabilization) the mean channel fluorescence intensity (MCFI) and percentage CD34+ value remained unchanged (day 7 MCFI 662, 0.2%; day 146 MCFI 661, 0.21%), confirming the antigen stability of the sample (note: the same flow cytometer settings were utilized throughout) (Fig 1). Finally, complete erythrocyte lysis was achieved. The increasing use of the ISHAGE protocol in the programme prompted us to examine the performance and suitability of the stabilized specimen for use with such sequential gating strategies. Following stabilization, CD45 can still be used for sequential gating strategies ( Figs 2 and 3). No negative comments from participants were received in relation to the performance of the material when used with sequential gating strategies.

Figure 1.

Fig 1(a). Stabilized peripheral whole blood analysed at day 7 for CD34+ peripheral blood stem cells using the Milan protocol (CD34+ peripheral blood stem cells equal 0.2%, mean channel fluorescence intensity equals 662).

Figure 1(b).

1%, mean channel fluorescence intensity equals 661).

Figure 2.

4+ PBSCs using the ISHAGE sequential gating protocol.

Survey findings

Interlaboratory CVs

Over the initial 18-month period 10 specimens were issued with a mean response rate of 91% for each trial. The initial issue (sample 1 in April 1996) was analysed by 91% of 64 laboratories, and resulted in an interlaboratory CV for absolute and percentage values of 76% and 83% respectively ( 2 Table II), with a CV for the WCC of 8%. The CVs for the following nine issues, involving up to 77 laboratories, never exceeded these initial values, with specimens 7 and 9 having the lowest CVs (< 25%). The mean CV for percentage and absolute values for all 10 specimens was 39.8% and 44.4% respectively. It should be noted, however, that for trials 497 (samples 7 and 8) and 597 (samples 9 and 10) both known (samples 7 and 9) and unknown (samples 8 and 10) specimens were issued. Samples 7 and 9 were excluded the overall CVs for the remaining eight specimens was 46.6% and 50.8% for percentage and absolute values respectively. Using the known sample, laboratories were required to optimize their flow cytometer and gate settings, before analysing the unknown sample. Samples 9 and 10 (known and unknown) were identical. The use of such ‘reference samples’ clearly resulted in improved CVs for unknown samples ( 2 Table II). Sample 8 was cord blood, issued primarily to mimic the flow cytometric characteristics of fresh cord blood samples (increased debris that may affect analysis).

Table 2. Table II. Median and interlaboratory CV values for CD34+ PBSC estimations and total WCC on the first 10 specimens issued.Thumbnail image of

Gating procedures

Participants were required to provide details of the gating strategies employed to enumerate PBSCs. The number of laboratories using the ‘Milan protocol’ ( Siena et al, 1991 ) remained relatively constant throughout the study, whereas there was a continual increase in the use of sequential or Boolean gating strategies, as in the ‘ISHAGE protocol’ ( Sutherland et al, 1996 ) and the ‘ProCount’ approach ( Verwer & Ward, 1997). Our material was suitable for use with such techniques ( Figs 2 and 3). There were no significant differences in the results obtained with the various gating strategies for 9/10 samples, the exception being sample 8 (cord blood). For the latter, the ‘ISHAGE protocol’ resulted in a 20% reduction in percentage value (median 0.185%, n = 6), when compared to the ‘Milan protocol’ (median 0.23%, n = 26). Incorporation of CD45 to identify leucocytes and to exclude platelets and debris ( Bender et al, 1994 ) resulted in an 11% reduction compared to the ‘Milan protocol’ (median 0.205%, n = 24) and a 10.8% higher value than the ‘ISHAGE protocol’. These differences reflect the nature of cord blood samples and indicate the importance of a standard approach to the analysis of such specimens. 12 centres did not use isotype controls, despite nine using the Milan protocol in which the incorporation of an isotype control is implicit to the technique. The remainder used a CD45 non-sequential gating strategy ( Bender et al, 1994 ).

Determination of absolute counts

Table II details the median CD34+ absolute values and WCC for each specimen issued to date. A progressive improvement in CVs for absolute CD34+ counts was seen, ranging from 76% (sample 1) to 24% (sample 10) (note samples 7 and 9 were issued with known values). A dual platform approach was employed by 88% of participants with 20 different methods used for determining the white cell count (WCC), incorporating 16 different haematological analysers. Although the majority of participants used a dual-platform approach, lower inter-laboratory CVs were observed with single- platform instrumentation. For example, analysis of data returned for sample 10 showed that the overall interlaboratory CV for centres using methods employing beads, or precision fluidics, was 9.9% compared to 24% for laboratories using dual platform technology.

Detection systems

Up to 10 different flow cytometers were used by the 77 participants and included benchtop (Becton Dickinson FACScan, Becton Dickinson FACSCalibur, Coulter XL, Coulter Profile I and Coulter Profile II and Ortho CytoronAbsolute) and stream-in-air (Coulter Epics 753 and Epics Elite, Becton Dickinson FACStar and Becton Dickinson FACSort) flow cytometers. It is well recognized that stream-in-air instruments may not be as sensitive as benchtop analysers. However, those using stream-in-air technology compensated for this, by incorporating phycoerythrin conjugated anti-CD34 antibodies in conjunction with FITC labelled anti-CD45. No significant differences in results were noted between this group and the benchtop group.

Number of events counted

A marked variation in the number of events counted was apparent throughout the study, regardless of anti-CD45 utilization. For example, when sample 1 was analysed, 8% of participants collected geqslant R: gt-or-equal, slanted 100 000 events, while a similar number acquired and analysed leqslant R: less-than-or-eq, slant 10 000 events, of whom one collected only 2000 events. Over the course of the programme the median number of events acquired and analysed has remained constant at approximately 55 000. For example, for sample 10, 17% of centres collected geqslant R: gt-or-equal, slanted 100 000 events but 4% still collected leqslant R: less-than-or-eq, slant 10 000 events.

Antibody source, class and fluorochrome effect

Tables III and IV highlight the use of anti-CD34 antibodies, including the class and nature of fluorochrome. Only one laboratory (from sample 3 onwards) used a class I antibody and, interestingly, was identified as a persistent unsatisfactory performer (see below). Initially (sample 1), four different antibody clones, of either class II or III, were used by 54/57 participants who provided data. Of these 54 participants, three used class II (QBEnd 10) antibodies (two PE-conjugated, one FITC-conjugated). The remaining 51 participants used class III antibodies (45, HPCA-2; five, BIRMA K3; one, clone 581), of which 42 were PE- and nine FITC-conjugated. There were no significant differences between the median values obtained for percentage and absolute CD34+ counts using HPCA-2 or BIRMA K3, irrespective of the fluorochrome used. However, FITC-conjugated antibodies gave an increased median value for percentage and absolute CD34+ counts in 7/10 specimens issued, a finding independent of antibody class (one laboratory used a PE-conjugated antibody but failed to provide data regarding the clone or class). Analysis of interlaboratory variation by fluorochrome for sample 1 showed that laboratories who used PE-conjugated anti-CD34 had interlaboratory CVs of 83.3% and 71.5% for percentage and absolute values respectively. However, laboratories who used FITC conjugated anti-CD34 had interlaboratory CVs of 1447% and 583% for percentage and absolute values respectively. The use of FITC-conjugated antibodies has shown a steady decline during the programme, making continued analysis of fluorochrome effect statistically invalid. Only one participant used FITC HPCA-2 (class III) for samples 9 and 10, the remaining 76 using PE-conjugated class II or III antibodies. No laboratory used class II FITC-conjugated antibodies after sample 4.

Table 3. Table III. Effect of fluorochrome upon the median values obtained for CD34+ PBSC estimations on the first 10 samples issued.Thumbnail image of
Table 4. Table IV. Effect of CD34 antibody class upon median values obtained for CD34+ PBSC estimations of the first 10 samples issued.Thumbnail image of

Cell isolation methods

The majority of laboratories (66%) employed a lyse and wash technique, of whom the largest group (47%) used FACS lysing reagent (the remainder used a variety of lysing reagents). Interestingly, 22% of the lyse and wash group used an in-house reagent, whereas four participants used the Coulter Q-Prep method with an additional wash stage after lysis. The lyse–no-wash technique was employed by 31% of participants, of whom 67% used Coulter Q-Prep. A single laboratory used a no-lyse–no-wash technique employing the nucleic acid dye (NAD) LDS-751, whereas one centre used ficoll density gradient separation before switching to the whole blood lysis methodology after sample 6. The proportion of participants using the different techniques has remained relatively constant throughout the programme. The lyse–no-wash technique gave a higher percentage group mean than the lyse–wash group (data from samples 7 and 9 were excluded because they were issued with known values). For example, the lyse–no-wash group mean was 186% and 192% higher than the lyse–wash group for samples 4 and 8 respectively (0.171% v 0.092% and 0.5% v 0.26%). Overall, of the eight samples analysed, the lyse–no-wash technique gave a 21% mean increase in absolute CD34+ cell count, when compared with the lyse–wash technique.

Overall methodology analysis

Nine areas of analytical variation were identified, namely haematological analyser, flow cytometer, the incorporation of anti-CD45, anti-CD34 (including class and fluorochrome), lysing reagent, lysis procedure (i.e. lyse–wash, lyse–no-wash, etc.) and gating protocol. 74 different approaches were used by the 77 laboratories that returned a complete data set. Interestingly, we calculate that > 200 000 possible permutations exist for determining CD34+ PBSCs (taking account that certain reagents may not be compatible with particular instruments or gating strategies).

Poor performance

Five participants (all U.K.) were identified as PUPs from analysis of the last four issues. No consistent methodological approach could be identified to account for poor performance. However, the single laboratory using a class I antibody conjugated with FITC has since changed to a class III with a resulting improvement in performance. Five additional laboratories (one U.K., four overseas) persistently failed to return data and were identified as PUPs on this basis.

DISCUSSION

The determination of leucocyte antigens in EQA programmes, for example CD4+ lymphocyte quantitation, has previously involved the use of fresh whole blood, necessitating rapid distribution and incurring high transportation costs ( Edwards et al, 1989 ; Goguel et al, 1993 ; Homburger et al, 1993 ; Paxton et al, 1989 ; Schonwald & Jilch, 1994). However, there is no guarantee that the testing laboratory will receive the analyte in perfect condition, a fact which presents problems in identifying poor or inadequately performing laboratories. Indeed, the use of fresh or cryopreserved material in EQA schemes for CD4+ lymphocyte quantitation is known to have a significant effect on the results (high CV and SD values) ( Edwards et al, 1989 ; Goguel et al, 1993 ; Homburger et al, 1993 ; Paxton et al, 1989 ; Schonwald & Jilch, 1994; Barnett et al, 1996 ). Furthermore, recent EQA schemes for CD34+ stem cells have also identified analyte instability as a major contributing factor to the high CVs (CVs > 100%) ( Gratama et al, 1997 ). In an attempt to circumvent this problem, Lowdell & Bainbridge (1996) constructed an EQA programme which ‘clustered’ laboratories, based upon a single centre sending samples to four other institutions, who in turn repeated the sample issue process. However, even within such a complex system, it was recognized that sample stability was an issue. Recent data has also indicated that CD34 expression is affected by storage at room temperature, cryopreservation and fixatives used with specific cell labelling methods ( Macey et al, 1997 ). Furthermore, a recent Australian study has reported that when flow cytometric list mode data is issued, instead of samples, inter-laboratory CVs of < 20% are achieved, underlining the problem of using fresh specimens in EQA programmes ( Chang & Ma, 1996). Therefore, following the successful introduction of a novel stabilized whole blood preparation within the UK NEQAS Immune Monitoring scheme ( Barnett et al, 1996 ), we examined its role in an EQA scheme for CD34+ stem cell enumeration.

The current study demonstrates that stabilized whole blood, obtained from either patients undergoing peripheral blood stem cell mobilization or from cord blood, can be successfully introduced into such an EQA programme. Longitudinal studies have confirmed the stability of the CD34 antigen for over 200 d (independent analysis of a sample 615 d old on both Becton Dickinson and Coulter platforms gave equivalent results to day zero (R. Sutherland and M. Keeney, Ontario, Canada, personal communications). Furthermore, CD45 expression is preserved well enough to enable the use of such material in sequential gating strategies that employ CD45 detection ( Figs 2 and 3) such as the ISHAGE protocol. In addition, the material is suitable for use with strategies that employ nucleic acid dyes (i.e. LDS-751 and the ‘ProCount’, data not shown). We have previously reported the suitability of such material for use with a variety of flow cytometers (e.g. FACScan, Ortho CytoronAbsolute), lysing reagents (e.g. FACS Lysing solution, Becton Dickinson, San Jose, Mountain View, Calif., and Ortho-mune lysing reagent, Ortho Diagnostic Systems Inc., Raritan, N.J.) and no-wash–no-lyse techniques ( Barnett et al, 1996 ). The striking effect of the preparation's introduction as an EQA material in this programme was the low interlaboratory CVs obtained, when compared to other EQA schemes that lacked standardized protocols. Furthermore, when the stabilized material was used as a reference material, to optimize flow cytometer setup, acquisition and analysis, further reductions in interlaboratory CVs were noted (e.g. 22% and 24% for percentage and absolute CD34+ PBSC values respectively for sample 10).

In agreement with published data from other EQA schemes and workshops, the UK NEQAS CD34+ stem cell enumeration programme has observed a wide variation in methodology ( Chang & Ma, 1996; Brecher et al, 1996 ; Lumley et al, 1996 ; Gratama et al, 1997 ; Chin-Yee et al, 1997b ). The use of a stabilized material, in conjunction with the largest participant base reported to date (102 participants, March 1998), has enabled a detailed analysis of the various methodological approaches. The major factors affecting the results and thus requiring future standardization were identified as follows: the means of determining the total nucleated cell count, the lysing reagent, the class of anti-CD34 antibody and the fluorochrome used, the number of events collected and the gating strategy employed.

The absolute CD34+ count for a given specimen will vary depending on whether a dual or single platform system is used. The development of single-platform instrument systems resulted from the requirement for precise and accurate T-lymphocyte subset analysis ( Connelly et al, 1995 ; Strauss et al, 1996 ). Traditionally, absolute CD4+ T-cell counts were calculated using a dual-platform system; the percentage of CD4+ T lymphocytes being derived from the flow cytometer and the absolute lymphocyte, or total white cell count, from a haematological analyser. It became apparent, however, that such an approach can result in considerable variation ( Robinson et al, 1992 ; Goguel et al, 1993 ) and led to the development of the FACSCount and CytoronAbsolute, capable of producing data in both absolute and percentage format ( Mercolino et al, 1995 ; Connelly et al, 1995 ; Strauss et al, 1996 ). Further software developments have enabled single-platform approaches for CD34+ enumeration on instruments such as the FACScan and FACSCabilur ( Verwer & Ward, 1997). Furthermore, recent studies ( Chin-Yee et al, 1997a ; Keeney et al, 1998 ) have shown that the ISHAGE protocol can also be modified to a become a single-platform approach. The addition of a known number of Flow-Count fluorescent microspheres (Coulter Corporation, Miami, Fla., U.S.A.) to the sample, combined with a lyse–no-wash sample processing approach enables absolute CD34+ cell counts to be obtained directly from the flow cytometer. However, the majority of participants in the present study employed dual-platform instrumentation, a factor that contributed to the increased variation in absolute CD34+ counts. For example, the overall interlaboratory CV for single-platform analysers (sample 10) was 9.9% compared to 24% for laboratories using dual-platform technology.

A further major factor which contributed to the variation observed in absolute CD34+ count was the lysing solution employed. It is well documented that systems utilizing lyse–no-wash, or no-lyse–no-wash techniques have reduced variability and tighter CVs for CD4+T-lymphocyte enumeration ( Connelly et al, 1995 ; Strauss et al, 1996 ; Barnett et al, 1996 ). We have confirmed this observation for CD34+cell enumeration and found that laboratories using lyse–no-wash systems returned CD34 counts approximately 20% higher, suggesting that cells are lost during the washing process. It should also be stressed that certain lysing reagents may reduce antigen expression and therefore be a source of additional variability ( McCarthy et al, 1994 ; Macey et al, 1997 ). It is quite feasible that diminution of antigen density due to particular lysing reagents, coupled with the use of FITC and a lyse–wash technique, will result in PBSCs being significantly underestimated; therefore such an approach should not be used.

The CD34 antigen has been reviewed in detail elsewhere ( Sutherland & Keating, 1992; Sutherland et al, 1992 ). It is a heavily glycosylated mucin-like structure with three epitopes, defined by sensitivity to neuraminidase and O-sialo-glycoprotease from Pasteurella haemolytica. Epitopes recognized by class I antibodies are sensitive to both enzymes, class II antibodies are sensitive to glycoprotease only, whereas those detected by class III antibodies are insensitive to both enzymes ( Sutherland et al, 1992 ). Class I antibodies fail to detect all glycoforms of the CD34 antigen and may only weakly bind to CD34 expressed on some leukaemias and leukaemia-derived cell lines. An additional decrease in sensitivity is observed if class I and II antibodies are FITC-conjugated. As a result, only PE-conjugated class II and either FITC- or PE-conjugated class III antibodies are recommended for CD34 detection ( Sutherland et al, 1996 ). Not surprisingly, therefore, the single participant that used a class I antibody, variously conjugated to FITC or PE, was identified as a persistent unsatisfactory performer, returning consistently low CD34 counts, with performance only improving when the laboratory switched to a PE-conjugated class III anti-CD34. Initially, 10 laboratories used FITC-conjugated antibodies, although the numbers reduced during the programme to a single participant (class III, FITC-conjugated). Interestingly, the use of FITC-labelled antibodies generally resulted in higher CD34 counts (see Table III), when compared to centres using PE. This finding was most marked with the cord blood, the most likely cause being the different fluorochrome sensitivities and types of gating strategies employed. It is well established that sequential gating strategies exclude debris and define ‘true’ CD34+ CD45dim haemopoietic cells. However, the use of non-sequential gating methods, such as the Milan protocol ( Siena et al, 1991 ), will potentially include debris and non-viable cells which may bind anti-CD34 non-specifically ( Sutherland et al, 1997 ), resulting in a falsely high count. Such an approach, coupled with the use of FITC conjugates, will make the discrimination of debris and CD34+ cells more subjective, accounting for the observed difference in results, especially when analysing cord blood samples.

The present study has revealed a marked variation in the number of CD34+ cells routinely counted. Reduction of the number of events per analysis will reduce the reliability of the estimation to unacceptable levels. Given the fact that the standard error of the number of positive cells per analysis is given by the square root of the number of positive cells; the larger the acquisition, the less the coefficient of variation ( Wunder et al, 1992 ). As a result, to maintain precision and also to ensure a methodological CV of 10%, a minimum of 100 CD34+ events should be collected from at least 75 000 CD45 events ( Sutherland et al, 1996 ). This approach is supported by the CD34 Task Force on behalf of the European Working Group on Clinical Cell Analysis ( Gratama et al, 1998 ). Despite these views, 75% of laboratories in the current study collected fewer than 75 000 CD45+ events, with 8% of participants (sample 1) collecting 20 000 or fewer events. Surprisingly, two participants collected as few as 29 and 40 CD34+ events from a total of 14 500 and 10 500 CD45+ events respectively. Furthermore, nine centres using the Milan protocol failed to use an isotype control, despite this being intrinsic to the gating strategy.

In conclusion, we have demonstrated the advantage of utilizing stabilized whole blood samples for EQA of CD34+ cell enumeration. This, in turn, has facilitated the development of an international EQA programme, significantly larger than other schemes reported to date, involving 91 participants in 19 countries. More importantly, it has demonstrated that interlaboratory CVs can, on an international scale, be reduced to < 25%, without using specified gating criteria. However, our findings indicate that further improvements are possible if standardized protocols are adopted, such as those proposed by ISHAGE and EWGCCA ( Sutherland et al, 1996 ; Gratama et al, 1998 ), and highlights the urgent need for nationally and internationally agreed consensus guidelines.

Acknowledgements

We thank Professor G. Janossy (Royal Free Hospital, London) and Associate Professor D. R. Sutherland (Toronto Hospital, Toronto, Canada) for reviewing the manuscript and for the help and advice they have given to this UK NEQAS programme over the past 18 months. We thank Dr C. Doré, Department of Medical Statistics and Evaluation, Royal Postgraduate Medical School, London, and Professor G. Raab, Mathematics Department, Napier University, Edinburgh, for their statistical advice. We also extend our gratitude to the UK NEQAS Haematology Steering Committee and the medical and technical staff within the Department of Haematology, Royal Hallamshire Hospital, for their support of this programme.

Ancillary