• subcellular features;
  • automated recognition;
  • CHO cells;
  • Vero cells;
  • GFP;
  • rejection rate


Systemic analysis of subcellular protein localization (location proteomics) provides clues for understanding gene functions and physiological condition of the cells. However, recognition of cell images of subcellular structures highly depends on experience and becomes the rate-limiting step when classifying subcellular protein localization. Several research groups have extracted specific numerical features for the recognition of subcellular protein localization, but these recognition systems are restricted to images of single particular cell line acquired by one specific imaging system and not applied to recognize a range of cell image sources. In this study, we establish a single system for automated subcellular structure recognition to identify cell images from various sources. Two different sources of cell images, 317 Vero ( and 875 CHO cell images of subcellular structures, were used to train and test the system. When the system was trained by a single source of images, the recognition rate is high and specific to the trained source. The system trained by the CHO cell images gave high average recognition accuracy for CHO cells of 96%, but this was reduced to 46% with Vero images. When we trained the system using a mixture of CHO and Vero cell images, an average accuracy of recognition reached 86.6% for both CHO and Vero cell images. The system can reject images with low confidence and identify the cell images correctly recognized to avoid manual reconfirmation. In summary, we have established a single system that can recognize subcellular protein localizations from two different sources for location-proteomic studies. studies. Microsc. Res. Tech., 2008. © 2007 Wiley-Liss, Inc.