11. Document Image Processing and Classification

  1. Frank Y. Shih

Published Online: 4 AUG 2010

DOI: 10.1002/9780470590416.ch11

Image Processing and Pattern Recognition

Image Processing and Pattern Recognition

How to Cite

Shih, F. Y. (2010) Document Image Processing and Classification, in Image Processing and Pattern Recognition, John Wiley & Sons, Inc., Hoboken, NJ, USA. doi: 10.1002/9780470590416.ch11

Publication History

  1. Published Online: 4 AUG 2010
  2. Published Print: 12 APR 2010

ISBN Information

Print ISBN: 9780470404614

Online ISBN: 9780470590416



  • block segmentation;
  • document processing;
  • fuzzy typographical analysis (FTA);
  • image processing;
  • logo identification;
  • rule-based character recognition system (CRS)


In a document processing system, the data in any text content take on a variety of sizes and fonts of characters, graphics, and pictures. Numerous algorithms have been proposed for document processing. The interpretation of graphics and pictures is achieved by image processing and computer vision accompanied with rule-based technologies alluded to in the descriptive paragraphs. Therefore, the document processing system is the state-of-the-art enterprise of automating and integrating a wide range of processes and representations for document perception. It integrates many techniques involved in computer graphics, image processing, computer vision, and pattern recognition. This chapter is organized as follows: Block segmentation and classification are presented followed by a rule-based character recognition system (CRS). The chapter describes Logo identification and presents the Fuzzy typographical analysis (FTA) for character preclassification.

Controlled Vocabulary Terms

character recognition equipment; fuzzy systems; image processing