Script Identification of Camera-Based Images

Executive Summary

This paper reports a statistical script identification technique that determines the script of document images, especially camera-based images which suffer from perspective distortion. The identification technique represents a document image by a frequency vector of affine invariant signatures of characters, and identifies the script by comparing the vector with preprepared script templates. Experimental results show that the authors' method is tolerant to moderate perspectives, document skew and various image noises. Script identification is to determine the script in which a document image is written.

