Design and Development of Artificial Neural Network Based Tamil Unicode Symbols Identification System
Design and Development of Unicode and its recognition especially for Indian script is an active area of research today. An attempt is made to identify Tamil-a vernacular of southern India, which is also the official language of Tamilnadu. Tamil language present great challenges to an OCR designer due to the large number (247 letters) in the alphabet, the sophisticated ways in which they combine, and the complicated graphemes they result in. The conventional programming methods of mapping symbol images into matrices, analyzing pixel and/or vector data and trying to decide which symbol corresponds to which character would yield little or no realistic results.