International Research Publication House (IRPH)
Data mining is typically concerned with the detection of patterns in numeric data, but very often important (e.g., critical to business) information is stored in the form of text. Unlike numeric data, text is often amorphous, and difficult to deal with. Text mining generally consists of the analysis of (multiple) text documents by extracting key phrases, concepts, etc. and the preparation of the text processed in that manner for further analyses with numeric data mining techniques. In this paper, the authors have presented an overview of text mining and surveyed some of the techniques used to discover knowledge from text databases.