International Journal of Computer Science and Network Solutions (IJCSNS)
The increasing amount of XML datasets available to casual users increases the necessity of investigating techniques to extract knowledge from these data. Data mining is widely applied in the database research area in order to extract frequent values from both structured and semi structured datasets. Extracting information from semi structured documents is a very hard task, and is going to become more and more critical as the amount of digital information available on the Internet grows. Documents are often so large that the data set returned as answer to a query may be too big to convey required knowledge.