Information Extraction Using Metadata and Solving Polysemy Problems
Data mining is the exploration and evaluation of large quantity of data to discover substantial, novel, useful and effectively understandable data. Hence determining the knowledge of a document becomes a necessary task in data mining. There are three approaches of metadata in general. They are stylistic, machine learning and knowledge bases. Sometimes the problem occurs when mining a document that contains polysemic words which leads to irrelevant extraction and increased processing time. Polysemy refers to coexistence of many possible meaning for a word or phrase. In order to extract exact information, polysemy like issue should be solved.