DiscoTEX: A Framework of Combining IE and KDD for Text Mining

Text mining based on the integration of Information Extraction (IE) and traditional Knowledge Discovery from Databases (KDD). The authors, first present the idea of combining IE and KDD serially for text mining, explain how a document in this system can be represented as a vector of textual elements, and empirically show that rules mined from IE-extracted data are nearly as accurate as those discovered from manually extracted data. The assumption of traditional data mining that the information to be mined is already in the form of a relational database does not hold in many cases.

Provided by: Kurukshetra University Topic: Big Data Date Added: Jun 2012 Format: PDF

Find By Topic