An Effective Text Processing Approach with MapReduce
Information extraction is a technology that is innovative from the user’s point of view in the current information-driven world. Rather than indicating which documents need to be read by a user, it extracts pieces of information that are salient to the user’s needs. Links between the extracted information and the original documents are maintained to allow the user to reference context for example Named Entity Recognition (NER). It helps machine to recognize proper nouns (entities) in text and associating them with the appropriate types.