Date Added: Nov 2010
This paper compares several methods of information extraction on the internet. Today, internet has become a treasure of knowledge. Every year, thousands of pieces of different information are posted on the internet. So, extracted information on the internet for many different purposes has become an important problem today. Users may extract information based on some available tools such as Lapis, Risk, Rapier, Wien, and Stalker? However, these tools have a disadvantage: The authors must update the training data when the website changes. So SVM and CRF associated with natural language processing are the best solutions to solve this problem.