Science & Engineering Research Support soCiety (SERSC)
In this paper, the authors proposed an easy approach for XML ifying of crude corpus in the field of opinion mining. The XMLification is done based on regular expressions. Corpus is the plural form of 'Corpora'. It is nothing but the collection of linguistic data. In this paper, the corpus is reviews posted on web sites; more specifically some product reviews. The reviews or the opinions are in the html files which are collected from sites like Cnet.com, Epinions.com, Amazon.com, ebay.com etc.