On Multi Class Vector Space Model-based Information Retrieval

A new model of information retrieval algorithms, multi class vector space model, is proposed in this paper based on traditional vector space model. Web document has semi structured characteristic. The keyword or terms that are used for indexing purpose in any location, so content of this location represent important information in the web documents. Vector space model ignores the importance of these terms with respect to their position while calculating the weight of the indexing terms. The experimental result shows that this method can further improve the performance of vector space model, save storage space and speed up the retrieval speed with high precision and recall rate.