Provided by:
International Journal of Computer Applications
Topic:
Big Data
Format:
PDF
As well actual clustering algorithms have to deal with explosive growth of documents of various sizes and terms of various frequencies, an appropriate term-weighting scheme has a crucial impact on the overall performance of such systems. Term-weighting is one of the critical processes for document retrieval and ranking in most search result clustering systems. In this paper, the authors introduce a new technique for clustering algorithms that solve the problem of indexing the terms of big datasets and their characteristics which exist in most of current clustering approaches.