International Journal of Computer & Organization Trends(IJCOT)
Document clustering is a more specific technique for unsupervised document organization, it is generally considered to be a centralized process. Clustering methods can be used to automatically group the retrieved documents into a list of meaningful categories. This paper gives an overview of some of the mostly used document clustering techniques and introduces the matlab tool which provides the users many functions that helps in the clustering of the documents. In particular the authors concentrate on the most commonly used clustering techniques agglomerative hierarchical clustering and K-means that are commonly used for document clustering and related matlab functions available in the matlab toolbox.