Multi-Core CUDA Architecture for Parallelization of Hierarchical Text Clustering

Provided by: IRD India
Topic: Data Management
Format: PDF
Text clustering is the problem of dividing text documents into groups, such that documents in same group are similar to one another and different from documents in other groups. Because of the general tendency of texts forming hierarchies, text clustering is best performed by using a hierarchical clustering method. An important aspect while clustering large text databases is that of high dimensionality of the representation space. Not only does it take lot of space in storing hierarchy trees but also a lot of time is spent in similarity calculations while clustering these documents.

Find By Topic