Vector Quantization for Privacy During Big Data Analysis and for Compression of Big Data
Now-a-days, data on the web is increasing like water in the ocean. Big data is one of the emerging areas that deal with large velocity of data. “Big data is not defined in single world data sets that are too large and complex to manipulate or interrogate with standard methods or tools”. Big data is a new term used to identify the datasets that due to their large size and complexity, the authors cannot manage them with their current methodologies or data mining software tools. Big data mining is the capability of extracting useful information from these large datasets or streams of data, that due to its volume, variability and velocity, it was not possible before to do it.