University of Madhya Pradesh
Today data is increasing in volume, variety and velocity. To manage this data, the authors have to use databases with massively parallel software running on tens, hundreds, or more than thousands of servers. So Big data platforms are used to acquire organize and analyze these types of data. In this paper, first of all, they will acquire data from social media using Flume. Flume can take log files as source and after collecting data, it can store it directly to file system like HDFS or GFS. Then, organize this data by using different distributed file system such as Google file system or Hadoop file system.