The International Journal of Innovative Research in Computer and Communication Engineering
Big data is a popular term used to describe the exponential growth and availability of data, both structured and unstructured. Big data may be important to business and society as the Internet has become. Big data is so large that it's difficult to process using traditional database and software techniques. Big data analytics refers to the process of collecting, organizing and analyzing large sets of data ("Big data") to discover patterns and other useful information systems. Hadoop is based on a simple data model, any data will fit. HDFS (Hadoop Distributed File System) designed to hold very large amounts of data (terabytes or petabytes or even zeta bytes) and provide high-throughput access to this information.