Weblog Analysis with Map-Reduce and Performance Comparison of Single v/s Multinode Hadoop Cluster

Provided by: Auricle Technologies
Topic: Big Data
Format: PDF
In this internet era websites are useful source of much information. Because of growing popularity of World Wide Web (WWW) a website receives thousands to millions requests per day. Thus, the log files of such websites are growing in size day-by-day. These log files are useful source of information to identify user's behavior. This paper is an attempt to analyze the weblogs using Hadoop MapReduce algorithm. Hadoop is an open source framework that provides parallel storage and processing of large datasets. This paper makes use of Hadoop's this feature to analyze the large, semi-structured dataset of websites log.

Find By Topic