International Journal Of Electronics,Communication And Soft Computing Science &Engineering (IJECSCSE)
Large scale data analysis is increasingly important in both the academics and enterprise. Statistical languages provide rich functionality and ease of use for large data analysis. Hadoop has changed the economics and the dynamics of large scale computing. It enables scalable and cost effectively. To collect the insights from this data, R is very amazing tool which allows running advanced statistical model on data. This paper gives an overview of large scale data analysis by Hadoop and using R on Hadoop.