A Survey on Text Based Indexing Techniques in Hadoop

Provided by: International Journal of Advanced Research in Computer Science and Software Engineering (IJARCSSE)
Topic: Cloud
Format: PDF
Cloud computing is an emerging area within the field of Information Technology (IT). In a pure cloud computing model, this means having all the software and data hosted on a server or a pool of servers, and accessing them through the internet without the need for very much (if any) local hard disk, memory, or processor capacity, allowing the use of very light weight client computers by the end user. Current Cloud systems rely on underlying Distributed File Systems (DFS) to manage data. Examples include Google's GFS and Hadoop's HDFS. The challenges here lie in how to partition data among nodes and how to have nodes collaborate for a specific job.

Find By Topic