Improving the Read Performance of the Distributed File System Through Anticipated Parallel Processing
In the emerging big data scenario, Distributed File Systems (DFSs) are used for storing and accessing information in a scalable manner. Many cloud computing systems use DFS as the main storage component. The big data applications deployed in cloud computing systems more frequently perform read operations and less frequently the write operations. So, improving the performance of read access has become an important research issue in DFS. In the paper, many client side caching with appropriate pre fetching techniques are proposed for improving the performance read access in the DFS.