The Google File System

Pranavi Adusumilli

Abstract: Google File System, a scalable distributed file system is implemented for large distributed data-intensive applications. It provides fault tolerance while running on inexpensive commodity hardware, and it delivers high aggregate performance to a large number of clients. While sharing many of the same goals as previous distributed file systems, this design has been driven by observations of their application workloads and technological environment, both current and anticipated that reflect a marked departure from some earlier file system assumptions. This has led us to reexamine traditional choices and explore radically different design points. The file system has successfully met their storage needs. It is widely deployed within Google as the storage platform for the generation and processing of data used by our service as well as research and development efforts that require large data sets. The largest cluster to date provides hundreds of terabytes of storage across thousands of disks on over a thousand machines, and it is concurrently accessed by hundreds of clients. In this paper, I present file system interface extensions designed to support distributed applications by Google, discuss many aspects of their design, and report measurements from both micro-benchmarks and real world use.

Keywords: Google File System, large distributed data-intensive applications.

Title: The Google File System

Author: Pranavi Adusumilli

International Journal of Computer Science and Information Technology Research

ISSN 2348-1196 (print), ISSN 2348-120X (online)

Research Publish Journals

Vol. 3, Issue 2, April 2015 - June 2015

Citation
Share : Facebook Twitter Linked In

Citation
The Google File System by Pranavi Adusumilli