Galley: a New Parallel File System for Scientiic Workloads

نویسندگان

  • Nils Nieuwejaar
  • Je Becker
  • Russell Carter
  • Sam Fineberg
چکیده

Most current multiprocessor le systems are designed to use multiple disks in parallel, using the high aggregate bandwidth to meet the growing I/O requirements of parallel scienti c applications. Most multiprocessor le systems provide applications with a conventional Unix-like interface, allowing the application to access those multiple disks transparently. This interface conceals the parallelism within the le system, increasing the ease of programmability, but making it di cult or impossible for sophisticated application and library programmers to use knowledge about their I/O to exploit that parallelism. In addition to providing an insu cient interface, most current multiprocessor le systems are optimized for a di erent workload than they are being asked to support. In this work we examine current multiprocessor le systems, as well as how those le systems are used by scienti c applications. Contrary to the expectations of the designers of current parallel le systems, the workloads on those systems are dominated by requests to read and write small pieces of data. Furthermore, rather than being accessed sequentially and contiguously, as in uniprocessor and supercomputer workloads, les in multiprocessor le systems are accessed in regular, structured, but non-contiguous patterns. Based on our observations of multiprocessor workloads, we have designed Galley, a new parallel le system that is intended to e ciently support realistic scienti c multiprocessor workloads. In this work, we introduce Galley and discuss its design and implementation. We describe Galley's new three-dimensional le structure and discuss how that structure can be used by parallel applications to achieve higher performance. We introduce several new data-access interfaces, which allow applications to explicitly describe the regular access patterns we found to be common in parallel le system workloads. We show how these new interfaces allow parallel applications to achieve tremendous increases in I/O performance. Finally, we discuss how Galley's new le structure and data-access interfaces can be useful in practice.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance of the Galley Parallel File System

As the I/O needs of parallel scientific applications increase, file systems for multiprocessors are being designed to provide applications with parallel access to multiple disks. Many parallel file systems present applications with a conventional Unix-like interface that allows the application to access multiple disks transparently. This interface conceals the parallelism within the file system...

متن کامل

Performance of the Galley Parallel File System Nils

As the I O needs of parallel scienti c applications increase le systems for multiprocessors are being designed to provide applications with parallel access to multiple disks Many parallel le systems present applications with a conventional Unix like interface that allows the application to access mul tiple disks transparently This interface conceals the paral lelism within the le system which i...

متن کامل

The Galley Parallel File System Nils

Most current multiprocessor le systems are designed to use multiple disks in parallel using the high aggregate bandwidth to meet the growing I O requirements of parallel scienti c applications Many multiprocessor le systems provide applications with a conventional Unix like interface allowing the application to access multiple disks transparently This interface conceals the parallelism within t...

متن کامل

File-Access Characteristics of Parallel Scientific Workloads

Phenomenal improvements in the computational performance of multiprocessors have not been matched by comparable gains in I/O system performance. This imbalance has resulted in I/O becoming a significant bottleneck for many scientific applications. One key to overcoming this bottleneck is improving the performance of parallel file systems. The design of a high-performance parallel file system re...

متن کامل

Parallel Scientific Workloads

Phenomenal improvements in the computational performance of multiprocessors have not been matched by comparable gains in I/O system performance. This imbalance has resulted in I/O becoming a significant bottleneck for many scientific applications. One key to overcoming this bottleneck is improving the performance of parallel file systems. The design of a high-performance parallel file system re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996