Introducing Endpoints into the EMPI4Re MPI library

نویسنده

  • Daniel Holmes
چکیده

Endpoints is a topic under discussion within the MPI forum [3] that proposes to introduce a new communicator creation function that creates a communicator with multiple ranks for each MPI process in a parent communicator. In this talk we will discuss our initial implementation of endpoints in our research MPI library, called EMPI4Re. The Exascale MPI library for Research (the EMPI4Re library) is an MPI-1 library created by EPCC at The University of Edinburgh as a vehicle for research into new MPI functionality. The library adopts the conceptual model of Partitioned Global Address Space (PGAS) and assumes hardware support for Remote Direct Memory Access (RDMA) operations. This conceptual model enables efficient implementation of remotely accessible double-buffered first-in-first-out (FIFO) queues, used for point-to-point operations, and distributed state control structures, used for collective operations. Modern machine architectures, such as the Cray XC30, support RDMA operations only for exposed segments of process memory. Data movement of arbitrary application buffers is enabled by collective window creation operations (using MPI terminology) or symmetric memory allocation operations (using DMAPP terminology). The code-base for the EMPI4Re library currently consists of 55,495 lines of C code (c.f. 933,889 for OpenMPI version 1.8.6) including source code and header files for both the library and its test suite. The endpoint function, MPI_COMM_CREATE_ENDPOINTS, is collective over all the MPI processes in the parent communicator and returns an array of handles for the new communicator. Each handle in the array corresponds to one of the ranks in the new communicator. The ranks represented by the handles in the array are contiguous and the starting rank is defined by the rank of the calling MPI process in the

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Flexible Intermediate Library for MPI-2 Support on an SCore Cluster System

A flexible intermediate library named Stampi for MPI-2 support on a heterogeneous computing environment has been implemented on an SCore cluster system. With the help of a flexible communication mechanism of this library, users can execute MPI functions without awareness of underlying communication mechanism. In message transfer of Stampi, a vendor-supplied MPI library and TCP sockets are used ...

متن کامل

MGF: A Grid-Enabled MPI Library with a Delegation Mechanism to Improve Collective Operations

The success of Grid technologies depends on the ability of libraries and tools to hide the heterogeneous complexity of Grid systems. MPI-based programming libraries can make this environment more accessible to developers with parallel programming skills. In this paper we presentMGF, anMPI library which extends the existingMPICH-G2.MGF aims are: to allow parallel MPI applications to be executed ...

متن کامل

Lightweight threading with MPI using Persistent Communications Semantics

Multi-threaded performance in MPI is of concern for future systems, particularly at Exascale, where massive concurrency will be necessary to leverage the full power of systems. While MPI provides generalized solutions and additional proposals like endpoints expand this general model, examining common use cases that have good solutions that may not be universally applicable is a viable additiona...

متن کامل

Parleda: a Library for Parallel Processing in Computational Geometry Applications

ParLeda is a software library that provides the basic primitives needed for parallel implementation of computational geometry applications. It can also be used in implementing a parallel application that uses geometric data structures. The parallel model that we use is based on a new heterogeneous parallel model named HBSP, which is based on BSP and is introduced here. ParLeda uses two main lib...

متن کامل

Automatic Pro ling of MPI Applications with Hardware Performance Counters

This paper presents an automatic counter instrumentation and pro ling module added to the MPI library on Cray T3E and SGI Origin2000 systems. A detailed summary of the hardware performance counters and the MPI calls of any MPI production program is gathered during execution and written in MPI Finalize on a special syslog le. The user can get the same information in a di erent le. Statistical su...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015