Overcoming Distributed Debugging Challenges in the MPI+OpenMP Programming Model

نویسندگان

  • Lai Wei
  • Ignacio Laguna
  • Dong H. Ahn
  • Matthew P. LeGendre
  • Gregory L. Lee
چکیده

There is a general consensus that exascale computing will employ a wide range of programming models to harness the many levels of architectural parallelism [1], including models to exploit parallelism in CPUs and devices, such as OpenMP. To aid programmers in managing the complexities arising from multiple programming models, debugging tools must enable programmers to identify errors at the level of the programming model. However, the question of what the effective levels for debugging in hybrid distributed models are, remains unanswered. In this work, we present a novel framework to build an intuitive stack trace view of MPI+OpenMP programs. We develop a methodology to reconstruct call stacks for OpenMP threads and share our lessons learned from incorporating OpenMP awareness into a highly-scalable, lightweight debugging tool for MPI applications: the Stack Trace Analysis Tool (STAT) [2]. Our framework leverages OMPD [3], an emerging debugging interface for OpenMP, so that we can evaluate the effective levels of debugging for MPI+OpenMP. Our easy-to-understand stack trace views help users debug MPI+OpenMP programs at the user code level by mapping the stack traces to the high-level abstractions provided by programming models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Static/Dynamic Analyses for Validation and Improvements of Multi-Model HPC Applications. (Analyse statique/dynamique pour la validation et l'amélioration des applications parallèles multi-modèles)

Supercomputing plays an important role in several innovative fields, speeding up prototyping or validating scientific theories. However, supercomputers are evolving rapidly with now millions of processing units, posing the questions of their programmability. Despite the emergence of more widespread and functional parallel programming models, developing correct and effective parallel application...

متن کامل

A Hybrid MPI+OpenMP Application for Processing Big Trajectory Data

In this paper, we present the use of parallel/distributed programming frameworks, MPI and OpenMP, in processing and analysis of big trajectory data. We developed a distributed application that initially performs a spatial join between big trajectory data and regions of interest, and further aggregates join results to provide analysis of movement. The solution was implemented using hybrid distri...

متن کامل

Parallel computing using MPI and OpenMP on self-configured platform, UMZHPC.

Parallel computing is a topic of interest for a broad scientific community since it facilitates many time-consuming algorithms in different application domains.In this paper, we introduce a novel platform for parallel computing by using MPI and OpenMP programming languages based on set of networked PCs. UMZHPC is a free Linux-based parallel computing infrastructure that has been developed to cr...

متن کامل

Integrating MPI and the Nanothreads Programming Model

This paper presents a prototype runtime system that integrates MPI, used on distributed memory systems, and Nanothreads Programming Model (NPM), a programming model for shared memory multiprocessors. This integration does not alter the independence of the two models, since the runtime system is based on a multilevel design that supports each of them individually but offers the capability to com...

متن کامل

Study of parallel programming models on computer clusters with Intel MIC coprocessors

Coprocessors based on the Intel Many Integrated Core (MIC) Architecture have been adopted in many highperformance computer clusters. Typical parallel programming models, such as MPI and OpenMP, are supported on MIC processors to achieve the parallelism. In this work, we conduct a detailed study on the performance and scalability of the MIC processors under different programming models using the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015