Dynamic balancing of communication and computation load for HLA-based simulations on large-scale distributed systems
نویسندگان
چکیده
Dynamic balancing of computation and communication load is vital for the execution stability and performance of distributed, parallel simulations deployed on the shared, unreliable resources of large-scale environments. High Level Architecture (HLA) based simulations can experience a decrease in performance due to imbalances that are produced initially and/or during run time. These imbalances are generated by the dynamic load changes of distributed simulations or by unknown, non-managed background processes resulting from the non-dedication of shared resources. Due to the dynamic execution characteristics of elements that compose distributed applications, the computational load and interaction dependencies of each simulation entity change during run time. These dynamic changes lead to an irregular load and communication distribution, which increases overhead of resources and latencies. A static partitioning of load is limited to deterministic applications and is incapable of predicting the dynamic changes caused by distributed applications or by external background processes. Therefore, a scheme for balancing the communication and computational load during the execution of distributed simulations is devised in a scalable hierarchical architecture. The proposed balancing system employs local and cluster monitoring mechanisms in order to observe the distributed load changes and identify imbalances, repartitioning policies to determine a distribution of load and minimize imbalances. A migration technique is also employed by this proposed balancing system to perform reliable and low-latency load transfers. Such a system successfully improves the use of shared resources and increases distributed simulations’ performance by minimizing communication latencies and partitioning the load evenly. Experiments and comparative analyses were conducted in order to identify the gains that the proposed balancing scheme provides to large-scale distributed simulations.
منابع مشابه
Improved Prediction-based Dynamic Load Balancing Systems for HLA-Based Distributed Simulations
Due to the dependency of High-Level Architecture (HLA)-Based simulations on the resources of distributed environments, simulations can face load imbalances and can suffer from low performance in terms of execution time. HLA is a framework that simplifies the implementation of distributed simulations; it also has been built with dedicated resources in mind. As technology is nowadays shifting tow...
متن کاملOrganization detection for dynamic load balancing in individual-based simulations
Large-scale individual-based simulations can benefit a lot from high performance computing environments. The benefit that can be hopped depends greatly on a good load distribution among the processing ressources together with the minimization of the communication overhead. However, minimizing both idle time and communication overhead requires the search for a trade-off. Inspired by complex syst...
متن کاملAn Efficient and Secure Cloud-Based Distributed Simulation System
For the deficiency of High Level Architecture (HLA), it is not well suited for large-scale distributed simulation systems. To enhance the capability of HLA and satisfy the new requirements of large-scale distributed simulations, an efficient and secure cloudbased distributed simulation system, namely CDS, based on the cloud computing technology and HLA is proposed. CDS provides a service-orient...
متن کاملA Parallel Architecture for IISPH Fluids
We present an architecture for parallel computation of incompressible IISPH simulations on distributed memory systems. We use orthogonal recursive bisection for domain decomposition and present a stable and fast converging load balancing controller. The neighbor search data structure is derived such that it optimally fits into the parallel pipeline. We further show how symmetry aspects of the s...
متن کاملDesign and Analysis of a Dynamic Load Balancing Strategy for Large-Scale Distributed Association Rule Mining
Association rule mining is one of the most important data mining techniques. Algorithms of this technique search a large space, considering numerous different alternatives and scanning the data repeatedly. Parallelism seems to be the natural solution in order to be able to work with industrial-sized databases. Large-scale computing systems, such as Grid computing environments, are recently rega...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Parallel Distrib. Comput.
دوره 71 شماره
صفحات -
تاریخ انتشار 2011