Enabling Queries Using the Grid-brick Approach: A Distributed Data Storage Architecture
نویسندگان
چکیده
This paper presents a Grid-based parallel Event Processing System (GEPS) that is applicable to all the domains where large collections of separate blocks of data (images) need to be processed, stored and analyzed. Data intensive applications are becoming increasingly important in many science areas. In many domains the need for future computing and data management capabilities is difficult to accommodate within the expected technology improvements and the usage of scalable solutions, distributed over a large number of nodes, becomes essential. The processing, storage and analysis of High Energy Physics collisions illustrates the main challenges of this research area. Using the Globus grid toolkit, we have developed the GEPS system that provides a framework for data storage, processing and analysis that is based on the Grid-brick approach. The main concept of this approach is that individual farm nodes mirror the same architecture to provide all the available services for the data stored locally that is to each node. The data is split over different nodes that are responsible fro processing the local queries. Performance results indicate that event processing and filtering can be effectively implemented on GEPS, encouraging the continued effort to improve the GEPS prototype.
منابع مشابه
Grid-Brick Event Processing Framework in GEPS
Experiments like ATLAS at LHC involve a scale of computing and data management that greatly exceeds the capability of existing systems, making it necessary to resort to Grid-based Parallel Event Processing Systems (GEPS). Traditional Grid systems concentrate the data in central data servers which have to be accessed by many nodes each time an analysis or processing job starts. These systems req...
متن کاملAgent-Based Query Optimisation in a Grid Environment
+ IASTED International Conference on Applied Informatics, Innsbruck, Austria, February 2001 Abstract The next generation experiments in High Energy Physics are the driving force for setting up an International Data Grid at CERN, the European Organization for Nuclear Research. Hundreds of Petabytes of data will be distributed and replicated all over the globe starting from 2005. In order to anal...
متن کاملImproving Data Grids Performance by Using Modified Dynamic Hierarchical Replication Strategy
Abstract: A Data Grid connects a collection of geographically distributed computational and storage resources that enables users to share data and other resources. Data replication, a technique much discussed by Data Grid researchers in recent years creates multiple copies of file and places them in various locations to shorten file access times. In this paper, a dynamic data replication strate...
متن کاملA grid middleware for data management exploiting peer-to-peer techniques
In this paper, we describe a service-oriented middleware architecture for Grid environments which enables efficient datamanagement. Our design introduces concepts fromPeer-to-Peer computing in order to provide a scalable and reliable infrastructure for storage, search and retrieval of annotated content. To ensure fast file lookups in the distributed repositories, our system incorporates a multi...
متن کاملGreen Energy Generation in Buildings: Grid-Tied Distributed Generation Systems (DGS) With Energy Storage Applications to Sustain the Smart Grid Transformation
The challenge of electricity distribution’s upgrade to incorporate new technologies is big, and electric utilities are mandated to work diligently on this agenda, thus making investments to ensure that current networks maintain their electricity supply commitments secure and reliable in face of disruptions and adverse environmental conditions from a variety of sources. The paper presents a new ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002