Blue Gene/L compute chip: Memory and Ethernet subsystem
نویسندگان
چکیده
compute chip: Memory and Ethernet subsystem M. Ohmacht R. A. Bergamaschi S. Bhattacharya A. Gara M. E. Giampapa B. Gopalsamy R. A. Haring D. Hoenicke D. J. Krolak J. A. Marcella B. J. Nathanson V. Salapura M. E. Wazlowski The Blue Genet/L compute chip is a dual-processor system-on-a-chip capable of delivering an arithmetic peak performance of 5.6 gigaflops. To match the memory speed to the high compute performance, the system implements an aggressive three-level on-chip cache hierarchy. The implemented hierarchy offers high bandwidth and integrated prefetching on cache hierarchy levels 2 and 3 (L2 and L3) to reduce memory access time. A Gigabit Ethernet interface driven by direct memory access (DMA) is integrated in the cache hierarchy, requiring only an external physical link layer chip to connect to the media. The integrated L3 cache stores a total of 4 MB of data, using multibank embedded dynamic random access memory (DRAM). The 1,024-bit-wide data port of the embedded DRAM provides 22.4 GB/s bandwidth to serve the speculative prefetching demands of the two processor cores and the Gigabit Ethernet DMA engine. To reduce hardware overhead due to cache coherence intervention requests, memory coherence is maintained by software. This is particularly efficient for regular highly parallel applications with partitionable working sets. The system further integrates an on-chip double-data-rate (DDR) DRAM controller for direct attachment of main memory modules to optimize overall memory performance and cost. For booting the system and low-latency interprocessor communication and synchronization, a 16-KB static random access memory (SRAM) and hardware locks have been added to the design.
منابع مشابه
Design of the IBM Blue Gene/Q Compute chip
The heart of a Blue GeneA/Q system is the Blue Gene/Q Compute (BQC) chip, which combines processors, memory, and communication functions on a single chip. The Blue Gene/Q Compute chip has 16 þ 1 þ 1 processor cores, each with a quad single-instruction, multiple-data (SIMD) floating-point unit, and a multi-versioned Level 2 cache that provides hardware support for transactional memory, speculati...
متن کاملEmbedded DRAM: Technology platform for the Blue Gene/L chip
The Blue Genet/L chip is a technological tour de force that embodies the system-on-a-chip concept in its entirety. This paper outlines the salient features of this 130-nm complementary metal oxide semiconductor (CMOS) technology, including the IBM unique embedded dynamic random access memory (DRAM) technology. Crucial to the execution of Blue Gene/L is the simultaneous instantiation of multiple...
متن کاملBlue Gene/L compute chip: Control, test, and bring-up infrastructure
compute chip: Control, test, and bring-up infrastructure R. A. Haring R. Bellofatto A. A. Bright P. G. Crumley M. B. Dombrowa S. M. Douskey M. R. Ellavsky B. Gopalsamy D. Hoenicke T. A. Liebsch J. A. Marcella M. Ohmacht The Blue Genet/L compute (BLC) and Blue Gene/L link (BLL) chips have extensive facilities for control, bring-up, self-test, debug, and nonintrusive performance monitoring built ...
متن کاملPerformance and Scalability Evaluation of 'Big Memory' on Blue Gene Linux
We address memory performance issues observed in Blue Gene Linux and discuss the design and implementation of “Big Memory”—an alternative, transparent memory space introduced to eliminate the memory performance issues. We evaluate the performance of Big Memory using custom memory benchmarks, NAS Parallel Benchmarks, and the Parallel Ocean Program, at a scale of up to 4096 nodes. We find that Bi...
متن کاملCharacterizing the Performance of Big Memory on Blue Gene Linux
Using Linux for high-performance applications on the compute nodes of IBM Blue Gene/P is challenging because of TLB misses and difficulties with programming the network DMA engine. We present a design and implementation of “big memory”—an alternative, transparent memory space for computational processes, which addresses these difficulties. The big memory uses extremely large memory pages availa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IBM Journal of Research and Development
دوره 49 شماره
صفحات -
تاریخ انتشار 2005