Turboscalar: A High Frequency High IPC Microarchitecture

نویسندگان

  • Bryan Black
  • John Paul Shen
چکیده

There is significant performance motivation to build larger and wider superscalar machines, however the implementation complexity can be overwhelming. When superscalar machines grow they necessarily become deeper in order to maintain frequency. As the pipeline depth increases the performance gained by a wide instruction fetch and dispatch is lost to branch misprediction penalty cycles. This work proposes the new Turboscalar microarchitecture, which is strongly based on the superscalar paradigm. Turboscalar utilizes run time information to optimize instruction execution. This new microarchitecture increases performance by reducing implementation complexity, allowing the construction of very shallow wide pipelines, which yield high performance. Results: A realistic Turboscalar implementation is proposed, that improves performance 66% over a wide deep superscalar that utilizes a block-based trace cache.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Microarchitecture Level Interconnect Modeling Considering Layout Optimization

— In this paper, we study microarchitecture-level interconnect modeling for power and performance. Considering structural interconnects, layer assignment, and concurrent repeater and Flip-Flop (FF) insertion, we develop cycle-accurate microarchitecture-level power and throughput simulation and obtain an accurate modeling of interconnects at the early design stage. Experiment show that the simul...

متن کامل

An Evaluation of Stratified Sampling of Microarchitecture Simulations

Recent research advocates applying sampling to accelerate microarchitecture simulation. Simple random sampling offers accurate performance estimates (with a high quantifiable confidence) by taking a large number (e.g., 10,000) of short performance measurements over the full length of a benchmark. Simple random sampling does not exploit the often repetitive behaviors of benchmarks, collecting ma...

متن کامل

Levo - A Scalable Processor With High IPC

UHT, MORANO, KHALAFI & KAELI 2 description of the microarchitecture, a description of the physical layout (a floorplan), and many new simulation results. The paper is organized as follows. In Section 2 we review major impediments to high IPC realization. Section 3 provides the Levo logical description, and discusses Levo’s solutions to the high IPC problems. Other implementation issues are addr...

متن کامل

Superspeculative Microarchitecture for Beyond AD 2000

I n its brief lifetime of 26 years, the microprocessor has achieved a total performance growth of 10,000 times thanks to technology improvements and microarchitecture innovations. Transistor count and clock frequency have increased by an order of magnitude in each of the first two decades of microprocessors; transistor count increased from 10,000 to 100,000 in the 1970s and up to 1 million in t...

متن کامل

Kool Chips Workshop Sunday 10 th December , 2000 in conjunction with MICRO

This paper discusses early results from a project called Morph where the goal is to develop a microarchitecture that can adapt its intrinsic performance dynamically. By observing that power dissipation is, to a first approximation, proportional to a power of the performance, reducing the performance characteristics thus reduces the power even faster, resulting in greatly improved performance/ w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000