Performance / Price Sort

نویسندگان

  • Jim Gray
  • Joshua Coates
  • Chris Nyberg
چکیده

NTsort is an external sort on WindowsNT 5.0. It has minimal functionality but excellent price performance. In particular, running on mail-order hardware it can sort 1.5 GB for a penny. NT5.0 is not yet available. For commercially available sorts, Postman Sort from Robert Ramey Software Development has elapsed time performance comparable to NTsort, while using less processor time. It can sort 1.27 GB for a penny (12.7 million records.) These sorts set new price-performance records. This paper documents this and proposes that the PennySort benchmark be revised to Performance/Price sort: a simple GB/$ sort metric based on a two-pass external sort. Why does anyone care about sorting and sort performance? The prosaic reason is that sorting is a common task-it is frequently used in database systems, data analysis, and data mining. Another important reason is that sorting is a simple balanced workload, involving memory access, IO, and cpu. It evaluates a computer system's overall performance. Being simple, sorting is easily ported from one system to another, easily scaled to large SMP systems, and to computer clusters. The first public sort benchmark was defined in A Measure of Transaction Processing Performance, Datamation, April 1, 1985 [1]. That article defined DatamationSort to measure how fast can you sort a million records. The records are 100 bytes, with 10-byte keys in random order. The sort is external (disk-to-disk.) The time includes starting the program, creating the target file, and doing the sort. Prices are list prices depreciated over 3 years. Since then, there has been steady improvement in sort performance: sort speeds have approximately doubled each year, and price performance has approximately doubled each year-improving a thousand-fold every decade. In part, this has been due to Moore's law, things get faster every year: but that is only 40% of the story. The other 60% came from better algorithms and from parallelism. The current champion, NOWsort, used 95 UltrasSPARCs to sort 8.4 GB in a minute. DatamationSort times were getting tiny (a few seconds) and so it seemed better to have a fixed-time benchmark rather than a fixed-size sort. MinuteSort, how much can you sort in a minute, replaced DatamationSort in 1994. MinuteSort is a "biggest bang" (price is no object) test. PennySort is a "bang-for-the-buck" measure, how many 100-byte records you can sort for a penny, if the system cost is depreciated over three years. Inexpensive systems are allowed to run for a …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DMSort: A PennySort and Performance/Price Sort

This work describes our approach to creating a fast and low-cost sorting system. The goal of this work is to win the 2002 PennySort and Performance/Price sort. We have designed a sorting program called DMSort that is capable of more than double the performance of previously published results when run on our system configuration. This paper discusses the DMSort system alongside a discussion of t...

متن کامل

Erik Demaine Scribe : Eric Price 1 Overview

• Comparison sort: O(n lg n) • Counting sort: O(n + u) = O(n + 2w) (= O(n) for w = lg n) • Radix sort: O(n · w lg n) (= O(n) for w = O(lg n)) • van Emde Boas: O(n lg w). For w = lg n, this is O(n lg lg n). This can be improved to O(n lg w lg n), making it better than the previous methods (Kirkpatrick and Reisch [7]). • Signature sort: O(n) for w = Ω(lg n). Combined with the previous result for ...

متن کامل

Hybrid Dynamic Price Prediction Model In online Auctions

Online auction has turned into an extremely well known ebusiness exchange sort. The massive business openings pull in a great deal of employment and large number of online stores. In this paper we outlines a hybrid auction model (HDAM) which uses a detailed technique for initial price estimation and predictive analytics. To begin with, HDAM inputs an online auction to participate in and explore...

متن کامل

The Effect of Oil Shocks on the Performance of Banks

  Crude oil is the most important input in production, and its price shocks are remarkable because of its significant impact on the real economy. Oil is important in economic activity and financial markets. The shock of oil prices may also affect the performance of banks, with adverse effects on macroeconomics such as consumption and investment. The purpose of this study was to investigate the...

متن کامل

Government charges, the CPI and

Conceptually, inflation is best understood as on-going erosion of the purchasing power of money. Consumer price inflation is generally measured as changes in the weighted average of the money prices of a basket of selected goods and services. This raises questions concerning the choice of goods and services which constitute that basket, what sort of price changes should be counted as part of th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره cs.DB/9809004  شماره 

صفحات  -

تاریخ انتشار 1998